Keep field name for csv timestamp column (don't implicitly convert) #8440

stephanie-engel · 2020-11-19T21:30:36Z

Required for all PRs:

Signed CLA.
Associated README.md updated.
Has appropriate unit tests.

Root Cause Analysis:
Within the csv parser, the field name for the csv_timestamp_column is, initially, converted from a string into an int. This causes the field name (for example: 202011131605) to be interpreted as a Unix format (instead of the Go reference time 200601021504).

The Fix:

Prevent implicit type conversion on the timestamp column, so that parseTimestamp can correctly parse the time as unix or UTC
Added a unit test to verify TimestampFormat: "200601021504"

closes #7288

srebhan

Hey @stephanie-engel, nice work! However, I think we also have to handle the cases where the column-type is given explicitly. This is the section above your change. How about handling both cases by adding

		if fieldName == p.TimestampColumn {
				recordFields[fieldName] = value
                                continue
                }

directly before line 208

			if len(p.ColumnTypes) > 0 {

This way you can leave the lines 241--250 unchanged.

stephanie-engel · 2020-11-20T15:05:25Z

Hey @stephanie-engel, nice work! However, I think we also have to handle the cases where the column-type is given explicitly. This is the section above your change. How about handling both cases by adding
		if fieldName == p.TimestampColumn {
				recordFields[fieldName] = value
                                continue
                }
directly before line 208
			if len(p.ColumnTypes) > 0 {
This way you can leave the lines 241--250 unchanged.

Thanks for the great feedback, @srebhan ! I just went ahead and made the changes you requested. I confirmed that the timestamp is still parsed correctly and the unit tests still pass 😄

ssoroka

Looks great, congratulations. 🥇

(cherry picked from commit 247230c)

stephanie-engel requested a review from ssoroka November 19, 2020 21:30

srebhan requested changes Nov 20, 2020

View reviewed changes

srebhan self-assigned this Nov 20, 2020

keep field name as is for csv timestamp column

3daf53a

stephanie-engel force-pushed the se-csv-7288 branch from 6bcdfdb to 3daf53a Compare November 20, 2020 15:03

ssoroka approved these changes Nov 20, 2020

View reviewed changes

stephanie-engel merged commit 247230c into master Nov 20, 2020

stephanie-engel deleted the se-csv-7288 branch November 20, 2020 15:52

ssoroka pushed a commit that referenced this pull request Dec 1, 2020

keep field name as is for csv timestamp column (#8440)

da202db

(cherry picked from commit 247230c)

Hipska added the area/csv csv parser/serialiser related label Feb 15, 2022

arstercz pushed a commit to arstercz/telegraf that referenced this pull request Mar 5, 2023

keep field name as is for csv timestamp column (influxdata#8440)

f520249

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keep field name for csv timestamp column (don't implicitly convert) #8440

Keep field name for csv timestamp column (don't implicitly convert) #8440

stephanie-engel commented Nov 19, 2020 •

edited

Loading

srebhan left a comment •

edited

Loading

stephanie-engel commented Nov 20, 2020

ssoroka left a comment

Keep field name for csv timestamp column (don't implicitly convert) #8440

Keep field name for csv timestamp column (don't implicitly convert) #8440

Conversation

stephanie-engel commented Nov 19, 2020 • edited Loading

Required for all PRs:

srebhan left a comment • edited Loading

Choose a reason for hiding this comment

stephanie-engel commented Nov 20, 2020

ssoroka left a comment

Choose a reason for hiding this comment

stephanie-engel commented Nov 19, 2020 •

edited

Loading

srebhan left a comment •

edited

Loading