From 9a8a79f9c30db293a11c953e4032fcd2bb8d9ada Mon Sep 17 00:00:00 2001 From: Allan Bowe Date: Sat, 20 Mar 2021 14:56:16 +0100 Subject: [PATCH] chore: automated commit --- docs/dcu-fileupload.md | 36 +++++++++++++++++++----------------- draw/excelcost.drawio | 0 2 files changed, 19 insertions(+), 17 deletions(-) create mode 100644 draw/excelcost.drawio diff --git a/docs/dcu-fileupload.md b/docs/dcu-fileupload.md index 274b641..421d93f 100644 --- a/docs/dcu-fileupload.md +++ b/docs/dcu-fileupload.md @@ -4,6 +4,23 @@ Files can be uploaded via the Editor interface - first choose the library and ta +## Excel Uploads + +Thanks to our pro license of [sheetJS](https://sheetjs.com/), we can support all versions of excel, large workbooks, and extract data extremely fast. + +The rules for data extraction are: + +* Scan the spreadsheet until a row is found with all the target columns (not case sensitive) +* Extract data below that until the *first row containing a blank primary key value* + +This is incredibly flexible, and means: + +* data can be anywhere, on any worksheet +* data can contain additional columns (they are just ignored) +* data can be completely surrounded by other data + +A copy of the original Excel file is also uploaded to the staging area. This means that a complete audit trail can be captured, right back to the original source data. + ## CSV Uploads The following should be considered when uploading data in this way: @@ -16,31 +33,16 @@ The following should be considered when uploading data in this way: - The order of variables does not matter - The delimiter is extracted from the header row - so for `var1;var2;var3` the delimeter would be assumed to be a semicolon - The above assumes the delimiter is the first special character! So `var,1;var2;var3` would fail - - The following characters should not be used as delimiters + - The following characters should **not** be used as delimiters - doublequote - quote - space - underscore -When loading dates, be aware that the data controller makes use of the `ANYDTDTE` and `ANYDTDTTME` informats (width 19). +When loading dates, be aware that Data Controller makes use of the `ANYDTDTE` and `ANYDTDTTME` informats (width 19). This means that uploaded date / datetime values should be unambiguous (eg `01FEB1942` vs `01/02/42`) to avoid confusion - as the latter could be interpreted as `02JAN2042` depending on your locale and options `YEARCUTOFF` settings. Note that UTC dates with offset values (eg `2018-12-26T09:19:25.123+0100`) are not currently supported. If this is a feature you would like to see, contact us. !!! tip To get a copy of a file in the right format for upload, use the [file download](/dc-userguide/#usage) feature in the Viewer tab -## Excel Uploads -Thanks to our pro license of [sheetJS](https://sheetjs.com/), we can support all versions of excel, and extract the data super quickly to boot. - -The rules for data extraction are: - -* Scan the spreadsheet until a row is found with all the target columns (with no blank cells between columns) -* Extract data below that row up until the first blank primary key value - -This is incredibly flexible, and means: - -* data can be anywhere, on any worksheet -* data can contain additional columns (they are just ignored) -* data can be completely surrounded by other data - -A copy of the original Excel file is also uploaded to the staging area. This means that a complete audit trail can be captured, right back to the original source data. diff --git a/draw/excelcost.drawio b/draw/excelcost.drawio new file mode 100644 index 0000000..e69de29