docs: LEAP-1657: Add a note to Text tag about \r\n (#6645)

Co-authored-by: robot-ci-heartex <[email protected]> Co-authored-by: Max Tkachenko <[email protected]>
HumanSignal · Nov 14, 2024 · 525dd19 · 525dd19
1 parent 92c85c5
commit 525dd19
Show file tree

Hide file tree

Showing 2 changed files with 15 additions and 0 deletions.
diff --git a/docs/source/tags/text.md b/docs/source/tags/text.md
@@ -12,6 +12,13 @@ Every space in the text sample is counted when calculating result offsets, for e
 
 Use with the following data types: text.
 
+### How to read my text files in python?
+The Label Studio editor counts `\r\n` as two different symbols, displaying them as `\n\n`, making it look like there is extra margin between lines.
+You should either preprocess your files to replace `\r\n` with `\n` completely, or open files in Python with `newline=''` to avoid converting `\r\n` to `\n`:
+`with open('my-file.txt', encoding='utf-8', newline='') as f: text = f.read()`
+This is especially important when you are doing span NER labeling and need to get the correct offsets:
+`text[start_offset:end_offset]`
+
 ### Parameters
 
 | Param | Type | Default | Description |

diff --git a/web/libs/editor/src/tags/object/Text.js b/web/libs/editor/src/tags/object/Text.js
@@ -6,6 +6,14 @@
  * Every space in the text sample is counted when calculating result offsets, for example for NER labeling tasks.
  *
  * Use with the following data types: text.
+ *
+ * ### How to read my text files in python?
+ * The Label Studio editor counts `\r\n` as two different symbols, displaying them as `\n\n`, making it look like there is extra margin between lines.
+ * You should either preprocess your files to replace `\r\n` with `\n` completely, or open files in Python with `newline=''` to avoid converting `\r\n` to `\n`:
+ * `with open('my-file.txt', encoding='utf-8', newline='') as f: text = f.read()`
+ * This is especially important when you are doing span NER labeling and need to get the correct offsets:
+ * `text[start_offset:end_offset]`
+ *
  * @example
  * <!--Labeling configuration to label text for NER tasks with a word-level granularity -->
  * <View>