Skip to content

Specify what constitutes a new-line #70

@tahonermann

Description

@tahonermann

The C++ standard defines behavior that depends on the presence of a new-line, but does not define what characters or character sequences constitute a new-line. Uses of the "newline" and "new-line" terms appear in:

P2178 proposal 2 sought to clarify the set of characters and character sequences that constitute new-lines and proposed the following set. This set was derived from information in UAX #14.

  • U+000A: LINE FEED (LF)
  • U+000B: LINE TABULATION
  • U+000C: FORM FEED (FF)
  • U+000D: CARRIAGE RETURN (CR)
  • U+000D: CARRIAGE RETURN (CR) then U+000A: LINE FEED (LF)
  • U+0085: NEXT LINE (NEL)
  • U+2028: LINE SEPARATOR
  • U+2029: PARAGRAPH SEPARATOR

This set matches the guidance in Unicode 13, chapter 5.8, "Newline Guidelines".

When addressing this issue, we should settle on one of either "newline" or "new-line". The Unicode standard uses "newline". ISO 10646 only mentions newline in character names.

Metadata

Metadata

Assignees

No one assigned

    Labels

    clarificationSomething isn't clearhelp wantedExtra attention is neededpaper neededA paper proposing a specific solution is needed

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions