ceph-diff-sorted - Man Page

compare two sorted files line by line


ceph-diff-sorted file1 file2


ceph-diff-sorted is a simplifed diff utility optimized for comparing two files with lines that are lexically sorted.

The output is simplified in comparison to that of the standard diff tool available in POSIX systems. Angle brackets ('<' and '>') are used to show lines that appear in one file but not the other. The output is not compatible with the patch tool.

This tool was created in order to perform diffs of large files (e.g., containing billions of lines) that the standard diff tool cannot handle efficiently. Knowing that the lines are sorted allows this to be done efficiently with minimal memory overhead.

The sorting of each file needs to be done lexcially. Most POSIX systems use the LANG environment variable to determine the sort tool's sorting order. To sort lexically we would need something such as:

$ LANG=C sort some-file.txt >some-file-sorted.txt


Compare two files:

$ ceph-diff-sorted fileA.txt fileB.txt

Exit Status

When complete, the exit status will be set to one of the following:


files same


files different


usage problem (e.g., wrong number of command-line arguments)


problem opening input file


bad file content (e.g., unsorted order or empty lines)


ceph-diff-sorted is part of Ceph, a massively scalable, open-source, distributed storage system.  Please refer to the Ceph documentation at http://ceph.com/docs for more information.

See Also


Referenced By


May 06, 2021 dev Ceph