Duplicated protein sequences for A. thaliana
The "protein" field of the gff seems to be also used to extract CDS sequences.
The N0.tsv from OrthoFinder displays orthogroups like:
grep "AT5G45890" N0.tsv
N0.HOG0000159 OG0000092 n67 AT5G45890.1, AT5G45890.1-Protein Ca_v2.0_17488.1 Lcu.2RBY.4g047180.1 MtrunA17Chr4g0040851.1 Psat7g166640.1 Vfaba.Hedin2.R1.6g078440.1
Related to issue: https://github.com/NBISweden/AGAT/issues/189