; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006464 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006464
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold2:47623368..47638925
RNA-Seq ExpressionSpg006464
SyntenySpg006464
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022388.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0087.57Show/hide
Query:  LIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFG
        LIHE I HGSTKSFNALINRLSSQ AHHQVLQTYISM   NTPPDAYTFPSLLKACT+LN FS+GLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCIH G
Subjt:  LIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFG

Query:  RKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARS
        RKVFDTMPERNVVPWT IIG YS QGDV IAF+MFKQMRENEI PTSVT LSLLPGI ELPLLQ LHC I+LYGFGS+LALSNSMV+MYGRCGS+D A S
Subjt:  RKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARS

Query:  LFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLAL
        LFESMD RDIVSWNSLLSAYSKI GIEEILQL+H+MRIE IKPDK+TFCS LSASA KCDIRLGKLVHGL+LK GLDMDQQVETTLVVLYL+C  LD AL
Subjt:  LFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLAL

Query:  KVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQS
        KVFE TIEKDVVLWTAMISGLVQNDCSDKAL VFY M+E+NVE GTATLAS LAACAQLGCY IGTSIHGY+LRQGIM DIPAQNSLVTMYAKCNRLEQS
Subjt:  KVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQS

Query:  CAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLE
        CAIFNK+VEK+LVSWNAI+AGHAKNGYLSKAIFFF+EMR T+FQRPDSITV SLLQACG  GALCQGKWIHN +FRSSL+PCIM ETALVDMYFKCG++E
Subjt:  CAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLE

Query:  NAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGK
        NAQKCFDYML KDLVTWSTLIAGYG NGKG+IALRKYSEFL TGMEPNHVIFLSVLSACSHSGLI+QGLSIYESM KDF MPTNLEH+ACIIDLLSRAGK
Subjt:  NAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGK

Query:  VEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
        VEEAYSFY  MF+EPSIDVLGILL ACRVNGSV+LGEVIARDM ELKPVDAGNYVQLAHSYAS +RWDGVE AWTQMRSLGLKKLPGWSCIE
Subjt:  VEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

XP_022931568.1 pentatricopeptide repeat-containing protein At4g04370 [Cucurbita moschata]0.0e+0087.3Show/hide
Query:  QLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF
        +LIHE I HGSTKSFNALINRLSSQ AHHQVLQTYISM   NTPPDAYTFPSLLKACT+LN FS+GLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCIH 
Subjt:  QLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF

Query:  GRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR
        GRKVFDTMPERNVVPWT IIGCYS QGDV IAF+MFKQMRENEI PTSVT LSLLPGI ELPLLQ LHC I+LYGFGS+LALSNSMV+MYGRCGS+D A 
Subjt:  GRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR

Query:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLA
        SLFESMD RDIVSWNSLLSAYSKI GIEEILQL+H+MRIE IKPDK+TFCS LSASA KCDIRLGKLVHGL+LK GLDMDQQVETTLVVLYL+C  LD A
Subjt:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLA

Query:  LKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ
        LKVFE TIEKDVVLWTAMISGLVQNDCSD AL VFY M+E+NVE GTATLAS LAACAQLGCY IGTSIHGY+LRQGIM DIPAQNSLVTMYAKCNRLEQ
Subjt:  LKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ

Query:  SCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNL
        SCAIFNK+VEK+LVSWNAI+AGHAKNGYLSKAIFFF+EMR T+FQRPDSITV SLLQACG  GALCQGKWIH+F+FRSSL+PCIM ETALVDMYFKCG++
Subjt:  SCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNL

Query:  ENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAG
        ENAQKCFDYMLQKDLVTWSTLIAGYGFNG+G+IALRKYSEFL TGMEPNHVIFLSVLSACSHSGLI+QGLSIYESM KDF MPTNLEH+ACIIDLLSRAG
Subjt:  ENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAG

Query:  KVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
        KVEEAYSFY  MF+EPSIDVLGILL ACRVNG+V+LGEVI+RDM ELKPVDAGNYVQLAHSYAS +RWDGVE AWTQMRSLGLKKLPGWSCIE
Subjt:  KVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

XP_022989320.1 pentatricopeptide repeat-containing protein At4g04370 isoform X1 [Cucurbita maxima]0.0e+0077.13Show/hide
Query:  QLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF
        +LIHE I HGSTKSFNALINRLSSQ AHHQVLQTYISM   NTPPDAYTFPSLLKACT+LN FSDGLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCIH 
Subjt:  QLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF

Query:  GRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR
        GRKVFDTMPERNVVPWT IIGCYS QGDV IAF+MFKQMRE+EI PTSVT LSLLPGI ELPLLQ LHC IILYGFGS+LALSNSMV MYGRCGS+D A 
Subjt:  GRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR

Query:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLA
        SLFESMD RDIVSWNSLLSAYSKI GIEEILQL+HAMRIE IKPDK+TFCS LSASA KCDIRLGKLVHGL+LK GLDMDQQVETTLVVLYL+C  LD A
Subjt:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLA

Query:  LKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ
        LKVFE +IEKDVVLWTAMISGLVQNDCSD AL VFYRM+E+NV+ GTATLAS LAACAQLGCY IGTSIHGY+LRQGIM DIPAQNSLVTMYAKCNRLEQ
Subjt:  LKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ

Query:  SCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNL
        SCAIFNK+VEK+LVSWNAI+AGHAKNGYL+KAIFFF EMR T+FQRPDSITV SLLQACG  GALCQGKWIHN +FRSSL+PCIM ETALVDMYFKCG++
Subjt:  SCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNL

Query:  ENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAG
        E AQKCFD MLQKDLVTWSTLI GYGFNGKG+IALRKYSEFL TGMEPNHVIFLSVLSAC+HSGLI+QGLSIYESM KDF MPTNLEH+ACIIDLLSRAG
Subjt:  ENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAG

Query:  KVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE-------
        KVEEAYSFY  MF+EPSIDVLGILL ACRVNG+V+LGEVI+RDM ELKPVDAGNYVQLAHSYAS +RWDGVE AWTQMRSLGLKKLPGWSCIE       
Subjt:  KVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE-------

Query:  ------------------ELMDRN---------------------------------------------------------RVKSAYGIVKKNDLRKEFL
                          +L+ ++                                                         +VKSAYGI+KKNDL  EFL
Subjt:  ------------------ELMDRN---------------------------------------------------------RVKSAYGIVKKNDLRKEFL

Query:  NIYQKCEER
        +IY+KC+ER
Subjt:  NIYQKCEER

XP_023520788.1 pentatricopeptide repeat-containing protein At4g04370 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0087.16Show/hide
Query:  QLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF
        +LIHE I HGSTKSFNALINRLSSQ AHHQVLQTYISM   NTPPDAYTFPSLLKACT+LN FS+GLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCIH 
Subjt:  QLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF

Query:  GRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR
        GRKVFDTMPERNVVPWT IIGCYS QGDV IAF+MFKQMRENEI PTSVT LSLLPGI ELPLLQ LHC IILYGFGS+L+LSNSMV+MYGRCGS+D A 
Subjt:  GRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR

Query:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLA
        SLFESMD RDIVSWNSLLS YSKI  IEEILQL+H+MRIE IKPDK+TFCS LS SA KCDIRLGKLVHGL+LK GLDMDQQVETTLVVLYL+C  LD A
Subjt:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLA

Query:  LKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ
        LKVFE TIEKDVVLWTAMISGLVQNDCSDKAL VFYRM+E+NVE GTATLAS LAACAQLGCY IGTSIHGY+LR GIM DIPAQNSLVTMYAKCNRLEQ
Subjt:  LKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ

Query:  SCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNL
        SCAIFNK+VEK+LVSWNAI+AGHAKNGYLSKAIFFF+EMR T+FQRPDSITV SLLQACG  GALCQGKWIHN +FRSSL+PCIM ETALVDMYFKCG++
Subjt:  SCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNL

Query:  ENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAG
        ENAQKCFDYMLQKDLVTWSTLIAGYG NGKG+IALRKYSEFL T MEPNHVIFLSVLSACSHSGLI+QGLSIYESM KDF MPTNLEH+ACIIDLLSRAG
Subjt:  ENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAG

Query:  KVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
        KVEEAYSFY  MF+EPSIDVLGILL ACRVNGSV+LGEVIARDM ELKPVDAGNYVQLAHSYAS +RWDGVE AWTQMRSLGLKKLPGWSCIE
Subjt:  KVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

XP_038878475.1 pentatricopeptide repeat-containing protein At4g04370 [Benincasa hispida]0.0e+0086.58Show/hide
Query:  WQLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIH
        ++LIHEPIAHGSTKSFN+L+NRLSSQGAHHQVLQTYIS Q  NTPPDAYTFPSLLKACT LN+FS+GLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCIH
Subjt:  WQLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIH

Query:  FGRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDA
        FGRKVFDTMPERNVVPWTT+IGCYS QGD+DIAFSMFKQMRE+ IQPTSVT LSLLPGI ELPLL CLHC I+LYGF S+LAL NSMVNMYG+CG I DA
Subjt:  FGRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDA

Query:  RSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDL
        RSLFESMDYRD+VSWNSLLSAYSKIGGIEEIL+ +  MRIEDIKPDKQTFCSALSASAIK D+R GKLVH LILK G D+DQQVET L+VLYLRCR LDL
Subjt:  RSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDL

Query:  ALKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLE
        A +VF+ T EKD VLWTAMISGLVQNDC+DKALGVFY+M+E+NVEP TATLASAL+ACAQL C DIGTSIHGYVLRQGI+LDIPAQNSLVTMYAKCN+LE
Subjt:  ALKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLE

Query:  QSCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNL
        QSC+IFN+MVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRT+FQRPDSITV SLLQACG  GAL QGKWIHNFV RSSL+PCIM ETALVDMYFKCGNL
Subjt:  QSCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNL

Query:  ENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAG
          AQKCFDYMLQKDLVTWS LIAGYGFNGKG+IALRKYSEFL TGMEPNHVIFLSVLSACSHSGLI QGL+IYESM KDFRM  NLEH+ACIIDLLSRAG
Subjt:  ENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAG

Query:  KVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
        KV+EAYSFYKMMFKEP+IDVLGILL ACRVNGSV+LG+VIARDM ELKPVDAGN+VQLAHSYASM+RWDGVE AWTQMRSLGLKKLPGWS IE
Subjt:  KVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

TrEMBL top hitse value%identityAlignment
A0A1S4DUX5 pentatricopeptide repeat-containing protein At4g043700.0e+0085.4Show/hide
Query:  QLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF
        +LIHE IAHGSTKSFN+L++RLSSQGAHHQVLQTYISMQK +TP DAYTFPSL KACT LNLFS GLSLHQS++VNGLS+DSYIGSSLI+FYAKFGCIH 
Subjt:  QLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF

Query:  GRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR
        GRKVFDTM +RNVVPWTTIIG YS +GD+DIAFSMFKQMRE+ IQPTSVTLLSLLPGI +LPLL CLHC I LYGF S+LALSNSMVNMYG+CG I DAR
Subjt:  GRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR

Query:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLA
        SLFES+DYRDIVSWNSLLSAYSKIG  EEILQL+ AM+IEDIKPDKQTFCSALSASAIK D+RLGKLVHGL+LK GL++DQ VE+ LVVLYLRCR LDLA
Subjt:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLA

Query:  LKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ
         KVF+ T EKDVV+WTAMISGLVQNDC+DKALGVFY+M+E+NV+P TATLASALAACAQLGC DIG SIHGYVLRQGIMLDIPAQNSLVTMYAKCN+L+Q
Subjt:  LKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ

Query:  SCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLE
        SC+IFNKMVEKD+VSWNAIVAG+AKNGYLSKAIFFFNEMRT+FQRPDSITV SLLQACG  GALCQGKWIHNFV RSSL+PCIM ETALVDMYFKCGNLE
Subjt:  SCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLE

Query:  NAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGK
        NAQKCFD M Q+DLV WSTLI GYGFNGKG+IALRKYSEFL TGMEPNHVIF+SVLSACSHSGLISQGLSIYESM KDFRMP NLEH+ACI+DLLSRAGK
Subjt:  NAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGK

Query:  VEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
        V+EAYSFYKMMFKEPS+ VLG LL ACRVNGSV+LG+VIARDM ELKPVD GN+VQLA+SYASMNRWDGVE+AWTQMRSLGLKK PGWS IE
Subjt:  VEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

A0A6J1E522 pentatricopeptide repeat-containing protein At4g043700.0e+0086.75Show/hide
Query:  PIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVF
        PIA+GSTKSFN++INRLSSQGAHHQVLQTY SMQK +TPPDAYTFPSLLKACTILNLF DGLSLHQSIIVNG S DSYIGSSLI+FYAKFGCI  GRKVF
Subjt:  PIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVF

Query:  DTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFES
        D MPERNVVPWTTIIGCYS +G++D+AFSMFKQMR   IQPTSVTLLSLLP I ELPLLQCLHCWIILYGF SNL+LSNSMVN+YGRCGSI+DARSLFES
Subjt:  DTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFES

Query:  MDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFE
        MDYRDIVSWNSLLSAYSKIG IEEILQL+  MR EDIKPDKQTFCSALSASAIK DIRLGKLVHGLI+K GL +DQQVET L+VLYLRC+SLDLALKVF+
Subjt:  MDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFE

Query:  VTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIF
         T EKD+VLWTAMISGLVQNDC+DKAL VFY+M+E+N+EPGTATLASALAACAQLGCYDIGT IHGY+LRQGIMLDIPAQN+LVTMYAKCNRLEQSC IF
Subjt:  VTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIF

Query:  NKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKC
        NKMVE+DLVSWNAIVAGHAKNGYLSKAI FFNEMRT+ QRPDSITV SLLQACG  GAL QGKWIHNFVFRSSLMPCIMIETAL+DMYFKCGNLE AQKC
Subjt:  NKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKC

Query:  FDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAY
        FDYM  +DLVTWSTLI+GYGFNG G+IALRKYSEFL TG+EPNHVIFLSVLSACSHSGL++QGL IYESM +DF MP NLEH+ACI+DLLSRAGKVEEAY
Subjt:  FDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAY

Query:  SFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
        SFYKMMF+EPSIDVLGILL ACRVNGSV+LGE IARD+  LKPVD GNYVQLAHSYASM RWDGVEEAWTQMRSLGLKKLPGWS IE
Subjt:  SFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

A0A6J1EZ22 pentatricopeptide repeat-containing protein At4g043700.0e+0087.3Show/hide
Query:  QLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF
        +LIHE I HGSTKSFNALINRLSSQ AHHQVLQTYISM   NTPPDAYTFPSLLKACT+LN FS+GLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCIH 
Subjt:  QLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF

Query:  GRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR
        GRKVFDTMPERNVVPWT IIGCYS QGDV IAF+MFKQMRENEI PTSVT LSLLPGI ELPLLQ LHC I+LYGFGS+LALSNSMV+MYGRCGS+D A 
Subjt:  GRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR

Query:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLA
        SLFESMD RDIVSWNSLLSAYSKI GIEEILQL+H+MRIE IKPDK+TFCS LSASA KCDIRLGKLVHGL+LK GLDMDQQVETTLVVLYL+C  LD A
Subjt:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLA

Query:  LKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ
        LKVFE TIEKDVVLWTAMISGLVQNDCSD AL VFY M+E+NVE GTATLAS LAACAQLGCY IGTSIHGY+LRQGIM DIPAQNSLVTMYAKCNRLEQ
Subjt:  LKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ

Query:  SCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNL
        SCAIFNK+VEK+LVSWNAI+AGHAKNGYLSKAIFFF+EMR T+FQRPDSITV SLLQACG  GALCQGKWIH+F+FRSSL+PCIM ETALVDMYFKCG++
Subjt:  SCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNL

Query:  ENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAG
        ENAQKCFDYMLQKDLVTWSTLIAGYGFNG+G+IALRKYSEFL TGMEPNHVIFLSVLSACSHSGLI+QGLSIYESM KDF MPTNLEH+ACIIDLLSRAG
Subjt:  ENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAG

Query:  KVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
        KVEEAYSFY  MF+EPSIDVLGILL ACRVNG+V+LGEVI+RDM ELKPVDAGNYVQLAHSYAS +RWDGVE AWTQMRSLGLKKLPGWSCIE
Subjt:  KVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

A0A6J1JJR3 pentatricopeptide repeat-containing protein At4g04370 isoform X20.0e+0086.87Show/hide
Query:  QLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF
        +LIHE I HGSTKSFNALINRLSSQ AHHQVLQTYISM   NTPPDAYTFPSLLKACT+LN FSDGLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCIH 
Subjt:  QLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF

Query:  GRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR
        GRKVFDTMPERNVVPWT IIGCYS QGDV IAF+MFKQMRE+EI PTSVT LSLLPGI ELPLLQ LHC IILYGFGS+LALSNSMV MYGRCGS+D A 
Subjt:  GRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR

Query:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLA
        SLFESMD RDIVSWNSLLSAYSKI GIEEILQL+HAMRIE IKPDK+TFCS LSASA KCDIRLGKLVHGL+LK GLDMDQQVETTLVVLYL+C  LD A
Subjt:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLA

Query:  LKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ
        LKVFE +IEKDVVLWTAMISGLVQNDCSD AL VFYRM+E+NV+ GTATLAS LAACAQLGCY IGTSIHGY+LRQGIM DIPAQNSLVTMYAKCNRLEQ
Subjt:  LKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ

Query:  SCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNL
        SCAIFNK+VEK+LVSWNAI+AGHAKNGYL+KAIFFF EMR T+FQRPDSITV SLLQACG  GALCQGKWIHN +FRSSL+PCIM ETALVDMYFKCG++
Subjt:  SCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNL

Query:  ENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAG
        E AQKCFD MLQKDLVTWSTLI GYGFNGKG+IALRKYSEFL TGMEPNHVIFLSVLSAC+HSGLI+QGLSIYESM KDF MPTNLEH+ACIIDLLSRAG
Subjt:  ENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAG

Query:  KVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
        KVEEAYSFY  MF+EPSIDVLGILL ACRVNG+V+LGEVI+RDM ELKPVDAGNYVQLAHSYAS +RWDGVE AWTQMRSLGLKKLPGWSCIE
Subjt:  KVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

A0A6J1JM14 pentatricopeptide repeat-containing protein At4g04370 isoform X10.0e+0077.13Show/hide
Query:  QLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF
        +LIHE I HGSTKSFNALINRLSSQ AHHQVLQTYISM   NTPPDAYTFPSLLKACT+LN FSDGLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCIH 
Subjt:  QLIHEPIAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF

Query:  GRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR
        GRKVFDTMPERNVVPWT IIGCYS QGDV IAF+MFKQMRE+EI PTSVT LSLLPGI ELPLLQ LHC IILYGFGS+LALSNSMV MYGRCGS+D A 
Subjt:  GRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR

Query:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLA
        SLFESMD RDIVSWNSLLSAYSKI GIEEILQL+HAMRIE IKPDK+TFCS LSASA KCDIRLGKLVHGL+LK GLDMDQQVETTLVVLYL+C  LD A
Subjt:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLA

Query:  LKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ
        LKVFE +IEKDVVLWTAMISGLVQNDCSD AL VFYRM+E+NV+ GTATLAS LAACAQLGCY IGTSIHGY+LRQGIM DIPAQNSLVTMYAKCNRLEQ
Subjt:  LKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ

Query:  SCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNL
        SCAIFNK+VEK+LVSWNAI+AGHAKNGYL+KAIFFF EMR T+FQRPDSITV SLLQACG  GALCQGKWIHN +FRSSL+PCIM ETALVDMYFKCG++
Subjt:  SCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNL

Query:  ENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAG
        E AQKCFD MLQKDLVTWSTLI GYGFNGKG+IALRKYSEFL TGMEPNHVIFLSVLSAC+HSGLI+QGLSIYESM KDF MPTNLEH+ACIIDLLSRAG
Subjt:  ENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAG

Query:  KVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE-------
        KVEEAYSFY  MF+EPSIDVLGILL ACRVNG+V+LGEVI+RDM ELKPVDAGNYVQLAHSYAS +RWDGVE AWTQMRSLGLKKLPGWSCIE       
Subjt:  KVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE-------

Query:  ------------------ELMDRN---------------------------------------------------------RVKSAYGIVKKNDLRKEFL
                          +L+ ++                                                         +VKSAYGI+KKNDL  EFL
Subjt:  ------------------ELMDRN---------------------------------------------------------RVKSAYGIVKKNDLRKEFL

Query:  NIYQKCEER
        +IY+KC+ER
Subjt:  NIYQKCEER

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic1.1e-11633.64Show/hide
Query:  PPDAYTFPS--LLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRE
        P + Y  P+  LL+ C+ L      L L   +  NGL  + +  + L++ + ++G +    +VF+ +  +  V + T++  ++   D+D A   F +MR 
Subjt:  PPDAYTFPS--LLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRE

Query:  NEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMR
        ++++P       LL +     EL + + +H  ++  GF  +L     + NMY +C  +++AR +F+ M  RD+VSWN++++ YS+ G     L+++ +M 
Subjt:  NEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMR

Query:  IEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRM
         E++KP   T  S L A +    I +GK +HG  ++ G D    + T LV +Y +C SL+ A ++F+  +E++VV W +MI   VQN+   +A+ +F +M
Subjt:  IEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRM

Query:  METNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNE
        ++  V+P   ++  AL ACA LG  + G  IH   +  G+  ++   NSL++MY KC  ++ + ++F K+  + LVSWNA++ G A+NG    A+ +F++
Subjt:  METNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNE

Query:  MRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYS
        MR+   +PD+ T +S++ A          KWIH  V RS L   + + TALVDMY KCG +  A+  FD M ++ + TW+ +I GYG +G GK AL  + 
Subjt:  MRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYS

Query:  EFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEV
        E     ++PN V FLSV+SACSHSGL+  GL  +  M +++ +  +++H   ++DLL RAG++ EA+ F   M  +P+++V G +LGAC+++ +V   E 
Subjt:  EFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEV

Query:  IARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
         A  + EL P D G +V LA+ Y + + W+ V +    M   GL+K PG S +E
Subjt:  IARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

Q7Y211 Pentatricopeptide repeat-containing protein At3g57430, chloroplastic1.8e-11433.19Show/hide
Query:  QVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSY-IGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSHQGD
        + + TY+ M  +   PD Y FP+LLKA   L     G  +H  +   G   DS  + ++L+N Y K G      KVFD + ERN V W ++I        
Subjt:  QVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSY-IGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSHQGD

Query:  VDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFG-----SNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSK
         ++A   F+ M +  ++P+S TL+S++     LP+ + L     ++ +G      N  + N++V MYG+ G +  ++ L  S   RD+V+WN++LS+  +
Subjt:  VDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFG-----SNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSK

Query:  IGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILK-GGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEKDVVLWTAMISGL
           + E L+ L  M +E ++PD+ T  S L A +    +R GK +H   LK G LD +  V + LV +Y  C+ +    +VF+   ++ + LW AMI+G 
Subjt:  IGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILK-GGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEKDVVLWTAMISGL

Query:  VQNDCSDKALGVFYRMMET-NVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVEKDLVSWNAIVA
         QN+   +AL +F  M E+  +   + T+A  + AC + G +    +IHG+V+++G+  D   QN+L+ MY++  +++ +  IF KM ++DLV+WN ++ 
Subjt:  VQNDCSDKALGVFYRMMET-NVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVEKDLVSWNAIVA

Query:  GHAKNGYLSKAIFFFNEMRTNFQR------------PDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYM
        G+  + +   A+   ++M+ N +R            P+SIT++++L +C    AL +GK IH +  +++L   + + +ALVDMY KCG L+ ++K FD +
Subjt:  GHAKNGYLSKAIFFFNEMRTNFQR------------PDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYM

Query:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYK
         QK+++TW+ +I  YG +G G+ A+      +  G++PN V F+SV +ACSHSG++ +GL I+  M  D+ +  + +H AC++DLL RAG+++EAY    
Subjt:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYK

Query:  MMFKE-PSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
        MM ++         LLGA R++ ++++GE+ A+++++L+P  A +YV LA+ Y+S   WD   E    M+  G++K PG S IE
Subjt:  MMFKE-PSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

Q9SS60 Pentatricopeptide repeat-containing protein At3g035808.2e-11231.57Show/hide
Query:  FNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVV
        +N++I   S  G   + L+ Y  +++    PD YTFPS++KAC  L     G  +++ I+  G   D ++G++L++ Y++ G +   R+VFD MP R++V
Subjt:  FNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVV

Query:  PWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLL---QCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDI
         W ++I  YS  G  + A  ++ +++ + I P S T+ S+LP    L ++   Q LH + +  G  S + ++N +V MY +     DAR +F+ MD RD 
Subjt:  PWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLL---QCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDI

Query:  VSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEKD
        VS+N+++  Y K+  +EE +++     ++  KPD  T  S L A     D+ L K ++  +LK G  ++  V   L+ +Y +C  +  A  VF     KD
Subjt:  VSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEKD

Query:  VVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVEK
         V W ++ISG +Q+    +A+ +F  MM    +    T    ++   +L     G  +H   ++ GI +D+   N+L+ MYAKC  +  S  IF+ M   
Subjt:  VVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVEK

Query:  DLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYMLQ
        D V+WN +++   + G  +  +    +MR +   PD  T +  L  C    A   GK IH  + R      + I  AL++MY KCG LEN+ + F+ M +
Subjt:  DLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYMLQ

Query:  KDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYKMM
        +D+VTW+ +I  YG  G+G+ AL  +++   +G+ P+ V+F++++ ACSHSGL+ +GL+ +E M   +++   +EH AC++DLLSR+ K+ +A  F + M
Subjt:  KDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYKMM

Query:  FKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
          +P   +   +L ACR +G ++  E ++R ++EL P D G  +  +++YA++ +WD V      ++   + K PG+S IE
Subjt:  FKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

Q9STE1 Pentatricopeptide repeat-containing protein At4g213006.3e-11231.74Show/hide
Query:  IAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFD
        +   S + +N++I+     G  +Q L  Y  M      PD  TFP L+KAC  L  F     L  ++   G+  + ++ SSLI  Y ++G I    K+FD
Subjt:  IAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFD

Query:  TMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLF
         + +++ V W  ++  Y+  G +D     F  MR ++I P +VT   +LS+    L + L   LH  +++ G     ++ NS+++MY +CG  DDA  LF
Subjt:  TMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLF

Query:  ESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKV
          M   D V+WN ++S Y + G +EE L   + M    + PD  TF S L + +   ++   K +H  I++  + +D  + + L+  Y +CR + +A  +
Subjt:  ESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKV

Query:  FEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCA
        F      DVV++TAMISG + N     +L +F  +++  + P   TL S L     L    +G  +HG+++++G         +++ MYAKC R+  +  
Subjt:  FEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCA

Query:  IFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQ
        IF ++ ++D+VSWN+++   A++   S AI  F +M  +    D +++ + L AC    +   GK IH F+ + SL   +  E+ L+DMY KCGNL+ A 
Subjt:  IFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQ

Query:  KCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFL-ATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVE
          F  M +K++V+W+++IA  G +GK K +L  + E +  +G+ P+ + FL ++S+C H G + +G+  + SM +D+ +    EH AC++DL  RAG++ 
Subjt:  KCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFL-ATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVE

Query:  EAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
        EAY   K M   P   V G LLGACR++ +V+L EV +  +++L P ++G YV +++++A+   W+ V +  + M+   ++K+PG+S IE
Subjt:  EAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

Q9XE98 Pentatricopeptide repeat-containing protein At4g043703.3e-22254.32Show/hide
Query:  STKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPE
        STK FN+ IN LSS G H QVL T+ SM      PD +TFPSLLKAC  L   S GLS+HQ ++VNG S D YI SSL+N YAKFG +   RKVF+ M E
Subjt:  STKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPE

Query:  RNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRD
        R+VV WT +IGCYS  G V  A S+  +MR   I+P  VTLL +L G+LE+  LQCLH + ++YGF  ++A+ NSM+N+Y +C  + DA+ LF+ M+ RD
Subjt:  RNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRD

Query:  IVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEK
        +VSWN+++S Y+ +G + EIL+LL+ MR + ++PD+QTF ++LS S   CD+ +G+++H  I+K G D+D  ++T L+ +YL+C   + + +V E    K
Subjt:  IVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEK

Query:  DVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVE
        DVV WT MISGL++   ++KAL VF  M+++  +  +  +AS +A+CAQLG +D+G S+HGYVLR G  LD PA NSL+TMYAKC  L++S  IF +M E
Subjt:  DVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVE

Query:  KDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYM
        +DLVSWNAI++G+A+N  L KA+  F EM+    Q+ DS TV+SLLQAC   GAL  GK IH  V RS + PC +++TALVDMY KCG LE AQ+CFD +
Subjt:  KDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYM

Query:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYK
          KD+V+W  LIAGYGF+GKG IAL  YSEFL +GMEPNHVIFL+VLS+CSH+G++ QGL I+ SM++DF +  N EH AC++DLL RA ++E+A+ FYK
Subjt:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYK

Query:  MMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
          F  PSIDVLGI+L ACR NG  ++ ++I  DM+ELKP DAG+YV+L HS+A+M RWD V E+W QMRSLGLKKLPGWS IE
Subjt:  MMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein7.9e-11833.64Show/hide
Query:  PPDAYTFPS--LLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRE
        P + Y  P+  LL+ C+ L      L L   +  NGL  + +  + L++ + ++G +    +VF+ +  +  V + T++  ++   D+D A   F +MR 
Subjt:  PPDAYTFPS--LLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRE

Query:  NEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMR
        ++++P       LL +     EL + + +H  ++  GF  +L     + NMY +C  +++AR +F+ M  RD+VSWN++++ YS+ G     L+++ +M 
Subjt:  NEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMR

Query:  IEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRM
         E++KP   T  S L A +    I +GK +HG  ++ G D    + T LV +Y +C SL+ A ++F+  +E++VV W +MI   VQN+   +A+ +F +M
Subjt:  IEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRM

Query:  METNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNE
        ++  V+P   ++  AL ACA LG  + G  IH   +  G+  ++   NSL++MY KC  ++ + ++F K+  + LVSWNA++ G A+NG    A+ +F++
Subjt:  METNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNE

Query:  MRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYS
        MR+   +PD+ T +S++ A          KWIH  V RS L   + + TALVDMY KCG +  A+  FD M ++ + TW+ +I GYG +G GK AL  + 
Subjt:  MRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYS

Query:  EFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEV
        E     ++PN V FLSV+SACSHSGL+  GL  +  M +++ +  +++H   ++DLL RAG++ EA+ F   M  +P+++V G +LGAC+++ +V   E 
Subjt:  EFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEV

Query:  IARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
         A  + EL P D G +V LA+ Y + + W+ V +    M   GL+K PG S +E
Subjt:  IARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

AT3G03580.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.8e-11331.57Show/hide
Query:  FNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVV
        +N++I   S  G   + L+ Y  +++    PD YTFPS++KAC  L     G  +++ I+  G   D ++G++L++ Y++ G +   R+VFD MP R++V
Subjt:  FNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVV

Query:  PWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLL---QCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDI
         W ++I  YS  G  + A  ++ +++ + I P S T+ S+LP    L ++   Q LH + +  G  S + ++N +V MY +     DAR +F+ MD RD 
Subjt:  PWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLL---QCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDI

Query:  VSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEKD
        VS+N+++  Y K+  +EE +++     ++  KPD  T  S L A     D+ L K ++  +LK G  ++  V   L+ +Y +C  +  A  VF     KD
Subjt:  VSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEKD

Query:  VVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVEK
         V W ++ISG +Q+    +A+ +F  MM    +    T    ++   +L     G  +H   ++ GI +D+   N+L+ MYAKC  +  S  IF+ M   
Subjt:  VVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVEK

Query:  DLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYMLQ
        D V+WN +++   + G  +  +    +MR +   PD  T +  L  C    A   GK IH  + R      + I  AL++MY KCG LEN+ + F+ M +
Subjt:  DLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYMLQ

Query:  KDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYKMM
        +D+VTW+ +I  YG  G+G+ AL  +++   +G+ P+ V+F++++ ACSHSGL+ +GL+ +E M   +++   +EH AC++DLLSR+ K+ +A  F + M
Subjt:  KDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYKMM

Query:  FKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
          +P   +   +L ACR +G ++  E ++R ++EL P D G  +  +++YA++ +WD V      ++   + K PG+S IE
Subjt:  FKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-11533.19Show/hide
Query:  QVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSY-IGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSHQGD
        + + TY+ M  +   PD Y FP+LLKA   L     G  +H  +   G   DS  + ++L+N Y K G      KVFD + ERN V W ++I        
Subjt:  QVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSY-IGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSHQGD

Query:  VDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFG-----SNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSK
         ++A   F+ M +  ++P+S TL+S++     LP+ + L     ++ +G      N  + N++V MYG+ G +  ++ L  S   RD+V+WN++LS+  +
Subjt:  VDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFG-----SNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSK

Query:  IGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILK-GGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEKDVVLWTAMISGL
           + E L+ L  M +E ++PD+ T  S L A +    +R GK +H   LK G LD +  V + LV +Y  C+ +    +VF+   ++ + LW AMI+G 
Subjt:  IGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILK-GGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEKDVVLWTAMISGL

Query:  VQNDCSDKALGVFYRMMET-NVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVEKDLVSWNAIVA
         QN+   +AL +F  M E+  +   + T+A  + AC + G +    +IHG+V+++G+  D   QN+L+ MY++  +++ +  IF KM ++DLV+WN ++ 
Subjt:  VQNDCSDKALGVFYRMMET-NVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVEKDLVSWNAIVA

Query:  GHAKNGYLSKAIFFFNEMRTNFQR------------PDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYM
        G+  + +   A+   ++M+ N +R            P+SIT++++L +C    AL +GK IH +  +++L   + + +ALVDMY KCG L+ ++K FD +
Subjt:  GHAKNGYLSKAIFFFNEMRTNFQR------------PDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYM

Query:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYK
         QK+++TW+ +I  YG +G G+ A+      +  G++PN V F+SV +ACSHSG++ +GL I+  M  D+ +  + +H AC++DLL RAG+++EAY    
Subjt:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYK

Query:  MMFKE-PSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
        MM ++         LLGA R++ ++++GE+ A+++++L+P  A +YV LA+ Y+S   WD   E    M+  G++K PG S IE
Subjt:  MMFKE-PSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

AT4G04370.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-22354.32Show/hide
Query:  STKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPE
        STK FN+ IN LSS G H QVL T+ SM      PD +TFPSLLKAC  L   S GLS+HQ ++VNG S D YI SSL+N YAKFG +   RKVF+ M E
Subjt:  STKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPE

Query:  RNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRD
        R+VV WT +IGCYS  G V  A S+  +MR   I+P  VTLL +L G+LE+  LQCLH + ++YGF  ++A+ NSM+N+Y +C  + DA+ LF+ M+ RD
Subjt:  RNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRD

Query:  IVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEK
        +VSWN+++S Y+ +G + EIL+LL+ MR + ++PD+QTF ++LS S   CD+ +G+++H  I+K G D+D  ++T L+ +YL+C   + + +V E    K
Subjt:  IVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEK

Query:  DVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVE
        DVV WT MISGL++   ++KAL VF  M+++  +  +  +AS +A+CAQLG +D+G S+HGYVLR G  LD PA NSL+TMYAKC  L++S  IF +M E
Subjt:  DVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVE

Query:  KDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYM
        +DLVSWNAI++G+A+N  L KA+  F EM+    Q+ DS TV+SLLQAC   GAL  GK IH  V RS + PC +++TALVDMY KCG LE AQ+CFD +
Subjt:  KDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYM

Query:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYK
          KD+V+W  LIAGYGF+GKG IAL  YSEFL +GMEPNHVIFL+VLS+CSH+G++ QGL I+ SM++DF +  N EH AC++DLL RA ++E+A+ FYK
Subjt:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYK

Query:  MMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
          F  PSIDVLGI+L ACR NG  ++ ++I  DM+ELKP DAG+YV+L HS+A+M RWD V E+W QMRSLGLKKLPGWS IE
Subjt:  MMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.5e-11331.74Show/hide
Query:  IAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFD
        +   S + +N++I+     G  +Q L  Y  M      PD  TFP L+KAC  L  F     L  ++   G+  + ++ SSLI  Y ++G I    K+FD
Subjt:  IAHGSTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFD

Query:  TMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLF
         + +++ V W  ++  Y+  G +D     F  MR ++I P +VT   +LS+    L + L   LH  +++ G     ++ NS+++MY +CG  DDA  LF
Subjt:  TMPERNVVPWTTIIGCYSHQGDVDIAFSMFKQMRENEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLF

Query:  ESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKV
          M   D V+WN ++S Y + G +EE L   + M    + PD  TF S L + +   ++   K +H  I++  + +D  + + L+  Y +CR + +A  +
Subjt:  ESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKV

Query:  FEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCA
        F      DVV++TAMISG + N     +L +F  +++  + P   TL S L     L    +G  +HG+++++G         +++ MYAKC R+  +  
Subjt:  FEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCA

Query:  IFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQ
        IF ++ ++D+VSWN+++   A++   S AI  F +M  +    D +++ + L AC    +   GK IH F+ + SL   +  E+ L+DMY KCGNL+ A 
Subjt:  IFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQ

Query:  KCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFL-ATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVE
          F  M +K++V+W+++IA  G +GK K +L  + E +  +G+ P+ + FL ++S+C H G + +G+  + SM +D+ +    EH AC++DL  RAG++ 
Subjt:  KCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFL-ATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMPTNLEHQACIIDLLSRAGKVE

Query:  EAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE
        EAY   K M   P   V G LLGACR++ +V+L EV +  +++L P ++G YV +++++A+   W+ V +  + M+   ++K+PG+S IE
Subjt:  EAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSCIE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAACGAGTTAAGATGGAGGAGGAAGATGAAAATGGCAGTGATGAAGATAAAGAAGAAGGGCACAGAATCCTTCATTGTAATTTGGATTACTGGGAATTGTGATGC
CTGTGTTTGGTGCACTGTATTTGGTCTGATGCCTGTTCGTCCTCCATCGAAACTCCTGGTATGGACTGCAGCTCTGTATAAACTCCTCTGGGGTGTTCGTGATGGCCCTG
CACCAACAGCTCCAGCTCTTGCTGTCAATGATGATTTTCTTCTATTTCAGCTTGTTGCTTCCATTGTTGCCATTCAATGTCGTTCGACACTCATCCACCTTAGCAACCAT
TCATTTGCCACAGTTGCAATCCTCGTCGTTGCACTGTACTCCTTCAACATCAACCCCTTCGTGGAACTGTTAGAAAACAATCAAGACATGGATGTAGGCCTTGTTGCCGA
ACCACTTACATGCCGAATACTTACCTCCTTTACAGTGGATGTAGGCCTTGTTGCCGAACCACTTACATGCAATGTTGTCGCTCCTCAAAATGAAATCGATGGCGACCACA
AAGAGAAGGAATATTATCGACATGAACTTTGCAAATTTCGGCTAAGAGATTTTACCATAATTGCTTATGTGGTGGTCGTAAAAACACGGAATAATGTGGTGGGATTTTCT
AGGAGGAAGCTGAAGTGGTTTAACCAGAAAGAACCCAACCATGCTTCACGCCGTCTCCTCCTCCGGTTCCGTTCACTGCTGCCTCCACCGTCATCTTTCTTCCGCTTGTG
CCACATCCACCATCGTCTTTCTTCCGTTCGTGCCGCCGCAGCCTCCCTTCTAGCGGTTTTGCCCTCATTTTCTACTCCCTCGTTGTTACTGGTGTTGAGAACCTCTATGA
GCTTGGTGATGACGAAGAGGAGTCGTGATGAAAAAGACCATTGGCAATTAATCCATGAGCCGATAGCCCATGGCAGCACAAAATCATTCAACGCCCTCATAAATCGCCTT
TCGTCCCAAGGCGCTCACCATCAAGTTCTTCAAACCTACATTTCTATGCAAAAGATAAATACACCACCAGATGCTTACACTTTTCCCAGCCTTCTGAAAGCTTGTACCAT
TTTGAACTTGTTTTCAGATGGCCTCTCGCTTCACCAATCTATCATTGTTAATGGCCTTTCTTATGATTCCTACATTGGGTCTTCGCTCATTAATTTCTATGCCAAATTTG
GGTGCATTCATTTTGGTCGCAAGGTGTTTGATACAATGCCCGAAAGAAATGTTGTTCCTTGGACTACCATTATTGGGTGCTATTCACATCAGGGAGATGTTGACATTGCT
TTTTCAATGTTCAAACAAATGCGGGAGAATGAAATTCAGCCCACTTCAGTAACCTTGTTGAGTCTGCTCCCTGGTATTTTAGAGCTTCCCCTTCTTCAGTGTTTGCATTG
TTGGATTATTTTGTATGGTTTTGGGTCAAATTTAGCTTTATCGAACTCCATGGTGAATATGTATGGTAGATGTGGCAGCATTGATGATGCAAGAAGTTTGTTTGAGTCAA
TGGATTATAGAGACATAGTTTCTTGGAATTCATTATTATCAGCCTATTCGAAAATTGGCGGGATTGAAGAAATATTGCAGCTTCTACATGCCATGAGGATTGAAGATATC
AAACCTGACAAACAAACTTTTTGCTCTGCTTTGTCTGCTTCTGCTATAAAATGTGATATTAGATTAGGTAAGTTGGTGCATGGTTTGATTCTTAAAGGTGGATTAGATAT
GGATCAACAAGTAGAGACGACACTCGTAGTTTTATACTTGAGATGTAGGAGTTTGGATCTCGCACTTAAAGTTTTCGAAGTAACTATTGAAAAGGATGTGGTCCTCTGGA
CAGCAATGATATCAGGACTTGTCCAGAACGATTGTTCTGACAAGGCATTGGGGGTCTTCTATCGAATGATGGAAACAAACGTGGAGCCAGGTACTGCCACCTTAGCTAGT
GCTCTGGCAGCCTGTGCTCAACTTGGTTGTTATGATATTGGTACCTCGATTCATGGTTATGTATTAAGGCAAGGAATAATGCTAGACATACCTGCTCAAAACTCCCTTGT
CACCATGTATGCAAAGTGTAATAGGTTGGAGCAAAGCTGTGCAATTTTTAATAAGATGGTTGAAAAGGATTTAGTTTCTTGGAATGCTATTGTGGCTGGACATGCTAAAA
ATGGTTATTTAAGCAAGGCCATCTTTTTCTTCAATGAAATGAGAACAAACTTTCAAAGGCCTGACTCAATAACAGTGATCTCACTTCTTCAAGCTTGTGGTTTTACTGGT
GCACTTTGCCAGGGAAAGTGGATTCACAACTTCGTTTTTAGAAGTTCCCTTATGCCATGCATTATGATCGAAACGGCTCTAGTTGACATGTATTTCAAATGTGGAAACTT
AGAGAATGCTCAGAAGTGTTTCGATTATATGTTGCAAAAAGATCTTGTAACATGGAGCACACTTATTGCTGGATATGGTTTTAATGGAAAGGGAAAAATTGCTTTGAGAA
AATATTCAGAGTTTCTTGCCACAGGGATGGAACCGAATCATGTTATTTTCCTTTCAGTTCTTTCTGCTTGTAGCCACAGTGGGCTTATTAGCCAAGGTTTGAGCATATAC
GAGTCAATGATTAAAGATTTCAGAATGCCAACAAATCTCGAGCACCAAGCTTGTATCATTGACCTCCTTAGTCGAGCTGGAAAGGTTGAAGAGGCATATAGCTTCTATAA
AATGATGTTTAAAGAACCCTCAATAGATGTTTTAGGCATACTCCTTGGTGCTTGTCGTGTTAATGGCAGTGTCAAACTTGGTGAGGTTATTGCTAGAGATATGGTTGAAT
TAAAGCCCGTGGATGCTGGAAACTATGTGCAACTGGCTCATAGTTATGCATCCATGAATAGATGGGATGGAGTGGAGGAGGCATGGACCCAAATGAGGTCTCTTGGTCTG
AAAAAGCTTCCTGGATGGAGTTGTATTGAGGAGTTGATGGATAGAAACAGGGTAAAAAGTGCATATGGCATTGTAAAGAAAAATGATCTTCGGAAGGAATTCCTCAATAT
CTATCAGAAGTGTGAAGAGAGGTACTTTGATGACCCCTATGCTCCATCTTTCTTCTTTGCACTCAAATTTTCTTTTGGCGCAAGGCACCCCCTTAGCGCCTTGCTTCGAG
GCACGGTGAGGAAGGCGCGGCCAAGGCCTGCGCTCAAGGTTTACTTAGCATTGGAGGCTGGTTATTTTGAAAAGGTTGATGAACTTTGCTATAGCCTATGGGAGGCAATT
GCCAAAGGTAAATCCGAAGACCTTCCAATACAGAGCCGGTTGGTAAGTGATAAACCTCAATTAGTTTATCTTGAAGAATTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTAACGAGTTAAGATGGAGGAGGAAGATGAAAATGGCAGTGATGAAGATAAAGAAGAAGGGCACAGAATCCTTCATTGTAATTTGGATTACTGGGAATTGTGATGC
CTGTGTTTGGTGCACTGTATTTGGTCTGATGCCTGTTCGTCCTCCATCGAAACTCCTGGTATGGACTGCAGCTCTGTATAAACTCCTCTGGGGTGTTCGTGATGGCCCTG
CACCAACAGCTCCAGCTCTTGCTGTCAATGATGATTTTCTTCTATTTCAGCTTGTTGCTTCCATTGTTGCCATTCAATGTCGTTCGACACTCATCCACCTTAGCAACCAT
TCATTTGCCACAGTTGCAATCCTCGTCGTTGCACTGTACTCCTTCAACATCAACCCCTTCGTGGAACTGTTAGAAAACAATCAAGACATGGATGTAGGCCTTGTTGCCGA
ACCACTTACATGCCGAATACTTACCTCCTTTACAGTGGATGTAGGCCTTGTTGCCGAACCACTTACATGCAATGTTGTCGCTCCTCAAAATGAAATCGATGGCGACCACA
AAGAGAAGGAATATTATCGACATGAACTTTGCAAATTTCGGCTAAGAGATTTTACCATAATTGCTTATGTGGTGGTCGTAAAAACACGGAATAATGTGGTGGGATTTTCT
AGGAGGAAGCTGAAGTGGTTTAACCAGAAAGAACCCAACCATGCTTCACGCCGTCTCCTCCTCCGGTTCCGTTCACTGCTGCCTCCACCGTCATCTTTCTTCCGCTTGTG
CCACATCCACCATCGTCTTTCTTCCGTTCGTGCCGCCGCAGCCTCCCTTCTAGCGGTTTTGCCCTCATTTTCTACTCCCTCGTTGTTACTGGTGTTGAGAACCTCTATGA
GCTTGGTGATGACGAAGAGGAGTCGTGATGAAAAAGACCATTGGCAATTAATCCATGAGCCGATAGCCCATGGCAGCACAAAATCATTCAACGCCCTCATAAATCGCCTT
TCGTCCCAAGGCGCTCACCATCAAGTTCTTCAAACCTACATTTCTATGCAAAAGATAAATACACCACCAGATGCTTACACTTTTCCCAGCCTTCTGAAAGCTTGTACCAT
TTTGAACTTGTTTTCAGATGGCCTCTCGCTTCACCAATCTATCATTGTTAATGGCCTTTCTTATGATTCCTACATTGGGTCTTCGCTCATTAATTTCTATGCCAAATTTG
GGTGCATTCATTTTGGTCGCAAGGTGTTTGATACAATGCCCGAAAGAAATGTTGTTCCTTGGACTACCATTATTGGGTGCTATTCACATCAGGGAGATGTTGACATTGCT
TTTTCAATGTTCAAACAAATGCGGGAGAATGAAATTCAGCCCACTTCAGTAACCTTGTTGAGTCTGCTCCCTGGTATTTTAGAGCTTCCCCTTCTTCAGTGTTTGCATTG
TTGGATTATTTTGTATGGTTTTGGGTCAAATTTAGCTTTATCGAACTCCATGGTGAATATGTATGGTAGATGTGGCAGCATTGATGATGCAAGAAGTTTGTTTGAGTCAA
TGGATTATAGAGACATAGTTTCTTGGAATTCATTATTATCAGCCTATTCGAAAATTGGCGGGATTGAAGAAATATTGCAGCTTCTACATGCCATGAGGATTGAAGATATC
AAACCTGACAAACAAACTTTTTGCTCTGCTTTGTCTGCTTCTGCTATAAAATGTGATATTAGATTAGGTAAGTTGGTGCATGGTTTGATTCTTAAAGGTGGATTAGATAT
GGATCAACAAGTAGAGACGACACTCGTAGTTTTATACTTGAGATGTAGGAGTTTGGATCTCGCACTTAAAGTTTTCGAAGTAACTATTGAAAAGGATGTGGTCCTCTGGA
CAGCAATGATATCAGGACTTGTCCAGAACGATTGTTCTGACAAGGCATTGGGGGTCTTCTATCGAATGATGGAAACAAACGTGGAGCCAGGTACTGCCACCTTAGCTAGT
GCTCTGGCAGCCTGTGCTCAACTTGGTTGTTATGATATTGGTACCTCGATTCATGGTTATGTATTAAGGCAAGGAATAATGCTAGACATACCTGCTCAAAACTCCCTTGT
CACCATGTATGCAAAGTGTAATAGGTTGGAGCAAAGCTGTGCAATTTTTAATAAGATGGTTGAAAAGGATTTAGTTTCTTGGAATGCTATTGTGGCTGGACATGCTAAAA
ATGGTTATTTAAGCAAGGCCATCTTTTTCTTCAATGAAATGAGAACAAACTTTCAAAGGCCTGACTCAATAACAGTGATCTCACTTCTTCAAGCTTGTGGTTTTACTGGT
GCACTTTGCCAGGGAAAGTGGATTCACAACTTCGTTTTTAGAAGTTCCCTTATGCCATGCATTATGATCGAAACGGCTCTAGTTGACATGTATTTCAAATGTGGAAACTT
AGAGAATGCTCAGAAGTGTTTCGATTATATGTTGCAAAAAGATCTTGTAACATGGAGCACACTTATTGCTGGATATGGTTTTAATGGAAAGGGAAAAATTGCTTTGAGAA
AATATTCAGAGTTTCTTGCCACAGGGATGGAACCGAATCATGTTATTTTCCTTTCAGTTCTTTCTGCTTGTAGCCACAGTGGGCTTATTAGCCAAGGTTTGAGCATATAC
GAGTCAATGATTAAAGATTTCAGAATGCCAACAAATCTCGAGCACCAAGCTTGTATCATTGACCTCCTTAGTCGAGCTGGAAAGGTTGAAGAGGCATATAGCTTCTATAA
AATGATGTTTAAAGAACCCTCAATAGATGTTTTAGGCATACTCCTTGGTGCTTGTCGTGTTAATGGCAGTGTCAAACTTGGTGAGGTTATTGCTAGAGATATGGTTGAAT
TAAAGCCCGTGGATGCTGGAAACTATGTGCAACTGGCTCATAGTTATGCATCCATGAATAGATGGGATGGAGTGGAGGAGGCATGGACCCAAATGAGGTCTCTTGGTCTG
AAAAAGCTTCCTGGATGGAGTTGTATTGAGGAGTTGATGGATAGAAACAGGGTAAAAAGTGCATATGGCATTGTAAAGAAAAATGATCTTCGGAAGGAATTCCTCAATAT
CTATCAGAAGTGTGAAGAGAGGTACTTTGATGACCCCTATGCTCCATCTTTCTTCTTTGCACTCAAATTTTCTTTTGGCGCAAGGCACCCCCTTAGCGCCTTGCTTCGAG
GCACGGTGAGGAAGGCGCGGCCAAGGCCTGCGCTCAAGGTTTACTTAGCATTGGAGGCTGGTTATTTTGAAAAGGTTGATGAACTTTGCTATAGCCTATGGGAGGCAATT
GCCAAAGGTAAATCCGAAGACCTTCCAATACAGAGCCGGTTGGTAAGTGATAAACCTCAATTAGTTTATCTTGAAGAATTATAG
Protein sequenceShow/hide protein sequence
MANELRWRRKMKMAVMKIKKKGTESFIVIWITGNCDACVWCTVFGLMPVRPPSKLLVWTAALYKLLWGVRDGPAPTAPALAVNDDFLLFQLVASIVAIQCRSTLIHLSNH
SFATVAILVVALYSFNINPFVELLENNQDMDVGLVAEPLTCRILTSFTVDVGLVAEPLTCNVVAPQNEIDGDHKEKEYYRHELCKFRLRDFTIIAYVVVVKTRNNVVGFS
RRKLKWFNQKEPNHASRRLLLRFRSLLPPPSSFFRLCHIHHRLSSVRAAAASLLAVLPSFSTPSLLLVLRTSMSLVMTKRSRDEKDHWQLIHEPIAHGSTKSFNALINRL
SSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSHQGDVDIA
FSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLHAMRIEDI
KPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLVVLYLRCRSLDLALKVFEVTIEKDVVLWTAMISGLVQNDCSDKALGVFYRMMETNVEPGTATLAS
ALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSCAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNFQRPDSITVISLLQACGFTG
ALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLENAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIY
ESMIKDFRMPTNLEHQACIIDLLSRAGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIARDMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGL
KKLPGWSCIEELMDRNRVKSAYGIVKKNDLRKEFLNIYQKCEERYFDDPYAPSFFFALKFSFGARHPLSALLRGTVRKARPRPALKVYLALEAGYFEKVDELCYSLWEAI
AKGKSEDLPIQSRLVSDKPQLVYLEEL