; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036506 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036506
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr3:47589156..47591940
RNA-Seq ExpressionLag0036506
SyntenyLag0036506
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022159804.1 pentatricopeptide repeat-containing protein At4g04370 [Momordica charantia]0.0e+0085.59Show/hide
Query:  MNRLIH-------EPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINF
        MNRLIH        PIA+ STKSFN++INRLSSQGAHHQVLQTY SMQK +TPPDAYTFPSLLKACTILNLF DGLSLHQSIIVNG S DSYIGSSLI+F
Subjt:  MNRLIH-------EPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINF

Query:  YAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYG
        YAKFGCI  GRKVFD MPERNVVPWTTIIGCYS++G++D+AFSMFKQMR   IQPTSVTLLSLLP I ELPLLQCLHCWIILYGF SNL+LSNSMVN+YG
Subjt:  YAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYG

Query:  RCGSIDDARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLY
        RCGSI+DARSLFESMDYRDIVSWNSLLSAYSKIG IEEILQL+  MR E IKPDKQTFCSALSASAIK DIRLGKLVHGLI+K GL +DQQVET L+VLY
Subjt:  RCGSIDDARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLY

Query:  LRCRSLDLALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTM
        LRC+SLDLALKVF++T EKD+VLWTAMISGLVQNDC+DKAL VFYQM+E+++EPGTATLASALAACAQLGCYDIGT IHGY+LRQGIMLDIPAQN+LVTM
Subjt:  LRCRSLDLALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTM

Query:  YAKCNRLEQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVD
        YAKCNRLEQS  IFNKMVE+DLVSWNAIVAGHAKNGYLSKAI FFNEMRT+LQRPDSITV SLLQACG  GAL QGKWIHNFVFRSSLMPCIMIETAL+D
Subjt:  YAKCNRLEQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVD

Query:  MYFKCGNLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACI
        MYFKCGNLEIAQKCFDYM  +DLVTWSTLI+GYGFNG G+IALRKYSEFL TG+EPNHVIFLSVLSACSHSGL++QGL IYESM +DF M  NLEH+ACI
Subjt:  MYFKCGNLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACI

Query:  IDLLSRAGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSI
        +DLLSRAGKVEEAYSFYKMMF+EPSIDVLGILL ACRVNGSV+LGE IAR++  LKPVD GNYVQLAHSYASM RWDGVEEAWTQMRSLGLKKLPGWSSI
Subjt:  IDLLSRAGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSI

Query:  E
        E
Subjt:  E

XP_022931568.1 pentatricopeptide repeat-containing protein At4g04370 [Cucurbita moschata]0.0e+0086.04Show/hide
Query:  MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCI
        MNRLIHE I H STKSFNALINRLSSQ AHHQVLQTYISM   NTPPDAYTFPSLLKACT+LN FS+GLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCI
Subjt:  MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCI

Query:  HFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDD
        H GRKVFDTMPERNVVPWT IIGCYS+QGDV IAF+MFKQMRENEI PTSVT LSLLPGI ELPLLQ LHC I+LYGFGS+LALSNSMV+MYGRCGS+D 
Subjt:  HFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDD

Query:  ARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLD
        A SLFESMD RDIVSWNSLLSAYSKI GIEEILQL+++MRIE IKPDK+TFCS LSASA KCDIRLGKLVHGL+LK GLDMDQQVETTL+VLYL+C  LD
Subjt:  ARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLD

Query:  LALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRL
         ALKVFE+TIEKDVVLWTAMISGLVQNDCSD AL VFY M+E++VE GTATLAS LAACAQLGCY IGTSIHGY+LRQGIM DIPAQNSLVTMYAKCNRL
Subjt:  LALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRL

Query:  EQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCG
        EQS AIFNK+VEK+LVSWNAI+AGHAKNGYLSKAIFFF+EMR T+ QRPDSITV SLLQACG  GALCQGKWIH+F+FRSSL+PCIM ETALVDMYFKCG
Subjt:  EQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCG

Query:  NLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSR
        ++E AQKCFDYMLQKDLVTWSTLIAGYGFNG+G+IALRKYSEFL TGMEPNHVIFLSVLSACSHSGLI+QGLSIYESM KDF M TNLEH+ACIIDLLSR
Subjt:  NLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSR

Query:  AGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
        AGKVEEAYSFY  MF+EPSIDVLGILL ACRVNG+V+LGEVI+R+M ELKPVDAGNYVQLAHSYAS +RWDGVE AWTQMRSLGLKKLPGWS IE
Subjt:  AGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

XP_022989320.1 pentatricopeptide repeat-containing protein At4g04370 isoform X1 [Cucurbita maxima]0.0e+0076.08Show/hide
Query:  MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCI
        MNRLIHE I H STKSFNALINRLSSQ AHHQVLQTYISM   NTPPDAYTFPSLLKACT+LN FSDGLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCI
Subjt:  MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCI

Query:  HFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDD
        H GRKVFDTMPERNVVPWT IIGCYS+QGDV IAF+MFKQMRE+EI PTSVT LSLLPGI ELPLLQ LHC IILYGFGS+LALSNSMV MYGRCGS+D 
Subjt:  HFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDD

Query:  ARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLD
        A SLFESMD RDIVSWNSLLSAYSKI GIEEILQL++AMRIE IKPDK+TFCS LSASA KCDIRLGKLVHGL+LK GLDMDQQVETTL+VLYL+C  LD
Subjt:  ARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLD

Query:  LALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRL
         ALKVFE++IEKDVVLWTAMISGLVQNDCSD AL VFY+M+E++V+ GTATLAS LAACAQLGCY IGTSIHGY+LRQGIM DIPAQNSLVTMYAKCNRL
Subjt:  LALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRL

Query:  EQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCG
        EQS AIFNK+VEK+LVSWNAI+AGHAKNGYL+KAIFFF EMR T+ QRPDSITV SLLQACG  GALCQGKWIHN +FRSSL+PCIM ETALVDMYFKCG
Subjt:  EQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCG

Query:  NLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSR
        ++E AQKCFD MLQKDLVTWSTLI GYGFNGKG+IALRKYSEFL TGMEPNHVIFLSVLSAC+HSGLI+QGLSIYESM KDF M TNLEH+ACIIDLLSR
Subjt:  NLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSR

Query:  AGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE-----
        AGKVEEAYSFY  MF+EPSIDVLGILL ACRVNG+V+LGEVI+R+M ELKPVDAGNYVQLAHSYAS +RWDGVE AWTQMRSLGLKKLPGWS IE     
Subjt:  AGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE-----

Query:  --------------------ELMNRN---------------------------------------------------------RVKSAYGIVKKNDLRKE
                            +L++++                                                         +VKSAYGI+KKNDL  E
Subjt:  --------------------ELMNRN---------------------------------------------------------RVKSAYGIVKKNDLRKE

Query:  FLNIYQKCEER
        FL+IY+KC+ER
Subjt:  FLNIYQKCEER

XP_022989321.1 pentatricopeptide repeat-containing protein At4g04370 isoform X2 [Cucurbita maxima]0.0e+0085.61Show/hide
Query:  MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCI
        MNRLIHE I H STKSFNALINRLSSQ AHHQVLQTYISM   NTPPDAYTFPSLLKACT+LN FSDGLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCI
Subjt:  MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCI

Query:  HFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDD
        H GRKVFDTMPERNVVPWT IIGCYS+QGDV IAF+MFKQMRE+EI PTSVT LSLLPGI ELPLLQ LHC IILYGFGS+LALSNSMV MYGRCGS+D 
Subjt:  HFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDD

Query:  ARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLD
        A SLFESMD RDIVSWNSLLSAYSKI GIEEILQL++AMRIE IKPDK+TFCS LSASA KCDIRLGKLVHGL+LK GLDMDQQVETTL+VLYL+C  LD
Subjt:  ARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLD

Query:  LALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRL
         ALKVFE++IEKDVVLWTAMISGLVQNDCSD AL VFY+M+E++V+ GTATLAS LAACAQLGCY IGTSIHGY+LRQGIM DIPAQNSLVTMYAKCNRL
Subjt:  LALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRL

Query:  EQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCG
        EQS AIFNK+VEK+LVSWNAI+AGHAKNGYL+KAIFFF EMR T+ QRPDSITV SLLQACG  GALCQGKWIHN +FRSSL+PCIM ETALVDMYFKCG
Subjt:  EQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCG

Query:  NLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSR
        ++E AQKCFD MLQKDLVTWSTLI GYGFNGKG+IALRKYSEFL TGMEPNHVIFLSVLSAC+HSGLI+QGLSIYESM KDF M TNLEH+ACIIDLLSR
Subjt:  NLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSR

Query:  AGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
        AGKVEEAYSFY  MF+EPSIDVLGILL ACRVNG+V+LGEVI+R+M ELKPVDAGNYVQLAHSYAS +RWDGVE AWTQMRSLGLKKLPGWS IE
Subjt:  AGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

XP_038878475.1 pentatricopeptide repeat-containing protein At4g04370 [Benincasa hispida]0.0e+0086.42Show/hide
Query:  RLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF
        +LIHEPIAH STKSFN+L+NRLSSQGAHHQVLQTYIS Q  NTPPDAYTFPSLLKACT LN+FS+GLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCIHF
Subjt:  RLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHF

Query:  GRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR
        GRKVFDTMPERNVVPWTT+IGCYS+QGD+DIAFSMFKQMRE+ IQPTSVT LSLLPGI ELPLL CLHC I+LYGF S+LAL NSMVNMYG+CG I DAR
Subjt:  GRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDAR

Query:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLA
        SLFESMDYRD+VSWNSLLSAYSKIGGIEEIL+ +  MRIE IKPDKQTFCSALSASAIK D+R GKLVH LILK G D+DQQVET LIVLYLRCR LDLA
Subjt:  SLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLA

Query:  LKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ
         +VF++T EKD VLWTAMISGLVQNDC+DKALGVFYQM+E++VEP TATLASAL+ACAQL C DIGTSIHGYVLRQGI+LDIPAQNSLVTMYAKCN+LEQ
Subjt:  LKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQ

Query:  SFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLE
        S +IFN+MVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRT+ QRPDSITV SLLQACG  GAL QGKWIHNFV RSSL+PCIM ETALVDMYFKCGNL 
Subjt:  SFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLE

Query:  IAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGK
         AQKCFDYMLQKDLVTWS LIAGYGFNGKG+IALRKYSEFL TGMEPNHVIFLSVLSACSHSGLI QGL+IYESM KDFRMS NLEH+ACIIDLLSRAGK
Subjt:  IAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGK

Query:  VEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
        V+EAYSFYKMMFKEP+IDVLGILL ACRVNGSV+LG+VIAR+M ELKPVDAGN+VQLAHSYASM+RWDGVE AWTQMRSLGLKKLPGWSSIE
Subjt:  VEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

TrEMBL top hitse value%identityAlignment
A0A1S4DUX5 pentatricopeptide repeat-containing protein At4g043700.0e+0084.44Show/hide
Query:  MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCI
        M+RLIHE IAH STKSFN+L++RLSSQGAHHQVLQTYISMQK +TP DAYTFPSL KACT LNLFS GLSLHQS++VNGLS+DSYIGSSLI+FYAKFGCI
Subjt:  MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCI

Query:  HFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDD
        H GRKVFDTM +RNVVPWTTIIG YS++GD+DIAFSMFKQMRE+ IQPTSVTLLSLLPGI +LPLL CLHC I LYGF S+LALSNSMVNMYG+CG I D
Subjt:  HFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDD

Query:  ARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLD
        ARSLFES+DYRDIVSWNSLLSAYSKIG  EEILQL+ AM+IE IKPDKQTFCSALSASAIK D+RLGKLVHGL+LK GL++DQ VE+ L+VLYLRCR LD
Subjt:  ARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLD

Query:  LALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRL
        LA KVF++T EKDVV+WTAMISGLVQNDC+DKALGVFYQM+E++V+P TATLASALAACAQLGC DIG SIHGYVLRQGIMLDIPAQNSLVTMYAKCN+L
Subjt:  LALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRL

Query:  EQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGN
        +QS +IFNKMVEKD+VSWNAIVAG+AKNGYLSKAIFFFNEMRT+ QRPDSITV SLLQACG  GALCQGKWIHNFV RSSL+PCIM ETALVDMYFKCGN
Subjt:  EQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGN

Query:  LEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRA
        LE AQKCFD M Q+DLV WSTLI GYGFNGKG+IALRKYSEFL TGMEPNHVIF+SVLSACSHSGLISQGLSIYESM KDFRM  NLEH+ACI+DLLSRA
Subjt:  LEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRA

Query:  GKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
        GKV+EAYSFYKMMFKEPS+ VLG LL ACRVNGSV+LG+VIAR+M ELKPVD GN+VQLA+SYASMNRWDGVE+AWTQMRSLGLKK PGWSSIE
Subjt:  GKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

A0A6J1E522 pentatricopeptide repeat-containing protein At4g043700.0e+0085.59Show/hide
Query:  MNRLIH-------EPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINF
        MNRLIH        PIA+ STKSFN++INRLSSQGAHHQVLQTY SMQK +TPPDAYTFPSLLKACTILNLF DGLSLHQSIIVNG S DSYIGSSLI+F
Subjt:  MNRLIH-------EPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINF

Query:  YAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYG
        YAKFGCI  GRKVFD MPERNVVPWTTIIGCYS++G++D+AFSMFKQMR   IQPTSVTLLSLLP I ELPLLQCLHCWIILYGF SNL+LSNSMVN+YG
Subjt:  YAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYG

Query:  RCGSIDDARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLY
        RCGSI+DARSLFESMDYRDIVSWNSLLSAYSKIG IEEILQL+  MR E IKPDKQTFCSALSASAIK DIRLGKLVHGLI+K GL +DQQVET L+VLY
Subjt:  RCGSIDDARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLY

Query:  LRCRSLDLALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTM
        LRC+SLDLALKVF++T EKD+VLWTAMISGLVQNDC+DKAL VFYQM+E+++EPGTATLASALAACAQLGCYDIGT IHGY+LRQGIMLDIPAQN+LVTM
Subjt:  LRCRSLDLALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTM

Query:  YAKCNRLEQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVD
        YAKCNRLEQS  IFNKMVE+DLVSWNAIVAGHAKNGYLSKAI FFNEMRT+LQRPDSITV SLLQACG  GAL QGKWIHNFVFRSSLMPCIMIETAL+D
Subjt:  YAKCNRLEQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVD

Query:  MYFKCGNLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACI
        MYFKCGNLEIAQKCFDYM  +DLVTWSTLI+GYGFNG G+IALRKYSEFL TG+EPNHVIFLSVLSACSHSGL++QGL IYESM +DF M  NLEH+ACI
Subjt:  MYFKCGNLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACI

Query:  IDLLSRAGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSI
        +DLLSRAGKVEEAYSFYKMMF+EPSIDVLGILL ACRVNGSV+LGE IAR++  LKPVD GNYVQLAHSYASM RWDGVEEAWTQMRSLGLKKLPGWSSI
Subjt:  IDLLSRAGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSI

Query:  E
        E
Subjt:  E

A0A6J1EZ22 pentatricopeptide repeat-containing protein At4g043700.0e+0086.04Show/hide
Query:  MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCI
        MNRLIHE I H STKSFNALINRLSSQ AHHQVLQTYISM   NTPPDAYTFPSLLKACT+LN FS+GLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCI
Subjt:  MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCI

Query:  HFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDD
        H GRKVFDTMPERNVVPWT IIGCYS+QGDV IAF+MFKQMRENEI PTSVT LSLLPGI ELPLLQ LHC I+LYGFGS+LALSNSMV+MYGRCGS+D 
Subjt:  HFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDD

Query:  ARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLD
        A SLFESMD RDIVSWNSLLSAYSKI GIEEILQL+++MRIE IKPDK+TFCS LSASA KCDIRLGKLVHGL+LK GLDMDQQVETTL+VLYL+C  LD
Subjt:  ARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLD

Query:  LALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRL
         ALKVFE+TIEKDVVLWTAMISGLVQNDCSD AL VFY M+E++VE GTATLAS LAACAQLGCY IGTSIHGY+LRQGIM DIPAQNSLVTMYAKCNRL
Subjt:  LALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRL

Query:  EQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCG
        EQS AIFNK+VEK+LVSWNAI+AGHAKNGYLSKAIFFF+EMR T+ QRPDSITV SLLQACG  GALCQGKWIH+F+FRSSL+PCIM ETALVDMYFKCG
Subjt:  EQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCG

Query:  NLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSR
        ++E AQKCFDYMLQKDLVTWSTLIAGYGFNG+G+IALRKYSEFL TGMEPNHVIFLSVLSACSHSGLI+QGLSIYESM KDF M TNLEH+ACIIDLLSR
Subjt:  NLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSR

Query:  AGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
        AGKVEEAYSFY  MF+EPSIDVLGILL ACRVNG+V+LGEVI+R+M ELKPVDAGNYVQLAHSYAS +RWDGVE AWTQMRSLGLKKLPGWS IE
Subjt:  AGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

A0A6J1JJR3 pentatricopeptide repeat-containing protein At4g04370 isoform X20.0e+0085.61Show/hide
Query:  MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCI
        MNRLIHE I H STKSFNALINRLSSQ AHHQVLQTYISM   NTPPDAYTFPSLLKACT+LN FSDGLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCI
Subjt:  MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCI

Query:  HFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDD
        H GRKVFDTMPERNVVPWT IIGCYS+QGDV IAF+MFKQMRE+EI PTSVT LSLLPGI ELPLLQ LHC IILYGFGS+LALSNSMV MYGRCGS+D 
Subjt:  HFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDD

Query:  ARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLD
        A SLFESMD RDIVSWNSLLSAYSKI GIEEILQL++AMRIE IKPDK+TFCS LSASA KCDIRLGKLVHGL+LK GLDMDQQVETTL+VLYL+C  LD
Subjt:  ARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLD

Query:  LALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRL
         ALKVFE++IEKDVVLWTAMISGLVQNDCSD AL VFY+M+E++V+ GTATLAS LAACAQLGCY IGTSIHGY+LRQGIM DIPAQNSLVTMYAKCNRL
Subjt:  LALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRL

Query:  EQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCG
        EQS AIFNK+VEK+LVSWNAI+AGHAKNGYL+KAIFFF EMR T+ QRPDSITV SLLQACG  GALCQGKWIHN +FRSSL+PCIM ETALVDMYFKCG
Subjt:  EQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCG

Query:  NLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSR
        ++E AQKCFD MLQKDLVTWSTLI GYGFNGKG+IALRKYSEFL TGMEPNHVIFLSVLSAC+HSGLI+QGLSIYESM KDF M TNLEH+ACIIDLLSR
Subjt:  NLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSR

Query:  AGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
        AGKVEEAYSFY  MF+EPSIDVLGILL ACRVNG+V+LGEVI+R+M ELKPVDAGNYVQLAHSYAS +RWDGVE AWTQMRSLGLKKLPGWS IE
Subjt:  AGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

A0A6J1JM14 pentatricopeptide repeat-containing protein At4g04370 isoform X10.0e+0076.08Show/hide
Query:  MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCI
        MNRLIHE I H STKSFNALINRLSSQ AHHQVLQTYISM   NTPPDAYTFPSLLKACT+LN FSDGLS+HQS+IVNGLS+DSYIGSSLI+FYAKFGCI
Subjt:  MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCI

Query:  HFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDD
        H GRKVFDTMPERNVVPWT IIGCYS+QGDV IAF+MFKQMRE+EI PTSVT LSLLPGI ELPLLQ LHC IILYGFGS+LALSNSMV MYGRCGS+D 
Subjt:  HFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDD

Query:  ARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLD
        A SLFESMD RDIVSWNSLLSAYSKI GIEEILQL++AMRIE IKPDK+TFCS LSASA KCDIRLGKLVHGL+LK GLDMDQQVETTL+VLYL+C  LD
Subjt:  ARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLD

Query:  LALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRL
         ALKVFE++IEKDVVLWTAMISGLVQNDCSD AL VFY+M+E++V+ GTATLAS LAACAQLGCY IGTSIHGY+LRQGIM DIPAQNSLVTMYAKCNRL
Subjt:  LALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRL

Query:  EQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCG
        EQS AIFNK+VEK+LVSWNAI+AGHAKNGYL+KAIFFF EMR T+ QRPDSITV SLLQACG  GALCQGKWIHN +FRSSL+PCIM ETALVDMYFKCG
Subjt:  EQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCG

Query:  NLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSR
        ++E AQKCFD MLQKDLVTWSTLI GYGFNGKG+IALRKYSEFL TGMEPNHVIFLSVLSAC+HSGLI+QGLSIYESM KDF M TNLEH+ACIIDLLSR
Subjt:  NLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSR

Query:  AGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE-----
        AGKVEEAYSFY  MF+EPSIDVLGILL ACRVNG+V+LGEVI+R+M ELKPVDAGNYVQLAHSYAS +RWDGVE AWTQMRSLGLKKLPGWS IE     
Subjt:  AGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE-----

Query:  --------------------ELMNRN---------------------------------------------------------RVKSAYGIVKKNDLRKE
                            +L++++                                                         +VKSAYGI+KKNDL  E
Subjt:  --------------------ELMNRN---------------------------------------------------------RVKSAYGIVKKNDLRKE

Query:  FLNIYQKCEER
        FL+IY+KC+ER
Subjt:  FLNIYQKCEER

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic7.3e-11733.64Show/hide
Query:  PPDAYTFPS--LLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRE
        P + Y  P+  LL+ C+ L      L L   +  NGL  + +  + L++ + ++G +    +VF+ +  +  V + T++  +++  D+D A   F +MR 
Subjt:  PPDAYTFPS--LLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRE

Query:  NEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMR
        ++++P       LL +     EL + + +H  ++  GF  +L     + NMY +C  +++AR +F+ M  RD+VSWN++++ YS+ G     L+++ +M 
Subjt:  NEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMR

Query:  IEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQM
         E +KP   T  S L A +    I +GK +HG  ++ G D    + T L+ +Y +C SL+ A ++F+  +E++VV W +MI   VQN+   +A+ +F +M
Subjt:  IEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQM

Query:  METSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNE
        ++  V+P   ++  AL ACA LG  + G  IH   +  G+  ++   NSL++MY KC  ++ + ++F K+  + LVSWNA++ G A+NG    A+ +F++
Subjt:  METSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNE

Query:  MRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYS
        MR+   +PD+ T +S++ A          KWIH  V RS L   + + TALVDMY KCG + IA+  FD M ++ + TW+ +I GYG +G GK AL  + 
Subjt:  MRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYS

Query:  EFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEV
        E     ++PN V FLSV+SACSHSGL+  GL  +  M +++ +  +++H   ++DLL RAG++ EA+ F   M  +P+++V G +LGAC+++ +V   E 
Subjt:  EFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEV

Query:  IAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
         A  + EL P D G +V LA+ Y + + W+ V +    M   GL+K PG S +E
Subjt:  IAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

Q7Y211 Pentatricopeptide repeat-containing protein At3g57430, chloroplastic1.2e-11433.19Show/hide
Query:  QVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSY-IGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSQQGD
        + + TY+ M  +   PD Y FP+LLKA   L     G  +H  +   G   DS  + ++L+N Y K G      KVFD + ERN V W ++I        
Subjt:  QVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSY-IGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSQQGD

Query:  VDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFG-----SNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSK
         ++A   F+ M +  ++P+S TL+S++     LP+ + L     ++ +G      N  + N++V MYG+ G +  ++ L  S   RD+V+WN++LS+  +
Subjt:  VDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFG-----SNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSK

Query:  IGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILK-GGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEKDVVLWTAMISGL
           + E L+ L  M +E ++PD+ T  S L A +    +R GK +H   LK G LD +  V + L+ +Y  C+ +    +VF+   ++ + LW AMI+G 
Subjt:  IGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILK-GGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEKDVVLWTAMISGL

Query:  VQNDCSDKALGVFYQMMETS-VEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVEKDLVSWNAIVA
         QN+   +AL +F  M E++ +   + T+A  + AC + G +    +IHG+V+++G+  D   QN+L+ MY++  +++ +  IF KM ++DLV+WN ++ 
Subjt:  VQNDCSDKALGVFYQMMETS-VEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVEKDLVSWNAIVA

Query:  GHAKNGYLSKAIFFFNEMRTNLQR------------PDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYM
        G+  + +   A+   ++M+ NL+R            P+SIT++++L +C    AL +GK IH +  +++L   + + +ALVDMY KCG L++++K FD +
Subjt:  GHAKNGYLSKAIFFFNEMRTNLQR------------PDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYM

Query:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYK
         QK+++TW+ +I  YG +G G+ A+      +  G++PN V F+SV +ACSHSG++ +GL I+  M  D+ +  + +H AC++DLL RAG+++EAY    
Subjt:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYK

Query:  MMFKE-PSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
        MM ++         LLGA R++ ++++GE+ A+ +++L+P  A +YV LA+ Y+S   WD   E    M+  G++K PG S IE
Subjt:  MMFKE-PSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

Q9SS60 Pentatricopeptide repeat-containing protein At3g035801.3e-11031.57Show/hide
Query:  FNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVV
        +N++I   S  G   + L+ Y  +++    PD YTFPS++KAC  L     G  +++ I+  G   D ++G++L++ Y++ G +   R+VFD MP R++V
Subjt:  FNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVV

Query:  PWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLL---QCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDI
         W ++I  YS  G  + A  ++ +++ + I P S T+ S+LP    L ++   Q LH + +  G  S + ++N +V MY +     DAR +F+ MD RD 
Subjt:  PWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLL---QCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDI

Query:  VSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEKD
        VS+N+++  Y K+  +EE +++     ++  KPD  T  S L A     D+ L K ++  +LK G  ++  V   LI +Y +C  +  A  VF +   KD
Subjt:  VSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEKD

Query:  VVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVEK
         V W ++ISG +Q+    +A+ +F  MM    +    T    ++   +L     G  +H   ++ GI +D+   N+L+ MYAKC  +  S  IF+ M   
Subjt:  VVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVEK

Query:  DLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYMLQ
        D V+WN +++   + G  +  +    +MR +   PD  T +  L  C    A   GK IH  + R      + I  AL++MY KCG LE + + F+ M +
Subjt:  DLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYMLQ

Query:  KDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYKMM
        +D+VTW+ +I  YG  G+G+ AL  +++   +G+ P+ V+F++++ ACSHSGL+ +GL+ +E M   +++   +EH AC++DLLSR+ K+ +A  F + M
Subjt:  KDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYKMM

Query:  FKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
          +P   +   +L ACR +G ++  E ++R ++EL P D G  +  +++YA++ +WD V      ++   + K PG+S IE
Subjt:  FKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

Q9STE1 Pentatricopeptide repeat-containing protein At4g213001.6e-11132.07Show/hide
Query:  STKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPE
        S + +N++I+     G  +Q L  Y  M      PD  TFP L+KAC  L  F     L  ++   G+  + ++ SSLI  Y ++G I    K+FD + +
Subjt:  STKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPE

Query:  RNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMD
        ++ V W  ++  Y++ G +D     F  MR ++I P +VT   +LS+    L + L   LH  +++ G     ++ NS+++MY +CG  DDA  LF  M 
Subjt:  RNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMD

Query:  YRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEAT
          D V+WN ++S Y + G +EE L     M    + PD  TF S L + +   ++   K +H  I++  + +D  + + LI  Y +CR + +A  +F   
Subjt:  YRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEAT

Query:  IEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNK
           DVV++TAMISG + N     +L +F  +++  + P   TL S L     L    +G  +HG+++++G         +++ MYAKC R+  ++ IF +
Subjt:  IEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNK

Query:  MVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFD
        + ++D+VSWN+++   A++   S AI  F +M  +    D +++ + L AC    +   GK IH F+ + SL   +  E+ L+DMY KCGNL+ A   F 
Subjt:  MVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFD

Query:  YMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFL-ATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYS
         M +K++V+W+++IA  G +GK K +L  + E +  +G+ P+ + FL ++S+C H G + +G+  + SM +D+ +    EH AC++DL  RAG++ EAY 
Subjt:  YMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFL-ATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYS

Query:  FYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
          K M   P   V G LLGACR++ +V+L EV + ++++L P ++G YV +++++A+   W+ V +  + M+   ++K+PG+S IE
Subjt:  FYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

Q9XE98 Pentatricopeptide repeat-containing protein At4g043701.9e-22154.32Show/hide
Query:  STKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPE
        STK FN+ IN LSS G H QVL T+ SM      PD +TFPSLLKAC  L   S GLS+HQ ++VNG S D YI SSL+N YAKFG +   RKVF+ M E
Subjt:  STKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPE

Query:  RNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRD
        R+VV WT +IGCYS+ G V  A S+  +MR   I+P  VTLL +L G+LE+  LQCLH + ++YGF  ++A+ NSM+N+Y +C  + DA+ LF+ M+ RD
Subjt:  RNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRD

Query:  IVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEK
        +VSWN+++S Y+ +G + EIL+LL  MR + ++PD+QTF ++LS S   CD+ +G+++H  I+K G D+D  ++T LI +YL+C   + + +V E    K
Subjt:  IVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEK

Query:  DVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVE
        DVV WT MISGL++   ++KAL VF +M+++  +  +  +AS +A+CAQLG +D+G S+HGYVLR G  LD PA NSL+TMYAKC  L++S  IF +M E
Subjt:  DVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVE

Query:  KDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYM
        +DLVSWNAI++G+A+N  L KA+  F EM+   +Q+ DS TV+SLLQAC   GAL  GK IH  V RS + PC +++TALVDMY KCG LE AQ+CFD +
Subjt:  KDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYM

Query:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYK
          KD+V+W  LIAGYGF+GKG IAL  YSEFL +GMEPNHVIFL+VLS+CSH+G++ QGL I+ SM++DF +  N EH AC++DLL RA ++E+A+ FYK
Subjt:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYK

Query:  MMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
          F  PSIDVLGI+L ACR NG  ++ ++I  +M+ELKP DAG+YV+L HS+A+M RWD V E+W QMRSLGLKKLPGWS IE
Subjt:  MMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein5.2e-11833.64Show/hide
Query:  PPDAYTFPS--LLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRE
        P + Y  P+  LL+ C+ L      L L   +  NGL  + +  + L++ + ++G +    +VF+ +  +  V + T++  +++  D+D A   F +MR 
Subjt:  PPDAYTFPS--LLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRE

Query:  NEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMR
        ++++P       LL +     EL + + +H  ++  GF  +L     + NMY +C  +++AR +F+ M  RD+VSWN++++ YS+ G     L+++ +M 
Subjt:  NEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSKIGGIEEILQLLNAMR

Query:  IEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQM
         E +KP   T  S L A +    I +GK +HG  ++ G D    + T L+ +Y +C SL+ A ++F+  +E++VV W +MI   VQN+   +A+ +F +M
Subjt:  IEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEKDVVLWTAMISGLVQNDCSDKALGVFYQM

Query:  METSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNE
        ++  V+P   ++  AL ACA LG  + G  IH   +  G+  ++   NSL++MY KC  ++ + ++F K+  + LVSWNA++ G A+NG    A+ +F++
Subjt:  METSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNE

Query:  MRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYS
        MR+   +PD+ T +S++ A          KWIH  V RS L   + + TALVDMY KCG + IA+  FD M ++ + TW+ +I GYG +G GK AL  + 
Subjt:  MRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYS

Query:  EFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEV
        E     ++PN V FLSV+SACSHSGL+  GL  +  M +++ +  +++H   ++DLL RAG++ EA+ F   M  +P+++V G +LGAC+++ +V   E 
Subjt:  EFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEV

Query:  IAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
         A  + EL P D G +V LA+ Y + + W+ V +    M   GL+K PG S +E
Subjt:  IAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

AT3G03580.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.5e-11231.57Show/hide
Query:  FNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVV
        +N++I   S  G   + L+ Y  +++    PD YTFPS++KAC  L     G  +++ I+  G   D ++G++L++ Y++ G +   R+VFD MP R++V
Subjt:  FNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPERNVV

Query:  PWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLL---QCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDI
         W ++I  YS  G  + A  ++ +++ + I P S T+ S+LP    L ++   Q LH + +  G  S + ++N +V MY +     DAR +F+ MD RD 
Subjt:  PWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLL---QCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDI

Query:  VSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEKD
        VS+N+++  Y K+  +EE +++     ++  KPD  T  S L A     D+ L K ++  +LK G  ++  V   LI +Y +C  +  A  VF +   KD
Subjt:  VSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEKD

Query:  VVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVEK
         V W ++ISG +Q+    +A+ +F  MM    +    T    ++   +L     G  +H   ++ GI +D+   N+L+ MYAKC  +  S  IF+ M   
Subjt:  VVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVEK

Query:  DLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYMLQ
        D V+WN +++   + G  +  +    +MR +   PD  T +  L  C    A   GK IH  + R      + I  AL++MY KCG LE + + F+ M +
Subjt:  DLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYMLQ

Query:  KDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYKMM
        +D+VTW+ +I  YG  G+G+ AL  +++   +G+ P+ V+F++++ ACSHSGL+ +GL+ +E M   +++   +EH AC++DLLSR+ K+ +A  F + M
Subjt:  KDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYKMM

Query:  FKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
          +P   +   +L ACR +G ++  E ++R ++EL P D G  +  +++YA++ +WD V      ++   + K PG+S IE
Subjt:  FKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.3e-11633.19Show/hide
Query:  QVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSY-IGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSQQGD
        + + TY+ M  +   PD Y FP+LLKA   L     G  +H  +   G   DS  + ++L+N Y K G      KVFD + ERN V W ++I        
Subjt:  QVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSY-IGSSLINFYAKFGCIHFGRKVFDTMPERNVVPWTTIIGCYSQQGD

Query:  VDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFG-----SNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSK
         ++A   F+ M +  ++P+S TL+S++     LP+ + L     ++ +G      N  + N++V MYG+ G +  ++ L  S   RD+V+WN++LS+  +
Subjt:  VDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFG-----SNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLLSAYSK

Query:  IGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILK-GGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEKDVVLWTAMISGL
           + E L+ L  M +E ++PD+ T  S L A +    +R GK +H   LK G LD +  V + L+ +Y  C+ +    +VF+   ++ + LW AMI+G 
Subjt:  IGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILK-GGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEKDVVLWTAMISGL

Query:  VQNDCSDKALGVFYQMMETS-VEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVEKDLVSWNAIVA
         QN+   +AL +F  M E++ +   + T+A  + AC + G +    +IHG+V+++G+  D   QN+L+ MY++  +++ +  IF KM ++DLV+WN ++ 
Subjt:  VQNDCSDKALGVFYQMMETS-VEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVEKDLVSWNAIVA

Query:  GHAKNGYLSKAIFFFNEMRTNLQR------------PDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYM
        G+  + +   A+   ++M+ NL+R            P+SIT++++L +C    AL +GK IH +  +++L   + + +ALVDMY KCG L++++K FD +
Subjt:  GHAKNGYLSKAIFFFNEMRTNLQR------------PDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYM

Query:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYK
         QK+++TW+ +I  YG +G G+ A+      +  G++PN V F+SV +ACSHSG++ +GL I+  M  D+ +  + +H AC++DLL RAG+++EAY    
Subjt:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYK

Query:  MMFKE-PSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
        MM ++         LLGA R++ ++++GE+ A+ +++L+P  A +YV LA+ Y+S   WD   E    M+  G++K PG S IE
Subjt:  MMFKE-PSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

AT4G04370.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-22254.32Show/hide
Query:  STKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPE
        STK FN+ IN LSS G H QVL T+ SM      PD +TFPSLLKAC  L   S GLS+HQ ++VNG S D YI SSL+N YAKFG +   RKVF+ M E
Subjt:  STKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPE

Query:  RNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRD
        R+VV WT +IGCYS+ G V  A S+  +MR   I+P  VTLL +L G+LE+  LQCLH + ++YGF  ++A+ NSM+N+Y +C  + DA+ LF+ M+ RD
Subjt:  RNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRD

Query:  IVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEK
        +VSWN+++S Y+ +G + EIL+LL  MR + ++PD+QTF ++LS S   CD+ +G+++H  I+K G D+D  ++T LI +YL+C   + + +V E    K
Subjt:  IVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEK

Query:  DVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVE
        DVV WT MISGL++   ++KAL VF +M+++  +  +  +AS +A+CAQLG +D+G S+HGYVLR G  LD PA NSL+TMYAKC  L++S  IF +M E
Subjt:  DVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVE

Query:  KDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYM
        +DLVSWNAI++G+A+N  L KA+  F EM+   +Q+ DS TV+SLLQAC   GAL  GK IH  V RS + PC +++TALVDMY KCG LE AQ+CFD +
Subjt:  KDLVSWNAIVAGHAKNGYLSKAIFFFNEMR-TNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYM

Query:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYK
          KD+V+W  LIAGYGF+GKG IAL  YSEFL +GMEPNHVIFL+VLS+CSH+G++ QGL I+ SM++DF +  N EH AC++DLL RA ++E+A+ FYK
Subjt:  LQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYK

Query:  MMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
          F  PSIDVLGI+L ACR NG  ++ ++I  +M+ELKP DAG+YV+L HS+A+M RWD V E+W QMRSLGLKKLPGWS IE
Subjt:  MMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-11232.07Show/hide
Query:  STKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPE
        S + +N++I+     G  +Q L  Y  M      PD  TFP L+KAC  L  F     L  ++   G+  + ++ SSLI  Y ++G I    K+FD + +
Subjt:  STKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTMPE

Query:  RNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMD
        ++ V W  ++  Y++ G +D     F  MR ++I P +VT   +LS+    L + L   LH  +++ G     ++ NS+++MY +CG  DDA  LF  M 
Subjt:  RNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVT---LLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMD

Query:  YRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEAT
          D V+WN ++S Y + G +EE L     M    + PD  TF S L + +   ++   K +H  I++  + +D  + + LI  Y +CR + +A  +F   
Subjt:  YRDIVSWNSLLSAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEAT

Query:  IEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNK
           DVV++TAMISG + N     +L +F  +++  + P   TL S L     L    +G  +HG+++++G         +++ MYAKC R+  ++ IF +
Subjt:  IEKDVVLWTAMISGLVQNDCSDKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNK

Query:  MVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFD
        + ++D+VSWN+++   A++   S AI  F +M  +    D +++ + L AC    +   GK IH F+ + SL   +  E+ L+DMY KCGNL+ A   F 
Subjt:  MVEKDLVSWNAIVAGHAKNGYLSKAIFFFNEMRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFD

Query:  YMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFL-ATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYS
         M +K++V+W+++IA  G +GK K +L  + E +  +G+ P+ + FL ++S+C H G + +G+  + SM +D+ +    EH AC++DL  RAG++ EAY 
Subjt:  YMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFL-ATGMEPNHVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYS

Query:  FYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE
          K M   P   V G LLGACR++ +V+L EV + ++++L P ++G YV +++++A+   W+ V +  + M+   ++K+PG+S IE
Subjt:  FYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLAHSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGATTAATCCATGAGCCGATAGCCCATATCAGCACAAAATCATTCAACGCCCTCATAAATCGCCTTTCGTCCCAAGGCGCTCACCATCAAGTTCTTCAAACCTA
CATTTCTATGCAAAAGATAAATACACCACCAGATGCTTACACTTTTCCCAGCCTTCTCAAAGCTTGTACCATTTTGAACTTGTTTTCAGATGGCCTCTCGCTTCACCAAT
CTATCATTGTTAATGGCCTTTCTTATGATTCCTACATTGGGTCTTCGCTCATTAATTTCTATGCCAAATTTGGGTGCATTCATTTTGGTCGCAAGGTGTTTGATACAATG
CCCGAAAGAAATGTTGTTCCTTGGACTACCATTATTGGGTGCTATTCACAGCAGGGAGATGTTGACATTGCTTTTTCAATGTTCAAACAAATGCGGGAGAATGAAATTCA
GCCCACTTCTGTAACCTTGTTGAGTCTGCTCCCTGGTATTTTAGAGCTTCCCCTTCTTCAGTGTTTGCATTGTTGGATTATTTTGTATGGTTTTGGGTCAAATTTAGCTT
TATCGAACTCCATGGTGAATATGTATGGTAGATGTGGCAGCATTGACGATGCAAGAAGTTTGTTTGAGTCAATGGATTATAGAGACATAGTTTCTTGGAATTCATTATTA
TCAGCCTATTCGAAAATTGGCGGGATTGAAGAAATATTGCAGCTTCTAAATGCTATGAGGATTGAAGTTATCAAACCTGACAAACAAACTTTTTGCTCTGCTTTGTCTGC
TTCTGCTATAAAATGTGATATTAGATTAGGTAAGTTGGTGCATGGTTTGATTCTTAAAGGTGGATTAGATATGGATCAACAAGTAGAGACGACACTCATAGTTTTATACT
TGAGATGTAGGAGTTTGGATCTCGCACTTAAAGTTTTCGAAGCAACTATTGAAAAGGATGTGGTCCTCTGGACAGCAATGATATCAGGACTTGTCCAGAACGATTGTTCT
GACAAGGCATTGGGGGTCTTCTATCAAATGATGGAAACAAGCGTGGAGCCAGGTACTGCCACCTTAGCTAGTGCTCTGGCAGCCTGTGCTCAACTTGGTTGTTATGATAT
TGGTACCTCGATTCATGGTTATGTATTAAGGCAAGGAATAATGCTAGACATACCTGCTCAAAACTCCCTTGTCACCATGTATGCAAAATGTAATAGGTTGGAGCAGAGCT
TTGCAATTTTTAATAAGATGGTTGAAAAGGATTTAGTTTCTTGGAATGCTATTGTGGCTGGACATGCTAAAAATGGTTATTTAAGCAAGGCCATCTTTTTCTTCAATGAA
ATGAGAACAAACTTGCAAAGGCCTGACTCAATAACAGTGATCTCACTTCTTCAAGCTTGTGGTTTTACTGGTGCACTTTGCCAGGGAAAGTGGATTCACAACTTCGTTTT
TAGAAGTTCCCTTATGCCATGCATTATGATCGAAACGGCTCTAGTTGACATGTATTTCAAATGTGGAAACTTAGAGATTGCTCAGAAGTGTTTCGATTATATGTTGCAAA
AAGATCTTGTAACATGGAGCACACTTATTGCTGGATATGGTTTTAATGGAAAGGGAAAAATTGCTTTGAGAAAATATTCAGAGTTTCTTGCCACAGGGATGGAACCAAAT
CATGTTATTTTCCTTTCAGTTCTTTCTGCTTGTAGCCACAGTGGGCTTATTAGCCAAGGTTTGAGCATATACGAGTCAATGATTAAAGATTTCAGAATGTCAACAAATCT
CGAGCACCAAGCTTGTATCATTGACCTCCTTAGTCGAGCTGGAAAGGTTGAAGAGGCATATAGCTTCTATAAAATGATGTTTAAAGAACCCTCAATAGATGTTTTAGGCA
TACTCCTTGGTGCTTGTCGTGTTAATGGCAGTGTCAAACTTGGTGAGGTTATTGCTAGAGAAATGGTTGAATTAAAGCCCGTGGATGCTGGAAACTATGTGCAACTGGCT
CATAGTTATGCATCCATGAATAGATGGGATGGAGTGGAGGAGGCATGGACCCAAATGAGGTCTCTTGGTCTGAAAAAGCTTCCTGGATGGAGTTCTATTGAGGAGTTGAT
GAATAGAAACAGGGTAAAAAGTGCATATGGCATTGTAAAGAAAAATGATCTTCGGAAGGAATTCCTCAATATCTATCAGAAGTGTGAAGAGAGGCGCAAGGCACCCCCTA
GCGCCTTGCTTCGAAGAAGTGAGGCACAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGATTAATCCATGAGCCGATAGCCCATATCAGCACAAAATCATTCAACGCCCTCATAAATCGCCTTTCGTCCCAAGGCGCTCACCATCAAGTTCTTCAAACCTA
CATTTCTATGCAAAAGATAAATACACCACCAGATGCTTACACTTTTCCCAGCCTTCTCAAAGCTTGTACCATTTTGAACTTGTTTTCAGATGGCCTCTCGCTTCACCAAT
CTATCATTGTTAATGGCCTTTCTTATGATTCCTACATTGGGTCTTCGCTCATTAATTTCTATGCCAAATTTGGGTGCATTCATTTTGGTCGCAAGGTGTTTGATACAATG
CCCGAAAGAAATGTTGTTCCTTGGACTACCATTATTGGGTGCTATTCACAGCAGGGAGATGTTGACATTGCTTTTTCAATGTTCAAACAAATGCGGGAGAATGAAATTCA
GCCCACTTCTGTAACCTTGTTGAGTCTGCTCCCTGGTATTTTAGAGCTTCCCCTTCTTCAGTGTTTGCATTGTTGGATTATTTTGTATGGTTTTGGGTCAAATTTAGCTT
TATCGAACTCCATGGTGAATATGTATGGTAGATGTGGCAGCATTGACGATGCAAGAAGTTTGTTTGAGTCAATGGATTATAGAGACATAGTTTCTTGGAATTCATTATTA
TCAGCCTATTCGAAAATTGGCGGGATTGAAGAAATATTGCAGCTTCTAAATGCTATGAGGATTGAAGTTATCAAACCTGACAAACAAACTTTTTGCTCTGCTTTGTCTGC
TTCTGCTATAAAATGTGATATTAGATTAGGTAAGTTGGTGCATGGTTTGATTCTTAAAGGTGGATTAGATATGGATCAACAAGTAGAGACGACACTCATAGTTTTATACT
TGAGATGTAGGAGTTTGGATCTCGCACTTAAAGTTTTCGAAGCAACTATTGAAAAGGATGTGGTCCTCTGGACAGCAATGATATCAGGACTTGTCCAGAACGATTGTTCT
GACAAGGCATTGGGGGTCTTCTATCAAATGATGGAAACAAGCGTGGAGCCAGGTACTGCCACCTTAGCTAGTGCTCTGGCAGCCTGTGCTCAACTTGGTTGTTATGATAT
TGGTACCTCGATTCATGGTTATGTATTAAGGCAAGGAATAATGCTAGACATACCTGCTCAAAACTCCCTTGTCACCATGTATGCAAAATGTAATAGGTTGGAGCAGAGCT
TTGCAATTTTTAATAAGATGGTTGAAAAGGATTTAGTTTCTTGGAATGCTATTGTGGCTGGACATGCTAAAAATGGTTATTTAAGCAAGGCCATCTTTTTCTTCAATGAA
ATGAGAACAAACTTGCAAAGGCCTGACTCAATAACAGTGATCTCACTTCTTCAAGCTTGTGGTTTTACTGGTGCACTTTGCCAGGGAAAGTGGATTCACAACTTCGTTTT
TAGAAGTTCCCTTATGCCATGCATTATGATCGAAACGGCTCTAGTTGACATGTATTTCAAATGTGGAAACTTAGAGATTGCTCAGAAGTGTTTCGATTATATGTTGCAAA
AAGATCTTGTAACATGGAGCACACTTATTGCTGGATATGGTTTTAATGGAAAGGGAAAAATTGCTTTGAGAAAATATTCAGAGTTTCTTGCCACAGGGATGGAACCAAAT
CATGTTATTTTCCTTTCAGTTCTTTCTGCTTGTAGCCACAGTGGGCTTATTAGCCAAGGTTTGAGCATATACGAGTCAATGATTAAAGATTTCAGAATGTCAACAAATCT
CGAGCACCAAGCTTGTATCATTGACCTCCTTAGTCGAGCTGGAAAGGTTGAAGAGGCATATAGCTTCTATAAAATGATGTTTAAAGAACCCTCAATAGATGTTTTAGGCA
TACTCCTTGGTGCTTGTCGTGTTAATGGCAGTGTCAAACTTGGTGAGGTTATTGCTAGAGAAATGGTTGAATTAAAGCCCGTGGATGCTGGAAACTATGTGCAACTGGCT
CATAGTTATGCATCCATGAATAGATGGGATGGAGTGGAGGAGGCATGGACCCAAATGAGGTCTCTTGGTCTGAAAAAGCTTCCTGGATGGAGTTCTATTGAGGAGTTGAT
GAATAGAAACAGGGTAAAAAGTGCATATGGCATTGTAAAGAAAAATGATCTTCGGAAGGAATTCCTCAATATCTATCAGAAGTGTGAAGAGAGGCGCAAGGCACCCCCTA
GCGCCTTGCTTCGAAGAAGTGAGGCACAGTGA
Protein sequenceShow/hide protein sequence
MNRLIHEPIAHISTKSFNALINRLSSQGAHHQVLQTYISMQKINTPPDAYTFPSLLKACTILNLFSDGLSLHQSIIVNGLSYDSYIGSSLINFYAKFGCIHFGRKVFDTM
PERNVVPWTTIIGCYSQQGDVDIAFSMFKQMRENEIQPTSVTLLSLLPGILELPLLQCLHCWIILYGFGSNLALSNSMVNMYGRCGSIDDARSLFESMDYRDIVSWNSLL
SAYSKIGGIEEILQLLNAMRIEVIKPDKQTFCSALSASAIKCDIRLGKLVHGLILKGGLDMDQQVETTLIVLYLRCRSLDLALKVFEATIEKDVVLWTAMISGLVQNDCS
DKALGVFYQMMETSVEPGTATLASALAACAQLGCYDIGTSIHGYVLRQGIMLDIPAQNSLVTMYAKCNRLEQSFAIFNKMVEKDLVSWNAIVAGHAKNGYLSKAIFFFNE
MRTNLQRPDSITVISLLQACGFTGALCQGKWIHNFVFRSSLMPCIMIETALVDMYFKCGNLEIAQKCFDYMLQKDLVTWSTLIAGYGFNGKGKIALRKYSEFLATGMEPN
HVIFLSVLSACSHSGLISQGLSIYESMIKDFRMSTNLEHQACIIDLLSRAGKVEEAYSFYKMMFKEPSIDVLGILLGACRVNGSVKLGEVIAREMVELKPVDAGNYVQLA
HSYASMNRWDGVEEAWTQMRSLGLKKLPGWSSIEELMNRNRVKSAYGIVKKNDLRKEFLNIYQKCEERRKAPPSALLRRSEAQ