; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034475 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034475
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr3:7655975..7659233
RNA-Seq ExpressionLag0034475
SyntenyLag0034475
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051836.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0089.05Show/hide
Query:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA
        MK+KS LRQ+VDLLCSR  ATSEAYTQLVLECVRTN+++QAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDA+NLFDKML+RD FSWNALLSAYA
Subjt:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA

Query:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGS+Q+L+ATFD+MPFRDSVSYNT IAGF+GNS P+ESL+LFKRMQREGF  TEYT VS LNASAQLLDLR GKQIHGS+IVRNFLGNVFIWN LTDMY
Subjt:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFD L  KNLVSWNLMISGY KNGQPE+CIGLLH+MRLSGHMP+QVTMSTIIAAYCQCGRVDEAR+VFSEFKEKDIVCWTAMLVGYAKN
Subjt:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ
        GREEDALLLFNEMLLEHI PDSYTLSSVVSSCAKLASL+HGQAVHGKSIL+GL+NNLLVSSALIDMYSKCGFI+DARSVF LMPTRNV+SWNAM+VG AQ
Subjt:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV
        NGHDKDALELFENMLQQKFKPDNVTFIG+LSACLH NW+EQGQEYFDSISNQHGLTPTLDHYACMVNLLGR GRI+QAVSLIK+M HEPD+LIWSTLLS+
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV

Query:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN
         +TKGDI NAEMAARHLFELDP +AVPY+MLSNMYASMGRWK VA+VR LMKSKNVKKFAG+SWIEID EVH+FTSEDRTHPE+E IYEELN+LI KLQ 
Subjt:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS
        EGFTPNT LVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNG+SPIRIIKNIRIC+DCHEFMKFAS IIGRQIILRDSNRFHHFSTGKCSC D+
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS

XP_004147314.1 putative pentatricopeptide repeat-containing protein At1g68930 [Cucumis sativus]0.0e+0088.76Show/hide
Query:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA
        MK+KS LRQ+VDLLCSR  ATSEAYTQLVLECVRTN+++QAKRLQSHMEHHLFQPTD FLHNQLLHLYAKFGKLRDA+NLFDKML+RD+FSWNALLSAYA
Subjt:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA

Query:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGS+Q+L+ATFD+MPFRDSVSYNT IAGF+GNS P+ESLELFKRMQREGF  TEYT VS LNASAQL DLR GKQIHGS+IVRNFLGNVFIWNALTDMY
Subjt:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFD L  KNLVSWNLMISGY KNGQPE+CIGLLH+MRLSGHMPDQVTMSTIIAAYCQCGRVDEAR+VFSEFKEKDIVCWTAM+VGYAKN
Subjt:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ
        GREEDALLLFNEMLLEHI PDSYTLSSVVSSCAKLASL+HGQAVHGKSIL+GL+NNLLVSSALIDMYSKCGFI+DARSVF LMPTRNV+SWNAM+VG AQ
Subjt:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV
        NGHDKDALELFENMLQQKFKPDNVTFIG+LSACLH NW+EQGQEYFDSI+NQHG+TPTLDHYACMVNLLGR GRI+QAV+LIK+M H+PD+LIWSTLLS+
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV

Query:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN
         +TKGDI NAE+AARHLFELDP  AVPY+MLSNMYASMGRWKDVA+VR LMKSKNVKKFAG+SWIEIDNEVH+FTSEDRTHPE+E IYE+LNMLI KLQ 
Subjt:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS
        EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNG+SPIRIIKNIRIC+DCHEFMKFAS IIGRQIILRDSNRFHHFSTGKCSC D+
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS

XP_022145099.1 pentatricopeptide repeat-containing protein At4g02750-like [Momordica charantia]0.0e+0089.91Show/hide
Query:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA
        M++K +LRQA+DLLCSRG A+SEAYT L+LECVRTN+VDQAKRLQSHMEHHLFQP DPFL NQLLHLYAKFGK+RDA+NLFDKMLERDVFSWNALLSAYA
Subjt:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA

Query:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGS+Q+L+ATFD+MPFRDSVSYNT IAGFAGN  PKESLELF+RMQ EGFV TEYTNVSALNA+AQLLDLRRGK+IHGSVIV  FLGN FIWNALTDMY
Subjt:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFD L NKNL+SWNLMISGYVKNGQPE+CIGLLHEM++SGHMPDQVTMSTIIAAYCQC  VDEARKVFSEFKEKDIVCWTAMLVGYAKN
Subjt:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ
        GREEDALLLFNEMLLEH++PDSYTLSSVVSSCAKLASLYHGQAVHGKSIL+GLDNNLLVSSALIDMYSKCGF+++ARSVF +MPTRNVISWNAM+VGYAQ
Subjt:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV
        NGHDKDAL  FENMLQQKFKPDNVTFIGVLSACLHSNW+E+GQ YFDSISNQHGL PT+DHYACMVNLLGRLGRIDQAV LIKSMPHEPD LIWSTLLSV
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV

Query:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN
        S  KGDIANAEMAAR+LFELDPLNAVPYVMLSNMYA MGRWKDVA+VRTLMKSKNVKKFAGYSWIEIDN+VHKFTSEDRTHPETEKIYEELNMLIRK Q 
Subjt:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS
        +GFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGL+KKPNGV+PIRIIKNIRICSDCHEFMKFAS II RQIILRDSNRFHHF+TGKCSCKD+
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS

XP_022960689.1 pentatricopeptide repeat-containing protein At4g02750-like isoform X1 [Cucurbita moschata]0.0e+0088.76Show/hide
Query:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA
        MK+KS+LRQAVDLLCSR  ATSEAYTQLVLECVR N++DQAKRLQSHMEHHLFQP DPFLHNQLLHLYAKFGKLRDA+NLFDKMLERDVFSWNALLSAYA
Subjt:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA

Query:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGS+QDLRATFD+MP+RDSVSYNTIIAG +GNSFPKESLELF+RMQREG   TEYTNVSALNASAQLLDLRRGKQIHGSVIV N+LGNVFI NALTDMY
Subjt:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFDRL NKNLVSWNLMISGYVKNGQPE+CIGLLH+MRLSGHMPDQVT+ST+IAAYCQCGR DEAR+VF+EFK+KDIVCWTAMLVGYAK+
Subjt:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ
        GREEDALLLFNEMLLEH  PDSYTLSSVVSSCAKLASLYHGQA+HGKSIL+GLDNNLLVSSALIDMYSKCG IEDARSVF +MPTRNVI+WNAM+VGYAQ
Subjt:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV
        NG DKD LELFENMLQ+KFKPDNVTF+GVLSACLHSN++EQGQ +FDSISNQHGLTP+LDHYACMVNLLGR GRIDQAV LIKSMPHEPD+LIWSTLLSV
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV

Query:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN
        S TKGD+A+AEM  RHLFELDP NAVPY+MLSNMYASMGRWKDVA VR++MK+KNVKKFAGYSWIEIDNEVHKFTSEDRTHPETE+IYEEL +LIRKL+ 
Subjt:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS
        +GF PNTNLVLHDVGE+EK KSICFHSEKLAL FGLIKK NGVSPIRIIKNIRICSDCHEFMKFASM I RQIILRDSNRFHHFS GKCSCKD+
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS

XP_038895252.1 pentatricopeptide repeat-containing protein At2g22070-like [Benincasa hispida]0.0e+0089.91Show/hide
Query:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA
        MK+KS+LRQA+DLLCS+  ATSEAYTQLVLECVR N ++QAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDA+NLFDKMLERDVFSWNA+LSA+A
Subjt:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA

Query:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGS+Q+LRATFDQMPFRDSVSYNT IAGFAGNS PKESLELFKRMQREGF  TEYT VS LNAS QLLDLRRGKQIHGSVIV NFLGNVFI NALTDMY
Subjt:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFD   NKNLVSWNLMISGY KNG+PE+CIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVD ARKVFSEFKEKDIVCWTAMLVGYAKN
Subjt:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ
        GREEDAL LFNEMLLEHI PDSYTLSSVVSSCAKLA L+HGQAVHGKSIL+GL+NNLLVSSALIDMYSKCGFI+DARSVF LMPTRNV+SWNAM+VG AQ
Subjt:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV
        NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLH NW+EQGQ YFDSISNQHGLTPTLDHYACMVNLLGR GRI QAVSLIK+M HEPD+LIWSTLLS+
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV

Query:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN
        S+TKGD+ NAEMAA+HLFELDP +AVPY+MLSNMYASMGRWKDVA+VR LM SKNVKKFAGYSWIEIDNEV +FTSEDRTHPETEKIYEELNMLI KLQ 
Subjt:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS
        EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNG+SPIRIIKNIRIC+DCHEFMKFAS II RQIILRDSNRFHHFSTGKCSCKD+
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS

TrEMBL top hitse value%identityAlignment
A0A0A0LUY3 DYW_deaminase domain-containing protein0.0e+0088.76Show/hide
Query:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA
        MK+KS LRQ+VDLLCSR  ATSEAYTQLVLECVRTN+++QAKRLQSHMEHHLFQPTD FLHNQLLHLYAKFGKLRDA+NLFDKML+RD+FSWNALLSAYA
Subjt:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA

Query:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGS+Q+L+ATFD+MPFRDSVSYNT IAGF+GNS P+ESLELFKRMQREGF  TEYT VS LNASAQL DLR GKQIHGS+IVRNFLGNVFIWNALTDMY
Subjt:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFD L  KNLVSWNLMISGY KNGQPE+CIGLLH+MRLSGHMPDQVTMSTIIAAYCQCGRVDEAR+VFSEFKEKDIVCWTAM+VGYAKN
Subjt:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ
        GREEDALLLFNEMLLEHI PDSYTLSSVVSSCAKLASL+HGQAVHGKSIL+GL+NNLLVSSALIDMYSKCGFI+DARSVF LMPTRNV+SWNAM+VG AQ
Subjt:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV
        NGHDKDALELFENMLQQKFKPDNVTFIG+LSACLH NW+EQGQEYFDSI+NQHG+TPTLDHYACMVNLLGR GRI+QAV+LIK+M H+PD+LIWSTLLS+
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV

Query:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN
         +TKGDI NAE+AARHLFELDP  AVPY+MLSNMYASMGRWKDVA+VR LMKSKNVKKFAG+SWIEIDNEVH+FTSEDRTHPE+E IYE+LNMLI KLQ 
Subjt:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS
        EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNG+SPIRIIKNIRIC+DCHEFMKFAS IIGRQIILRDSNRFHHFSTGKCSC D+
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS

A0A5A7UC76 Pentatricopeptide repeat-containing protein0.0e+0089.05Show/hide
Query:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA
        MK+KS LRQ+VDLLCSR  ATSEAYTQLVLECVRTN+++QAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDA+NLFDKML+RD FSWNALLSAYA
Subjt:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA

Query:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGS+Q+L+ATFD+MPFRDSVSYNT IAGF+GNS P+ESL+LFKRMQREGF  TEYT VS LNASAQLLDLR GKQIHGS+IVRNFLGNVFIWN LTDMY
Subjt:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFD L  KNLVSWNLMISGY KNGQPE+CIGLLH+MRLSGHMP+QVTMSTIIAAYCQCGRVDEAR+VFSEFKEKDIVCWTAMLVGYAKN
Subjt:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ
        GREEDALLLFNEMLLEHI PDSYTLSSVVSSCAKLASL+HGQAVHGKSIL+GL+NNLLVSSALIDMYSKCGFI+DARSVF LMPTRNV+SWNAM+VG AQ
Subjt:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV
        NGHDKDALELFENMLQQKFKPDNVTFIG+LSACLH NW+EQGQEYFDSISNQHGLTPTLDHYACMVNLLGR GRI+QAVSLIK+M HEPD+LIWSTLLS+
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV

Query:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN
         +TKGDI NAEMAARHLFELDP +AVPY+MLSNMYASMGRWK VA+VR LMKSKNVKKFAG+SWIEID EVH+FTSEDRTHPE+E IYEELN+LI KLQ 
Subjt:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS
        EGFTPNT LVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNG+SPIRIIKNIRIC+DCHEFMKFAS IIGRQIILRDSNRFHHFSTGKCSC D+
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS

A0A6J1CU81 pentatricopeptide repeat-containing protein At4g02750-like0.0e+0089.91Show/hide
Query:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA
        M++K +LRQA+DLLCSRG A+SEAYT L+LECVRTN+VDQAKRLQSHMEHHLFQP DPFL NQLLHLYAKFGK+RDA+NLFDKMLERDVFSWNALLSAYA
Subjt:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA

Query:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGS+Q+L+ATFD+MPFRDSVSYNT IAGFAGN  PKESLELF+RMQ EGFV TEYTNVSALNA+AQLLDLRRGK+IHGSVIV  FLGN FIWNALTDMY
Subjt:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFD L NKNL+SWNLMISGYVKNGQPE+CIGLLHEM++SGHMPDQVTMSTIIAAYCQC  VDEARKVFSEFKEKDIVCWTAMLVGYAKN
Subjt:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ
        GREEDALLLFNEMLLEH++PDSYTLSSVVSSCAKLASLYHGQAVHGKSIL+GLDNNLLVSSALIDMYSKCGF+++ARSVF +MPTRNVISWNAM+VGYAQ
Subjt:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV
        NGHDKDAL  FENMLQQKFKPDNVTFIGVLSACLHSNW+E+GQ YFDSISNQHGL PT+DHYACMVNLLGRLGRIDQAV LIKSMPHEPD LIWSTLLSV
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV

Query:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN
        S  KGDIANAEMAAR+LFELDPLNAVPYVMLSNMYA MGRWKDVA+VRTLMKSKNVKKFAGYSWIEIDN+VHKFTSEDRTHPETEKIYEELNMLIRK Q 
Subjt:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS
        +GFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGL+KKPNGV+PIRIIKNIRICSDCHEFMKFAS II RQIILRDSNRFHHF+TGKCSCKD+
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS

A0A6J1HBT0 pentatricopeptide repeat-containing protein At4g02750-like isoform X10.0e+0088.76Show/hide
Query:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA
        MK+KS+LRQAVDLLCSR  ATSEAYTQLVLECVR N++DQAKRLQSHMEHHLFQP DPFLHNQLLHLYAKFGKLRDA+NLFDKMLERDVFSWNALLSAYA
Subjt:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA

Query:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGS+QDLRATFD+MP+RDSVSYNTIIAG +GNSFPKESLELF+RMQREG   TEYTNVSALNASAQLLDLRRGKQIHGSVIV N+LGNVFI NALTDMY
Subjt:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFDRL NKNLVSWNLMISGYVKNGQPE+CIGLLH+MRLSGHMPDQVT+ST+IAAYCQCGR DEAR+VF+EFK+KDIVCWTAMLVGYAK+
Subjt:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ
        GREEDALLLFNEMLLEH  PDSYTLSSVVSSCAKLASLYHGQA+HGKSIL+GLDNNLLVSSALIDMYSKCG IEDARSVF +MPTRNVI+WNAM+VGYAQ
Subjt:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV
        NG DKD LELFENMLQ+KFKPDNVTF+GVLSACLHSN++EQGQ +FDSISNQHGLTP+LDHYACMVNLLGR GRIDQAV LIKSMPHEPD+LIWSTLLSV
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV

Query:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN
        S TKGD+A+AEM  RHLFELDP NAVPY+MLSNMYASMGRWKDVA VR++MK+KNVKKFAGYSWIEIDNEVHKFTSEDRTHPETE+IYEEL +LIRKL+ 
Subjt:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS
        +GF PNTNLVLHDVGE+EK KSICFHSEKLAL FGLIKK NGVSPIRIIKNIRICSDCHEFMKFASM I RQIILRDSNRFHHFS GKCSCKD+
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS

A0A6J1JAW4 pentatricopeptide repeat-containing protein At4g02750-like isoform X10.0e+0088.33Show/hide
Query:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA
        MK+KS+LRQAV LLCSR  ATSEAYTQLVLECVR N++DQAKRLQSHMEHHLFQP DPFLHNQLLHLYAKFGKLRDA+NLFDKMLERDVFSWNALLSAYA
Subjt:  MKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYA

Query:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGS+QDLRATFD+MP+RDSVSYNTIIAG +GNSFPKESLELF+RMQREG   TEYTNVSALNASAQLLDLRRGKQIHGSVIV N+LGNVFI NALTDMY
Subjt:  KSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN
        AKCGEIE ARWLFDRL NKNLVSWNLMISGYVKNGQPE+CIGLLHEMRLSGHMPDQVT+ST+IAAYCQCGR DEAR+VF+EFK+KDIVCWTAMLVGYAK+
Subjt:  AKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ
        GREEDALLLFNEMLLEH  PDSYT SSVVSSCAKLASLYHGQA+HGKSIL+GLDNNLLVSSALIDMYSKCG I+DARSVF +MPTRNVI+WNAM+VGYAQ
Subjt:  GREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV
        NG DKD LELFENMLQ+KFKPDNVTF+GVLSACLHSN +EQGQ +FDSISNQHGLTP+LDHYACMVNLLGR GRIDQAV+LIKSMPHEPD+LIWSTLLSV
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSV

Query:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN
        S TKGD+A AEMA RHLFELD  NAVPY+MLSNMYASMGRWKDVA VR++MK+KNVKKFAGYSWIEIDNEVHKFTSEDRTHPETE+IYEEL +LIRKL+ 
Subjt:  STTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQN

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS
        +GF PNTNLVLHDVGE+EK KSICFHSEKLAL FGLIKK NGVSPIRIIKNIRICSDCHEFMKFASM I RQIILRDSNRFHHFS GKCSCKD+
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDS

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.0e-13635.75Show/hide
Query:  KMKSKSQLRQAVDLLCSRGGATSEAY-TQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDP---FLHNQLLHLYAKFGKLRDARNLFDKM-----------
        +  S  +LRQ + L+   G      + T+LV    R   VD+A R        +F+P D     L++ +L  +AK   L  A   F +M           
Subjt:  KMKSKSQLRQAVDLLCSRGGATSEAY-TQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDP---FLHNQLLHLYAKFGKLRDARNLFDKM-----------

Query:  ----------------------------LERDVFSWNALLSAYAKSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEY
                                       D+F+   L + YAK   V + R  FD+MP RD VS+NTI+AG++ N   + +LE+ K M  E    +  
Subjt:  ----------------------------LERDVFSWNALLSAYAKSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEY

Query:  TNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQ
        T VS L A + L  +  GK+IHG  +   F   V I  AL DMYAKCG +E AR LFD ++ +N+VSWN MI  YV+N  P+  + +  +M   G  P  
Subjt:  TNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQ

Query:  VT-----------------------------------MSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPD
        V+                                   ++++I+ YC+C  VD A  +F + + + +V W AM++G+A+NGR  DAL  F++M    ++PD
Subjt:  VT-----------------------------------MSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPD

Query:  SYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQNGHDKDALELFENMLQQKFKP
        ++T  SV+++ A+L+  +H + +HG  + S LD N+ V++AL+DMY+KCG I  AR +F +M  R+V +WNAM+ GY  +G  K ALELFE M +   KP
Subjt:  SYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQNGHDKDALELFENMLQQKFKP

Query:  DNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEMAARHLFELD
        + VTF+ V+SAC HS  +E G + F  +   + +  ++DHY  MV+LLGR GR+++A   I  MP +P   ++  +L       ++  AE AA  LFEL+
Subjt:  DNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEMAARHLFELD

Query:  PLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFK
        P +   +V+L+N+Y +   W+ V  VR  M  + ++K  G S +EI NEVH F S    HP+++KIY  L  LI  ++  G+ P+TNLVL  V  D K +
Subjt:  PLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFK

Query:  SICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD
         +  HSEKLA++FGL+    G + I + KN+R+C+DCH   K+ S++ GR+I++RD  RFHHF  G CSC D
Subjt:  SICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD

Q9CAA8 Putative pentatricopeptide repeat-containing protein At1g689307.1e-14236.12Show/hide
Query:  SEAYTQLVLECV---RTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYAKSGSVQDLRATFDQMPFR
        S  Y+  + +C+     N+    K +  ++   L  P + FL+N ++H YA       AR +FD++ + ++FSWN LL AY+K+G + ++ +TF+++P R
Subjt:  SEAYTQLVLECV---RTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYAKSGSVQDLRATFDQMPFR

Query:  DSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVT-TEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAK---------------
        D V++N +I G++ +     +++ +  M R+     T  T ++ L  S+    +  GKQIHG VI   F   + + + L  MYA                
Subjt:  DSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVT-TEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAK---------------

Query:  ----------------CGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAA---------------------
                        CG IE A  LF R + K+ VSW  MI G  +NG  +  I    EM++ G   DQ    +++ A                     
Subjt:  ----------------CGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAA---------------------

Query:  --------------YCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILS
                      YC+C  +  A+ VF   K+K++V WTAM+VGY + GR E+A+ +F +M    I PD YTL   +S+CA ++SL  G   HGK+I S
Subjt:  --------------YCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILS

Query:  GLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQNGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISN
        GL + + VS++L+ +Y KCG I+D+  +F  M  R+ +SW AMV  YAQ G   + ++LF+ M+Q   KPD VT  GV+SAC  +  +E+GQ YF  +++
Subjt:  GLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQNGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISN

Query:  QHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLM
        ++G+ P++ HY+CM++L  R GR+++A+  I  MP  PD + W+TLLS    KG++   + AA  L ELDP +   Y +LS++YAS G+W  VA +R  M
Subjt:  QHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLM

Query:  KSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKN
        + KNVKK  G SWI+   ++H F+++D + P  ++IY +L  L  K+ + G+ P+T+ V HDV E  K K + +HSE+LA+AFGLI  P+G  PIR+ KN
Subjt:  KSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKN

Query:  IRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD
        +R+C DCH   K  S + GR+I++RD+ RFH F  G CSC D
Subjt:  IRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD

Q9FIB2 Putative pentatricopeptide repeat-containing protein At5g099503.8e-13539.56Show/hide
Query:  NALLSAYAKSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFI
        N L++ YAK GS+ D R  F  M  +DSVS+N++I G   N    E++E +K M+R   +   +T +S+L++ A L   + G+QIHG  +      NV +
Subjt:  NALLSAYAKSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFI

Query:  WNALTDMYAKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQ--PERCIGLL---------------------------------HEMRLSGHMPDQV
         NAL  +YA+ G + + R +F  +   + VSWN +I    ++ +  PE  +  L                                 H + L  ++ D+ 
Subjt:  WNALTDMYAKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQ--PERCIGLL---------------------------------HEMRLSGHMPDQV

Query:  TM-STIIAAYCQCGRVDEARKVFSEFKE-KDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDN
        T  + +IA Y +CG +D   K+FS   E +D V W +M+ GY  N     AL L   ML    R DS+  ++V+S+ A +A+L  G  VH  S+ + L++
Subjt:  TM-STIIAAYCQCGRVDEARKVFSEFKE-KDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDN

Query:  NLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQNGHDKDALELFENM-LQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHG
        +++V SAL+DMYSKCG ++ A   F  MP RN  SWN+M+ GYA++G  ++AL+LFE M L  +  PD+VTF+GVLSAC H+  +E+G ++F+S+S+ +G
Subjt:  NLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQNGHDKDALELFENM-LQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHG

Query:  LTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEM---AARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLM
        L P ++H++CM ++LGR G +D+    I+ MP +P+ LIW T+L  +  + +   AE+   AA  LF+L+P NAV YV+L NMYA+ GRW+D+   R  M
Subjt:  LTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEM---AARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLM

Query:  KSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKN
        K  +VKK AGYSW+ + + VH F + D++HP+ + IY++L  L RK+++ G+ P T   L+D+ ++ K + + +HSEKLA+AF L  + +   PIRI+KN
Subjt:  KSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKN

Query:  IRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD
        +R+C DCH   K+ S I GRQIILRDSNRFHHF  G CSC D
Subjt:  IRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220701.7e-14336.28Show/hide
Query:  FLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYAKSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTN
        +L N L+++Y+K G    AR LFD+M  R  FSWN +LSAY+K G +      FDQ+P RDSVS+ T+I G+       +++ +   M +EG   T++T 
Subjt:  FLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYAKSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTN

Query:  VSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDRLI-------------------------------NKNLVSWNLM
         + L + A    +  GK++H  ++     GNV + N+L +MYAKCG+   A+++FDR++                                +++V+WN M
Subjt:  VSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDRLI-------------------------------NKNLVSWNLM

Query:  ISGYVKNGQPERCIGLLHEM-RLSGHMPDQVTMSTIIAA-----------------------------------YCQCGRVDEARK--------------
        ISG+ + G   R + +  +M R S   PD+ T++++++A                                   Y +CG V+ AR+              
Subjt:  ISGYVKNGQPERCIGLLHEM-RLSGHMPDQVTMSTIIAA-----------------------------------YCQCGRVDEARK--------------

Query:  -------------------VFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLL
                           +F   K++D+V WTAM+VGY ++G   +A+ LF  M+    RP+SYTL++++S  + LASL HG+ +HG ++ SG   ++ 
Subjt:  -------------------VFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLL

Query:  VSSALIDMYSKCGFIEDARSVFKLMP-TRNVISWNAMVVGYAQNGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTP
        VS+ALI MY+K G I  A   F L+   R+ +SW +M++  AQ+GH ++ALELFE ML +  +PD++T++GV SAC H+  + QG++YFD + +   + P
Subjt:  VSSALIDMYSKCGFIEDARSVFKLMP-TRNVISWNAMVVGYAQNGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTP

Query:  TLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVK
        TL HYACMV+L GR G + +A   I+ MP EPD + W +LLS      +I   ++AA  L  L+P N+  Y  L+N+Y++ G+W++ A +R  MK   VK
Subjt:  TLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVK

Query:  KFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSD
        K  G+SWIE+ ++VH F  ED THPE  +IY  +  +  +++  G+ P+T  VLHD+ E+ K + +  HSEKLA+AFGLI  P+  + +RI+KN+R+C+D
Subjt:  KFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSD

Query:  CHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD
        CH  +KF S ++GR+II+RD+ RFHHF  G CSC+D
Subjt:  CHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD

Q9SY02 Pentatricopeptide repeat-containing protein At4g027504.3e-14738.14Show/hide
Query:  TSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYAKSGSVQDLRATFDQMPFRDS
        +S +Y  ++   +R  + + A++L   M        D    N ++  Y +   L  AR LF+ M ERDV SWN +LS YA++G V D R+ FD+MP ++ 
Subjt:  TSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYAKSGSVQDLRATFDQMPFRDS

Query:  VSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDRLINKN
        VS+N +++ +  NS  +E+  LFK   RE +    +     L    +   +   +Q   S+ VR    +V  WN +   YA+ G+I++AR LFD    ++
Subjt:  VSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDRLINKN

Query:  LVSWNLMISGYVKNGQPERCIGLLHEM----------RLSGHMPDQ-----------------VTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAM
        + +W  M+SGY++N   E    L  +M           L+G++  +                  T +T+I  Y QCG++ EA+ +F +  ++D V W AM
Subjt:  LVSWNLMISGYVKNGQPERCIGLLHEM----------RLSGHMPDQ-----------------VTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAM

Query:  LVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNA
        + GY+++G   +AL LF +M  E  R +  + SS +S+CA + +L  G+ +HG+ +  G +    V +AL+ MY KCG IE+A  +FK M  ++++SWN 
Subjt:  LVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNA

Query:  MVVGYAQNGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLI
        M+ GY+++G  + AL  FE+M ++  KPD+ T + VLSAC H+  +++G++YF +++  +G+ P   HYACMV+LLGR G ++ A +L+K+MP EPD  I
Subjt:  MVVGYAQNGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLI

Query:  WSTLLSVSTTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNM
        W TLL  S   G+   AE AA  +F ++P N+  YV+LSN+YAS GRW DV  +R  M+ K VKK  GYSWIEI N+ H F+  D  HPE ++I+  L  
Subjt:  WSTLLSVSTTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNM

Query:  LIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD
        L  +++  G+   T++VLHDV E+EK + + +HSE+LA+A+G+++  +G  PIR+IKN+R+C DCH  +K+ + I GR IILRD+NRFHHF  G CSC D
Subjt:  LIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-13735.75Show/hide
Query:  KMKSKSQLRQAVDLLCSRGGATSEAY-TQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDP---FLHNQLLHLYAKFGKLRDARNLFDKM-----------
        +  S  +LRQ + L+   G      + T+LV    R   VD+A R        +F+P D     L++ +L  +AK   L  A   F +M           
Subjt:  KMKSKSQLRQAVDLLCSRGGATSEAY-TQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDP---FLHNQLLHLYAKFGKLRDARNLFDKM-----------

Query:  ----------------------------LERDVFSWNALLSAYAKSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEY
                                       D+F+   L + YAK   V + R  FD+MP RD VS+NTI+AG++ N   + +LE+ K M  E    +  
Subjt:  ----------------------------LERDVFSWNALLSAYAKSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEY

Query:  TNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQ
        T VS L A + L  +  GK+IHG  +   F   V I  AL DMYAKCG +E AR LFD ++ +N+VSWN MI  YV+N  P+  + +  +M   G  P  
Subjt:  TNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQ

Query:  VT-----------------------------------MSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPD
        V+                                   ++++I+ YC+C  VD A  +F + + + +V W AM++G+A+NGR  DAL  F++M    ++PD
Subjt:  VT-----------------------------------MSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPD

Query:  SYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQNGHDKDALELFENMLQQKFKP
        ++T  SV+++ A+L+  +H + +HG  + S LD N+ V++AL+DMY+KCG I  AR +F +M  R+V +WNAM+ GY  +G  K ALELFE M +   KP
Subjt:  SYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQNGHDKDALELFENMLQQKFKP

Query:  DNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEMAARHLFELD
        + VTF+ V+SAC HS  +E G + F  +   + +  ++DHY  MV+LLGR GR+++A   I  MP +P   ++  +L       ++  AE AA  LFEL+
Subjt:  DNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEMAARHLFELD

Query:  PLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFK
        P +   +V+L+N+Y +   W+ V  VR  M  + ++K  G S +EI NEVH F S    HP+++KIY  L  LI  ++  G+ P+TNLVL  V  D K +
Subjt:  PLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFK

Query:  SICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD
         +  HSEKLA++FGL+    G + I + KN+R+C+DCH   K+ S++ GR+I++RD  RFHHF  G CSC D
Subjt:  SICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD

AT1G68930.1 pentatricopeptide (PPR) repeat-containing protein5.0e-14336.12Show/hide
Query:  SEAYTQLVLECV---RTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYAKSGSVQDLRATFDQMPFR
        S  Y+  + +C+     N+    K +  ++   L  P + FL+N ++H YA       AR +FD++ + ++FSWN LL AY+K+G + ++ +TF+++P R
Subjt:  SEAYTQLVLECV---RTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYAKSGSVQDLRATFDQMPFR

Query:  DSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVT-TEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAK---------------
        D V++N +I G++ +     +++ +  M R+     T  T ++ L  S+    +  GKQIHG VI   F   + + + L  MYA                
Subjt:  DSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVT-TEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAK---------------

Query:  ----------------CGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAA---------------------
                        CG IE A  LF R + K+ VSW  MI G  +NG  +  I    EM++ G   DQ    +++ A                     
Subjt:  ----------------CGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAA---------------------

Query:  --------------YCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILS
                      YC+C  +  A+ VF   K+K++V WTAM+VGY + GR E+A+ +F +M    I PD YTL   +S+CA ++SL  G   HGK+I S
Subjt:  --------------YCQCGRVDEARKVFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILS

Query:  GLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQNGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISN
        GL + + VS++L+ +Y KCG I+D+  +F  M  R+ +SW AMV  YAQ G   + ++LF+ M+Q   KPD VT  GV+SAC  +  +E+GQ YF  +++
Subjt:  GLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQNGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISN

Query:  QHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLM
        ++G+ P++ HY+CM++L  R GR+++A+  I  MP  PD + W+TLLS    KG++   + AA  L ELDP +   Y +LS++YAS G+W  VA +R  M
Subjt:  QHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLM

Query:  KSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKN
        + KNVKK  G SWI+   ++H F+++D + P  ++IY +L  L  K+ + G+ P+T+ V HDV E  K K + +HSE+LA+AFGLI  P+G  PIR+ KN
Subjt:  KSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKN

Query:  IRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD
        +R+C DCH   K  S + GR+I++RD+ RFH F  G CSC D
Subjt:  IRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein1.2e-14436.28Show/hide
Query:  FLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYAKSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTN
        +L N L+++Y+K G    AR LFD+M  R  FSWN +LSAY+K G +      FDQ+P RDSVS+ T+I G+       +++ +   M +EG   T++T 
Subjt:  FLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYAKSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTN

Query:  VSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDRLI-------------------------------NKNLVSWNLM
         + L + A    +  GK++H  ++     GNV + N+L +MYAKCG+   A+++FDR++                                +++V+WN M
Subjt:  VSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDRLI-------------------------------NKNLVSWNLM

Query:  ISGYVKNGQPERCIGLLHEM-RLSGHMPDQVTMSTIIAA-----------------------------------YCQCGRVDEARK--------------
        ISG+ + G   R + +  +M R S   PD+ T++++++A                                   Y +CG V+ AR+              
Subjt:  ISGYVKNGQPERCIGLLHEM-RLSGHMPDQVTMSTIIAA-----------------------------------YCQCGRVDEARK--------------

Query:  -------------------VFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLL
                           +F   K++D+V WTAM+VGY ++G   +A+ LF  M+    RP+SYTL++++S  + LASL HG+ +HG ++ SG   ++ 
Subjt:  -------------------VFSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLL

Query:  VSSALIDMYSKCGFIEDARSVFKLMP-TRNVISWNAMVVGYAQNGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTP
        VS+ALI MY+K G I  A   F L+   R+ +SW +M++  AQ+GH ++ALELFE ML +  +PD++T++GV SAC H+  + QG++YFD + +   + P
Subjt:  VSSALIDMYSKCGFIEDARSVFKLMP-TRNVISWNAMVVGYAQNGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTP

Query:  TLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVK
        TL HYACMV+L GR G + +A   I+ MP EPD + W +LLS      +I   ++AA  L  L+P N+  Y  L+N+Y++ G+W++ A +R  MK   VK
Subjt:  TLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVK

Query:  KFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSD
        K  G+SWIE+ ++VH F  ED THPE  +IY  +  +  +++  G+ P+T  VLHD+ E+ K + +  HSEKLA+AFGLI  P+  + +RI+KN+R+C+D
Subjt:  KFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSD

Query:  CHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD
        CH  +KF S ++GR+II+RD+ RFHHF  G CSC+D
Subjt:  CHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-14838.14Show/hide
Query:  TSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYAKSGSVQDLRATFDQMPFRDS
        +S +Y  ++   +R  + + A++L   M        D    N ++  Y +   L  AR LF+ M ERDV SWN +LS YA++G V D R+ FD+MP ++ 
Subjt:  TSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDARNLFDKMLERDVFSWNALLSAYAKSGSVQDLRATFDQMPFRDS

Query:  VSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDRLINKN
        VS+N +++ +  NS  +E+  LFK   RE +    +     L    +   +   +Q   S+ VR    +V  WN +   YA+ G+I++AR LFD    ++
Subjt:  VSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDRLINKN

Query:  LVSWNLMISGYVKNGQPERCIGLLHEM----------RLSGHMPDQ-----------------VTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAM
        + +W  M+SGY++N   E    L  +M           L+G++  +                  T +T+I  Y QCG++ EA+ +F +  ++D V W AM
Subjt:  LVSWNLMISGYVKNGQPERCIGLLHEM----------RLSGHMPDQ-----------------VTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAM

Query:  LVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNA
        + GY+++G   +AL LF +M  E  R +  + SS +S+CA + +L  G+ +HG+ +  G +    V +AL+ MY KCG IE+A  +FK M  ++++SWN 
Subjt:  LVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNA

Query:  MVVGYAQNGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLI
        M+ GY+++G  + AL  FE+M ++  KPD+ T + VLSAC H+  +++G++YF +++  +G+ P   HYACMV+LLGR G ++ A +L+K+MP EPD  I
Subjt:  MVVGYAQNGHDKDALELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLI

Query:  WSTLLSVSTTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNM
        W TLL  S   G+   AE AA  +F ++P N+  YV+LSN+YAS GRW DV  +R  M+ K VKK  GYSWIEI N+ H F+  D  HPE ++I+  L  
Subjt:  WSTLLSVSTTKGDIANAEMAARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNM

Query:  LIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD
        L  +++  G+   T++VLHDV E+EK + + +HSE+LA+A+G+++  +G  PIR+IKN+R+C DCH  +K+ + I GR IILRD+NRFHHF  G CSC D
Subjt:  LIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD

AT5G09950.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-13639.56Show/hide
Query:  NALLSAYAKSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFI
        N L++ YAK GS+ D R  F  M  +DSVS+N++I G   N    E++E +K M+R   +   +T +S+L++ A L   + G+QIHG  +      NV +
Subjt:  NALLSAYAKSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFLGNVFI

Query:  WNALTDMYAKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQ--PERCIGLL---------------------------------HEMRLSGHMPDQV
         NAL  +YA+ G + + R +F  +   + VSWN +I    ++ +  PE  +  L                                 H + L  ++ D+ 
Subjt:  WNALTDMYAKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQ--PERCIGLL---------------------------------HEMRLSGHMPDQV

Query:  TM-STIIAAYCQCGRVDEARKVFSEFKE-KDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDN
        T  + +IA Y +CG +D   K+FS   E +D V W +M+ GY  N     AL L   ML    R DS+  ++V+S+ A +A+L  G  VH  S+ + L++
Subjt:  TM-STIIAAYCQCGRVDEARKVFSEFKE-KDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDN

Query:  NLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQNGHDKDALELFENM-LQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHG
        +++V SAL+DMYSKCG ++ A   F  MP RN  SWN+M+ GYA++G  ++AL+LFE M L  +  PD+VTF+GVLSAC H+  +E+G ++F+S+S+ +G
Subjt:  NLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQNGHDKDALELFENM-LQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHG

Query:  LTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEM---AARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLM
        L P ++H++CM ++LGR G +D+    I+ MP +P+ LIW T+L  +  + +   AE+   AA  LF+L+P NAV YV+L NMYA+ GRW+D+   R  M
Subjt:  LTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEM---AARHLFELDPLNAVPYVMLSNMYASMGRWKDVATVRTLM

Query:  KSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKN
        K  +VKK AGYSW+ + + VH F + D++HP+ + IY++L  L RK+++ G+ P T   L+D+ ++ K + + +HSEKLA+AF L  + +   PIRI+KN
Subjt:  KSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGVSPIRIIKN

Query:  IRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD
        +R+C DCH   K+ S I GRQIILRDSNRFHHF  G CSC D
Subjt:  IRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATTAACCATTAACAGACGACCTTCTCAGGTTTCCTTATCCATAGAGCCACGTAGATCTTTGTTCCTTAACAAAAATTTCAAATTCAAACTGAAAATGAAATCAAA
ATCCCAGCTGCGGCAAGCTGTAGACTTGCTCTGCTCTCGGGGTGGCGCCACCTCTGAGGCATACACCCAATTGGTCCTCGAATGTGTCCGTACAAACAAAGTCGACCAAG
CTAAGAGACTGCAATCCCACATGGAGCATCATCTTTTCCAACCCACTGATCCATTTCTCCATAATCAGCTACTTCATTTGTATGCGAAATTTGGGAAGCTTCGAGATGCA
CGAAACCTGTTTGATAAAATGCTCGAAAGGGATGTTTTCTCCTGGAATGCTCTGCTCTCTGCCTATGCTAAATCGGGGTCTGTCCAGGACTTGAGGGCGACATTTGATCA
AATGCCTTTCCGGGATTCGGTTTCATACAATACGATCATTGCAGGTTTTGCTGGAAATAGTTTTCCAAAAGAGTCGCTTGAGCTTTTTAAAAGAATGCAGAGGGAAGGTT
TTGTGACTACTGAGTATACGAATGTGAGCGCATTGAATGCGTCTGCGCAATTGTTGGATTTGAGGCGTGGGAAACAGATTCATGGGAGTGTTATTGTGCGTAACTTTTTA
GGGAATGTGTTCATTTGGAATGCTTTAACAGACATGTATGCCAAATGTGGTGAGATTGAACAGGCGAGGTGGTTGTTTGATCGTCTCATTAACAAGAATTTGGTTTCTTG
GAACTTGATGATATCTGGGTATGTAAAGAATGGACAGCCTGAGAGGTGCATTGGTTTGTTACATGAAATGCGGTTGTCCGGTCATATGCCCGATCAAGTTACCATGTCAA
CTATAATCGCAGCTTACTGTCAATGTGGACGTGTGGATGAAGCAAGAAAGGTGTTTAGTGAGTTTAAAGAGAAGGATATTGTTTGCTGGACAGCTATGTTGGTGGGTTAT
GCAAAAAATGGCAGAGAAGAGGATGCATTGTTGTTGTTTAATGAAATGCTATTGGAACATATTAGACCTGACAGCTACACTTTATCAAGTGTTGTCAGTTCATGTGCCAA
ATTAGCATCTCTATATCATGGTCAGGCAGTCCATGGAAAATCAATTCTGTCCGGGCTTGATAATAATTTGCTTGTTTCTAGTGCACTAATTGATATGTATTCTAAATGTG
GTTTCATAGAGGATGCAAGGTCAGTCTTCAAATTGATGCCAACTAGGAATGTGATTTCATGGAATGCTATGGTTGTTGGTTATGCACAAAATGGACATGATAAGGATGCC
CTTGAACTCTTTGAAAACATGTTGCAACAGAAATTTAAACCTGATAATGTAACTTTTATAGGCGTTTTATCTGCTTGTCTCCATTCTAATTGGATGGAGCAAGGGCAGGA
GTATTTTGATTCCATAAGCAATCAACATGGACTGACTCCTACTTTGGATCATTATGCATGTATGGTCAATCTCCTAGGACGTTTGGGCCGCATCGATCAAGCAGTTAGTC
TAATAAAAAGTATGCCCCATGAGCCAGATTACCTGATTTGGTCCACACTTCTATCGGTTAGTACAACAAAGGGTGATATTGCAAATGCAGAAATGGCAGCTAGGCATCTC
TTCGAATTGGATCCTCTGAATGCTGTGCCATATGTTATGCTCTCAAATATGTATGCCTCTATGGGTAGATGGAAGGATGTAGCAACAGTTAGGACTCTCATGAAGAGCAA
GAATGTCAAAAAGTTTGCTGGGTACAGTTGGATTGAGATTGATAATGAGGTGCACAAATTCACATCCGAAGACCGGACTCATCCAGAAACAGAAAAAATATATGAGGAAT
TGAACATGTTGATAAGGAAACTTCAAAATGAAGGATTTACCCCTAATACAAATCTGGTTTTGCATGATGTAGGAGAGGACGAAAAGTTCAAATCCATATGTTTCCACAGC
GAGAAACTTGCTCTTGCTTTTGGTTTGATAAAGAAACCTAATGGAGTTAGTCCAATAAGGATCATAAAAAATATCCGCATTTGCAGTGATTGCCATGAATTTATGAAGTT
TGCATCTATGATTATTGGAAGGCAAATAATCTTGAGAGATTCAAATAGGTTTCATCATTTTTCAACTGGGAAGTGTTCCTGCAAGGACAGCTGCTTTGAGCAGAACAAAG
TTTCCTCAGTTTTTTATGAGGAGTATTGTGCAATGAGGAACATCAATTTCCAACTCGCTAACTCCAAGGCAGTAGCAGAAGGTCAAATTCTGATGGTTTATACTTTTTGT
TGGCAATCATCACAGGGGAAAAGTGCCATGTTTGATGTAGTTATCATTGTACTTCGTGCTGCAGAACCACATGCTGGAAACTTGGCTGCTAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAATTAACCATTAACAGACGACCTTCTCAGGTTTCCTTATCCATAGAGCCACGTAGATCTTTGTTCCTTAACAAAAATTTCAAATTCAAACTGAAAATGAAATCAAA
ATCCCAGCTGCGGCAAGCTGTAGACTTGCTCTGCTCTCGGGGTGGCGCCACCTCTGAGGCATACACCCAATTGGTCCTCGAATGTGTCCGTACAAACAAAGTCGACCAAG
CTAAGAGACTGCAATCCCACATGGAGCATCATCTTTTCCAACCCACTGATCCATTTCTCCATAATCAGCTACTTCATTTGTATGCGAAATTTGGGAAGCTTCGAGATGCA
CGAAACCTGTTTGATAAAATGCTCGAAAGGGATGTTTTCTCCTGGAATGCTCTGCTCTCTGCCTATGCTAAATCGGGGTCTGTCCAGGACTTGAGGGCGACATTTGATCA
AATGCCTTTCCGGGATTCGGTTTCATACAATACGATCATTGCAGGTTTTGCTGGAAATAGTTTTCCAAAAGAGTCGCTTGAGCTTTTTAAAAGAATGCAGAGGGAAGGTT
TTGTGACTACTGAGTATACGAATGTGAGCGCATTGAATGCGTCTGCGCAATTGTTGGATTTGAGGCGTGGGAAACAGATTCATGGGAGTGTTATTGTGCGTAACTTTTTA
GGGAATGTGTTCATTTGGAATGCTTTAACAGACATGTATGCCAAATGTGGTGAGATTGAACAGGCGAGGTGGTTGTTTGATCGTCTCATTAACAAGAATTTGGTTTCTTG
GAACTTGATGATATCTGGGTATGTAAAGAATGGACAGCCTGAGAGGTGCATTGGTTTGTTACATGAAATGCGGTTGTCCGGTCATATGCCCGATCAAGTTACCATGTCAA
CTATAATCGCAGCTTACTGTCAATGTGGACGTGTGGATGAAGCAAGAAAGGTGTTTAGTGAGTTTAAAGAGAAGGATATTGTTTGCTGGACAGCTATGTTGGTGGGTTAT
GCAAAAAATGGCAGAGAAGAGGATGCATTGTTGTTGTTTAATGAAATGCTATTGGAACATATTAGACCTGACAGCTACACTTTATCAAGTGTTGTCAGTTCATGTGCCAA
ATTAGCATCTCTATATCATGGTCAGGCAGTCCATGGAAAATCAATTCTGTCCGGGCTTGATAATAATTTGCTTGTTTCTAGTGCACTAATTGATATGTATTCTAAATGTG
GTTTCATAGAGGATGCAAGGTCAGTCTTCAAATTGATGCCAACTAGGAATGTGATTTCATGGAATGCTATGGTTGTTGGTTATGCACAAAATGGACATGATAAGGATGCC
CTTGAACTCTTTGAAAACATGTTGCAACAGAAATTTAAACCTGATAATGTAACTTTTATAGGCGTTTTATCTGCTTGTCTCCATTCTAATTGGATGGAGCAAGGGCAGGA
GTATTTTGATTCCATAAGCAATCAACATGGACTGACTCCTACTTTGGATCATTATGCATGTATGGTCAATCTCCTAGGACGTTTGGGCCGCATCGATCAAGCAGTTAGTC
TAATAAAAAGTATGCCCCATGAGCCAGATTACCTGATTTGGTCCACACTTCTATCGGTTAGTACAACAAAGGGTGATATTGCAAATGCAGAAATGGCAGCTAGGCATCTC
TTCGAATTGGATCCTCTGAATGCTGTGCCATATGTTATGCTCTCAAATATGTATGCCTCTATGGGTAGATGGAAGGATGTAGCAACAGTTAGGACTCTCATGAAGAGCAA
GAATGTCAAAAAGTTTGCTGGGTACAGTTGGATTGAGATTGATAATGAGGTGCACAAATTCACATCCGAAGACCGGACTCATCCAGAAACAGAAAAAATATATGAGGAAT
TGAACATGTTGATAAGGAAACTTCAAAATGAAGGATTTACCCCTAATACAAATCTGGTTTTGCATGATGTAGGAGAGGACGAAAAGTTCAAATCCATATGTTTCCACAGC
GAGAAACTTGCTCTTGCTTTTGGTTTGATAAAGAAACCTAATGGAGTTAGTCCAATAAGGATCATAAAAAATATCCGCATTTGCAGTGATTGCCATGAATTTATGAAGTT
TGCATCTATGATTATTGGAAGGCAAATAATCTTGAGAGATTCAAATAGGTTTCATCATTTTTCAACTGGGAAGTGTTCCTGCAAGGACAGCTGCTTTGAGCAGAACAAAG
TTTCCTCAGTTTTTTATGAGGAGTATTGTGCAATGAGGAACATCAATTTCCAACTCGCTAACTCCAAGGCAGTAGCAGAAGGTCAAATTCTGATGGTTTATACTTTTTGT
TGGCAATCATCACAGGGGAAAAGTGCCATGTTTGATGTAGTTATCATTGTACTTCGTGCTGCAGAACCACATGCTGGAAACTTGGCTGCTAATTGA
Protein sequenceShow/hide protein sequence
MQLTINRRPSQVSLSIEPRRSLFLNKNFKFKLKMKSKSQLRQAVDLLCSRGGATSEAYTQLVLECVRTNKVDQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDA
RNLFDKMLERDVFSWNALLSAYAKSGSVQDLRATFDQMPFRDSVSYNTIIAGFAGNSFPKESLELFKRMQREGFVTTEYTNVSALNASAQLLDLRRGKQIHGSVIVRNFL
GNVFIWNALTDMYAKCGEIEQARWLFDRLINKNLVSWNLMISGYVKNGQPERCIGLLHEMRLSGHMPDQVTMSTIIAAYCQCGRVDEARKVFSEFKEKDIVCWTAMLVGY
AKNGREEDALLLFNEMLLEHIRPDSYTLSSVVSSCAKLASLYHGQAVHGKSILSGLDNNLLVSSALIDMYSKCGFIEDARSVFKLMPTRNVISWNAMVVGYAQNGHDKDA
LELFENMLQQKFKPDNVTFIGVLSACLHSNWMEQGQEYFDSISNQHGLTPTLDHYACMVNLLGRLGRIDQAVSLIKSMPHEPDYLIWSTLLSVSTTKGDIANAEMAARHL
FELDPLNAVPYVMLSNMYASMGRWKDVATVRTLMKSKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEKIYEELNMLIRKLQNEGFTPNTNLVLHDVGEDEKFKSICFHS
EKLALAFGLIKKPNGVSPIRIIKNIRICSDCHEFMKFASMIIGRQIILRDSNRFHHFSTGKCSCKDSCFEQNKVSSVFYEEYCAMRNINFQLANSKAVAEGQILMVYTFC
WQSSQGKSAMFDVVIIVLRAAEPHAGNLAAN