; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012102 (gene) of Snake gourd v1 genome

Gene IDTan0012102
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG04:14872331..14877937
RNA-Seq ExpressionTan0012102
SyntenyTan0012102
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144567.1 pentatricopeptide repeat-containing protein At3g29230 [Momordica charantia]4.2e-24373.93Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPS
        MQMC VP+RTPSWFSTR+LFEQKLSDLHKCTDLNQVKQLHAQILKS+LHLDLYVVPKLISAFSLC Q  LA +AF QIQ PNVHLYNTLIRAHA NSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPS

Query:  HAFATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF----------------------
         AFA FFAMQ  GFYPDNFTFPFLLK CSG +W PV+EMVHAQI+KFGFMSD+FVPNSLIDSYSKCGSRGI++AKK F                      
Subjt:  HAFATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF----------------------

Query:  -----------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAIR
                                                                                     NLVSWTIIISGFAEKG AREAI 
Subjt:  -----------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAIR

Query:  LYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKAL
        LYDQMEEAHL LDNGTVI ILAACAESGLL LGEKVH+SI KNNF CTTEISNALVDMYAKCGRL++A++VFNG +NKDVVSWNAMLQGLAMHGHGEKAL
Subjt:  LYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKAL

Query:  ELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVE
        ELFKRMK EGFSPD+VTMIGVLCACTHAGLID+G RYF  MERDYA+VPE+EHYGCM+DLLGRKGRLEEAIRLIH+MPMEPNAVIWGTLLGACRMHNAVE
Subjt:  ELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVE

Query:  LAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI
        LAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIG QKPSGASSIEVD+EVHEFTVFDRSHPKSDKIYQ I
Subjt:  LAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI

XP_022980258.1 pentatricopeptide repeat-containing protein At3g29230 [Cucurbita maxima]1.8e-23872.77Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPS
        MQMCSV  RTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKS+LHLDLYVVPKLISAFSLC QMPLA NAF Q+Q+PN HLYNTLIRAHAQNSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPS

Query:  HAFATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF----------------------
         AF+TFF MQ DG YPDNFTFPFLLK C+G  W PVIEMVHAQI+KFGFMSD+FVPNSLIDSYSKCGS GI +AKKLF                      
Subjt:  HAFATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF----------------------

Query:  ------------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAI
                                                                                      NLVSWTI+ISGFAEKG A+ AI
Subjt:  ------------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAI

Query:  RLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKA
         L+DQMEEA + LDNG VI ILAACAESGLLGLGEK+HASI+ +NFKCTTEISNALVDMYAKCGRL+IAY+VFN I+NKDVVSWNAML GLAMHGHGEKA
Subjt:  RLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKA

Query:  LELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAV
        LELFKRMK +GFSPDKVTMIGVLCAC+HAGLID+GIRYFS+ME+DYA+V E+EHYGCM+DLLGRKGRLEEAIRLI TMPMEPN +IWGTLLGACRMHNAV
Subjt:  LELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAV

Query:  ELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI
        ELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEV+NEVHEFTVFDRSHPKSDKIYQ I
Subjt:  ELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI

XP_023526333.1 pentatricopeptide repeat-containing protein At3g29230 [Cucurbita pepo subsp. pepo]1.1e-23572.16Show/hide
Query:  MCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHA
        MCSV  RTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKS+LHLDLYV PKLISAFSLC QMPLA NAF Q+Q+PNVHLYNTLIRAHAQNSQPS A
Subjt:  MCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHA

Query:  FATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF------------------------
        F+TFF MQ DG YPDNFTFP+LLK C+G  W PVIEMVHAQI+KFGFMS++FVPNSLIDSYSKCGS GI +AKKLF                        
Subjt:  FATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF------------------------

Query:  ----------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAIRL
                                                                                    NLVSWTIIISGFAEKG A+ AI L
Subjt:  ----------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAIRL

Query:  YDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALE
        +DQMEEA + LDNG VI ILA+CAESGLLGLGEK+HASIK +NFKCTTEISNALVDMYAKCGRL+IAY+VFN I+NKDVVSWNAML GLAMHGHGEKALE
Subjt:  YDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALE

Query:  LFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVEL
        LFKRMK +GFSPDKVTMIGVLCAC+HAGLID+GIRYFS+ME++YA+V E+EHYGCM+DLLGRKGRLEEAIRLI TMPMEPN +IWGTLLGACRMHNAVEL
Subjt:  LFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVEL

Query:  AREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI
        AREVLDHLVKLEPS+PGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEV+NEVHEFTVFDRSHPKSDKIYQ I
Subjt:  AREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI

XP_031738476.1 pentatricopeptide repeat-containing protein At3g29230 [Cucumis sativus]2.5e-23571.92Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPS
        MQMCSVPIRTPSWFSTRKL EQKLSDLHKCT+LNQVKQLHAQILKS+LH+DL+VVPKLISAFSLC QM LA NAF Q+Q+PNVHLYNT+IRAH+ NSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPS

Query:  HAFATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF----------------------
         AFATFFAMQ DG Y DNFTFPFLLK C+G +W PVIE VHAQI+KFGFMSD+FVPNSLIDSYSKCGS GI++AKKLF                      
Subjt:  HAFATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF----------------------

Query:  ------------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAI
                                                                                      NLVSWTII+SGFAEKG AREAI
Subjt:  ------------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAI

Query:  RLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKA
         L+DQME+A L LDNGTV+ ILAACAESGLLGLGEK+HASIK NNFKCTTEISNALVDMYAKCGRLNIAY VFN IKNKDVVSWNAMLQGLAMHGHG KA
Subjt:  RLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKA

Query:  LELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAV
        LELFKRMK EGFSP+KVTMIGVLCACTHAGLID+GIRYFSTMERDY +VPEVEHYGCM+DLLGRKGRLEEAIRLI  MPM PNA+IWGTLLGACRMHNAV
Subjt:  LELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAV

Query:  ELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI
        ELAREVLDHLV+LEP+D GN SMLSNIYAAAGDW+CVA+ RLRMRSIGT+KPSGASSIEV+NEVHEFTVFDRSHPKSD IYQ I
Subjt:  ELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI

XP_038900175.1 pentatricopeptide repeat-containing protein At3g29230-like isoform X2 [Benincasa hispida]1.1e-23872.09Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPS
        MQMCSVPIR PSWFSTRKLFEQKLSDLHKCTDLNQVKQ+HAQILKS+LH+DLYVVPKLISAFSLC QMPLA +AF Q+Q+PNVHLYNT+IRAH  NSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPS

Query:  HAFATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF----------------------
         AFATFFAMQ DGFYPDNFTFPFLLK C+G +W  V+EMVHAQI+KFGFMSD+FVPNSLIDSYSKCGSRGI++AKKLF                      
Subjt:  HAFATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF----------------------

Query:  ------------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAI
                                                                                      NLVSWTIIISGFAEKG AREA+
Subjt:  ------------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAI

Query:  RLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKA
         L++QME+A L  D+GTVI IL+ACAESGLLGLGEK+HASIK NNFKCT EISNALV+MYAKCGRLNIAY VFN IKNKDVVSWNAMLQGLAMHGHG KA
Subjt:  RLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKA

Query:  LELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAV
        LELFKRMK EGFSPDK+TMIGVLCACTHAGLID+GI+YFSTMERDYA+VPEVEHYGCM+DLLGRKGRLEEA+RLI  MPMEPN +IWG LLGACRMHNAV
Subjt:  LELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAV

Query:  ELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI
        ELAREVLDHLVKLEPSD GNLSMLSNIYAAAGDWDCVAD+RLRMRS GTQKPSGASSIEVDNEVHEFTVFDRSHPKSD IY  I
Subjt:  ELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI

TrEMBL top hitse value%identityAlignment
A0A0A0L7H7 Uncharacterized protein2.4e-23670.88Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPS
        MQMCSVPIRTPSWFSTRKL EQKLSDLHKCT+LNQVKQLHAQILKS+LH+DL+VVPKLISAFSLC QM LA NAF Q+Q+PNVHLYNT+IRAH+ NSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPS

Query:  HAFATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF----------------------
         AFATFFAMQ DG Y DNFTFPFLLK C+G +W PVIE VHAQI+KFGFMSD+FVPNSLIDSYSKCGS GI++AKKLF                      
Subjt:  HAFATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF----------------------

Query:  ------------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAI
                                                                                      NLVSWTII+SGFAEKG AREAI
Subjt:  ------------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAI

Query:  RLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKA
         L+DQME+A L LDNGTV+ ILAACAESGLLGLGEK+HASIK NNFKCTTEISNALVDMYAKCGRLNIAY VFN IKNKDVVSWNAMLQGLAMHGHG KA
Subjt:  RLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKA

Query:  LELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAV
        LELFKRMK EGFSP+KVTMIGVLCACTHAGLID+GIRYFSTMERDY +VPEVEHYGCM+DLLGRKGRLEEAIRLI  MPM PNA+IWGTLLGACRMHNAV
Subjt:  LELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAV

Query:  ELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKIWKIMGGSRLH
        ELAREVLDHLV+LEP+D GN SMLSNIYAAAGDW+CVA+ RLRMRSIGT+KPSGASSIEV+NEVHEFTVFDRSHPKSD IYQ + +  G + +H
Subjt:  ELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKIWKIMGGSRLH

A0A5N6QW65 Uncharacterized protein4.5e-20661.58Show/hide
Query:  SVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFA
        S PIR+P+W S R+L E+KLSDLHK T+L Q+KQ+HAQILK++LH DL+V PKLI+AFSLC QM LA+NAF QIQ PNVHLYNTLIRAH QNSQP  AFA
Subjt:  SVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFA

Query:  TFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF--------------------------
         FF MQS G +PDNFT+PFLLK C G  W P+++M+H+ I+KFGF SD+FVPNSL+DSYSKCGS G+ +AKKLF                          
Subjt:  TFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF--------------------------

Query:  -------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAIRLYDQ
                                                                                 NLV+WTIIISG+A+KG A EAI LYD 
Subjt:  -------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAIRLYDQ

Query:  MEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALELFK
        MEE+ L  D+G +I +LAACAESGLLGLG+KVHASIK+  FKC+T++SNAL+DMYAKCG L+ AY VF+GI  +DVVSWNAMLQGLA HGHGEKALELF 
Subjt:  MEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALELFK

Query:  RMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVELARE
        RMK EGF PDKVT++GVLCACTHAGL++ GI+YF TME +Y IVP+VEHYGCMIDLLGR GRL+EA RL+H+MPMEPNA+IWGTLLGACRMHN V+LA +
Subjt:  RMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVELARE

Query:  VLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKIWKIM
        V+DHLVKLEPSDPGN SMLSNIYAAAGDWD V++VRLRMRS G QKPSGASSIEVD+EVHEFTV DR+HPKSDKIYQ + +++
Subjt:  VLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKIWKIM

A0A6J1CU14 pentatricopeptide repeat-containing protein At3g292302.1e-24373.93Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPS
        MQMC VP+RTPSWFSTR+LFEQKLSDLHKCTDLNQVKQLHAQILKS+LHLDLYVVPKLISAFSLC Q  LA +AF QIQ PNVHLYNTLIRAHA NSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPS

Query:  HAFATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF----------------------
         AFA FFAMQ  GFYPDNFTFPFLLK CSG +W PV+EMVHAQI+KFGFMSD+FVPNSLIDSYSKCGSRGI++AKK F                      
Subjt:  HAFATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF----------------------

Query:  -----------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAIR
                                                                                     NLVSWTIIISGFAEKG AREAI 
Subjt:  -----------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAIR

Query:  LYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKAL
        LYDQMEEAHL LDNGTVI ILAACAESGLL LGEKVH+SI KNNF CTTEISNALVDMYAKCGRL++A++VFNG +NKDVVSWNAMLQGLAMHGHGEKAL
Subjt:  LYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKAL

Query:  ELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVE
        ELFKRMK EGFSPD+VTMIGVLCACTHAGLID+G RYF  MERDYA+VPE+EHYGCM+DLLGRKGRLEEAIRLIH+MPMEPNAVIWGTLLGACRMHNAVE
Subjt:  ELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVE

Query:  LAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI
        LAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIG QKPSGASSIEVD+EVHEFTVFDRSHPKSDKIYQ I
Subjt:  LAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI

A0A6J1EA54 pentatricopeptide repeat-containing protein At3g292301.7e-23471.99Show/hide
Query:  MCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHA
        MCSV  RTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKS LHLDLYVVPKLISAFSLC QMPLA NAF Q+Q+PNVHLYNTLIRAHAQNSQPS A
Subjt:  MCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHA

Query:  FATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF------------------------
        F+TFF MQ DG YPDNFTFPFLLK C+G  W PVIEMVHAQI+KFGFMS++ VPNSLIDSYSKCGS GI +AKKLF                        
Subjt:  FATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF------------------------

Query:  ----------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAIRL
                                                                                    NLVSWTI+ISGFAEKG A+ AI L
Subjt:  ----------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAIRL

Query:  YDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALE
        +DQMEEA + LDNG VI ILAA AESGLLGLGEK+HASIK +NFKCTTEISNALVDMYAKCGRL+IAY+VFN I+NKDVVSWNAML GLAMHGHGEKALE
Subjt:  YDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALE

Query:  LFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVEL
        LFKRMK +GFSPDKVTMIGVLCAC+HAGLID+GIRYFS+ME++YA+V E+EHYGCM+DLLGRKGRLEEAIRLI TMPMEPN +IWGTLLGACRMHNAVEL
Subjt:  LFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVEL

Query:  AREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI
        AREVLDHLVKLEPS+PGNLSMLSNIYAAAGDWDCVADVRLRMRS GTQKPSGASSIEV+NEVHEFTVFDRSHPKSDKIYQ I
Subjt:  AREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI

A0A6J1IYS0 pentatricopeptide repeat-containing protein At3g292308.9e-23972.77Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPS
        MQMCSV  RTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKS+LHLDLYVVPKLISAFSLC QMPLA NAF Q+Q+PN HLYNTLIRAHAQNSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPS

Query:  HAFATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF----------------------
         AF+TFF MQ DG YPDNFTFPFLLK C+G  W PVIEMVHAQI+KFGFMSD+FVPNSLIDSYSKCGS GI +AKKLF                      
Subjt:  HAFATFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF----------------------

Query:  ------------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAI
                                                                                      NLVSWTI+ISGFAEKG A+ AI
Subjt:  ------------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAI

Query:  RLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKA
         L+DQMEEA + LDNG VI ILAACAESGLLGLGEK+HASI+ +NFKCTTEISNALVDMYAKCGRL+IAY+VFN I+NKDVVSWNAML GLAMHGHGEKA
Subjt:  RLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKA

Query:  LELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAV
        LELFKRMK +GFSPDKVTMIGVLCAC+HAGLID+GIRYFS+ME+DYA+V E+EHYGCM+DLLGRKGRLEEAIRLI TMPMEPN +IWGTLLGACRMHNAV
Subjt:  LELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAV

Query:  ELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI
        ELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEV+NEVHEFTVFDRSHPKSDKIYQ I
Subjt:  ELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKI

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210651.3e-9338.45Show/hide
Query:  TDLNQVKQLHAQILK-----SSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHP-NVHLYNTLIRAHAQNSQPSHAFATFFAMQSDGFY-PDNFTFPF
        + + +++Q+HA  ++     S   L  +++  L+S  S    M  A   F +I+ P NV ++NTLIR +A+      AF+ +  M+  G   PD  T+PF
Subjt:  TDLNQVKQLHAQILK-----SSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHP-NVHLYNTLIRAHAQNSQPSHAFATFFAMQSDGFY-PDNFTFPF

Query:  LLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF------NLVSWTIIISGFAEKGRAREAIRLYDQMEEAHLNLDNGT
        L+K  +      + E +H+ + + GF S ++V NSL+  Y+ CG   +ASA K+F      +LV+W  +I+GFAE G+  EA+ LY +M    +  D  T
Subjt:  LLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF------NLVSWTIIISGFAEKGRAREAIRLYDQMEEAHLNLDNGT

Query:  VIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALELFKRMKG-EGFSPDK
        ++ +L+ACA+ G L LG++VH  + K         SN L+D+YA+CGR+  A ++F+ + +K+ VSW +++ GLA++G G++A+ELFK M+  EG  P +
Subjt:  VIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALELFKRMKG-EGFSPDK

Query:  VTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVELAREVLDHLVKLEPS
        +T +G+L AC+H G++  G  YF  M  +Y I P +EH+GCM+DLL R G++++A   I +MPM+PN VIW TLLGAC +H   +LA      +++LEP+
Subjt:  VTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVELAREVLDHLVKLEPS

Query:  DPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKIWKIMGGSR
          G+  +LSN+YA+   W  V  +R +M   G +K  G S +EV N VHEF + D+SHP+SD IY K+ ++ G  R
Subjt:  DPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKIWKIMGGSR

Q9FG16 Pentatricopeptide repeat-containing protein At5g065407.1e-9234.73Show/hide
Query:  STRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLI------SAFSL-CHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFATFF
        +T +    KL+ L  C+  + +K +H  +L++ L  D++V  +L+      S F+   + +  A   F QIQ+PN+ ++N LIR  +  ++PS AF  + 
Subjt:  STRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLI------SAFSL-CHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFATFF

Query:  AMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF-----------------------------
         M     +PDN TFPFL+K  S      V E  H+QI +FGF +D++V NSL+  Y+ CG   IA+A ++F                             
Subjt:  AMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF-----------------------------

Query:  --------NLVSWTIIISGFAEKGRAREAIRLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAY
                NL +W+I+I+G+A+     +AI L++ M+   +  +   ++ ++++CA  G L  GE+ +  + K++      +  ALVDM+ +CG +  A 
Subjt:  --------NLVSWTIIISGFAEKGRAREAIRLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAY

Query:  SVFNGIKNKDVVSWNAMLQGLAMHGHGEKALELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEE
         VF G+   D +SW+++++GLA+HGH  KA+  F +M   GF P  VT   VL AC+H GL++ G+  +  M++D+ I P +EHYGC++D+LGR G+L E
Subjt:  SVFNGIKNKDVVSWNAMLQGLAMHGHGEKALELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEE

Query:  AIRLIHTMPMEPNAVIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTV-
        A   I  M ++PNA I G LLGAC+++   E+A  V + L+K++P   G   +LSNIYA AG WD +  +R  M+    +KP G S IE+D ++++FT+ 
Subjt:  AIRLIHTMPMEPNAVIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTV-

Query:  FDRSHPKSDKIYQKIWKIMGGSRL
         D+ HP+  KI +K  +I+G  RL
Subjt:  FDRSHPKSDKIYQKIWKIMGGSRL

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665202.9e-9335.28Show/hide
Query:  FSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLIS---AFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFATFFAMQ
        FS      + +S L +C+   ++KQ+HA++LK+ L  D Y + K +S   + +    +P A   F     P+  L+N +IR  + + +P  +   +  M 
Subjt:  FSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLIS---AFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFATFFAMQ

Query:  SDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGS-----------------------RGIASAKKL----------
              + +TFP LLK CS    F     +HAQI K G+ +D++  NSLI+SY+  G+                       +G   A K+          
Subjt:  SDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGS-----------------------RGIASAKKL----------

Query:  --FNLVSWTIIISGFAEKGRAREAIRLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNG
           N +SWT +ISG+ +    +EA++L+ +M+ + +  DN ++   L+ACA+ G L  G+ +H+ + K   +  + +   L+DMYAKCG +  A  VF  
Subjt:  --FNLVSWTIIISGFAEKGRAREAIRLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNG

Query:  IKNKDVVSWNAMLQGLAMHGHGEKALELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLI
        IK K V +W A++ G A HGHG +A+  F  M+  G  P+ +T   VL AC++ GL++ G   F +MERDY + P +EHYGC++DLLGR G L+EA R I
Subjt:  IKNKDVVSWNAMLQGLAMHGHGEKALELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLI

Query:  HTMPMEPNAVIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHP
          MP++PNAVIWG LL ACR+H  +EL  E+ + L+ ++P   G     +NI+A    WD  A+ R  M+  G  K  G S+I ++   HEF   DRSHP
Subjt:  HTMPMEPNAVIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHP

Query:  KSDKIYQKIWKIM
        + +KI  K W+IM
Subjt:  KSDKIYQKIWKIM

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic2.8e-9634.39Show/hide
Query:  LSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLC---HQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFATFFAMQSDGFYPDNFT
        LS LH C  L  ++ +HAQ++K  LH   Y + KLI    L      +P AI+ FK IQ PN+ ++NT+ R HA +S P  A   +  M S G  P+++T
Subjt:  LSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLC---HQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFATFFAMQSDGFYPDNFT

Query:  FPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCG----------------------------SRG-IASAKKLF------NLVSWTI
        FPF+LK C+    F   + +H  + K G   DL+V  SLI  Y + G                            SRG I +A+KLF      ++VSW  
Subjt:  FPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCG----------------------------SRG-IASAKKLF------NLVSWTI

Query:  IISGFAEKGRAREAIRLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNA------------------------------
        +ISG+AE G  +EA+ L+  M + ++  D  T++ +++ACA+SG + LG +VH  I  + F    +I NA                              
Subjt:  IISGFAEKGRAREAIRLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNA------------------------------

Query:  -------------------------------------------------------------------------LVDMYAKCGRLNIAYSVFNGIKNKDVV
                                                                                 L+DMYAKCG +  A+ VFN I +K + 
Subjt:  -------------------------------------------------------------------------LVDMYAKCGRLNIAYSVFNGIKNKDVV

Query:  SWNAMLQGLAMHGHGEKALELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEP
        SWNAM+ G AMHG  + + +LF RM+  G  PD +T +G+L AC+H+G++D G   F TM +DY + P++EHYGCMIDLLG  G  +EA  +I+ M MEP
Subjt:  SWNAMLQGLAMHGHGEKALELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEP

Query:  NAVIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIY
        + VIW +LL AC+MH  VEL     ++L+K+EP +PG+  +LSNIYA+AG W+ VA  R  +   G +K  G SSIE+D+ VHEF + D+ HP++ +IY
Subjt:  NAVIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIY

Q9LS72 Pentatricopeptide repeat-containing protein At3g292301.3e-17351.28Show/hide
Query:  SVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFA
        S+P+R PSW S+R++FE++L DL KC +LNQVKQLHAQI++ +LH DL++ PKLISA SLC Q  LA+  F Q+Q PNVHL N+LIRAHAQNSQP  AF 
Subjt:  SVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFA

Query:  TFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF--------------------------
         F  MQ  G + DNFT+PFLLK CSG  W PV++M+H  I+K G  SD++VPN+LID YS+CG  G+  A KLF                          
Subjt:  TFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF--------------------------

Query:  ---------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAIRLY
                                                                                   N+V+WTIII+G+AEKG  +EA RL 
Subjt:  ---------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAIRLY

Query:  DQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALEL
        DQM  + L  D   VI ILAAC ESGLL LG ++H+ +K++N      + NAL+DMYAKCG L  A+ VFN I  KD+VSWN ML GL +HGHG++A+EL
Subjt:  DQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALEL

Query:  FKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVELA
        F RM+ EG  PDKVT I VLC+C HAGLID GI YF +ME+ Y +VP+VEHYGC++DLLGR GRL+EAI+++ TMPMEPN VIWG LLGACRMHN V++A
Subjt:  FKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVELA

Query:  REVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKIWKIM
        +EVLD+LVKL+P DPGN S+LSNIYAAA DW+ VAD+R +M+S+G +KPSGASS+E+++ +HEFTVFD+SHPKSD+IYQ +  ++
Subjt:  REVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKIWKIM

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-9734.39Show/hide
Query:  LSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLC---HQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFATFFAMQSDGFYPDNFT
        LS LH C  L  ++ +HAQ++K  LH   Y + KLI    L      +P AI+ FK IQ PN+ ++NT+ R HA +S P  A   +  M S G  P+++T
Subjt:  LSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLC---HQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFATFFAMQSDGFYPDNFT

Query:  FPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCG----------------------------SRG-IASAKKLF------NLVSWTI
        FPF+LK C+    F   + +H  + K G   DL+V  SLI  Y + G                            SRG I +A+KLF      ++VSW  
Subjt:  FPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCG----------------------------SRG-IASAKKLF------NLVSWTI

Query:  IISGFAEKGRAREAIRLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNA------------------------------
        +ISG+AE G  +EA+ L+  M + ++  D  T++ +++ACA+SG + LG +VH  I  + F    +I NA                              
Subjt:  IISGFAEKGRAREAIRLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNA------------------------------

Query:  -------------------------------------------------------------------------LVDMYAKCGRLNIAYSVFNGIKNKDVV
                                                                                 L+DMYAKCG +  A+ VFN I +K + 
Subjt:  -------------------------------------------------------------------------LVDMYAKCGRLNIAYSVFNGIKNKDVV

Query:  SWNAMLQGLAMHGHGEKALELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEP
        SWNAM+ G AMHG  + + +LF RM+  G  PD +T +G+L AC+H+G++D G   F TM +DY + P++EHYGCMIDLLG  G  +EA  +I+ M MEP
Subjt:  SWNAMLQGLAMHGHGEKALELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEP

Query:  NAVIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIY
        + VIW +LL AC+MH  VEL     ++L+K+EP +PG+  +LSNIYA+AG W+ VA  R  +   G +K  G SSIE+D+ VHEF + D+ HP++ +IY
Subjt:  NAVIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIY

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.0e-17551.28Show/hide
Query:  SVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFA
        S+P+R PSW S+R++FE++L DL KC +LNQVKQLHAQI++ +LH DL++ PKLISA SLC Q  LA+  F Q+Q PNVHL N+LIRAHAQNSQP  AF 
Subjt:  SVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFA

Query:  TFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF--------------------------
         F  MQ  G + DNFT+PFLLK CSG  W PV++M+H  I+K G  SD++VPN+LID YS+CG  G+  A KLF                          
Subjt:  TFFAMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF--------------------------

Query:  ---------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAIRLY
                                                                                   N+V+WTIII+G+AEKG  +EA RL 
Subjt:  ---------------------------------------------------------------------------NLVSWTIIISGFAEKGRAREAIRLY

Query:  DQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALEL
        DQM  + L  D   VI ILAAC ESGLL LG ++H+ +K++N      + NAL+DMYAKCG L  A+ VFN I  KD+VSWN ML GL +HGHG++A+EL
Subjt:  DQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALEL

Query:  FKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVELA
        F RM+ EG  PDKVT I VLC+C HAGLID GI YF +ME+ Y +VP+VEHYGC++DLLGR GRL+EAI+++ TMPMEPN VIWG LLGACRMHN V++A
Subjt:  FKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVELA

Query:  REVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKIWKIM
        +EVLD+LVKL+P DPGN S+LSNIYAAA DW+ VAD+R +M+S+G +KPSGASS+E+++ +HEFTVFD+SHPKSD+IYQ +  ++
Subjt:  REVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKIWKIM

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.2e-9538.45Show/hide
Query:  TDLNQVKQLHAQILK-----SSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHP-NVHLYNTLIRAHAQNSQPSHAFATFFAMQSDGFY-PDNFTFPF
        + + +++Q+HA  ++     S   L  +++  L+S  S    M  A   F +I+ P NV ++NTLIR +A+      AF+ +  M+  G   PD  T+PF
Subjt:  TDLNQVKQLHAQILK-----SSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHP-NVHLYNTLIRAHAQNSQPSHAFATFFAMQSDGFY-PDNFTFPF

Query:  LLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF------NLVSWTIIISGFAEKGRAREAIRLYDQMEEAHLNLDNGT
        L+K  +      + E +H+ + + GF S ++V NSL+  Y+ CG   +ASA K+F      +LV+W  +I+GFAE G+  EA+ LY +M    +  D  T
Subjt:  LLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF------NLVSWTIIISGFAEKGRAREAIRLYDQMEEAHLNLDNGT

Query:  VIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALELFKRMKG-EGFSPDK
        ++ +L+ACA+ G L LG++VH  + K         SN L+D+YA+CGR+  A ++F+ + +K+ VSW +++ GLA++G G++A+ELFK M+  EG  P +
Subjt:  VIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALELFKRMKG-EGFSPDK

Query:  VTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVELAREVLDHLVKLEPS
        +T +G+L AC+H G++  G  YF  M  +Y I P +EH+GCM+DLL R G++++A   I +MPM+PN VIW TLLGAC +H   +LA      +++LEP+
Subjt:  VTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVELAREVLDHLVKLEPS

Query:  DPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKIWKIMGGSR
          G+  +LSN+YA+   W  V  +R +M   G +K  G S +EV N VHEF + D+SHP+SD IY K+ ++ G  R
Subjt:  DPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKIWKIMGGSR

AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein5.0e-9334.73Show/hide
Query:  STRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLI------SAFSL-CHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFATFF
        +T +    KL+ L  C+  + +K +H  +L++ L  D++V  +L+      S F+   + +  A   F QIQ+PN+ ++N LIR  +  ++PS AF  + 
Subjt:  STRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLI------SAFSL-CHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFATFF

Query:  AMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF-----------------------------
         M     +PDN TFPFL+K  S      V E  H+QI +FGF +D++V NSL+  Y+ CG   IA+A ++F                             
Subjt:  AMQSDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLF-----------------------------

Query:  --------NLVSWTIIISGFAEKGRAREAIRLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAY
                NL +W+I+I+G+A+     +AI L++ M+   +  +   ++ ++++CA  G L  GE+ +  + K++      +  ALVDM+ +CG +  A 
Subjt:  --------NLVSWTIIISGFAEKGRAREAIRLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAY

Query:  SVFNGIKNKDVVSWNAMLQGLAMHGHGEKALELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEE
         VF G+   D +SW+++++GLA+HGH  KA+  F +M   GF P  VT   VL AC+H GL++ G+  +  M++D+ I P +EHYGC++D+LGR G+L E
Subjt:  SVFNGIKNKDVVSWNAMLQGLAMHGHGEKALELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEE

Query:  AIRLIHTMPMEPNAVIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTV-
        A   I  M ++PNA I G LLGAC+++   E+A  V + L+K++P   G   +LSNIYA AG WD +  +R  M+    +KP G S IE+D ++++FT+ 
Subjt:  AIRLIHTMPMEPNAVIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTV-

Query:  FDRSHPKSDKIYQKIWKIMGGSRL
         D+ HP+  KI +K  +I+G  RL
Subjt:  FDRSHPKSDKIYQKIWKIMGGSRL

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-9435.28Show/hide
Query:  FSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLIS---AFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFATFFAMQ
        FS      + +S L +C+   ++KQ+HA++LK+ L  D Y + K +S   + +    +P A   F     P+  L+N +IR  + + +P  +   +  M 
Subjt:  FSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLIS---AFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFATFFAMQ

Query:  SDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGS-----------------------RGIASAKKL----------
              + +TFP LLK CS    F     +HAQI K G+ +D++  NSLI+SY+  G+                       +G   A K+          
Subjt:  SDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGS-----------------------RGIASAKKL----------

Query:  --FNLVSWTIIISGFAEKGRAREAIRLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNG
           N +SWT +ISG+ +    +EA++L+ +M+ + +  DN ++   L+ACA+ G L  G+ +H+ + K   +  + +   L+DMYAKCG +  A  VF  
Subjt:  --FNLVSWTIIISGFAEKGRAREAIRLYDQMEEAHLNLDNGTVIGILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNG

Query:  IKNKDVVSWNAMLQGLAMHGHGEKALELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLI
        IK K V +W A++ G A HGHG +A+  F  M+  G  P+ +T   VL AC++ GL++ G   F +MERDY + P +EHYGC++DLLGR G L+EA R I
Subjt:  IKNKDVVSWNAMLQGLAMHGHGEKALELFKRMKGEGFSPDKVTMIGVLCACTHAGLIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLI

Query:  HTMPMEPNAVIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHP
          MP++PNAVIWG LL ACR+H  +EL  E+ + L+ ++P   G     +NI+A    WD  A+ R  M+  G  K  G S+I ++   HEF   DRSHP
Subjt:  HTMPMEPNAVIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHP

Query:  KSDKIYQKIWKIM
        + +KI  K W+IM
Subjt:  KSDKIYQKIWKIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAATGTGCAGCGTCCCAATTCGAACTCCCTCTTGGTTCTCCACTCGAAAGCTCTTCGAGCAGAAGCTTTCAGATCTCCACAAGTGCACAGACCTCAACCAAGTGAA
GCAACTCCACGCCCAAATCCTCAAATCCAGTCTCCACCTCGACCTCTATGTTGTTCCTAAGCTCATCTCCGCCTTCTCCCTCTGCCACCAAATGCCCCTCGCCATCAACG
CCTTCAAGCAAATTCAACATCCCAATGTACATTTGTACAACACTCTGATTCGAGCCCACGCCCAGAACTCGCAGCCTTCGCATGCTTTTGCCACTTTCTTCGCTATGCAA
TCTGATGGATTCTACCCCGATAATTTCACTTTCCCATTTCTTCTGAAGCCTTGTTCTGGGCCATTGTGGTTCCCAGTTATTGAAATGGTACACGCCCAAATCCAGAAATT
TGGGTTCATGTCGGATTTGTTTGTGCCCAATTCTCTTATTGATTCCTATTCCAAATGTGGGTCTCGTGGAATTGCGTCTGCCAAGAAGTTGTTTAATTTGGTTTCTTGGA
CTATAATTATCTCTGGTTTTGCTGAGAAGGGCCGGGCCAGAGAAGCCATTCGCTTGTACGATCAAATGGAAGAGGCTCACTTGAATTTAGACAATGGGACTGTAATAGGT
ATATTGGCTGCTTGTGCTGAGTCTGGTTTGCTTGGGCTTGGTGAGAAAGTTCATGCCTCTATTAAGAAGAACAATTTCAAATGCACTACTGAAATCTCCAATGCTTTAGT
TGATATGTATGCAAAATGTGGAAGGTTGAATATAGCTTACAGTGTCTTCAATGGCATAAAAAACAAAGATGTTGTGTCTTGGAATGCGATGCTCCAAGGGCTGGCAATGC
ATGGGCATGGAGAGAAAGCGCTTGAGCTTTTCAAAAGAATGAAAGGAGAGGGCTTCTCTCCCGACAAAGTTACAATGATCGGAGTCTTGTGTGCTTGTACGCACGCGGGA
TTGATCGACAATGGCATTCGATACTTCTCTACCATGGAAAGGGACTATGCTATTGTTCCTGAAGTTGAGCATTACGGTTGTATGATAGACCTTCTAGGTCGCAAGGGAAG
GCTTGAAGAAGCCATCAGGCTCATTCACACCATGCCAATGGAACCAAATGCTGTCATTTGGGGCACCCTTTTAGGGGCGTGTCGAATGCACAATGCTGTTGAACTTGCGA
GGGAGGTTCTTGATCATTTAGTTAAGTTGGAACCGTCTGATCCAGGTAATTTATCCATGTTATCGAACATATACGCTGCAGCCGGGGACTGGGATTGTGTTGCTGACGTG
AGGCTGAGAATGCGGAGTATTGGAACCCAAAAACCGTCGGGTGCCAGTTCCATCGAGGTTGACAATGAGGTTCATGAGTTTACAGTGTTCGATAGATCGCATCCGAAATC
TGATAAGATATATCAGAAAATATGGAAAATTATGGGTGGAAGTAGGTTACACAGAGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAATGTGCAGCGTCCCAATTCGAACTCCCTCTTGGTTCTCCACTCGAAAGCTCTTCGAGCAGAAGCTTTCAGATCTCCACAAGTGCACAGACCTCAACCAAGTGAA
GCAACTCCACGCCCAAATCCTCAAATCCAGTCTCCACCTCGACCTCTATGTTGTTCCTAAGCTCATCTCCGCCTTCTCCCTCTGCCACCAAATGCCCCTCGCCATCAACG
CCTTCAAGCAAATTCAACATCCCAATGTACATTTGTACAACACTCTGATTCGAGCCCACGCCCAGAACTCGCAGCCTTCGCATGCTTTTGCCACTTTCTTCGCTATGCAA
TCTGATGGATTCTACCCCGATAATTTCACTTTCCCATTTCTTCTGAAGCCTTGTTCTGGGCCATTGTGGTTCCCAGTTATTGAAATGGTACACGCCCAAATCCAGAAATT
TGGGTTCATGTCGGATTTGTTTGTGCCCAATTCTCTTATTGATTCCTATTCCAAATGTGGGTCTCGTGGAATTGCGTCTGCCAAGAAGTTGTTTAATTTGGTTTCTTGGA
CTATAATTATCTCTGGTTTTGCTGAGAAGGGCCGGGCCAGAGAAGCCATTCGCTTGTACGATCAAATGGAAGAGGCTCACTTGAATTTAGACAATGGGACTGTAATAGGT
ATATTGGCTGCTTGTGCTGAGTCTGGTTTGCTTGGGCTTGGTGAGAAAGTTCATGCCTCTATTAAGAAGAACAATTTCAAATGCACTACTGAAATCTCCAATGCTTTAGT
TGATATGTATGCAAAATGTGGAAGGTTGAATATAGCTTACAGTGTCTTCAATGGCATAAAAAACAAAGATGTTGTGTCTTGGAATGCGATGCTCCAAGGGCTGGCAATGC
ATGGGCATGGAGAGAAAGCGCTTGAGCTTTTCAAAAGAATGAAAGGAGAGGGCTTCTCTCCCGACAAAGTTACAATGATCGGAGTCTTGTGTGCTTGTACGCACGCGGGA
TTGATCGACAATGGCATTCGATACTTCTCTACCATGGAAAGGGACTATGCTATTGTTCCTGAAGTTGAGCATTACGGTTGTATGATAGACCTTCTAGGTCGCAAGGGAAG
GCTTGAAGAAGCCATCAGGCTCATTCACACCATGCCAATGGAACCAAATGCTGTCATTTGGGGCACCCTTTTAGGGGCGTGTCGAATGCACAATGCTGTTGAACTTGCGA
GGGAGGTTCTTGATCATTTAGTTAAGTTGGAACCGTCTGATCCAGGTAATTTATCCATGTTATCGAACATATACGCTGCAGCCGGGGACTGGGATTGTGTTGCTGACGTG
AGGCTGAGAATGCGGAGTATTGGAACCCAAAAACCGTCGGGTGCCAGTTCCATCGAGGTTGACAATGAGGTTCATGAGTTTACAGTGTTCGATAGATCGCATCCGAAATC
TGATAAGATATATCAGAAAATATGGAAAATTATGGGTGGAAGTAGGTTACACAGAGACTAG
Protein sequenceShow/hide protein sequence
MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSSLHLDLYVVPKLISAFSLCHQMPLAINAFKQIQHPNVHLYNTLIRAHAQNSQPSHAFATFFAMQ
SDGFYPDNFTFPFLLKPCSGPLWFPVIEMVHAQIQKFGFMSDLFVPNSLIDSYSKCGSRGIASAKKLFNLVSWTIIISGFAEKGRAREAIRLYDQMEEAHLNLDNGTVIG
ILAACAESGLLGLGEKVHASIKKNNFKCTTEISNALVDMYAKCGRLNIAYSVFNGIKNKDVVSWNAMLQGLAMHGHGEKALELFKRMKGEGFSPDKVTMIGVLCACTHAG
LIDNGIRYFSTMERDYAIVPEVEHYGCMIDLLGRKGRLEEAIRLIHTMPMEPNAVIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADV
RLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDKIYQKIWKIMGGSRLHRD