; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017120 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017120
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr03:11184966..11186555
RNA-Seq ExpressionHG10017120
SyntenyHG10017120
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603691.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]7.3e-17463.71Show/hide
Query:  MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVV
        MTVRLLH+LF IS  SALL RHGFH+V SIRLFS FK  T  TTNLPKPPRLL+LISPK N  SESRQTHLRLI+DFL+TD DQCRS+TLS+G DS SV 
Subjt:  MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVV

Query:  LLKDSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL----------------------------
        L KDSS V DQERESGHW  QLFAGRF+FDANDISS LSLC SQRNLRGGIQYHSVAIRTGFI NVYVGSSL                            
Subjt:  LLKDSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN
                  HGLSLQAI+LFKAMRK +QVEAD ITFLGVLSSCRH GLVEEGRYYFNLMVEL LKPELDHYSCVIDLLGRAGLLKEAQNFIE MPISPN
Subjt:  ----------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN

Query:  SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGL
        SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANL A+AGYLD+AA+LRKMMKD+GLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EI G+
Subjt:  SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGL

Query:  MDGIVNHMRSIGCVPEVEDEVDDVLLATS
        MDG+VNHMRS+GCVPEV+DE++D LLATS
Subjt:  MDGIVNHMRSIGCVPEVEDEVDDVLLATS

XP_022949850.1 pentatricopeptide repeat-containing protein At2g37320 [Cucurbita moschata]1.8e-17263.33Show/hide
Query:  MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVV
        MTVRLLH+LF IS  SALL RHGFH+V SIRLFS FK  T  TTNLPKPPRLL+LISPK N  SESRQTHLRLI+DFL+TD DQCRS+TLS+G DS SV 
Subjt:  MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVV

Query:  LLKDSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL----------------------------
        L KDSS V DQERESGHW  QLFAGRF+FDANDISS LSLC SQRN RGGIQYHSVAIRTGFI NVYVGSSL                            
Subjt:  LLKDSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN
                  HGLSLQAI+LFKAMRK +QVEAD ITFLGVLSSCRH GLVEEGRYYFNLMVEL LKPELDHYSCVIDLLGRAGLLKEAQN IE MPISPN
Subjt:  ----------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN

Query:  SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGL
        SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANL A+AGYLDDAA+LRKMMKD+GLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EI G+
Subjt:  SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGL

Query:  MDGIVNHMRSIGCVPEVEDEVDDVLLATS
        MDG+VNHMRS+GCVPEV++E++D LLATS
Subjt:  MDGIVNHMRSIGCVPEVEDEVDDVLLATS

XP_022977771.1 pentatricopeptide repeat-containing protein At2g37320 [Cucurbita maxima]4.3e-17463.71Show/hide
Query:  MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVV
        MTVRLLH+LF ISQ SALL RHGFH+V SIRLFS FK  T  TTNLPKPPRLL+LISPK N  SESRQTHLRLI+DFL+TD DQCRS+TLS+G DS SV 
Subjt:  MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVV

Query:  LLKDSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL----------------------------
        L KDSS VLDQERESGHWD QLFAGRF+FDANDISS LSLC SQRNLRGGIQYHSVAIRTGFI NVYVGSSL                            
Subjt:  LLKDSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN
                  HGLSL+AI+LF+AMRK +QVEAD ITFLGVLSSCRH GLVEEGRYYFNLMVEL LKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN
Subjt:  ----------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN

Query:  SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGL
        SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANL A+AGYL+DAA+LRKMMKD+GLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EI G+
Subjt:  SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGL

Query:  MDGIVNHMRSIGCVPEVEDEVDDVLLATS
        MDG+VNHMRS+ CVPEV+DE++D LLA S
Subjt:  MDGIVNHMRSIGCVPEVEDEVDDVLLATS

XP_023544680.1 pentatricopeptide repeat-containing protein At2g37320 [Cucurbita pepo subsp. pepo]2.1e-17363.52Show/hide
Query:  MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVV
        MTVRLLH LF IS+ SALL RHGFH+V SIRLFS FK  T  TT+LPKPPRLL+LISPK N  SESRQTHLRLI+DFL+TD DQCRS+TLS+G DS SV 
Subjt:  MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVV

Query:  LLKDSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL----------------------------
        L KDSS VLDQERESGHW  QLFAGRF+FDANDISS LSLC SQRNLRGGIQYHSVAIRTGFI NVYVGSSL                            
Subjt:  LLKDSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN
                  HGLSLQAI+LF+AMRK +QVEAD ITFLGVLSSCRH GLVEEGRYYFNLMVEL LKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN
Subjt:  ----------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN

Query:  SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGL
        SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANL A+AGYLDDAA+LRKMMKD+GLKT+PGYSWIEIQNKVYRFKAEDKSNPVM+EI G+
Subjt:  SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGL

Query:  MDGIVNHMRSIGCVPEVEDEVDDVLLATS
        MDG+VNHMRS+GCVPEV+DE+++ LLATS
Subjt:  MDGIVNHMRSIGCVPEVEDEVDDVLLATS

XP_038881286.1 pentatricopeptide repeat-containing protein At2g37320 isoform X1 [Benincasa hispida]6.9e-17263.69Show/hide
Query:  RLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVVLLK
        RLL ++F ISQ SALLYRHGFHLV SIRLFSNFK KT  +TNLPKPP+LL+LISPK N  +E+RQTHLRLIQDFL+TD DQCRS+TLS+G DSHS+V  K
Subjt:  RLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVVLLK

Query:  DSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL-------------------------------
        DSS VL QERESGHWD+QLFAGRFKFDANDISSVLSLCSSQ NLRGGIQYHSVAIRTGFI NVYVGSSL                               
Subjt:  DSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIV
               HGLSLQAI+L+  MRK KQVEADAITFLGVLSSCRHAGLVEEG+YYFNLMVEL +KPELDHY+CVIDLLGRAGLLKEAQNFIE+MPISPNSIV
Subjt:  -------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIV

Query:  WGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDG
        WGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQ+ANL AKAGYL+DAA+LRKMMKD+GLKT PGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDG
Subjt:  WGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDG

Query:  IVNHMRSIGCVPEVEDEVDDVLLATS
        I+NHMR IG   EVEDEVDD+ LATS
Subjt:  IVNHMRSIGCVPEVEDEVDDVLLATS

TrEMBL top hitse value%identityAlignment
A0A0A0KX36 Uncharacterized protein7.4e-16462.24Show/hide
Query:  LLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVVLLKD
        LLH+LF ISQ SA  YRHGF+++ SIR FSNFKS TH TTNLPKPPRLL+LISPK +V  ESRQTHLRLIQDFL+TD  QCRS+TL  G DS S+ L KD
Subjt:  LLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVVLLKD

Query:  SSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL--------------------------------
        SS VLDQE ESGHWDVQ FAGRFKF+ANDISSVLSLC+SQRNLRGGIQYHSVAIRTGFI NVYVGSSL                                
Subjt:  SSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL--------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVW
              HGLSL+AI+LFKAMRK KQVEADAITFLGVLSSCRHAG VEEGR+YFNLMVEL LKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPI+PNSIVW
Subjt:  ------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVW

Query:  GSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGI
        GSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQL NL AKAGYLDDAA+LRK+MKD+GLKTAPGYSWIEIQNKVYRFKAEDKSNP+MVEI GL+DG+
Subjt:  GSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGI

Query:  VNHMRSIGCVPEVEDEVDD
        VNHMR +GC  E+ED+V++
Subjt:  VNHMRSIGCVPEVEDEVDD

A0A1S3BGK7 pentatricopeptide repeat-containing protein At2g373201.4e-16562.57Show/hide
Query:  MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVV
        M   LLH+LFAISQ SA  YRHGF++V S+R FSNFKS TH TTNLPKP RLL+LISPK +V  ESRQTHLRLIQDFL+TD DQCRS+TLS G DS SV 
Subjt:  MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVV

Query:  LLKDSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL----------------------------
        L KDSS VLDQE ESGHWDVQ FAGRFKF ANDISSVLSLC+SQRNLRGG+QYHSVAIRTGFI NVYVGSSL                            
Subjt:  LLKDSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN
                  HGLS +AI+LFKAMRK KQVEADAITFLGVLSSCRHAG VEEGR+YFNLMVEL LKPELDHYSCVIDLLGRAGLLKEAQNFIEKMP+SPN
Subjt:  ----------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN

Query:  SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGL
        SI+WGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQL  L AKAGYLDDAA+LRK+MKD+GLKTAPGYSWIEIQNKVYRFKAEDKSNP+MVEI GL
Subjt:  SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGL

Query:  MDGIVNHMRSIGCVPEVEDEVDDVLLATS
        MD +VNHMR +G   E+EDEVDDVLLATS
Subjt:  MDGIVNHMRSIGCVPEVEDEVDDVLLATS

A0A5A7U1F7 Pentatricopeptide repeat-containing protein7.4e-16462.64Show/hide
Query:  LLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVVLLKDSSC
        +LFAISQ SA  YRHGF++V S+R FSNFKS TH TTNLPKP RLL+LISPK +V  ESRQTHLRLIQDFL+TD DQCRS+TLS G DS SV L KDSS 
Subjt:  LLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVVLLKDSSC

Query:  VLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL-----------------------------------
        VLDQE ESGHWDVQ FAGRFKF ANDISSVLSLC+SQRNLRGG+QYHSVAIRTGFI NVYVGSSL                                   
Subjt:  VLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL-----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVWGSL
           HGLS +AI+LFKAMRK KQVEADAITFLGVLSSCRHAG VEEGR+YFNLMVEL LKPELDHYSCVIDLLGRAGLLKEAQNFIEKMP+SPNSI+WGSL
Subjt:  ---HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVWGSL

Query:  LSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGIVNH
        LSACRLHGNVWIGLKAAESRLLLQPDCASTHLQL  L AKAGYLDDAA+LRK+MKD+GLKTAPGYSWIEIQNKVYRFKAEDKSNP+MVEI GLMD +VNH
Subjt:  LSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGIVNH

Query:  MRSIGCVPEVEDEVDDVLLATS
        MR +G   E+EDEVDDVLLATS
Subjt:  MRSIGCVPEVEDEVDDVLLATS

A0A6J1GDY6 pentatricopeptide repeat-containing protein At2g373208.8e-17363.33Show/hide
Query:  MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVV
        MTVRLLH+LF IS  SALL RHGFH+V SIRLFS FK  T  TTNLPKPPRLL+LISPK N  SESRQTHLRLI+DFL+TD DQCRS+TLS+G DS SV 
Subjt:  MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVV

Query:  LLKDSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL----------------------------
        L KDSS V DQERESGHW  QLFAGRF+FDANDISS LSLC SQRN RGGIQYHSVAIRTGFI NVYVGSSL                            
Subjt:  LLKDSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN
                  HGLSLQAI+LFKAMRK +QVEAD ITFLGVLSSCRH GLVEEGRYYFNLMVEL LKPELDHYSCVIDLLGRAGLLKEAQN IE MPISPN
Subjt:  ----------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN

Query:  SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGL
        SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANL A+AGYLDDAA+LRKMMKD+GLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EI G+
Subjt:  SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGL

Query:  MDGIVNHMRSIGCVPEVEDEVDDVLLATS
        MDG+VNHMRS+GCVPEV++E++D LLATS
Subjt:  MDGIVNHMRSIGCVPEVEDEVDDVLLATS

A0A6J1IKW6 pentatricopeptide repeat-containing protein At2g373202.1e-17463.71Show/hide
Query:  MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVV
        MTVRLLH+LF ISQ SALL RHGFH+V SIRLFS FK  T  TTNLPKPPRLL+LISPK N  SESRQTHLRLI+DFL+TD DQCRS+TLS+G DS SV 
Subjt:  MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVV

Query:  LLKDSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL----------------------------
        L KDSS VLDQERESGHWD QLFAGRF+FDANDISS LSLC SQRNLRGGIQYHSVAIRTGFI NVYVGSSL                            
Subjt:  LLKDSSCVLDQERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN
                  HGLSL+AI+LF+AMRK +QVEAD ITFLGVLSSCRH GLVEEGRYYFNLMVEL LKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN
Subjt:  ----------HGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPN

Query:  SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGL
        SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANL A+AGYL+DAA+LRKMMKD+GLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EI G+
Subjt:  SIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGL

Query:  MDGIVNHMRSIGCVPEVEDEVDDVLLATS
        MDG+VNHMRS+ CVPEV+DE++D LLA S
Subjt:  MDGIVNHMRSIGCVPEVEDEVDDVLLATS

SwissProt top hitse value%identityAlignment
Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220701.3e-5339.38Show/hide
Query:  ISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL--------------------------------------HGLSLQAIELFKAMRKLKQVEAD
        ++++LS+ SS  +L  G Q H  A+++G I++V V ++L                                      HG + +A+ELF+ M  ++ +  D
Subjt:  ISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL--------------------------------------HGLSLQAIELFKAMRKLKQVEAD

Query:  AITFLGVLSSCRHAGLVEEGRYYFNLMVELD-LKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQP
         IT++GV S+C HAGLV +GR YF++M ++D + P L HY+C++DL GRAGLL+EAQ FIEKMPI P+ + WGSLLSACR+H N+ +G  AAE  LLL+P
Subjt:  AITFLGVLSSCRHAGLVEEGRYYFNLMVELD-LKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQP

Query:  DCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGIVNHMRSIGCVPEVEDEVDDV
        + +  +  LANL +  G  ++AAK+RK MKD  +K   G+SWIE+++KV+ F  ED ++P   EI   M  I + ++ +G VP+    + D+
Subjt:  DCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGIVNHMRSIGCVPEVEDEVDDV

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136008.8e-4542.44Show/hide
Query:  VGSSLHGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV-ELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVW
        +G + +G   +A+ELF+ M +  + + D IT +GVLS+C HAG VEEGR+YF+ M  +  + P  DHY+C++DLLGRAG L+EA++ IE+MP+ P+S++W
Subjt:  VGSSLHGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV-ELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVW

Query:  GSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGI
        GSLL+AC++H N+ +G   AE  L ++P  +  ++ L+N+ A+ G  +D   +RK M+  G+   PG SWI+IQ   + F  +DKS+P   +I  L+D +
Subjt:  GSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGI

Query:  VNHMR
        +  MR
Subjt:  VNHMR

Q9ZUT4 Pentatricopeptide repeat-containing protein At2g373208.4e-7261.17Show/hide
Query:  NVYVGSSLHGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSI
        ++  G + HGL++QAIELF+ M      + DAIT+LGVLSSCRHAGLV+EGR +FNLM E  LKPEL+HYSC++DLLGR GLL+EA   IE MP+ PNS+
Subjt:  NVYVGSSLHGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMD
        +WGSLL +CR+HG+VW G++AAE RL+L+PDCA+TH+QLANL A  GY  +AA +RK+MKD+GLKT PG SWIEI N V+ FKAED SN  M+EI+ ++ 
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMD

Query:  GIVNHM
         +++HM
Subjt:  GIVNHM

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276102.1e-4632.46Show/hide
Query:  QLFAGRFKFDANDISSVLSLCSSQRNLRG-GIQYHSVAIRTGFIFNVYVGSSL-------------------------------------HGLSLQAIEL
        +L  G  K +    SS+L++C++     G G Q+H  AI++    ++ V S+L                                     HG +++A+++
Subjt:  QLFAGRFKFDANDISSVLSLCSSQRNLRG-GIQYHSVAIRTGFIFNVYVGSSL-------------------------------------HGLSLQAIEL

Query:  FKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV-ELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVWGSLLSACRLHGNVWI
        FK M+K ++V+ D +TF+GV ++C HAGLVEEG  YF++MV +  + P  +H SC++DL  RAG L++A   IE MP    S +W ++L+ACR+H    +
Subjt:  FKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV-ELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVWGSLLSACRLHGNVWI

Query:  GLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGIVNHMRSIGCVPEVED
        G  AAE  + ++P+ ++ ++ L+N+ A++G   + AK+RK+M +R +K  PGYSWIE++NK Y F A D+S+P+  +I   ++ +   ++ +G  P+   
Subjt:  GLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGIVNHMRSIGCVPEVED

Query:  EVDDV
         + D+
Subjt:  EVDDV

Q9ZVF4 Pentatricopeptide repeat-containing protein At2g01510, mitochondrial3.3e-4441.44Show/hide
Query:  VGSSLHGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV---ELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSI
        VG +++G S +A+ LF  M+  + +  + +TFLGVLS+C HAGLV EG+ YF+LMV   + +L+P  +HY+C++DLLGR+GLL+EA  FI+KMP+ P++ 
Subjt:  VGSSLHGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV---ELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMD
        +WG+LL AC +H ++ +G K A+  +   PD  S H+ L+N+ A AG  D   K+R  M+  G K    YS +E + K++ F   DKS+P    I   +D
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMD

Query:  GIVNHMRSIGCVPEVEDEVDDV
         I+  +R +G VP+      DV
Subjt:  GIVNHMRSIGCVPEVEDEVDDV

Arabidopsis top hitse value%identityAlignment
AT2G01510.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-4541.44Show/hide
Query:  VGSSLHGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV---ELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSI
        VG +++G S +A+ LF  M+  + +  + +TFLGVLS+C HAGLV EG+ YF+LMV   + +L+P  +HY+C++DLLGR+GLL+EA  FI+KMP+ P++ 
Subjt:  VGSSLHGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV---ELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMD
        +WG+LL AC +H ++ +G K A+  +   PD  S H+ L+N+ A AG  D   K+R  M+  G K    YS +E + K++ F   DKS+P    I   +D
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMD

Query:  GIVNHMRSIGCVPEVEDEVDDV
         I+  +R +G VP+      DV
Subjt:  GIVNHMRSIGCVPEVEDEVDDV

AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein6.2e-4642.44Show/hide
Query:  VGSSLHGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV-ELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVW
        +G + +G   +A+ELF+ M +  + + D IT +GVLS+C HAG VEEGR+YF+ M  +  + P  DHY+C++DLLGRAG L+EA++ IE+MP+ P+S++W
Subjt:  VGSSLHGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV-ELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVW

Query:  GSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGI
        GSLL+AC++H N+ +G   AE  L ++P  +  ++ L+N+ A+ G  +D   +RK M+  G+   PG SWI+IQ   + F  +DKS+P   +I  L+D +
Subjt:  GSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGI

Query:  VNHMR
        +  MR
Subjt:  VNHMR

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein9.6e-5539.38Show/hide
Query:  ISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL--------------------------------------HGLSLQAIELFKAMRKLKQVEAD
        ++++LS+ SS  +L  G Q H  A+++G I++V V ++L                                      HG + +A+ELF+ M  ++ +  D
Subjt:  ISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSL--------------------------------------HGLSLQAIELFKAMRKLKQVEAD

Query:  AITFLGVLSSCRHAGLVEEGRYYFNLMVELD-LKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQP
         IT++GV S+C HAGLV +GR YF++M ++D + P L HY+C++DL GRAGLL+EAQ FIEKMPI P+ + WGSLLSACR+H N+ +G  AAE  LLL+P
Subjt:  AITFLGVLSSCRHAGLVEEGRYYFNLMVELD-LKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQP

Query:  DCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGIVNHMRSIGCVPEVEDEVDDV
        + +  +  LANL +  G  ++AAK+RK MKD  +K   G+SWIE+++KV+ F  ED ++P   EI   M  I + ++ +G VP+    + D+
Subjt:  DCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGIVNHMRSIGCVPEVEDEVDDV

AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-4732.46Show/hide
Query:  QLFAGRFKFDANDISSVLSLCSSQRNLRG-GIQYHSVAIRTGFIFNVYVGSSL-------------------------------------HGLSLQAIEL
        +L  G  K +    SS+L++C++     G G Q+H  AI++    ++ V S+L                                     HG +++A+++
Subjt:  QLFAGRFKFDANDISSVLSLCSSQRNLRG-GIQYHSVAIRTGFIFNVYVGSSL-------------------------------------HGLSLQAIEL

Query:  FKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV-ELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVWGSLLSACRLHGNVWI
        FK M+K ++V+ D +TF+GV ++C HAGLVEEG  YF++MV +  + P  +H SC++DL  RAG L++A   IE MP    S +W ++L+ACR+H    +
Subjt:  FKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV-ELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVWGSLLSACRLHGNVWI

Query:  GLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGIVNHMRSIGCVPEVED
        G  AAE  + ++P+ ++ ++ L+N+ A++G   + AK+RK+M +R +K  PGYSWIE++NK Y F A D+S+P+  +I   ++ +   ++ +G  P+   
Subjt:  GLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGIVNHMRSIGCVPEVED

Query:  EVDDV
         + D+
Subjt:  EVDDV

AT2G37320.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.0e-7361.17Show/hide
Query:  NVYVGSSLHGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSI
        ++  G + HGL++QAIELF+ M      + DAIT+LGVLSSCRHAGLV+EGR +FNLM E  LKPEL+HYSC++DLLGR GLL+EA   IE MP+ PNS+
Subjt:  NVYVGSSLHGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMD
        +WGSLL +CR+HG+VW G++AAE RL+L+PDCA+TH+QLANL A  GY  +AA +RK+MKD+GLKT PG SWIEI N V+ FKAED SN  M+EI+ ++ 
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMD

Query:  GIVNHM
         +++HM
Subjt:  GIVNHM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGTTCGTCTTCTTCATTTGCTCTTCGCGATATCTCAACCTTCTGCACTTCTTTATAGGCATGGCTTCCACTTAGTACTCTCCATTAGGCTTTTCTCTAACTTCAA
ATCCAAGACCCATCAGACCACTAACTTACCTAAACCTCCGAGACTTTTGGAGCTCATTTCTCCAAAGGCAAATGTCGTCTCTGAAAGTCGCCAAACCCATCTTCGGCTCA
TTCAGGACTTTTTAAAAACAGATTTAGATCAATGTCGATCTAAAACCCTTTCCAACGGTTTAGATTCTCATTCAGTTGTTTTATTGAAGGATTCCTCCTGTGTTCTTGAT
CAAGAACGTGAATCTGGTCATTGGGATGTTCAGTTGTTCGCAGGAAGATTTAAATTTGATGCTAATGATATATCCAGCGTTTTGAGTTTGTGCAGTTCTCAACGCAATCT
TCGTGGTGGAATTCAGTATCATTCTGTGGCGATACGAACTGGGTTTATTTTCAATGTGTATGTAGGAAGTTCGCTGCATGGACTTTCTCTGCAAGCCATTGAGCTTTTTA
AAGCAATGAGGAAGCTGAAGCAAGTGGAAGCCGATGCCATCACTTTCCTTGGCGTCCTGTCCTCATGTAGACATGCAGGGCTTGTGGAAGAGGGCAGATACTACTTCAAT
CTTATGGTCGAGCTCGATTTGAAACCAGAGTTGGATCATTATTCATGTGTTATTGATCTGCTTGGCCGAGCTGGACTACTCAAAGAGGCTCAAAACTTCATTGAGAAGAT
GCCCATATCTCCCAATTCAATTGTTTGGGGATCACTTCTCTCTGCTTGCAGGCTTCATGGGAATGTATGGATAGGACTGAAGGCTGCAGAGAGTAGATTGTTACTGCAAC
CTGATTGCGCCTCGACACACTTGCAATTGGCTAATTTATGTGCAAAGGCAGGATACTTGGATGATGCTGCAAAGTTGCGGAAGATGATGAAAGACAGAGGGTTGAAGACT
GCTCCTGGATATAGCTGGATTGAGATTCAGAATAAAGTTTACAGATTCAAAGCAGAAGATAAGTCAAACCCTGTAATGGTTGAGATTTTAGGTCTTATGGATGGCATAGT
GAATCACATGAGATCTATAGGTTGTGTTCCTGAAGTGGAGGACGAAGTTGATGATGTTTTACTAGCAACATCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACAGTTCGTCTTCTTCATTTGCTCTTCGCGATATCTCAACCTTCTGCACTTCTTTATAGGCATGGCTTCCACTTAGTACTCTCCATTAGGCTTTTCTCTAACTTCAA
ATCCAAGACCCATCAGACCACTAACTTACCTAAACCTCCGAGACTTTTGGAGCTCATTTCTCCAAAGGCAAATGTCGTCTCTGAAAGTCGCCAAACCCATCTTCGGCTCA
TTCAGGACTTTTTAAAAACAGATTTAGATCAATGTCGATCTAAAACCCTTTCCAACGGTTTAGATTCTCATTCAGTTGTTTTATTGAAGGATTCCTCCTGTGTTCTTGAT
CAAGAACGTGAATCTGGTCATTGGGATGTTCAGTTGTTCGCAGGAAGATTTAAATTTGATGCTAATGATATATCCAGCGTTTTGAGTTTGTGCAGTTCTCAACGCAATCT
TCGTGGTGGAATTCAGTATCATTCTGTGGCGATACGAACTGGGTTTATTTTCAATGTGTATGTAGGAAGTTCGCTGCATGGACTTTCTCTGCAAGCCATTGAGCTTTTTA
AAGCAATGAGGAAGCTGAAGCAAGTGGAAGCCGATGCCATCACTTTCCTTGGCGTCCTGTCCTCATGTAGACATGCAGGGCTTGTGGAAGAGGGCAGATACTACTTCAAT
CTTATGGTCGAGCTCGATTTGAAACCAGAGTTGGATCATTATTCATGTGTTATTGATCTGCTTGGCCGAGCTGGACTACTCAAAGAGGCTCAAAACTTCATTGAGAAGAT
GCCCATATCTCCCAATTCAATTGTTTGGGGATCACTTCTCTCTGCTTGCAGGCTTCATGGGAATGTATGGATAGGACTGAAGGCTGCAGAGAGTAGATTGTTACTGCAAC
CTGATTGCGCCTCGACACACTTGCAATTGGCTAATTTATGTGCAAAGGCAGGATACTTGGATGATGCTGCAAAGTTGCGGAAGATGATGAAAGACAGAGGGTTGAAGACT
GCTCCTGGATATAGCTGGATTGAGATTCAGAATAAAGTTTACAGATTCAAAGCAGAAGATAAGTCAAACCCTGTAATGGTTGAGATTTTAGGTCTTATGGATGGCATAGT
GAATCACATGAGATCTATAGGTTGTGTTCCTGAAGTGGAGGACGAAGTTGATGATGTTTTACTAGCAACATCCTGA
Protein sequenceShow/hide protein sequence
MTVRLLHLLFAISQPSALLYRHGFHLVLSIRLFSNFKSKTHQTTNLPKPPRLLELISPKANVVSESRQTHLRLIQDFLKTDLDQCRSKTLSNGLDSHSVVLLKDSSCVLD
QERESGHWDVQLFAGRFKFDANDISSVLSLCSSQRNLRGGIQYHSVAIRTGFIFNVYVGSSLHGLSLQAIELFKAMRKLKQVEADAITFLGVLSSCRHAGLVEEGRYYFN
LMVELDLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPDCASTHLQLANLCAKAGYLDDAAKLRKMMKDRGLKT
APGYSWIEIQNKVYRFKAEDKSNPVMVEILGLMDGIVNHMRSIGCVPEVEDEVDDVLLATS