; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg07881 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg07881
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionscarecrow-like protein 13
Genome locationCarg_Chr07:1552894..1556666
RNA-Seq ExpressionCarg07881
SyntenyCarg07881
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR005202 - Transcription factor GRAS
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR030005 - Scarecrow-like protein 13


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594700.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]3.1e-27499.79Show/hide
Query:  MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMK
        MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMK
Subjt:  MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMK

Query:  GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLE
        GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLE
Subjt:  GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLE

Query:  HGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFV
        HGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFN MINEGYRPNYVTFLAVLTACGHVGFV
Subjt:  HGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFV

Query:  SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASG
        SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASG
Subjt:  SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASG

Query:  LWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL
        LWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL
Subjt:  LWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL

KAG6594701.1 Scarecrow-like protein 13, partial [Cucurbita argyrosperma subsp. sororia]1.8e-269100Show/hide
Query:  MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD
        MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD
Subjt:  MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD

Query:  QHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSF
        QHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSF
Subjt:  QHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSF

Query:  MDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYI
        MDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYI
Subjt:  MDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYI

Query:  PLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHR
        PLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHR
Subjt:  PLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHR

Query:  DRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA
        DRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA
Subjt:  DRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA

KAG7026668.1 Scarecrow-like protein 13, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD
        MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD
Subjt:  MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD

Query:  QHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSF
        QHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSF
Subjt:  QHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSF

Query:  MDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYI
        MDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYI
Subjt:  MDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYI

Query:  PLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHR
        PLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHR
Subjt:  PLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHR

Query:  DRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVASATASLLRIAVFPSNVPSETSFVLF
        DRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVASATASLLRIAVFPSNVPSETSFVLF
Subjt:  DRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVASATASLLRIAVFPSNVPSETSFVLF

Query:  HRNAGMRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFR
        HRNAGMRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFR
Subjt:  HRNAGMRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFR

Query:  KEYMKGKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACAS
        KEYMKGKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACAS
Subjt:  KEYMKGKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACAS

Query:  LASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACG
        LASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACG
Subjt:  LASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACG

Query:  HVGFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANG
        HVGFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANG
Subjt:  HVGFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANG

Query:  FAASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL
        FAASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL
Subjt:  FAASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL

XP_022926503.1 pentatricopeptide repeat-containing protein At4g16470 [Cucurbita moschata]6.8e-26997.89Show/hide
Query:  MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMK
        MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDR QVKPHQKDSSSWDRTFRSLCITGRL+EAVALLCCMPF+FHSKTYCLLLQECIFRKEYMK
Subjt:  MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMK

Query:  GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLE
        GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETAN+LHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLE
Subjt:  GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLE

Query:  HGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFV
        HGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSI DGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGH GFV
Subjt:  HGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFV

Query:  SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASG
        SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAY+FVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASG
Subjt:  SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASG

Query:  LWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL
        LWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSH+QAEEIYRTIHSIT ILKDAGSI ELSENSL
Subjt:  LWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL

XP_023518629.1 pentatricopeptide repeat-containing protein At4g16470 [Cucurbita pepo subsp. pepo]1.7e-26797.24Show/hide
Query:  MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMK
        MRLCGRPSSSGV+HLFTKSIVAGATATIRRRHKSEY NDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMK
Subjt:  MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMK

Query:  GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLE
        GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETAN+LHEKLLENSLVSWNALIAGYVQKGFGEVGLE+YFKMRRTGL+PDQYTFASVFRACASLASLE
Subjt:  GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLE

Query:  HGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFV
        HGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDG KVFNKS+TRNVITWTALISGYGHHGRVSEVLESFN MINEGYRPNYVTFLAVLTACGH GFV
Subjt:  HGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFV

Query:  SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASG
        SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAY+FVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASG
Subjt:  SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASG

Query:  LWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSEN
        LWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPI+KDAGS  ELSEN
Subjt:  LWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSEN

TrEMBL top hitse value%identityAlignment
A0A6J1EF80 scarecrow-like protein 131.1e-26798.95Show/hide
Query:  MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD
        MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENN SPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD
Subjt:  MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD

Query:  QHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSF
        QHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGE+RDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSF
Subjt:  QHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSF

Query:  MDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYI
        MDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYI
Subjt:  MDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYI

Query:  PLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHR
        PLIEDLA+RPGGPPVLL ITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHR
Subjt:  PLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHR

Query:  DRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA
        DRLLRLVKSLSPK+VTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA
Subjt:  DRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA

A0A6J1EI86 pentatricopeptide repeat-containing protein At4g164703.3e-26997.89Show/hide
Query:  MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMK
        MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDR QVKPHQKDSSSWDRTFRSLCITGRL+EAVALLCCMPF+FHSKTYCLLLQECIFRKEYMK
Subjt:  MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMK

Query:  GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLE
        GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETAN+LHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLE
Subjt:  GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLE

Query:  HGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFV
        HGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSI DGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGH GFV
Subjt:  HGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFV

Query:  SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASG
        SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAY+FVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASG
Subjt:  SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASG

Query:  LWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL
        LWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSH+QAEEIYRTIHSIT ILKDAGSI ELSENSL
Subjt:  LWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL

A0A6J1GE16 scarecrow-like protein 134.2e-24088.84Show/hide
Query:  MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD
        M+A+ N Q+SS IHEM+HQSVQ +D +YLSH H+LENN SPDASSQGNSV  SS+KDQFFTLESFPATA LSACNSPSAVS LSSRSPFSPQGSQ+CSSD
Subjt:  MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD

Query:  QHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSF
        Q HSFDNTCGSPQSGCSVTDDD ELK+KLKELEISLLGPE+DI+DSCYCSFRGG  +DAS+ARWNWNQ+ E IP L+LRDTLI CAQAIH++DLN AT F
Subjt:  QHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSF

Query:  MDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYI
        MDVLG+MVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEP+SSELMSYMS+LFQICPYFKFAYTSANA IWEAMVNEP+IHIIDFQIAQGSQYI
Subjt:  MDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYI

Query:  PLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHR
        PLI DLA+RPGGPPVLLRITGVDDSQS+HARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGC+VERS+LRIRPGEALAVNFPY LHHMPDESVSTQNHR
Subjt:  PLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHR

Query:  DRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA
        DRLLRLVKSLSPKVVTIVEQESNTNTSPF +RF+ETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA
Subjt:  DRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA

A0A6J1KS92 pentatricopeptide repeat-containing protein At4g164703.1e-25995.33Show/hide
Query:  MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMK
        MRLCGR SSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLC MPF+FHSKTYCLLLQECIFRKEYMK
Subjt:  MRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMK

Query:  GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLE
        GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETA +LHEKLL+NSLVSWNALIAG VQKG GEVGLELYFKMRRTGLIPDQYTFASV RACASLASLE
Subjt:  GKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLE

Query:  HGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFV
        HGKRAHGVLIKC+IGDNVVVSSALVDMYFKCSSISDGHKVF+KS+TRNVITWTALISGYGHHGRVSEVLESFN MINEGYRPNYVTFLAVLTACGH GFV
Subjt:  HGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFV

Query:  SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASG
        SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAY+FVVDAPCKEH+VIWGALVGGCKVHEDIDLMKHAAA+YLALDA NAGKYVVLANGFAASG
Subjt:  SEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASG

Query:  LWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSEN
        LWDNVAEIR MMKKSGMNKEPGYSRIEIQREFHFFVKSDKSH+QA EIYRTIHSITPILKDAGSI ELSEN
Subjt:  LWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSEN

A0A6J1KW95 scarecrow-like protein 133.5e-26398.11Show/hide
Query:  MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD
        MQA+PNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENN SPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD
Subjt:  MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD

Query:  QHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSF
        QHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPETDIV+SCYCSFRGGE+RDASMARWNWNQMIETIP+LSLRDTLIRCAQAIHEADLNAATSF
Subjt:  QHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSF

Query:  MDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYI
        MDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYI
Subjt:  MDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYI

Query:  PLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHR
        PLIEDLASRPGGPPV LRITGVDDSQSAHARGGGLQIVGQKLA LAQSKGIPFQFHAAAMSGCKVE SDLRIRP EALAVNFPYALHHMPDESVSTQNHR
Subjt:  PLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHR

Query:  DRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA
        DRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA
Subjt:  DRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA

SwissProt top hitse value%identityAlignment
O23491 Pentatricopeptide repeat-containing protein At4g164704.3e-14155.76Show/hide
Query:  SIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEY
        S+ +G   TI RR  +E    R QV+ +Q+ +   D+T + LC+TGRL EAV LL     +   +TY +LLQEC  RKEY KGKRIHAQM VVG   NEY
Subjt:  SIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEY

Query:  LKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNV
        LK KLLILYA  GDL+TA +L   L    L+ WNA+I+GYVQKG  + GL +Y+ MR+  ++PDQYTFASVFRAC++L  LEHGKRAH V+IK  I  N+
Subjt:  LKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNV

Query:  VVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEP
        +V SALVDMYFKCSS SDGH+VF++ +TRNVITWT+LISGYG+HG+VSEVL+ F +M  EG RPN VTFL VLTAC H G V + W +   MK  Y IEP
Subjt:  VVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEP

Query:  RGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMN
         GQHYAAM D L RAGRLQEAY FV+ +PCKEH  +WG+L+G C++H ++ L++ AA  +L LD  N G YVV ANG+A+ GL +  +++R  M+ +G+ 
Subjt:  RGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMN

Query:  KEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKD
        K+PGYS+IE+Q E H F+K D SH  +E+IY+ +H +T    D
Subjt:  KEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKD

Q8GVE1 Chitin-inducible gibberellin-responsive protein 26.5e-11350.55Show/hide
Query:  DSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSDQHHSFDNTCGSPQSGCSVTDDDYE
        D+  + H + L+++ SPDA  +  +  +       +TL+S      +   +SPS+ S       F+ +     S +  HS D+T GSP     VT+D  +
Subjt:  DSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSDQHHSFDNTCGSPQSGCSVTDDDYE

Query:  LKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSFMDVLGQMVSVSGDPAQRLGAYLLE
        LK KLK+LE  +LGP+++IV+    S     +   S+    W +M+  IP+ +L++ LI CA+A+ E +  A    +  L ++VSVSG+P +RLGAY++E
Subjt:  LKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSFMDVLGQMVSVSGDPAQRLGAYLLE

Query:  GLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYIPLIEDLASRPGGPPVLLRITGVDD
        GL ARL  SG +IYKALKC+EP SS+L+SYM  L++ CPYFKF Y SAN  I EA+  E  IHIIDF I+QG+Q+I L++ LA+RPGGPP  +RITG+DD
Subjt:  GLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYIPLIEDLASRPGGPPVLLRITGVDD

Query:  SQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHRDRLLRLVKSLSPKVVTIVEQESNT
        S SA+ARGGGL++VG++L+ +A    +PF+FH  A+SG KVE + L + PGEALAVNF   LHH+PDESVST NHRDRLLR+VKSLSPKV+T+VE ESNT
Subjt:  SQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHRDRLLRLVKSLSPKVVTIVEQESNT

Query:  NTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA
        NT+PF  RF ETLDYYTA+FESID+   RDD++RI  EQHC+AR+IVN++A
Subjt:  NTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA

Q8H125 Scarecrow-like protein 52.5e-10950.94Show/hide
Query:  DQFFTLESFPAT----AYLSACNSPSAVSVLSSRSPFSPQGSQTCSSDQHHSFDNTCGSPQSGCSVTD-DDYELKHKLKELEISLLGPETDIVDSCYCSF
        D + TLES   T       +  NS S  S  S+ SP S   +   S   +HS +    SP SG S T+ ++ EL   LK+LE +++ P+   VD+ Y + 
Subjt:  DQFFTLESFPAT----AYLSACNSPSAVSVLSSRSPFSPQGSQTCSSDQHHSFDNTCGSPQSGCSVTD-DDYELKHKLKELEISLLGPETDIVDSCYCSF

Query:  RGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSFMDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSEL
        +GG  +   +      + +E I +  L+  L  CA+A+   DL      +  L QMVSVSG+P QRLGAY+LEGL ARL  SGS+IYKAL+C++P+  EL
Subjt:  RGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSFMDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSEL

Query:  MSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYIPLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGI
        ++YM IL++ CPYFKF Y SAN  I EA+ NE  +HIIDFQI+QG Q++ LI  L +RPGGPP  +RITG+DD +S+ AR GGL++VGQ+L +LA+  G+
Subjt:  MSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYIPLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGI

Query:  PFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHRDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVAR
        PF+FH AA+   +VE   L +R GEALAVNFP  LHHMPDESV+ +NHRDRLLRLVK LSP VVT+VEQE+NTNT+PF  RF+ET+++Y A+FESIDV  
Subjt:  PFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHRDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVAR

Query:  SRDDKQRIRAEQHCVARDIVNMVA
        +RD K+RI  EQHC+AR++VN++A
Subjt:  SRDDKQRIRAEQHCVARDIVNMVA

Q9LDL7 Scarecrow-like transcription factor PAT18.2e-11654.42Show/hide
Query:  FFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSDQHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPET-DIVDSCYCSFRGGESR
        +F   S     YL   NS      L    P SP  +   ++    ++D+TCGS      VTD+  + KHK++E+E  ++GP++ D++  C  SF    S+
Subjt:  FFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSDQHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPET-DIVDSCYCSFRGGESR

Query:  DASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSFMDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKAL-KCEEPSSSELMSYMS
        + +     W   +E I +  LR  L+ CA+A+ E DL  A S M+ L QMVSVSG+P QRLGAYLLEGL A+L  SGS+IYKAL +C EP+S+EL+SYM 
Subjt:  DASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSFMDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKAL-KCEEPSSSELMSYMS

Query:  ILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYIPLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFH
        IL+++CPYFKF Y SAN  I EAM  E  +HIIDFQI QGSQ++ LI+  A+RPGGPP  +RITG+DD  SA+ARGGGL IVG +LA+LA+   +PF+F+
Subjt:  ILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYIPLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFH

Query:  AAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHRDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDK
        + ++S  +V+  +L +RPGEALAVNF + LHHMPDESVST+NHRDRLLR+VKSLSPKVVT+VEQESNTNT+ FF RF+ET++YY AMFESIDV   RD K
Subjt:  AAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHRDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDK

Query:  QRIRAEQHCVARDIVNMVA
        QRI  EQHC+ARD+VN++A
Subjt:  QRIRAEQHCVARDIVNMVA

Q9M0M5 Scarecrow-like protein 132.3e-14758.82Show/hide
Query:  MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD
        MQ +    S++ +H +Y Q       +    F   +N G  D  S          K+ FFTLES  A+  L + +SPS VS+ S RSPFSPQGSQ+C SD
Subjt:  MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD

Query:  QHHSFDNTCGSPQSG-CSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATS
         HHS DN  GSP SG  S+  D+  +K K++ELE+SLL  +T + +    S   G+S       WNW++++   P+L L++ L+  A+A+ + D   A  
Subjt:  QHHSFDNTCGSPQSG-CSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATS

Query:  FMDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQY
        F+DVL QMVSVSG P QRLG Y+ EGLRARLE SGS IYK+LKC EP+  ELMSYMS+L++ICPY+KFAYT+AN  I EA+  E  +HIIDFQIAQGSQY
Subjt:  FMDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQY

Query:  IPLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNH
        + LI++LA RPGGPP LLR+TGVDDSQS +ARGGGL +VG++LA LAQS G+PF+FH A MSGCKV+R  L + PG A+ VNFPY LHHMPDESVS +NH
Subjt:  IPLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNH

Query:  RDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA
        RDRLL L+KSLSPK+VT+VEQESNTNTSPF  RF+ETLDYYTAMFESID AR RDDKQRI AEQHCVARDIVNM+A
Subjt:  RDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA

Arabidopsis top hitse value%identityAlignment
AT1G50600.1 scarecrow-like 51.8e-11050.94Show/hide
Query:  DQFFTLESFPAT----AYLSACNSPSAVSVLSSRSPFSPQGSQTCSSDQHHSFDNTCGSPQSGCSVTD-DDYELKHKLKELEISLLGPETDIVDSCYCSF
        D + TLES   T       +  NS S  S  S+ SP S   +   S   +HS +    SP SG S T+ ++ EL   LK+LE +++ P+   VD+ Y + 
Subjt:  DQFFTLESFPAT----AYLSACNSPSAVSVLSSRSPFSPQGSQTCSSDQHHSFDNTCGSPQSGCSVTD-DDYELKHKLKELEISLLGPETDIVDSCYCSF

Query:  RGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSFMDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSEL
        +GG  +   +      + +E I +  L+  L  CA+A+   DL      +  L QMVSVSG+P QRLGAY+LEGL ARL  SGS+IYKAL+C++P+  EL
Subjt:  RGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSFMDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSEL

Query:  MSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYIPLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGI
        ++YM IL++ CPYFKF Y SAN  I EA+ NE  +HIIDFQI+QG Q++ LI  L +RPGGPP  +RITG+DD +S+ AR GGL++VGQ+L +LA+  G+
Subjt:  MSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYIPLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGI

Query:  PFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHRDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVAR
        PF+FH AA+   +VE   L +R GEALAVNFP  LHHMPDESV+ +NHRDRLLRLVK LSP VVT+VEQE+NTNT+PF  RF+ET+++Y A+FESIDV  
Subjt:  PFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHRDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVAR

Query:  SRDDKQRIRAEQHCVARDIVNMVA
        +RD K+RI  EQHC+AR++VN++A
Subjt:  SRDDKQRIRAEQHCVARDIVNMVA

AT4G16470.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-14255.76Show/hide
Query:  SIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEY
        S+ +G   TI RR  +E    R QV+ +Q+ +   D+T + LC+TGRL EAV LL     +   +TY +LLQEC  RKEY KGKRIHAQM VVG   NEY
Subjt:  SIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEY

Query:  LKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNV
        LK KLLILYA  GDL+TA +L   L    L+ WNA+I+GYVQKG  + GL +Y+ MR+  ++PDQYTFASVFRAC++L  LEHGKRAH V+IK  I  N+
Subjt:  LKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNV

Query:  VVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEP
        +V SALVDMYFKCSS SDGH+VF++ +TRNVITWT+LISGYG+HG+VSEVL+ F +M  EG RPN VTFL VLTAC H G V + W +   MK  Y IEP
Subjt:  VVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEP

Query:  RGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMN
         GQHYAAM D L RAGRLQEAY FV+ +PCKEH  +WG+L+G C++H ++ L++ AA  +L LD  N G YVV ANG+A+ GL +  +++R  M+ +G+ 
Subjt:  RGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMN

Query:  KEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKD
        K+PGYS+IE+Q E H F+K D SH  +E+IY+ +H +T    D
Subjt:  KEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKD

AT4G17230.1 SCARECROW-like 131.7e-14858.82Show/hide
Query:  MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD
        MQ +    S++ +H +Y Q       +    F   +N G  D  S          K+ FFTLES  A+  L + +SPS VS+ S RSPFSPQGSQ+C SD
Subjt:  MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSD

Query:  QHHSFDNTCGSPQSG-CSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATS
         HHS DN  GSP SG  S+  D+  +K K++ELE+SLL  +T + +    S   G+S       WNW++++   P+L L++ L+  A+A+ + D   A  
Subjt:  QHHSFDNTCGSPQSG-CSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATS

Query:  FMDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQY
        F+DVL QMVSVSG P QRLG Y+ EGLRARLE SGS IYK+LKC EP+  ELMSYMS+L++ICPY+KFAYT+AN  I EA+  E  +HIIDFQIAQGSQY
Subjt:  FMDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQY

Query:  IPLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNH
        + LI++LA RPGGPP LLR+TGVDDSQS +ARGGGL +VG++LA LAQS G+PF+FH A MSGCKV+R  L + PG A+ VNFPY LHHMPDESVS +NH
Subjt:  IPLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNH

Query:  RDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA
        RDRLL L+KSLSPK+VT+VEQESNTNTSPF  RF+ETLDYYTAMFESID AR RDDKQRI AEQHCVARDIVNM+A
Subjt:  RDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVA

AT5G48150.1 GRAS family transcription factor5.8e-11754.42Show/hide
Query:  FFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSDQHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPET-DIVDSCYCSFRGGESR
        +F   S     YL   NS      L    P SP  +   ++    ++D+TCGS      VTD+  + KHK++E+E  ++GP++ D++  C  SF    S+
Subjt:  FFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSDQHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPET-DIVDSCYCSFRGGESR

Query:  DASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSFMDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKAL-KCEEPSSSELMSYMS
        + +     W   +E I +  LR  L+ CA+A+ E DL  A S M+ L QMVSVSG+P QRLGAYLLEGL A+L  SGS+IYKAL +C EP+S+EL+SYM 
Subjt:  DASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSFMDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKAL-KCEEPSSSELMSYMS

Query:  ILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYIPLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFH
        IL+++CPYFKF Y SAN  I EAM  E  +HIIDFQI QGSQ++ LI+  A+RPGGPP  +RITG+DD  SA+ARGGGL IVG +LA+LA+   +PF+F+
Subjt:  ILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYIPLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFH

Query:  AAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHRDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDK
        + ++S  +V+  +L +RPGEALAVNF + LHHMPDESVST+NHRDRLLR+VKSLSPKVVT+VEQESNTNT+ FF RF+ET++YY AMFESIDV   RD K
Subjt:  AAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHRDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDK

Query:  QRIRAEQHCVARDIVNMVA
        QRI  EQHC+ARD+VN++A
Subjt:  QRIRAEQHCVARDIVNMVA

AT5G48150.2 GRAS family transcription factor5.8e-11754.42Show/hide
Query:  FFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSDQHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPET-DIVDSCYCSFRGGESR
        +F   S     YL   NS      L    P SP  +   ++    ++D+TCGS      VTD+  + KHK++E+E  ++GP++ D++  C  SF    S+
Subjt:  FFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSDQHHSFDNTCGSPQSGCSVTDDDYELKHKLKELEISLLGPET-DIVDSCYCSFRGGESR

Query:  DASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSFMDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKAL-KCEEPSSSELMSYMS
        + +     W   +E I +  LR  L+ CA+A+ E DL  A S M+ L QMVSVSG+P QRLGAYLLEGL A+L  SGS+IYKAL +C EP+S+EL+SYM 
Subjt:  DASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSFMDVLGQMVSVSGDPAQRLGAYLLEGLRARLERSGSAIYKAL-KCEEPSSSELMSYMS

Query:  ILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYIPLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFH
        IL+++CPYFKF Y SAN  I EAM  E  +HIIDFQI QGSQ++ LI+  A+RPGGPP  +RITG+DD  SA+ARGGGL IVG +LA+LA+   +PF+F+
Subjt:  ILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYIPLIEDLASRPGGPPVLLRITGVDDSQSAHARGGGLQIVGQKLAQLAQSKGIPFQFH

Query:  AAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHRDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDK
        + ++S  +V+  +L +RPGEALAVNF + LHHMPDESVST+NHRDRLLR+VKSLSPKVVT+VEQESNTNT+ FF RF+ET++YY AMFESIDV   RD K
Subjt:  AAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHRDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYYTAMFESIDVARSRDDK

Query:  QRIRAEQHCVARDIVNMVA
        QRI  EQHC+ARD+VN++A
Subjt:  QRIRAEQHCVARDIVNMVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGCAGCTCCGAATCGTCAGTCTTCATCTGTGATCCATGAGATGTATCACCAGTCTGTGCAAGGGATGGATTCTTTCTATTTATCTCATTTCCACGTCTTGGAAAA
CAATGGCTCTCCCGATGCAAGCAGTCAGGGAAACAGTGTTATTTCTTCTTCCTATAAGGATCAGTTCTTCACTCTCGAATCGTTCCCGGCTACTGCTTATCTGAGCGCCT
GTAATTCCCCTTCTGCTGTTAGTGTTCTGTCTTCCAGGAGCCCCTTCTCACCTCAAGGGTCTCAAACTTGCTCTTCTGATCAACATCATTCCTTTGACAACACATGTGGA
TCGCCGCAAAGTGGATGCTCGGTAACTGATGATGATTATGAATTGAAGCACAAGCTGAAGGAATTAGAGATTTCTTTGTTGGGACCAGAAACTGATATTGTGGATAGCTG
CTACTGTTCTTTCAGGGGTGGGGAGAGCCGAGATGCTTCGATGGCAAGGTGGAACTGGAATCAAATGATCGAAACAATTCCTAAACTAAGTCTTCGGGACACACTTATCC
GGTGTGCTCAAGCAATTCATGAGGCTGATTTAAATGCGGCAACATCGTTTATGGATGTTTTGGGGCAAATGGTTTCGGTCTCTGGTGATCCAGCCCAGAGGTTGGGAGCT
TACTTGTTGGAAGGGCTTAGAGCGAGGTTGGAGCGATCTGGAAGTGCAATATACAAGGCCTTAAAGTGCGAAGAGCCGTCAAGCTCTGAACTTATGTCCTATATGTCTAT
TCTATTCCAGATATGCCCGTACTTCAAATTTGCTTACACATCTGCAAATGCTTTCATTTGGGAGGCTATGGTAAATGAACCCGTAATCCACATCATTGATTTTCAAATTG
CACAGGGTAGTCAGTATATACCTCTCATTGAGGATCTTGCCAGTCGGCCTGGTGGACCCCCAGTTCTTCTTCGCATTACGGGTGTGGATGATTCCCAATCAGCTCATGCT
CGGGGAGGGGGACTTCAAATTGTGGGACAGAAGCTAGCTCAACTGGCTCAGTCTAAAGGAATTCCCTTCCAATTTCATGCTGCTGCAATGTCTGGTTGCAAGGTCGAGCG
CAGTGATCTTAGAATACGACCTGGAGAAGCCTTGGCTGTAAATTTTCCATATGCCTTGCACCACATGCCAGATGAGAGCGTGAGCACACAGAACCATCGAGATCGTCTTC
TAAGGCTGGTTAAGAGTCTATCACCAAAGGTAGTAACTATTGTTGAGCAGGAATCCAACACCAACACATCCCCGTTCTTTTTACGTTTTATAGAGACGCTGGACTATTAT
ACTGCTATGTTCGAGTCAATAGACGTAGCTCGTTCAAGAGACGACAAGCAACGAATCAGGGCAGAGCAGCACTGTGTTGCCCGAGACATAGTGAACATGGTAGCATCCGC
CACAGCTTCTCTTCTTCGTATCGCCGTCTTCCCTTCGAATGTCCCATCTGAAACCTCCTTTGTTCTGTTTCATCGAAATGCTGGAATGCGCCTCTGTGGTCGGCCTTCTT
CTTCCGGCGTCATTCATCTGTTCACCAAGTCGATTGTAGCCGGCGCAACCGCGACCATTCGTCGTCGGCATAAATCTGAATACGCCAACGACAGGTCTCAGGTGAAGCCA
CATCAGAAAGATTCCTCCTCCTGGGATAGAACCTTTAGGAGCCTATGTATAACGGGGAGATTGAGCGAGGCGGTTGCACTTTTGTGCTGTATGCCCTTCCGATTTCACTC
CAAAACTTACTGCCTTCTGTTACAAGAATGCATTTTCAGGAAAGAGTATATGAAAGGAAAAAGAATCCATGCTCAAATGGTTGTTGTTGGACATTTACCCAATGAGTATC
TCAAAACCAAACTGCTGATATTATATGCCAAATTAGGTGACTTAGAAACTGCAAATGTTCTTCATGAGAAATTGCTGGAGAACAGTCTGGTTTCATGGAATGCATTGATT
GCTGGATATGTACAGAAAGGGTTTGGAGAAGTTGGATTGGAGCTTTACTTTAAGATGAGACGAACTGGTTTAATACCTGATCAATATACCTTTGCATCAGTTTTCAGAGC
CTGTGCTAGCTTAGCTTCTTTGGAACATGGAAAGAGAGCTCATGGAGTTCTGATTAAGTGTCGAATCGGCGACAATGTTGTCGTGTCTAGTGCCCTTGTTGATATGTACT
TCAAATGCAGTAGCATATCAGATGGTCATAAGGTATTTAACAAATCTACAACTAGAAATGTGATTACATGGACTGCTTTAATATCAGGGTATGGCCACCATGGAAGAGTT
TCTGAAGTTTTGGAATCCTTCAATAGGATGATAAATGAAGGTTACCGACCAAATTACGTTACATTCCTTGCGGTTCTTACTGCTTGTGGTCATGTTGGTTTTGTATCGGA
AGCATGGCGATACTTATCGTTAATGAAGACGACGTATGAAATAGAACCAAGAGGGCAACATTATGCTGCCATGGCGGATCTTCTCGCGCGGGCAGGGAGGTTGCAAGAGG
CATATAATTTTGTCGTCGATGCACCATGCAAGGAGCACGCTGTTATATGGGGTGCTTTGGTCGGGGGTTGTAAGGTTCACGAAGACATAGATTTGATGAAACATGCAGCA
GCAAATTACTTGGCATTGGATGCTGGCAACGCTGGGAAGTATGTGGTTTTAGCAAATGGGTTTGCGGCGTCTGGCTTGTGGGATAATGTTGCGGAGATTAGAGGCATGAT
GAAGAAATCAGGAATGAATAAGGAACCTGGTTACAGCAGAATTGAGATACAACGTGAGTTTCACTTCTTTGTTAAAAGTGATAAATCTCACGAACAAGCCGAGGAGATTT
ATAGAACCATTCACAGCATCACTCCGATTTTAAAGGATGCAGGCTCTATTCATGAACTAAGTGAAAACTCATTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGCAGCTCCGAATCGTCAGTCTTCATCTGTGATCCATGAGATGTATCACCAGTCTGTGCAAGGGATGGATTCTTTCTATTTATCTCATTTCCACGTCTTGGAAAA
CAATGGCTCTCCCGATGCAAGCAGTCAGGGAAACAGTGTTATTTCTTCTTCCTATAAGGATCAGTTCTTCACTCTCGAATCGTTCCCGGCTACTGCTTATCTGAGCGCCT
GTAATTCCCCTTCTGCTGTTAGTGTTCTGTCTTCCAGGAGCCCCTTCTCACCTCAAGGGTCTCAAACTTGCTCTTCTGATCAACATCATTCCTTTGACAACACATGTGGA
TCGCCGCAAAGTGGATGCTCGGTAACTGATGATGATTATGAATTGAAGCACAAGCTGAAGGAATTAGAGATTTCTTTGTTGGGACCAGAAACTGATATTGTGGATAGCTG
CTACTGTTCTTTCAGGGGTGGGGAGAGCCGAGATGCTTCGATGGCAAGGTGGAACTGGAATCAAATGATCGAAACAATTCCTAAACTAAGTCTTCGGGACACACTTATCC
GGTGTGCTCAAGCAATTCATGAGGCTGATTTAAATGCGGCAACATCGTTTATGGATGTTTTGGGGCAAATGGTTTCGGTCTCTGGTGATCCAGCCCAGAGGTTGGGAGCT
TACTTGTTGGAAGGGCTTAGAGCGAGGTTGGAGCGATCTGGAAGTGCAATATACAAGGCCTTAAAGTGCGAAGAGCCGTCAAGCTCTGAACTTATGTCCTATATGTCTAT
TCTATTCCAGATATGCCCGTACTTCAAATTTGCTTACACATCTGCAAATGCTTTCATTTGGGAGGCTATGGTAAATGAACCCGTAATCCACATCATTGATTTTCAAATTG
CACAGGGTAGTCAGTATATACCTCTCATTGAGGATCTTGCCAGTCGGCCTGGTGGACCCCCAGTTCTTCTTCGCATTACGGGTGTGGATGATTCCCAATCAGCTCATGCT
CGGGGAGGGGGACTTCAAATTGTGGGACAGAAGCTAGCTCAACTGGCTCAGTCTAAAGGAATTCCCTTCCAATTTCATGCTGCTGCAATGTCTGGTTGCAAGGTCGAGCG
CAGTGATCTTAGAATACGACCTGGAGAAGCCTTGGCTGTAAATTTTCCATATGCCTTGCACCACATGCCAGATGAGAGCGTGAGCACACAGAACCATCGAGATCGTCTTC
TAAGGCTGGTTAAGAGTCTATCACCAAAGGTAGTAACTATTGTTGAGCAGGAATCCAACACCAACACATCCCCGTTCTTTTTACGTTTTATAGAGACGCTGGACTATTAT
ACTGCTATGTTCGAGTCAATAGACGTAGCTCGTTCAAGAGACGACAAGCAACGAATCAGGGCAGAGCAGCACTGTGTTGCCCGAGACATAGTGAACATGGTAGCATCCGC
CACAGCTTCTCTTCTTCGTATCGCCGTCTTCCCTTCGAATGTCCCATCTGAAACCTCCTTTGTTCTGTTTCATCGAAATGCTGGAATGCGCCTCTGTGGTCGGCCTTCTT
CTTCCGGCGTCATTCATCTGTTCACCAAGTCGATTGTAGCCGGCGCAACCGCGACCATTCGTCGTCGGCATAAATCTGAATACGCCAACGACAGGTCTCAGGTGAAGCCA
CATCAGAAAGATTCCTCCTCCTGGGATAGAACCTTTAGGAGCCTATGTATAACGGGGAGATTGAGCGAGGCGGTTGCACTTTTGTGCTGTATGCCCTTCCGATTTCACTC
CAAAACTTACTGCCTTCTGTTACAAGAATGCATTTTCAGGAAAGAGTATATGAAAGGAAAAAGAATCCATGCTCAAATGGTTGTTGTTGGACATTTACCCAATGAGTATC
TCAAAACCAAACTGCTGATATTATATGCCAAATTAGGTGACTTAGAAACTGCAAATGTTCTTCATGAGAAATTGCTGGAGAACAGTCTGGTTTCATGGAATGCATTGATT
GCTGGATATGTACAGAAAGGGTTTGGAGAAGTTGGATTGGAGCTTTACTTTAAGATGAGACGAACTGGTTTAATACCTGATCAATATACCTTTGCATCAGTTTTCAGAGC
CTGTGCTAGCTTAGCTTCTTTGGAACATGGAAAGAGAGCTCATGGAGTTCTGATTAAGTGTCGAATCGGCGACAATGTTGTCGTGTCTAGTGCCCTTGTTGATATGTACT
TCAAATGCAGTAGCATATCAGATGGTCATAAGGTATTTAACAAATCTACAACTAGAAATGTGATTACATGGACTGCTTTAATATCAGGGTATGGCCACCATGGAAGAGTT
TCTGAAGTTTTGGAATCCTTCAATAGGATGATAAATGAAGGTTACCGACCAAATTACGTTACATTCCTTGCGGTTCTTACTGCTTGTGGTCATGTTGGTTTTGTATCGGA
AGCATGGCGATACTTATCGTTAATGAAGACGACGTATGAAATAGAACCAAGAGGGCAACATTATGCTGCCATGGCGGATCTTCTCGCGCGGGCAGGGAGGTTGCAAGAGG
CATATAATTTTGTCGTCGATGCACCATGCAAGGAGCACGCTGTTATATGGGGTGCTTTGGTCGGGGGTTGTAAGGTTCACGAAGACATAGATTTGATGAAACATGCAGCA
GCAAATTACTTGGCATTGGATGCTGGCAACGCTGGGAAGTATGTGGTTTTAGCAAATGGGTTTGCGGCGTCTGGCTTGTGGGATAATGTTGCGGAGATTAGAGGCATGAT
GAAGAAATCAGGAATGAATAAGGAACCTGGTTACAGCAGAATTGAGATACAACGTGAGTTTCACTTCTTTGTTAAAAGTGATAAATCTCACGAACAAGCCGAGGAGATTT
ATAGAACCATTCACAGCATCACTCCGATTTTAAAGGATGCAGGCTCTATTCATGAACTAAGTGAAAACTCATTGTAG
Protein sequenceShow/hide protein sequence
MQAAPNRQSSSVIHEMYHQSVQGMDSFYLSHFHVLENNGSPDASSQGNSVISSSYKDQFFTLESFPATAYLSACNSPSAVSVLSSRSPFSPQGSQTCSSDQHHSFDNTCG
SPQSGCSVTDDDYELKHKLKELEISLLGPETDIVDSCYCSFRGGESRDASMARWNWNQMIETIPKLSLRDTLIRCAQAIHEADLNAATSFMDVLGQMVSVSGDPAQRLGA
YLLEGLRARLERSGSAIYKALKCEEPSSSELMSYMSILFQICPYFKFAYTSANAFIWEAMVNEPVIHIIDFQIAQGSQYIPLIEDLASRPGGPPVLLRITGVDDSQSAHA
RGGGLQIVGQKLAQLAQSKGIPFQFHAAAMSGCKVERSDLRIRPGEALAVNFPYALHHMPDESVSTQNHRDRLLRLVKSLSPKVVTIVEQESNTNTSPFFLRFIETLDYY
TAMFESIDVARSRDDKQRIRAEQHCVARDIVNMVASATASLLRIAVFPSNVPSETSFVLFHRNAGMRLCGRPSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKP
HQKDSSSWDRTFRSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNEYLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALI
AGYVQKGFGEVGLELYFKMRRTGLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDGHKVFNKSTTRNVITWTALISGYGHHGRV
SEVLESFNRMINEGYRPNYVTFLAVLTACGHVGFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGALVGGCKVHEDIDLMKHAA
ANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGMNKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSENSL