; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg12528 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg12528
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProtein of unknown function (DUF604)
Genome locationCarg_Chr14:511201..516257
RNA-Seq ExpressionCarg12528
SyntenyCarg12528
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR006740 - Protein of unknown function DUF604


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580525.1 hypothetical protein SDJN03_20527, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0072.78Show/hide
Query:  MKVRSQNSVKSGKFFKISLVLCSIAFLSLLFSFNQKPPNCTNCHRRKIIAVESSQSQPPTNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDE
        MKVRSQNSVKSGKFFKISLVLCSIAFLSLLFSFNQKPPNCTNCHRRKIIAVESSQSQPPTNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDE
Subjt:  MKVRSQNSVKSGKFFKISLVLCSIAFLSLLFSFNQKPPNCTNCHRRKIIAVESSQSQPPTNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDE

Query:  KPNATWPATSPPYRVSADTSEFSYTCWYGS----------------------------------------------------------------------
        KPNATWPATSPPYRVSADTSEFSYTCWYGS                                                                      
Subjt:  KPNATWPATSPPYRVSADTSEFSYTCWYGS----------------------------------------------------------------------

Query:  ------------------------------------------------------------RGSQYGLLAAHPVAPLVSLHHVDYLPPIFPTMIQIDALKA
                                                                    RGSQYGLLAAHPVAPLVSLHHVDYLPPIFPTMIQIDALKA
Subjt:  ------------------------------------------------------------RGSQYGLLAAHPVAPLVSLHHVDYLPPIFPTMIQIDALKA

Query:  LKTAYNLDPGRTLQQSFCYHPARNWSISVSWGYTVQLYPWLATPKDMEKSFQTFETWKSWSDGPFTFNTRPVQSDPCQMPILFFLDLAESPNRTVTSYKR
        LKTAYNLDPGRTLQQSFCYHPARNWSISVSWGYTVQLYPWLATPKDMEKSFQTFETWKSWSDGPFTFNTRPVQSDPCQMPILFFLDLAESPNRTVTSYKR
Subjt:  LKTAYNLDPGRTLQQSFCYHPARNWSISVSWGYTVQLYPWLATPKDMEKSFQTFETWKSWSDGPFTFNTRPVQSDPCQMPILFFLDLAESPNRTVTSYKR

Query:  QLDVWEKECSRDEFQLAQKVERFRVVTFGPFSASHWIKAPRRQCCQVVNGTSMDNVVNVWMRGCNPFETNSLKAFKFMVFRPADVFALFVRTFLVISIVA
        QLDVWEKECSRDEFQLAQKVERFRVVTFGPFSASHWIKAPRRQCCQVVNGTSMD+VVNVWMRGCNPFET +   +      PADVFALFVRTFLVISIVA
Subjt:  QLDVWEKECSRDEFQLAQKVERFRVVTFGPFSASHWIKAPRRQCCQVVNGTSMDNVVNVWMRGCNPFETNSLKAFKFMVFRPADVFALFVRTFLVISIVA

Query:  SFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDERRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDD
        SFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDERRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDD
Subjt:  SFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDERRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDD

Query:  TSQFNYTC--------------------------------------------------------------------------------------------
        TSQFNYTC                                                                                            
Subjt:  TSQFNYTC--------------------------------------------------------------------------------------------

Query:  ------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVSLHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFC
                                 EIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVSLHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFC
Subjt:  ------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVSLHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFC

Query:  YDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCERPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDY
        YDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCERPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDY
Subjt:  YDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCERPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDY

Query:  ASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP
        ASA+SVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP
Subjt:  ASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP

KAG7017275.1 hypothetical protein SDJN02_19138, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MKVRSQNSVKSGKFFKISLVLCSIAFLSLLFSFNQKPPNCTNCHRRKIIAVESSQSQPPTNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDE
        MKVRSQNSVKSGKFFKISLVLCSIAFLSLLFSFNQKPPNCTNCHRRKIIAVESSQSQPPTNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDE
Subjt:  MKVRSQNSVKSGKFFKISLVLCSIAFLSLLFSFNQKPPNCTNCHRRKIIAVESSQSQPPTNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDE

Query:  KPNATWPATSPPYRVSADTSEFSYTCWYGSRGSQYGLLAAHPVAPLVSLHHVDYLPPIFPTMIQIDALKALKTAYNLDPGRTLQQSFCYHPARNWSISVS
        KPNATWPATSPPYRVSADTSEFSYTCWYGSRGSQYGLLAAHPVAPLVSLHHVDYLPPIFPTMIQIDALKALKTAYNLDPGRTLQQSFCYHPARNWSISVS
Subjt:  KPNATWPATSPPYRVSADTSEFSYTCWYGSRGSQYGLLAAHPVAPLVSLHHVDYLPPIFPTMIQIDALKALKTAYNLDPGRTLQQSFCYHPARNWSISVS

Query:  WGYTVQLYPWLATPKDMEKSFQTFETWKSWSDGPFTFNTRPVQSDPCQMPILFFLDLAESPNRTVTSYKRQLDVWEKECSRDEFQLAQKVERFRVVTFGP
        WGYTVQLYPWLATPKDMEKSFQTFETWKSWSDGPFTFNTRPVQSDPCQMPILFFLDLAESPNRTVTSYKRQLDVWEKECSRDEFQLAQKVERFRVVTFGP
Subjt:  WGYTVQLYPWLATPKDMEKSFQTFETWKSWSDGPFTFNTRPVQSDPCQMPILFFLDLAESPNRTVTSYKRQLDVWEKECSRDEFQLAQKVERFRVVTFGP

Query:  FSASHWIKAPRRQCCQVVNGTSMDNVVNVWMRGCNPFETNSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHR
        FSASHWIKAPRRQCCQVVNGTSMDNVVNVWMRGCNPFETNSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHR
Subjt:  FSASHWIKAPRRQCCQVVNGTSMDNVVNVWMRGCNPFETNSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHR

Query:  EVKAFGAGEQPTNISHLVFGIGGSVKTWDERRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTCCEIGVPLTKELGFHQVDIRGNI
        EVKAFGAGEQPTNISHLVFGIGGSVKTWDERRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTCCEIGVPLTKELGFHQVDIRGNI
Subjt:  EVKAFGAGEQPTNISHLVFGIGGSVKTWDERRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTCCEIGVPLTKELGFHQVDIRGNI

Query:  YGILAAHPVAPLVSLHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANES
        YGILAAHPVAPLVSLHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANES
Subjt:  YGILAAHPVAPLVSLHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANES

Query:  FTFDTRPVSSNPCERPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVR
        FTFDTRPVSSNPCERPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVR
Subjt:  FTFDTRPVSSNPCERPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVR

Query:  IRDCNQFESVTPP
        IRDCNQFESVTPP
Subjt:  IRDCNQFESVTPP

XP_022935260.1 uncharacterized protein LOC111442198 [Cucurbita moschata]6.4e-20873.75Show/hide
Query:  RGCNPFET-NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDE
        +G N   T NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLF YLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTW+E
Subjt:  RGCNPFET-NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDE

Query:  RRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC----------------------------------------------------
        RRHYCELWWNKN+TRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC                                                    
Subjt:  RRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC----------------------------------------------------

Query:  ----------------------------------------------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVS
                                                                         +IGVPLTKELGFHQVDIRG+IYGILAAHPVAPLVS
Subjt:  ----------------------------------------------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVS

Query:  LHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE
        LHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE
Subjt:  LHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE

Query:  RPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP
        RPILYFLDTAERFGGRRWRTLTRYRKYVENNTNEC QPDYASA+SVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIR CNQFESVTPP
Subjt:  RPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP

XP_022983991.1 uncharacterized protein LOC111482442 [Cucurbita maxima]1.7e-20573.15Show/hide
Query:  RGCNPFET-NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDE
        +G N   T NSLKAFKFMVFRPADVFALFVRT LVIS+VASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDE
Subjt:  RGCNPFET-NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDE

Query:  RRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC----------------------------------------------------
        RRHYCELWW KNITRGFVWLEEKPEF W QSSPPYRISDDTSQFNYTC                                                    
Subjt:  RRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC----------------------------------------------------

Query:  ----------------------------------------------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVS
                                                                         EIGVPLTKELGFHQVDIRGNIYG+LAAHPVAPLVS
Subjt:  ----------------------------------------------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVS

Query:  LHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE
        LHHLDYL+AIFPAMTRPDSIKKLHTAYKTDP RALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE
Subjt:  LHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE

Query:  RPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP
        RPILYFLDTAERFGGRRWRTLT YRKYVENNT ECKQPDYASA+SVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP
Subjt:  RPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP

XP_023526502.1 uncharacterized protein LOC111789987 [Cucurbita pepo subsp. pepo]2.9e-20873.95Show/hide
Query:  RGCNPFET-NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDE
        +G N   T NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGC+GALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDE
Subjt:  RGCNPFET-NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDE

Query:  RRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC----------------------------------------------------
        RRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC                                                    
Subjt:  RRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC----------------------------------------------------

Query:  ----------------------------------------------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVS
                                                                         +IGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVS
Subjt:  ----------------------------------------------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVS

Query:  LHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE
        LHHLDYLEAIFP MT PDSIKKLHTAYKTDPGRALQHSFCYDMA NWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE
Subjt:  LHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE

Query:  RPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP
        RPILYFLDTAERFGGRRWRTLT YRKYVENNTNECKQPDYASA+SVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP
Subjt:  RPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP

TrEMBL top hitse value%identityAlignment
A0A5D3DN33 Uncharacterized protein7.7e-15957.35Show/hide
Query:  NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDERRHYCELWW
        NSLK FKF V RP D+F+  +R  LV+ +VASFSLFFYLT  D+ P C GCY A R SNHR+VKAF AGEQPTNISHLVFGIGGSVKTW+ERRHYCELWW
Subjt:  NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDERRHYCELWW

Query:  NKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC-------------------------------------------------------------
         KN+TRGFVWLEEKPE+ WP+SSPPYRIS DTS+FNYTC                                                             
Subjt:  NKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC-------------------------------------------------------------

Query:  -------------------------------------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVSLHHLDYLEA
                                                                EIGVPLTKELGFHQ+DIRGN YGILAAHP+APLVSLHHLDY+++
Subjt:  -------------------------------------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVSLHHLDYLEA

Query:  IFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCERPILYFLDT
        IFPAMT+PDS+KKL+ AY+TDP RALQHSFCYD  RNWSVSVSWGYSVQLYPWL TAKE++T FLT+QTWKT +NE FTFDT+PVSS+PC+RPILYFL++
Subjt:  IFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCERPILYFLDT

Query:  AERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP
         ER G R+WRTLT Y++Y E     C +PDYA A++VE FNVSAPEFDRRLW QAPRRQCC+VVHD N+++G ++V IRDC+  ESVTPP
Subjt:  AERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP

A0A6J1F4P4 uncharacterized protein LOC1114421385.1e-18771.43Show/hide
Query:  MKVRSQNSVKSGKFFKISLVLCSIAFLSLLFSFNQKPPNCTNCHRRKIIAVESSQSQPPTNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDE
        MKVRSQNSVKSGKFFKISLVLCSIAFLSLLFSFNQKPPNCTNCHRRKIIAVESSQSQPPTNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDE
Subjt:  MKVRSQNSVKSGKFFKISLVLCSIAFLSLLFSFNQKPPNCTNCHRRKIIAVESSQSQPPTNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDE

Query:  KPNATWPATSPPYRVSADTSEFSYTCWYGS----------------------------------------------------------------------
        KPNATWPATSPPYRVSADTSEFSYTCWYGS                                                                      
Subjt:  KPNATWPATSPPYRVSADTSEFSYTCWYGS----------------------------------------------------------------------

Query:  ------------------------------------------------------------RGSQYGLLAAHPVAPLVSLHHVDYLPPIFPTMIQIDALKA
                                                                    RGSQYGLLAAHPVAPLVSLHHVDYLPPIFPTMIQIDALKA
Subjt:  ------------------------------------------------------------RGSQYGLLAAHPVAPLVSLHHVDYLPPIFPTMIQIDALKA

Query:  LKTAYNLDPGRTLQQSFCYHPARNWSISVSWGYTVQLYPWLATPKDMEKSFQTFETWKSWSDGPFTFNTRPVQSDPCQMPILFFLDLAESPNRTVTSYKR
        LKTAYNLDPGRTLQQSFCY PARNWSISVSWGYTVQLYPWLATPKD+EKSFQTFETWKSWSDGPFTFNTRPVQSDPCQMPILFFLDLAESPNRTVTSYKR
Subjt:  LKTAYNLDPGRTLQQSFCYHPARNWSISVSWGYTVQLYPWLATPKDMEKSFQTFETWKSWSDGPFTFNTRPVQSDPCQMPILFFLDLAESPNRTVTSYKR

Query:  QLDVWEKECSRDEFQLAQKVERFRVVTFGPFSASHWIKAPRRQCCQVVNGTSMDNVVNVWMRGCNPFET
        QLDVWEKEC RDEFQLAQKVERFRVVTFGPFSASHWIKAPRRQCCQVVNGTSMDNVVNVW RGCNPFET
Subjt:  QLDVWEKECSRDEFQLAQKVERFRVVTFGPFSASHWIKAPRRQCCQVVNGTSMDNVVNVWMRGCNPFET

A0A6J1FA50 uncharacterized protein LOC1114421983.1e-20873.75Show/hide
Query:  RGCNPFET-NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDE
        +G N   T NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLF YLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTW+E
Subjt:  RGCNPFET-NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDE

Query:  RRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC----------------------------------------------------
        RRHYCELWWNKN+TRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC                                                    
Subjt:  RRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC----------------------------------------------------

Query:  ----------------------------------------------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVS
                                                                         +IGVPLTKELGFHQVDIRG+IYGILAAHPVAPLVS
Subjt:  ----------------------------------------------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVS

Query:  LHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE
        LHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE
Subjt:  LHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE

Query:  RPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP
        RPILYFLDTAERFGGRRWRTLTRYRKYVENNTNEC QPDYASA+SVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIR CNQFESVTPP
Subjt:  RPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP

A0A6J1J3X9 uncharacterized protein LOC1114824428.4e-20673.15Show/hide
Query:  RGCNPFET-NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDE
        +G N   T NSLKAFKFMVFRPADVFALFVRT LVIS+VASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDE
Subjt:  RGCNPFET-NSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDE

Query:  RRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC----------------------------------------------------
        RRHYCELWW KNITRGFVWLEEKPEF W QSSPPYRISDDTSQFNYTC                                                    
Subjt:  RRHYCELWWNKNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTC----------------------------------------------------

Query:  ----------------------------------------------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVS
                                                                         EIGVPLTKELGFHQVDIRGNIYG+LAAHPVAPLVS
Subjt:  ----------------------------------------------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVS

Query:  LHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE
        LHHLDYL+AIFPAMTRPDSIKKLHTAYKTDP RALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE
Subjt:  LHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE

Query:  RPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP
        RPILYFLDTAERFGGRRWRTLT YRKYVENNT ECKQPDYASA+SVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP
Subjt:  RPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP

A0A6J1J6Y4 uncharacterized protein LOC1114819201.4e-18168.87Show/hide
Query:  MKVRSQNSVKSGKFFKISLVLCSIAFLSLLFSFNQKPPNCTNCHRRKIIAVESSQSQPPTNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDE
        MK++SQNS+KS KFFKISLVLCSIAFLSLLFSFNQKPPNCT+CHRRKIIAVESSQSQPPTN+SHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDE
Subjt:  MKVRSQNSVKSGKFFKISLVLCSIAFLSLLFSFNQKPPNCTNCHRRKIIAVESSQSQPPTNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDE

Query:  KPNATWPATSPPYRVSADTSEFSYTCWYGS----------------------------------------------------------------------
        KPNATWPATSPPYRVSADTSEFSYTCWYGS                                                                      
Subjt:  KPNATWPATSPPYRVSADTSEFSYTCWYGS----------------------------------------------------------------------

Query:  ------------------------------------------------------------RGSQYGLLAAHPVAPLVSLHHVDYLPPIFPTMIQIDALKA
                                                                    RGSQYGLLAAHPVAPLVSLHHVDYLPPIFPTMI+IDALKA
Subjt:  ------------------------------------------------------------RGSQYGLLAAHPVAPLVSLHHVDYLPPIFPTMIQIDALKA

Query:  LKTAYNLDPGRTLQQSFCYHPARNWSISVSWGYTVQLYPWLATPKDMEKSFQTFETWKSWSDGPFTFNTRPVQSDPCQMPILFFLDLAESPNRTVTSYKR
        LKTAYNLDPGRTLQQSFCY PARNWSISVSWGYTVQLYPWLATPKDMEKSFQTFETWKSWSDGPFTFNTRPVQSDPC+MPILFFLDL ESPNRTVT+YKR
Subjt:  LKTAYNLDPGRTLQQSFCYHPARNWSISVSWGYTVQLYPWLATPKDMEKSFQTFETWKSWSDGPFTFNTRPVQSDPCQMPILFFLDLAESPNRTVTSYKR

Query:  QLDVWEKECSRDEFQLAQKVERFRVVTFGPFSASHWIKAPRRQCCQVVNGTSMDNVVNVWMRGCNPFET
        QLDVWEKECS+DEFQL QKVERFRVVT GPFS+SHWIKAPRRQCCQVVN TSMDNVVNVWMRGCNPFET
Subjt:  QLDVWEKECSRDEFQLAQKVERFRVVTFGPFSASHWIKAPRRQCCQVVNGTSMDNVVNVWMRGCNPFET

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37730.1 Protein of unknown function (DUF604)1.4e-9938.08Show/hide
Query:  FKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRI------------SNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDERR
        F     +P+ V +L  + F  I I  S ++  +  ++     CH   G  R+            ++   ++      + T+ISH+ FGIGGS++TW +R 
Subjt:  FKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRI------------SNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDERR

Query:  HYCELWWNKNITRGFVWLEEKP--EFPWPQSSPPYRISDDTSQFNYTC----------------------------------------------------
         Y ELWW  N+TRGF+WL+E+P     W  +SPPY++S DTS+F+YTC                                                    
Subjt:  HYCELWWNKNITRGFVWLEEKP--EFPWPQSSPPYRISDDTSQFNYTC----------------------------------------------------

Query:  ----------------------------------------------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVS
                                                                         EIGVPLTKELGFHQVDIRGN YG+LAAHPVAPLV+
Subjt:  ----------------------------------------------------------------CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVS

Query:  LHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE
        LHHLDY++ IFP  T+ D++++L +AYKTDP R +QHSFC+D  RNW VSVSWGY++Q+YP L TAKEL+TPFLTF++W+T ++E F+FDTRP+S +PCE
Subjt:  LHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCE

Query:  RPILYFLDTAERFGGRRWRTLTRYRKYVE-NNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTP
        RP++YFLD     G    +TLT YRK+VE   + +C  PDY+ A  VE+ +VS       LW+ APRRQCC++V+       ++ V+IR  N  ESVTP
Subjt:  RPILYFLDTAERFGGRRWRTLTRYRKYVE-NNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTP

AT3G11420.1 Protein of unknown function (DUF604)2.7e-6331.66Show/hide
Query:  ETNSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHG-CYGALRISNHREVKAF--GAGEQPTNISHLVFGIGGSVKTWDERRHY
        E   L +F FM  RP D   LF R  ++  ++ S SL    T      R +   YG    +  ++  A    A   PTNISH+ F I G+ +TW +R  Y
Subjt:  ETNSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHG-CYGALRISNHREVKAF--GAGEQPTNISHLVFGIGGSVKTWDERRHY

Query:  CELWWNKNITRGFVWLEEKPEFPWPQS----SPPYRISD--------------------------------------------------------DTSQF
          LWW +N TRGFVWL+E  + P   S    S P R+SD                                                        D  Q 
Subjt:  CELWWNKNITRGFVWLEEKPEFPWPQS----SPPYRISD--------------------------------------------------------DTSQF

Query:  NY---------------------------------------------------------TC-CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVSLH
         Y                                                         +C  EIGVP T+E GFHQ+DIRG+ YG LAAHP+APLVSLH
Subjt:  NY---------------------------------------------------------TC-CEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVSLH

Query:  HLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCERP
        HL YL+ +FP     +S++ L   Y  DP R LQ   C+D  R WS+S+SWGY++Q+Y +  TA EL TP  TF+TW++ ++  F F+TRP+  +PCERP
Subjt:  HLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCERP

Query:  ILYFLDTAERF---GGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHD--NNTINGLMQVRIRDCNQFESV
        + YF+D AE     G + W ++       + N   C + ++     V+   V++ + D   W +APRRQCC+V+           M +RIR C   E +
Subjt:  ILYFLDTAERF---GGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHD--NNTINGLMQVRIRDCNQFESV

AT4G23490.1 Protein of unknown function (DUF604)7.9e-4740.34Show/hide
Query:  QSSPPYRISDDTSQFNYTCCEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVSLHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMAR
        Q  P    SDD  Q      E+GVPLTKELGFHQ D+ GN++G+LAAHPV P VS+HHLD +E IFP MTR  ++KK+    K D    LQ S CYD  +
Subjt:  QSSPPYRISDDTSQFNYTCCEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVSLHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMAR

Query:  NWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEAN-ESFTFDTRPVSSNPCERPILYFLDTAERFGGRRWRTLTRYRKY-VENNTNECKQPDYASA
        +W++SVSWGY+VQ++  + + +E++ P  TF  W   A+  ++ F+TRPVS NPC++P ++++ ++ +F  +   T++ Y  + V + +   K  + A  
Subjt:  NWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEAN-ESFTFDTRPVSSNPCERPILYFLDTAERFGGRRWRTLTRYRKY-VENNTNECKQPDYASA

Query:  VSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHD--NNTI
         ++  +    P     LW ++PRR CC V+    NNT+
Subjt:  VSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHD--NNTI

AT4G23490.1 Protein of unknown function (DUF604)1.5e-1035.44Show/hide
Query:  TNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDEKPNATWPATS-----PPYRVSADTSEFSYTCWYGSRGS
        T+++H++FGIA S+K WK+R+ Y ++W+ P   RG+VW+D++   +          PP ++S  T+ F YT   G R +
Subjt:  TNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDEKPNATWPATS-----PPYRVSADTSEFSYTCWYGSRGS

AT5G12460.1 Protein of unknown function (DUF604)3.4e-5030.6Show/hide
Query:  EQPTNISHLVFGIGGSVKTWDERRHYCELWWNKNITRGFVWLEEKPE---FPWPQSSPPYRIS-------------------------------------
        E PTNISHL F I GS KTW  RR Y E WW  NIT+G+V+LE  P     PWPQ SPP+ ++                                     
Subjt:  EQPTNISHLVFGIGGSVKTWDERRHYCELWWNKNITRGFVWLEEKPE---FPWPQSSPPYRIS-------------------------------------

Query:  DDTSQF------------------------------------------------------------------------NYTC-CEIGVPLTKELGFHQVD
        DDT  F                                                                        ++ C  ++G+ LT E G HQ D
Subjt:  DDTSQF------------------------------------------------------------------------NYTC-CEIGVPLTKELGFHQVD

Query:  IRGNIYGILAAHPVAPLVSLHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKT
        + G+I G+L+AHP +PL+SLHH D ++ IFP M R  S+  L    KTD  R LQ + CY    NWSVSVSWGYSV +Y  +     L  P  TF+ WK 
Subjt:  IRGNIYGILAAHPVAPLVSLHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKT

Query:  EANESFTFDTRPVSSNPCERPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTING
            ++ F+TR V+++PCE P  +F D+      +   T T Y+  +E     C      S+ ++    V A      + +     +CCDV + N+T   
Subjt:  EANESFTFDTRPVSSNPCERPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSVEYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTING

Query:  LMQVRIRDCNQFESV
        +++V+IRDC+  E++
Subjt:  LMQVRIRDCNQFESV

AT5G41460.1 Protein of unknown function (DUF604)9.3e-4840.82Show/hide
Query:  SDDTSQFNYTCCEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVSLHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSW
        SDD  Q      E+GVPLTKELGFHQ D+ GN++G+LAAHPVAPLV+LHHLD +E IFP MTR D++K L    K D    +Q S CYD  R W+VSVSW
Subjt:  SDDTSQFNYTCCEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVSLHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQHSFCYDMARNWSVSVSW

Query:  GYSVQLYPWLATAKELDTPFLTFQTWKTEAN-ESFTFDTRPVSSNPCERPILYFLDTAERFGGRRWRTLTRYRKY-VENNTNECKQPDYASAVSVEYFNV
        G++VQ++  + +A+E++ P  TF  W   A+  ++ F+TRPVS +PC++P ++++ T+ R       T++RY  + V +     K  + +   +V  +  
Subjt:  GYSVQLYPWLATAKELDTPFLTFQTWKTEAN-ESFTFDTRPVSSNPCERPILYFLDTAERFGGRRWRTLTRYRKY-VENNTNECKQPDYASAVSVEYFNV

Query:  SAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESV
          P     LW ++PRR CC V    +  N  +++ +  C + E V
Subjt:  SAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESV

AT5G41460.1 Protein of unknown function (DUF604)9.1e-1133.65Show/hide
Query:  HRRKIIAVESSQSQPP----------TNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDEKP----NATWPATSPPYRVSADTSEFSYTCWYG
        H  +   ++S  S PP          T   H++FGIA S + WK+R+ Y ++W+ PN  R +VW+ EKP    +     + PP ++S DTS+F Y    G
Subjt:  HRRKIIAVESSQSQPP----------TNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDEKP----NATWPATSPPYRVSADTSEFSYTCWYG

Query:  SRGS
         R +
Subjt:  SRGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGTTCGGAGTCAGAATTCAGTAAAATCTGGTAAATTCTTCAAAATCTCACTTGTTTTGTGCTCCATAGCATTCCTTTCCCTCCTCTTTTCCTTCAACCAGAAACC
ACCCAACTGTACCAACTGCCACCGCCGGAAAATAATCGCCGTCGAGTCATCGCAATCGCAACCACCGACGAACGTATCCCACCTTCTATTTGGCATAGCGGGGTCCACCA
AGACATGGAAAAAGCGTCAAAGCTACTGCGAGCTCTGGTGGATGCCTAACGTCACCCGTGGGTTCGTCTGGGTGGACGAGAAGCCCAACGCCACGTGGCCAGCCACGTCT
CCTCCGTATAGAGTTTCAGCGGACACGTCGGAATTCAGCTACACCTGCTGGTATGGGTCCAGAGGAAGCCAATATGGGTTATTGGCAGCGCACCCAGTGGCTCCATTGGT
ATCACTCCACCACGTGGACTACCTACCACCCATTTTTCCCACCATGATCCAAATTGACGCCTTAAAGGCTTTAAAAACAGCCTACAACCTCGACCCTGGTCGGACCCTTC
AGCAAAGCTTCTGTTACCACCCGGCACGTAATTGGTCCATTTCGGTCTCATGGGGCTACACCGTCCAGCTCTACCCATGGCTCGCCACGCCCAAAGACATGGAGAAGAGT
TTTCAGACTTTTGAGACGTGGAAGAGCTGGAGCGACGGCCCATTCACTTTCAACACCCGCCCGGTCCAGTCGGACCCTTGCCAGATGCCGATTCTGTTCTTCTTGGACCT
GGCGGAATCCCCGAACCGGACGGTAACCAGTTACAAGAGGCAATTGGACGTTTGGGAAAAAGAGTGTAGCAGAGACGAATTCCAATTAGCCCAGAAAGTGGAGCGTTTCC
GCGTGGTGACTTTCGGACCCTTCAGCGCTTCCCATTGGATTAAGGCCCCACGTAGACAATGTTGCCAAGTGGTGAATGGTACGAGTATGGATAATGTGGTGAATGTATGG
ATGAGAGGCTGCAATCCCTTTGAGACGAATTCATTAAAAGCTTTCAAATTTATGGTGTTCAGGCCAGCTGATGTCTTCGCTCTTTTCGTAAGAACCTTTCTTGTCATCTC
CATCGTAGCCTCTTTTTCTCTCTTCTTTTACCTCACATTATACGACAAAATCCCCCGTTGTCACGGATGTTACGGTGCACTCCGAATCTCAAACCACCGGGAAGTGAAGG
CTTTCGGTGCCGGAGAACAACCGACGAACATTTCCCATCTTGTGTTCGGCATTGGTGGCTCTGTCAAGACGTGGGACGAGCGACGCCATTACTGCGAGCTGTGGTGGAAC
AAGAATATCACTCGTGGGTTTGTTTGGCTTGAAGAGAAGCCTGAATTTCCCTGGCCACAATCCTCTCCGCCCTACCGTATCTCCGACGACACCTCCCAATTCAACTACAC
TTGCTGCGAGATCGGCGTTCCGTTGACCAAAGAACTTGGATTCCACCAGGTGGATATTCGAGGAAACATATATGGTATATTAGCTGCACATCCAGTGGCGCCCTTGGTCT
CGCTCCACCACCTGGATTACCTGGAGGCCATATTTCCAGCCATGACCCGACCCGACTCGATCAAGAAGCTCCACACTGCTTACAAAACGGACCCAGGTCGAGCCCTTCAG
CACAGCTTCTGTTACGACATGGCTCGTAACTGGTCCGTTTCGGTGTCGTGGGGTTACAGTGTTCAGTTATATCCATGGCTGGCCACAGCCAAGGAGCTGGACACGCCCTT
TCTCACGTTCCAAACATGGAAGACAGAAGCCAATGAGTCCTTCACTTTCGATACCCGACCCGTAAGTTCCAACCCGTGTGAAAGGCCCATTCTTTATTTCTTGGATACGG
CGGAGAGATTTGGCGGCCGCAGGTGGCGGACGTTGACCAGGTACCGGAAATACGTGGAGAATAATACTAATGAGTGTAAGCAGCCGGATTACGCCTCTGCAGTGTCGGTT
GAGTATTTCAACGTGTCGGCGCCGGAATTCGACCGCCGTCTGTGGAGGCAGGCACCACGAAGACAGTGCTGTGATGTCGTCCATGATAACAACACAATAAATGGATTGAT
GCAAGTTCGTATTAGAGACTGCAATCAATTTGAAAGTGTGACGCCGCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGTTCGGAGTCAGAATTCAGTAAAATCTGGTAAATTCTTCAAAATCTCACTTGTTTTGTGCTCCATAGCATTCCTTTCCCTCCTCTTTTCCTTCAACCAGAAACC
ACCCAACTGTACCAACTGCCACCGCCGGAAAATAATCGCCGTCGAGTCATCGCAATCGCAACCACCGACGAACGTATCCCACCTTCTATTTGGCATAGCGGGGTCCACCA
AGACATGGAAAAAGCGTCAAAGCTACTGCGAGCTCTGGTGGATGCCTAACGTCACCCGTGGGTTCGTCTGGGTGGACGAGAAGCCCAACGCCACGTGGCCAGCCACGTCT
CCTCCGTATAGAGTTTCAGCGGACACGTCGGAATTCAGCTACACCTGCTGGTATGGGTCCAGAGGAAGCCAATATGGGTTATTGGCAGCGCACCCAGTGGCTCCATTGGT
ATCACTCCACCACGTGGACTACCTACCACCCATTTTTCCCACCATGATCCAAATTGACGCCTTAAAGGCTTTAAAAACAGCCTACAACCTCGACCCTGGTCGGACCCTTC
AGCAAAGCTTCTGTTACCACCCGGCACGTAATTGGTCCATTTCGGTCTCATGGGGCTACACCGTCCAGCTCTACCCATGGCTCGCCACGCCCAAAGACATGGAGAAGAGT
TTTCAGACTTTTGAGACGTGGAAGAGCTGGAGCGACGGCCCATTCACTTTCAACACCCGCCCGGTCCAGTCGGACCCTTGCCAGATGCCGATTCTGTTCTTCTTGGACCT
GGCGGAATCCCCGAACCGGACGGTAACCAGTTACAAGAGGCAATTGGACGTTTGGGAAAAAGAGTGTAGCAGAGACGAATTCCAATTAGCCCAGAAAGTGGAGCGTTTCC
GCGTGGTGACTTTCGGACCCTTCAGCGCTTCCCATTGGATTAAGGCCCCACGTAGACAATGTTGCCAAGTGGTGAATGGTACGAGTATGGATAATGTGGTGAATGTATGG
ATGAGAGGCTGCAATCCCTTTGAGACGAATTCATTAAAAGCTTTCAAATTTATGGTGTTCAGGCCAGCTGATGTCTTCGCTCTTTTCGTAAGAACCTTTCTTGTCATCTC
CATCGTAGCCTCTTTTTCTCTCTTCTTTTACCTCACATTATACGACAAAATCCCCCGTTGTCACGGATGTTACGGTGCACTCCGAATCTCAAACCACCGGGAAGTGAAGG
CTTTCGGTGCCGGAGAACAACCGACGAACATTTCCCATCTTGTGTTCGGCATTGGTGGCTCTGTCAAGACGTGGGACGAGCGACGCCATTACTGCGAGCTGTGGTGGAAC
AAGAATATCACTCGTGGGTTTGTTTGGCTTGAAGAGAAGCCTGAATTTCCCTGGCCACAATCCTCTCCGCCCTACCGTATCTCCGACGACACCTCCCAATTCAACTACAC
TTGCTGCGAGATCGGCGTTCCGTTGACCAAAGAACTTGGATTCCACCAGGTGGATATTCGAGGAAACATATATGGTATATTAGCTGCACATCCAGTGGCGCCCTTGGTCT
CGCTCCACCACCTGGATTACCTGGAGGCCATATTTCCAGCCATGACCCGACCCGACTCGATCAAGAAGCTCCACACTGCTTACAAAACGGACCCAGGTCGAGCCCTTCAG
CACAGCTTCTGTTACGACATGGCTCGTAACTGGTCCGTTTCGGTGTCGTGGGGTTACAGTGTTCAGTTATATCCATGGCTGGCCACAGCCAAGGAGCTGGACACGCCCTT
TCTCACGTTCCAAACATGGAAGACAGAAGCCAATGAGTCCTTCACTTTCGATACCCGACCCGTAAGTTCCAACCCGTGTGAAAGGCCCATTCTTTATTTCTTGGATACGG
CGGAGAGATTTGGCGGCCGCAGGTGGCGGACGTTGACCAGGTACCGGAAATACGTGGAGAATAATACTAATGAGTGTAAGCAGCCGGATTACGCCTCTGCAGTGTCGGTT
GAGTATTTCAACGTGTCGGCGCCGGAATTCGACCGCCGTCTGTGGAGGCAGGCACCACGAAGACAGTGCTGTGATGTCGTCCATGATAACAACACAATAAATGGATTGAT
GCAAGTTCGTATTAGAGACTGCAATCAATTTGAAAGTGTGACGCCGCCATAA
Protein sequenceShow/hide protein sequence
MKVRSQNSVKSGKFFKISLVLCSIAFLSLLFSFNQKPPNCTNCHRRKIIAVESSQSQPPTNVSHLLFGIAGSTKTWKKRQSYCELWWMPNVTRGFVWVDEKPNATWPATS
PPYRVSADTSEFSYTCWYGSRGSQYGLLAAHPVAPLVSLHHVDYLPPIFPTMIQIDALKALKTAYNLDPGRTLQQSFCYHPARNWSISVSWGYTVQLYPWLATPKDMEKS
FQTFETWKSWSDGPFTFNTRPVQSDPCQMPILFFLDLAESPNRTVTSYKRQLDVWEKECSRDEFQLAQKVERFRVVTFGPFSASHWIKAPRRQCCQVVNGTSMDNVVNVW
MRGCNPFETNSLKAFKFMVFRPADVFALFVRTFLVISIVASFSLFFYLTLYDKIPRCHGCYGALRISNHREVKAFGAGEQPTNISHLVFGIGGSVKTWDERRHYCELWWN
KNITRGFVWLEEKPEFPWPQSSPPYRISDDTSQFNYTCCEIGVPLTKELGFHQVDIRGNIYGILAAHPVAPLVSLHHLDYLEAIFPAMTRPDSIKKLHTAYKTDPGRALQ
HSFCYDMARNWSVSVSWGYSVQLYPWLATAKELDTPFLTFQTWKTEANESFTFDTRPVSSNPCERPILYFLDTAERFGGRRWRTLTRYRKYVENNTNECKQPDYASAVSV
EYFNVSAPEFDRRLWRQAPRRQCCDVVHDNNTINGLMQVRIRDCNQFESVTPP