; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g17480 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g17480
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:11893612..11894898
RNA-Seq ExpressionMoc01g17480
SyntenyMoc01g17480
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]2.6e-14081.16Show/hide
Query:  MRTKMQSMEEMYNEMILAASAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNP
        MRT+M +ME+MY+EM+ AA A SRSENRV R  + EQRG HLGPV++ HPE  E E +T QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQAESS NP
Subjt:  MRTKMQSMEEMYNEMILAASAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNP

Query:  ATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFR
         TP GVITREEFDQL+ + DAQVEAL AKCE+KE S +DG+LGESPFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKCR F+
Subjt:  ATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFR

Query:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEA
        IALTGSARLWYRRLPARSISTYSQLR+EF+ QFSSRHYD+KTATHL TIRQK+GETLREYVTRFQEEQLKVAHCSD SAMCYFLT LADE LTVKL EEA
Subjt:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEA

Query:  QATFVEVLQKAKKVIDGQELLQTKTSRPE
         ATFVEVLQKAKK+IDGQELL+TKT RPE
Subjt:  QATFVEVLQKAKKVIDGQELLQTKTSRPE

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]6.2e-12692.25Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGS
        +RGSSLRKGQSPSRSHRSSNQQAESS NPATPAGVITREEFDQLRG+LDAQVEAL AKCEQKEGSLNDG+LGESPFTSDVLEAPIP KFKAPTVKPYDGS
Subjt:  KRGSSLRKGQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGS

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRVFRIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKV
        +DPKDYVEVFEGLMDFQAASD IKCR F+IALT SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIRQK+GETLREYVTRFQEEQLKV
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRVFRIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKV

Query:  AHCSDDSAMCYFLTGLADEVLTVKLGEEAQATFVEVLQKAKKVIDGQELLQTKTSRPE
         HCSDDSAMCYFLTGLADE  TVKLGEEA ATF EVLQKAKKVIDGQELL+TKT RPE
Subjt:  AHCSDDSAMCYFLTGLADEVLTVKLGEEAQATFVEVLQKAKKVIDGQELLQTKTSRPE

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]7.1e-12282.73Show/hide
Query:  DNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDV
        + E E +T QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQAESS NP TP  VITREEFDQL+ + DAQVEAL A CE+KE S +DG+LGE PFT D+
Subjt:  DNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDV

Query:  LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFRIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQ
        LEAPI PKFK PT+KPYDGSK+PKDYV+VFEGLM+FQAA+DAIKCR F+IA TGSARLWYRRLPARSISTYSQLR+EF++QFSSR+YD+KTATHLATIRQ
Subjt:  LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFRIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQ

Query:  KKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAQATFVEVLQKAKKVIDGQELLQTKTSRPE
        KKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAD+ LTVKLGEEA ATF EVLQKAKKVIDGQELL+TKT RPE
Subjt:  KKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAQATFVEVLQKAKKVIDGQELLQTKTSRPE

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]9.3e-12270.78Show/hide
Query:  MEAMRTKMQSMEEMYNEMILAASAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS
        MEAMRT+M++MEEMYN+M+  A A SRS ++V    + EQ   H  PV+EEH             GDLR+HLNRKR SS R  ++ +  H++SNQQAESS
Subjt:  MEAMRTKMQSMEEMYNEMILAASAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS

Query:  RNPATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR
         NP  P GVITREEF+QL+ + DAQVEAL  +CE+KE + +DG+LGESPFTSD+LEA IPPKFK PT+K YDGSKDPKDYVEVFEGLMDFQAA+DAIKCR
Subjt:  RNPATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR

Query:  VFRIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLG
         F+IALTGSARLWYRRLPARSISTYSQLR+EF++QF SRHYD+KT THLATIRQK+G+TL+EY+TRFQEEQLKV HCSDDS+MCYFLTGLADE  TVKLG
Subjt:  VFRIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLG

Query:  EEAQATFVEVLQKAKKVIDGQELLQTKTSRPE
        EEA ATF EVLQ  KK IDGQELL+TKT RPE
Subjt:  EEAQATFVEVLQKAKKVIDGQELLQTKTSRPE

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]6.6e-15270.09Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTAKKGARGPAPAPTSENLDALQREMEAM
        MVQP +STNT DRR L A+D HQREVGA VVEGQ H+GL TEP  RSARIT P L PAH P+  KA RGRGG +++   G APAP+ EN DALQ+EMEAM
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTAKKGARGPAPAPTSENLDALQREMEAM

Query:  RTKMQSMEEMYNEMILAASAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPA
        RT+M +MEEMYNEM+ A  AGSRSE+R  R                             +RGDLR+HL+RKR SSLRKG+SPS SH++SNQQAESS NP 
Subjt:  RTKMQSMEEMYNEMILAASAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPA

Query:  TPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFRI
         P GVITREEFDQL+ + DAQVE L A+CE K  + +DG+LGESPFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+DAIK R F+I
Subjt:  TPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFRI

Query:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAQ
        ALT SARLWYRRLPARSISTYSQLR+EF +QFSSRHY++KTATHLATIRQK+ ETLREYVT FQEEQLKVAH SDDSA+CYFLT L DE LTVKLGEEA 
Subjt:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAQ

Query:  ATFVEVLQKAKKVIDGQELLQTKTSRPE
        ATF EVLQKAKKVIDGQEL +TKT R E
Subjt:  ATFVEVLQKAKKVIDGQELLQTKTSRPE

TrEMBL top hitse value%identityAlignment
A0A6J1DDS5 uncharacterized protein LOC1110198423.0e-12692.25Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGS
        +RGSSLRKGQSPSRSHRSSNQQAESS NPATPAGVITREEFDQLRG+LDAQVEAL AKCEQKEGSLNDG+LGESPFTSDVLEAPIP KFKAPTVKPYDGS
Subjt:  KRGSSLRKGQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGS

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRVFRIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKV
        +DPKDYVEVFEGLMDFQAASD IKCR F+IALT SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIRQK+GETLREYVTRFQEEQLKV
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRVFRIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKV

Query:  AHCSDDSAMCYFLTGLADEVLTVKLGEEAQATFVEVLQKAKKVIDGQELLQTKTSRPE
         HCSDDSAMCYFLTGLADE  TVKLGEEA ATF EVLQKAKKVIDGQELL+TKT RPE
Subjt:  AHCSDDSAMCYFLTGLADEVLTVKLGEEAQATFVEVLQKAKKVIDGQELLQTKTSRPE

A0A6J1DDW5 uncharacterized protein LOC1110196341.3e-14081.16Show/hide
Query:  MRTKMQSMEEMYNEMILAASAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNP
        MRT+M +ME+MY+EM+ AA A SRSENRV R  + EQRG HLGPV++ HPE  E E +T QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQAESS NP
Subjt:  MRTKMQSMEEMYNEMILAASAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNP

Query:  ATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFR
         TP GVITREEFDQL+ + DAQVEAL AKCE+KE S +DG+LGESPFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKCR F+
Subjt:  ATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFR

Query:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEA
        IALTGSARLWYRRLPARSISTYSQLR+EF+ QFSSRHYD+KTATHL TIRQK+GETLREYVTRFQEEQLKVAHCSD SAMCYFLT LADE LTVKL EEA
Subjt:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEA

Query:  QATFVEVLQKAKKVIDGQELLQTKTSRPE
         ATFVEVLQKAKK+IDGQELL+TKT RPE
Subjt:  QATFVEVLQKAKKVIDGQELLQTKTSRPE

A0A6J1DM55 uncharacterized protein LOC1110222673.5e-12282.73Show/hide
Query:  DNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDV
        + E E +T QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQAESS NP TP  VITREEFDQL+ + DAQVEAL A CE+KE S +DG+LGE PFT D+
Subjt:  DNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDV

Query:  LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFRIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQ
        LEAPI PKFK PT+KPYDGSK+PKDYV+VFEGLM+FQAA+DAIKCR F+IA TGSARLWYRRLPARSISTYSQLR+EF++QFSSR+YD+KTATHLATIRQ
Subjt:  LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFRIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQ

Query:  KKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAQATFVEVLQKAKKVIDGQELLQTKTSRPE
        KKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAD+ LTVKLGEEA ATF EVLQKAKKVIDGQELL+TKT RPE
Subjt:  KKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAQATFVEVLQKAKKVIDGQELLQTKTSRPE

A0A6J1DPN4 uncharacterized protein LOC1110230604.5e-12270.78Show/hide
Query:  MEAMRTKMQSMEEMYNEMILAASAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS
        MEAMRT+M++MEEMYN+M+  A A SRS ++V    + EQ   H  PV+EEH             GDLR+HLNRKR SS R  ++ +  H++SNQQAESS
Subjt:  MEAMRTKMQSMEEMYNEMILAASAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS

Query:  RNPATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR
         NP  P GVITREEF+QL+ + DAQVEAL  +CE+KE + +DG+LGESPFTSD+LEA IPPKFK PT+K YDGSKDPKDYVEVFEGLMDFQAA+DAIKCR
Subjt:  RNPATPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR

Query:  VFRIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLG
         F+IALTGSARLWYRRLPARSISTYSQLR+EF++QF SRHYD+KT THLATIRQK+G+TL+EY+TRFQEEQLKV HCSDDS+MCYFLTGLADE  TVKLG
Subjt:  VFRIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLG

Query:  EEAQATFVEVLQKAKKVIDGQELLQTKTSRPE
        EEA ATF EVLQ  KK IDGQELL+TKT RPE
Subjt:  EEAQATFVEVLQKAKKVIDGQELLQTKTSRPE

A0A6J1DZJ1 uncharacterized protein LOC1110257383.2e-15270.09Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTAKKGARGPAPAPTSENLDALQREMEAM
        MVQP +STNT DRR L A+D HQREVGA VVEGQ H+GL TEP  RSARIT P L PAH P+  KA RGRGG +++   G APAP+ EN DALQ+EMEAM
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTAKKGARGPAPAPTSENLDALQREMEAM

Query:  RTKMQSMEEMYNEMILAASAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPA
        RT+M +MEEMYNEM+ A  AGSRSE+R  R                             +RGDLR+HL+RKR SSLRKG+SPS SH++SNQQAESS NP 
Subjt:  RTKMQSMEEMYNEMILAASAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPA

Query:  TPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFRI
         P GVITREEFDQL+ + DAQVE L A+CE K  + +DG+LGESPFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+DAIK R F+I
Subjt:  TPAGVITREEFDQLRGQLDAQVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFRI

Query:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAQ
        ALT SARLWYRRLPARSISTYSQLR+EF +QFSSRHY++KTATHLATIRQK+ ETLREYVT FQEEQLKVAH SDDSA+CYFLT L DE LTVKLGEEA 
Subjt:  ALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAQ

Query:  ATFVEVLQKAKKVIDGQELLQTKTSRPE
        ATF EVLQKAKKVIDGQEL +TKT R E
Subjt:  ATFVEVLQKAKKVIDGQELLQTKTSRPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGTCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCG
CTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGATGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCAGTCCATGGAGGAAATG
TATAACGAAATGATATTAGCTGCAAGCGCAGGGTCTCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAATAGAAAAAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCC
GCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCT
CAGGTGGAGGCCTTAAACGCCAAATGTGAGCAGAAAGAAGGTTCATTGAACGATGGCAACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCC
GAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCA
AATGTCGCGTCTTTCGGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAACTGAGAAGGGAGTTCCTCGCC
CAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGAAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCA
ATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGTCCTCACGGTGAAACTTGGAGAGGAGGCACAGGCCACCTTCG
TCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCAAACCAAAACCAGCCGACCAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGTCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCG
CTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGATGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCAGTCCATGGAGGAAATG
TATAACGAAATGATATTAGCTGCAAGCGCAGGGTCTCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAATAGAAAAAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCC
GCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCT
CAGGTGGAGGCCTTAAACGCCAAATGTGAGCAGAAAGAAGGTTCATTGAACGATGGCAACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCC
GAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCA
AATGTCGCGTCTTTCGGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAACTGAGAAGGGAGTTCCTCGCC
CAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGAAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCA
ATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGTCCTCACGGTGAAACTTGGAGAGGAGGCACAGGCCACCTTCG
TCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCAAACCAAAACCAGCCGACCAGAATGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTAKKGARGPAPAPTSENLDALQREMEAMRTKMQSMEEM
YNEMILAASAGSRSENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDA
QVEALNAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFRIALTGSARLWYRRLPARSISTYSQLRREFLA
QFSSRHYDKKTATHLATIRQKKGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEVLTVKLGEEAQATFVEVLQKAKKVIDGQELLQTKTSRPE