; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g25740 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g25740
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:18526359..18527573
RNA-Seq ExpressionMoc08g25740
SyntenyMoc08g25740
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]3.5e-12679.28Show/hide
Query:  MRTKMLSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHVGPVEEEHPEDNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESFRNP
        MRT+M +ME+MY+EM+ AAGA SRSENRV R  + EQRG H+GPV++ HPE  E E +T Q GDLREHLNRKR SSLRKGQSPS SHR+SNQQAES  NP
Subjt:  MRTKMLSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHVGPVEEEHPEDNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESFRNP

Query:  ATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFL
         TP GVITREEFDQL+ + DAQVEALKAKCE+KE S +DGDLGESPFTSD+LE  IP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKCR F 
Subjt:  ATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFL

Query:  IALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEA
        IALTGS RLWYRRLPARSISTYSQLR+EF+ QFSSRHYD+KTA HL TIRQKEGETLREYVTRFQEEQLKVAHCSD SA+CYFLT LADE LTVKL EEA
Subjt:  IALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEA

Query:  LAIF
         A F
Subjt:  LAIF

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]2.6e-11392.31Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQAESFRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGS
        +RGSSLRKGQSPSRSHRSSNQQAES  NPATPAGVITREEFDQLRG+LDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLE PIP KFKAPTVKPYDGS
Subjt:  KRGSSLRKGQSPSRSHRSSNQQAESFRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGS

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFLIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKV
        +DPKDYVEVFEGLMDFQAASD IKCRAF IALT S RLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TA HLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFLIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAICYFLTGLADEALTVKLGEEALAIFA
         HCSDDSA+CYFLTGLADEA TVKLGEEA A FA
Subjt:  AHCSDDSAICYFLTGLADEALTVKLGEEALAIFA

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]3.6e-10780.31Show/hide
Query:  DNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESFRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDV
        + E E +T Q GDLREHLNRKR SSLRKGQSPS SHR+SNQQAES  NP TP  VITREEFDQL+ + DAQVEALKA CE+KE S +DGDLGE PFT D+
Subjt:  DNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESFRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDV

Query:  LEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFLIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQ
        LE PI PKFK PT+KPYDGSK+PKDYV+VFEGLM+FQAA+DAIKCRAF IA TGS RLWYRRLPARSISTYSQLR+EF++QFSSR+YD+KTA HLATIRQ
Subjt:  LEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFLIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQ

Query:  KEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEALAIFA
        K+GETLREYVTRFQEEQLKVAHCSDDSA+CYFLTGLAD+ LTVKLGEEA A FA
Subjt:  KEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEALAIFA

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]2.8e-11270.45Show/hide
Query:  MEAMRTKMLSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHVGPVEEEHPEDNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESF
        MEAMRT+M +MEEMYN+M+  AGA SRS ++V    + EQ   H  PV+EEH             GDLR+HLNRKR SS R  ++ +  H++SNQQAES 
Subjt:  MEAMRTKMLSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHVGPVEEEHPEDNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESF

Query:  RNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR
         NP  P GVITREEF+QL+ + DAQVEALK +CE+KE + +DGDLGESPFTSD+LE  IPPKFK PT+K YDGSKDPKDYVEVFEGLMDFQAA+DAIKCR
Subjt:  RNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR

Query:  AFLIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLG
        AF IALTGS RLWYRRLPARSISTYSQLR+EF++QF SRHYD+KT  HLATIRQKEG+TL+EY+TRFQEEQLKV HCSDDS++CYFLTGLADE  TVKLG
Subjt:  AFLIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLG

Query:  EEALAIFA
        EEALA FA
Subjt:  EEALAIFA

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]7.9e-13969.06Show/hide
Query:  MVQPSNSTNTADRRTLAASDAHQKEVEAAVVEGQGHDGLAAEPLRRSARITTPVLPPAHPPKTSKATRGGGGTSKKGARGPASAPTSENLDALQREMEAM
        MVQP +STNT DRR L A+D HQ+EV A VVEGQ H+GL  EP  RSARITTP L PAH PK  KA RG GG S++   G A AP+ EN DALQ+EMEAM
Subjt:  MVQPSNSTNTADRRTLAASDAHQKEVEAAVVEGQGHDGLAAEPLRRSARITTPVLPPAHPPKTSKATRGGGGTSKKGARGPASAPTSENLDALQREMEAM

Query:  RTKMLSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHVGPVEEEHPEDNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESFRNPA
        RT+ML+MEEMYNEM+ A GAGSRSE+R  R                             + GDLR+HL+RKR SSLRKG+SPS SH++SNQQAES  NP 
Subjt:  RTKMLSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHVGPVEEEHPEDNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESFRNPA

Query:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFLI
         P GVITREEFDQL+ + DAQVE LKA+CE K  + +DGDLGESPFTSD+LE  IP KFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+DAIK RAF I
Subjt:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFLI

Query:  ALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEAL
        ALT S RLWYRRLPARSISTYSQLR+EF +QFSSRHY++KTA HLATIRQKE ETLREYVT FQEEQLKVAH SDDSA+CYFLT L DE LTVKLGEEA 
Subjt:  ALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEAL

Query:  AIFA
        A FA
Subjt:  AIFA

TrEMBL top hitse value%identityAlignment
A0A6J1DDS5 uncharacterized protein LOC1110198421.2e-11392.31Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQAESFRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGS
        +RGSSLRKGQSPSRSHRSSNQQAES  NPATPAGVITREEFDQLRG+LDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLE PIP KFKAPTVKPYDGS
Subjt:  KRGSSLRKGQSPSRSHRSSNQQAESFRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGS

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFLIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKV
        +DPKDYVEVFEGLMDFQAASD IKCRAF IALT S RLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TA HLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFLIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAICYFLTGLADEALTVKLGEEALAIFA
         HCSDDSA+CYFLTGLADEA TVKLGEEA A FA
Subjt:  AHCSDDSAICYFLTGLADEALTVKLGEEALAIFA

A0A6J1DDW5 uncharacterized protein LOC1110196341.7e-12679.28Show/hide
Query:  MRTKMLSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHVGPVEEEHPEDNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESFRNP
        MRT+M +ME+MY+EM+ AAGA SRSENRV R  + EQRG H+GPV++ HPE  E E +T Q GDLREHLNRKR SSLRKGQSPS SHR+SNQQAES  NP
Subjt:  MRTKMLSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHVGPVEEEHPEDNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESFRNP

Query:  ATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFL
         TP GVITREEFDQL+ + DAQVEALKAKCE+KE S +DGDLGESPFTSD+LE  IP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKCR F 
Subjt:  ATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFL

Query:  IALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEA
        IALTGS RLWYRRLPARSISTYSQLR+EF+ QFSSRHYD+KTA HL TIRQKEGETLREYVTRFQEEQLKVAHCSD SA+CYFLT LADE LTVKL EEA
Subjt:  IALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEA

Query:  LAIF
         A F
Subjt:  LAIF

A0A6J1DM55 uncharacterized protein LOC1110222671.7e-10780.31Show/hide
Query:  DNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESFRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDV
        + E E +T Q GDLREHLNRKR SSLRKGQSPS SHR+SNQQAES  NP TP  VITREEFDQL+ + DAQVEALKA CE+KE S +DGDLGE PFT D+
Subjt:  DNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESFRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDV

Query:  LEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFLIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQ
        LE PI PKFK PT+KPYDGSK+PKDYV+VFEGLM+FQAA+DAIKCRAF IA TGS RLWYRRLPARSISTYSQLR+EF++QFSSR+YD+KTA HLATIRQ
Subjt:  LEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFLIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQ

Query:  KEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEALAIFA
        K+GETLREYVTRFQEEQLKVAHCSDDSA+CYFLTGLAD+ LTVKLGEEA A FA
Subjt:  KEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEALAIFA

A0A6J1DPN4 uncharacterized protein LOC1110230601.4e-11270.45Show/hide
Query:  MEAMRTKMLSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHVGPVEEEHPEDNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESF
        MEAMRT+M +MEEMYN+M+  AGA SRS ++V    + EQ   H  PV+EEH             GDLR+HLNRKR SS R  ++ +  H++SNQQAES 
Subjt:  MEAMRTKMLSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHVGPVEEEHPEDNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESF

Query:  RNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR
         NP  P GVITREEF+QL+ + DAQVEALK +CE+KE + +DGDLGESPFTSD+LE  IPPKFK PT+K YDGSKDPKDYVEVFEGLMDFQAA+DAIKCR
Subjt:  RNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR

Query:  AFLIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLG
        AF IALTGS RLWYRRLPARSISTYSQLR+EF++QF SRHYD+KT  HLATIRQKEG+TL+EY+TRFQEEQLKV HCSDDS++CYFLTGLADE  TVKLG
Subjt:  AFLIALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLG

Query:  EEALAIFA
        EEALA FA
Subjt:  EEALAIFA

A0A6J1DZJ1 uncharacterized protein LOC1110257383.8e-13969.06Show/hide
Query:  MVQPSNSTNTADRRTLAASDAHQKEVEAAVVEGQGHDGLAAEPLRRSARITTPVLPPAHPPKTSKATRGGGGTSKKGARGPASAPTSENLDALQREMEAM
        MVQP +STNT DRR L A+D HQ+EV A VVEGQ H+GL  EP  RSARITTP L PAH PK  KA RG GG S++   G A AP+ EN DALQ+EMEAM
Subjt:  MVQPSNSTNTADRRTLAASDAHQKEVEAAVVEGQGHDGLAAEPLRRSARITTPVLPPAHPPKTSKATRGGGGTSKKGARGPASAPTSENLDALQREMEAM

Query:  RTKMLSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHVGPVEEEHPEDNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESFRNPA
        RT+ML+MEEMYNEM+ A GAGSRSE+R  R                             + GDLR+HL+RKR SSLRKG+SPS SH++SNQQAES  NP 
Subjt:  RTKMLSMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHVGPVEEEHPEDNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESFRNPA

Query:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFLI
         P GVITREEFDQL+ + DAQVE LKA+CE K  + +DGDLGESPFTSD+LE  IP KFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+DAIK RAF I
Subjt:  TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFLI

Query:  ALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEAL
        ALT S RLWYRRLPARSISTYSQLR+EF +QFSSRHY++KTA HLATIRQKE ETLREYVT FQEEQLKVAH SDDSA+CYFLT L DE LTVKLGEEA 
Subjt:  ALTGSVRLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEAL

Query:  AIFA
        A FA
Subjt:  AIFA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCTCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCCGCCAGCGATGCCCACCAGAAGGAGGTCGAAGCAGCAGTGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAGCAGAACCCCTCCGCAGGTCGGCACGAATCACCACGCCTGTTCTACCACCTGCGCACCCCCCAAAGACATCCAAGGCCACCCGTGGCGGA
GGTGGAACCTCTAAGAAGGGCGCCCGGGGTCCAGCCTCGGCCCCGACAAGTGAGAACTTGGATGCACTCCAGAGAGAAATGGAGGCAATGCGCACAAAAATGCTG
TCCATGGAGGAAATGTATAACGAAATGATATTAGCTGCAGGCGCAGGGTCTCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGAGAGCAAAGGGGTTCCCAC
GTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGCGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCT
CTCCGAAAAGGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTTTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAG
TTCGACCAGCTGAGGGGCCAACTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTTCACTGAACGATGGCGACTTGGGAGAATCGCCC
TTCACCTCGGACGTTTTGGAAGAACCGATCCCTCCGAAGTTCAAGGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTT
GAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCTGATCGCGCTTACTGGCAGCGTGCGATTGTGGTATCGGAGACTGCCAGCC
AGGTCGATCTCGACCTATTCTCAACTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGGCCCATCTCGCCACCATCAGACAG
AAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATTTGCTATTTTCTCACCGGT
CTAGCCGACGAAGCCCTCACAGTGAAACTTGGAGAGGAGGCCCTGGCCATCTTCGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCTCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCCGCCAGCGATGCCCACCAGAAGGAGGTCGAAGCAGCAGTGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAGCAGAACCCCTCCGCAGGTCGGCACGAATCACCACGCCTGTTCTACCACCTGCGCACCCCCCAAAGACATCCAAGGCCACCCGTGGCGGA
GGTGGAACCTCTAAGAAGGGCGCCCGGGGTCCAGCCTCGGCCCCGACAAGTGAGAACTTGGATGCACTCCAGAGAGAAATGGAGGCAATGCGCACAAAAATGCTG
TCCATGGAGGAAATGTATAACGAAATGATATTAGCTGCAGGCGCAGGGTCTCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGAGAGCAAAGGGGTTCCCAC
GTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGCGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCT
CTCCGAAAAGGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTTTCGCAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAG
TTCGACCAGCTGAGGGGCCAACTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTTCACTGAACGATGGCGACTTGGGAGAATCGCCC
TTCACCTCGGACGTTTTGGAAGAACCGATCCCTCCGAAGTTCAAGGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTT
GAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCTGATCGCGCTTACTGGCAGCGTGCGATTGTGGTATCGGAGACTGCCAGCC
AGGTCGATCTCGACCTATTCTCAACTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGGCCCATCTCGCCACCATCAGACAG
AAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATTTGCTATTTTCTCACCGGT
CTAGCCGACGAAGCCCTCACAGTGAAACTTGGAGAGGAGGCCCTGGCCATCTTCGCCTAG
Protein sequenceShow/hide protein sequence
MVQPSNSTNTADRRTLAASDAHQKEVEAAVVEGQGHDGLAAEPLRRSARITTPVLPPAHPPKTSKATRGGGGTSKKGARGPASAPTSENLDALQREMEAMRTKML
SMEEMYNEMILAAGAGSRSENRVTRVGIREQRGSHVGPVEEEHPEDNESEGHTRQSGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESFRNPATPAGVITREE
FDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEEPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFLIALTGSVRLWYRRLPA
RSISTYSQLRREFLAQFSSRHYDKKTAAHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAICYFLTGLADEALTVKLGEEALAIFA