; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g07000 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g07000
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:4929090..4934321
RNA-Seq ExpressionMoc04g07000
SyntenyMoc04g07000
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]6.6e-12979.49Show/hide
Query:  MRTQMRSMEEIYNEMMLAAGAGSRSENRVTCVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHN-
        MRTQM +ME++Y+EM+ AAGA SRSENRV   D+ EQRG HLGP ++  PE  E E YT QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQAESS+N 
Subjt:  MRTQMRSMEEIYNEMMLAAGAGSRSENRVTCVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHN-

Query:  --PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ
          P G+ITREEFDQL+ + DAQVEALKAKCE+K+ S +DGDLGESPFTSD+LEA IP KFK PT+K YDG+KDPKDYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  --PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ

Query:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEA
        IALTGSARLWYRRLPARSISTYSQLR+EF+ QFSSRHYD+KTATHL TIRQKEGETLREYVTRFQE+QLKVAHCSD SAMCYFLT LADE LTVKL EEA
Subjt:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEA

Query:  PATFAEVLQVLK
        PATF EVLQ  K
Subjt:  PATFAEVLQVLK

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]3.2e-11591.7Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGT
        +RGSSLRKGQSPSRSHRSSNQQAESSHN   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ SLNDGDLGESPFTSDVLEAPIP KFKAPTVK YDG+
Subjt:  KRGSSLRKGQSPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGT

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKV
        +DPKDYVEVFEGLMDFQAASD IKCRAFQIALT SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIRQKEGETLREYVTRFQE+QLKV
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQVLK
         HCSDDSAMCYFLTGLADEA TVKLGEEAPATFAEVLQ  K
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQVLK

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]2.1e-11181.23Show/hide
Query:  DNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDV
        + E E YT QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQAESS+N   P  +ITREEFDQL+ + DAQVEALKA CE+K+ S +DGDLGE PFT D+
Subjt:  DNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDV

Query:  LEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQ
        LEAPI PKFK PT+K YDG+K+PKDYV+VFEGLM+FQAA+DAIKCRAFQIA TGSARLWYRRLPARSISTYSQLR+EF++QFSSR+YD+KTATHLATIRQ
Subjt:  LEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQ

Query:  KEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQVLK
        K+GETLREYVTRFQE+QLKVAHCSDDSAMCYFLTGLAD+ LTVKLGEEAPATFAEVLQ  K
Subjt:  KEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQVLK

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]5.4e-11571.43Show/hide
Query:  MEAMRTQMRSMEEIYNEMMLAAGAGSRSENRVTCVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS
        MEAMRTQMR+MEE+YN+M+  AGA SRS ++V   DV EQ   H  P +EE              GDLR+HLNRKR SS R  ++ +  H++SNQQAESS
Subjt:  MEAMRTQMRSMEEIYNEMMLAAGAGSRSENRVTCVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS

Query:  HN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCR
        +N   P G+ITREEF+QL+ + DAQVEALK +CE+K+ + +DGDLGESPFTSD+LEA IPPKFK PT+KSYDG+KDPKDYVEVFEGLMDFQAA+DAIKCR
Subjt:  HN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCR

Query:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLG
        AFQIALTGSARLWYRRLPARSISTYSQLR+EF++QF SRHYD+KT THLATIRQKEG+TL+EY+TRFQE+QLKV HCSDDS+MCYFLTGLADE  TVKLG
Subjt:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLG

Query:  EEAPATFAEVLQVLK
        EEA ATFAEVLQ+ K
Subjt:  EEAPATFAEVLQVLK

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]1.1e-14470.24Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKSARGPAPAPTSENFDALNREMEAMR
        MVQP +STNT DRR L A+D HQREVGA  VEGQ H+GL  EP  RSARIT P L PAHP+  KA RGRGG S+++  G APAP+ ENFDAL +EMEAMR
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKSARGPAPAPTSENFDALNREMEAMR

Query:  TQMRSMEEIYNEMMLAAGAGSRSENRVTCVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPA-
        TQM +MEE+YNEM+ A GAGSRSE+R                A +E             RGDLR+HL+RKR SSLRKG+SPS SH++SNQQAESS+NP  
Subjt:  TQMRSMEEIYNEMMLAAGAGSRSENRVTCVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPA-

Query:  --GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA
          G+ITREEFDQL+ + DAQVE LKA+CE K  + +DGDLGESPFTSD+LEA IP KFK PT+K YDG+KDPKDYVEVFEGLM FQAA+DAIK RAFQIA
Subjt:  --GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA
        LT SARLWYRRLPARSISTYSQLR+EF +QFSSRHY++KTATHLATIRQKE ETLREYVT FQE+QLKVAH SDDSA+CYFLT L DE LTVKLGEEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQVLK
        TFAEVLQ  K
Subjt:  TFAEVLQVLK

TrEMBL top hitse value%identityAlignment
A0A6J1DDS5 uncharacterized protein LOC1110198421.5e-11591.7Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGT
        +RGSSLRKGQSPSRSHRSSNQQAESSHN   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ SLNDGDLGESPFTSDVLEAPIP KFKAPTVK YDG+
Subjt:  KRGSSLRKGQSPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGT

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKV
        +DPKDYVEVFEGLMDFQAASD IKCRAFQIALT SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIRQKEGETLREYVTRFQE+QLKV
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQVLK
         HCSDDSAMCYFLTGLADEA TVKLGEEAPATFAEVLQ  K
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQVLK

A0A6J1DDW5 uncharacterized protein LOC1110196343.2e-12979.49Show/hide
Query:  MRTQMRSMEEIYNEMMLAAGAGSRSENRVTCVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHN-
        MRTQM +ME++Y+EM+ AAGA SRSENRV   D+ EQRG HLGP ++  PE  E E YT QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQAESS+N 
Subjt:  MRTQMRSMEEIYNEMMLAAGAGSRSENRVTCVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHN-

Query:  --PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ
          P G+ITREEFDQL+ + DAQVEALKAKCE+K+ S +DGDLGESPFTSD+LEA IP KFK PT+K YDG+KDPKDYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  --PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ

Query:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEA
        IALTGSARLWYRRLPARSISTYSQLR+EF+ QFSSRHYD+KTATHL TIRQKEGETLREYVTRFQE+QLKVAHCSD SAMCYFLT LADE LTVKL EEA
Subjt:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEA

Query:  PATFAEVLQVLK
        PATF EVLQ  K
Subjt:  PATFAEVLQVLK

A0A6J1DM55 uncharacterized protein LOC1110222671.0e-11181.23Show/hide
Query:  DNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDV
        + E E YT QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQAESS+N   P  +ITREEFDQL+ + DAQVEALKA CE+K+ S +DGDLGE PFT D+
Subjt:  DNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDV

Query:  LEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQ
        LEAPI PKFK PT+K YDG+K+PKDYV+VFEGLM+FQAA+DAIKCRAFQIA TGSARLWYRRLPARSISTYSQLR+EF++QFSSR+YD+KTATHLATIRQ
Subjt:  LEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQ

Query:  KEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQVLK
        K+GETLREYVTRFQE+QLKVAHCSDDSAMCYFLTGLAD+ LTVKLGEEAPATFAEVLQ  K
Subjt:  KEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQVLK

A0A6J1DPN4 uncharacterized protein LOC1110230602.6e-11571.43Show/hide
Query:  MEAMRTQMRSMEEIYNEMMLAAGAGSRSENRVTCVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS
        MEAMRTQMR+MEE+YN+M+  AGA SRS ++V   DV EQ   H  P +EE              GDLR+HLNRKR SS R  ++ +  H++SNQQAESS
Subjt:  MEAMRTQMRSMEEIYNEMMLAAGAGSRSENRVTCVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS

Query:  HN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCR
        +N   P G+ITREEF+QL+ + DAQVEALK +CE+K+ + +DGDLGESPFTSD+LEA IPPKFK PT+KSYDG+KDPKDYVEVFEGLMDFQAA+DAIKCR
Subjt:  HN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCR

Query:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLG
        AFQIALTGSARLWYRRLPARSISTYSQLR+EF++QF SRHYD+KT THLATIRQKEG+TL+EY+TRFQE+QLKV HCSDDS+MCYFLTGLADE  TVKLG
Subjt:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLG

Query:  EEAPATFAEVLQVLK
        EEA ATFAEVLQ+ K
Subjt:  EEAPATFAEVLQVLK

A0A6J1DZJ1 uncharacterized protein LOC1110257385.4e-14570.24Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKSARGPAPAPTSENFDALNREMEAMR
        MVQP +STNT DRR L A+D HQREVGA  VEGQ H+GL  EP  RSARIT P L PAHP+  KA RGRGG S+++  G APAP+ ENFDAL +EMEAMR
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKSARGPAPAPTSENFDALNREMEAMR

Query:  TQMRSMEEIYNEMMLAAGAGSRSENRVTCVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPA-
        TQM +MEE+YNEM+ A GAGSRSE+R                A +E             RGDLR+HL+RKR SSLRKG+SPS SH++SNQQAESS+NP  
Subjt:  TQMRSMEEIYNEMMLAAGAGSRSENRVTCVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPA-

Query:  --GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA
          G+ITREEFDQL+ + DAQVE LKA+CE K  + +DGDLGESPFTSD+LEA IP KFK PT+K YDG+KDPKDYVEVFEGLM FQAA+DAIK RAFQIA
Subjt:  --GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA
        LT SARLWYRRLPARSISTYSQLR+EF +QFSSRHY++KTATHLATIRQKE ETLREYVT FQE+QLKVAH SDDSA+CYFLT L DE LTVKLGEEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQVLK
        TFAEVLQ  K
Subjt:  TFAEVLQVLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGAGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAATAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCC
ATGGAAGAAATTTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGTGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTC
GGTCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
CGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGAC
GTTTTGGAAGCACCAATCCCTCCGAAGTTTAAAGCTCCTACCGTGAAGTCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATG
GACTTCCAAGCGGCATCAGACGCAATCAAGTGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCA
ACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAG
ACGCTGCGAGAATATGTCACCAGATTCCAGGAGAAGCAGTTGAAGGTTGCACATTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAA
GCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAAGTTTTGAAGTATCCCACCCCCAACGGCGTGGGCACGGTCCGAGGAGAA
CAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGAC
CTGCCGAGGAAGGAGTTTGCTGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCATACGAGACCGACCTGGCCAGG
TCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCAGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATT
AGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAACTCGGCGCGACCTCAAAGGTAGGAGGTGCGAGGTGACATTGGCTGCAGTT
CAAGGAAAAGGCAAAGAAATGAAAAATGTTGCCGACACCAAAGTAAAATCGAACAAAGCTTCTTCATTAATGAAGAGCAGAGCCAAGGCTTATAGACCTTGCAGC
TCTGCCCTGACAAGCAGAAAACAAAAGAAAAGGGAAGAAGGACAGCGAAAAGCCTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGAGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAATAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCC
ATGGAAGAAATTTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGTGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTC
GGTCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
CGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGAC
GTTTTGGAAGCACCAATCCCTCCGAAGTTTAAAGCTCCTACCGTGAAGTCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATG
GACTTCCAAGCGGCATCAGACGCAATCAAGTGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCA
ACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAG
ACGCTGCGAGAATATGTCACCAGATTCCAGGAGAAGCAGTTGAAGGTTGCACATTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAA
GCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAAGTTTTGAAGTATCCCACCCCCAACGGCGTGGGCACGGTCCGAGGAGAA
CAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGAC
CTGCCGAGGAAGGAGTTTGCTGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCATACGAGACCGACCTGGCCAGG
TCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCAGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATT
AGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAACTCGGCGCGACCTCAAAGGTAGGAGGTGCGAGGTGACATTGGCTGCAGTT
CAAGGAAAAGGCAAAGAAATGAAAAATGTTGCCGACACCAAAGTAAAATCGAACAAAGCTTCTTCATTAATGAAGAGCAGAGCCAAGGCTTATAGACCTTGCAGC
TCTGCCCTGACAAGCAGAAAACAAAAGAAAAGGGAAGAAGGACAGCGAAAAGCCTCGTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKSARGPAPAPTSENFDALNREMEAMRTQMRS
MEEIYNEMMLAAGAGSRSENRVTCVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQL
RGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKSYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSIS
TYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQVLKYPTPNGVGTVRGE
QTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEISAPESSWMDPIADFI
RGNSPQDPKERRKLARRATRRDLKGRRCEVTLAAVQGKGKEMKNVADTKVKSNKASSLMKSRAKAYRPCSSALTSRKQKKREEGQRKAS