; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g28170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g28170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:21141228..21142598
RNA-Seq ExpressionMoc06g28170
SyntenyMoc06g28170
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.2e-13091.82Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  L DGDLGESPFTSDVLEAPI PKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCS+DSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFSGGRAEY
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP+RKIGRGRSGKD+E ADPKSKDKGSFS GRAEY
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFSGGRAEY

XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]4.6e-14377.75Show/hide
Query:  MRTQMRSMEAMYNEMVLAAGVGSRSENRVTCMDVREQRGSHLGPAEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHN-
        MRTQM +ME MY+EMV AAG  SRSENRV   D+ EQRG HLGP ++  PE  E E YT QR DLREHLNRKR SSLRKGQSPS SHR+SNQQAESS+N 
Subjt:  MRTQMRSMEAMYNEMVLAAGVGSRSENRVTCMDVREQRGSHLGPAEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHN-

Query:  --PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ
          P G+ITREEFDQL+ + DAQVEALKAKCE+K+ S  DGDLGESPFTSD+LEA I  KFK PT+KPYDG+KDPKDYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  --PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ

Query:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGEEA
        IALTGSARLWYRRLPARSISTYSQLR+EF+ QFSSRHYD+KTATHL TIRQKEGETLREYVTRFQEEQLKVAHCS+ SAMCYFLT LADE LTVKL EEA
Subjt:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGEEA

Query:  PATFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFS
        PATF EVLQKAKK+IDGQELLRTKT RP++KI +GR+ KD  + D K++DKG  S
Subjt:  PATFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFS

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]8.9e-13991.38Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGT
        +RGSSLRKGQSPSRSHRSSNQQAESSHN   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ SL DGDLGESPFTSDVLEAPI  KFKAPTVKPYDG+
Subjt:  KRGSSLRKGQSPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGT

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV
        +DPKDYVEVFEGLMDFQAASD IKCRAFQIALT SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSNDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFSGGRAEY
         HCS+DSAMCYFLTGLADEA TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP+RKIGRGRSGKD+ERAD KSKDKGSFS  RA Y
Subjt:  AHCSNDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFSGGRAEY

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]5.1e-13497.67Show/hide
Query:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        GIITREEFDQLRGELDAQVEALKAKCEQKDDSL DGDLGESPFTSDVLEAPI PKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCS+DSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFSGGRAEY
        AEVLQKAKKVIDGQELLRTKTGRP+RKIGRGRSGKDVERADPKSKDKGSFS GRAEY
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFSGGRAEY

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]2.6e-15467.7Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDSLAAEPLRRSARITSPALPPVHPRASKAIRARGGTSKKGAWDPAPAPTSENFDALKREMEAMR
        MVQP +STNT DRR L A+D HQREVGA  VEGQ H+ L  EP  RSARIT+P L P HP+  KA R RGG S++     APAP+ ENFDAL++EMEAMR
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDSLAAEPLRRSARITSPALPPVHPRASKAIRARGGTSKKGAWDPAPAPTSENFDALKREMEAMR

Query:  TQMRSMEAMYNEMVLAAGVGSRSENRVTCMDVREQRGSHLGPAEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPA-
        TQM +ME MYNEMV A G GSRSE+R      R++RG                        DLR+HL+RKR SSLRKG+SPS SH++SNQQAESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGVGSRSENRVTCMDVREQRGSHLGPAEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPA-

Query:  --GIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA
          G+ITREEFDQL+ + DAQVE LKA+CE K  +  DGDLGESPFTSD+LEA I  KFK PT+KPYDG+KDPKDYVEVFEGLM FQAA+DAIK RAFQIA
Subjt:  --GIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGEEAPA
        LT SARLWYRRLPARSISTYSQLR+EF +QFSSRHY++KTATHLATIRQKE ETLREYVT FQEEQLKVAH S+DSA+CYFLT L DE LTVKLGEEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSF
        TFAEVLQKAKKVIDGQEL RTKTGR +++I + +  ++  +A+ KSKDK  +
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSF

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088135.7e-13191.82Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  L DGDLGESPFTSDVLEAPI PKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCS+DSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFSGGRAEY
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP+RKIGRGRSGKD+E ADPKSKDKGSFS GRAEY
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFSGGRAEY

A0A6J1DDS5 uncharacterized protein LOC1110198424.3e-13991.38Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGT
        +RGSSLRKGQSPSRSHRSSNQQAESSHN   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ SL DGDLGESPFTSDVLEAPI  KFKAPTVKPYDG+
Subjt:  KRGSSLRKGQSPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGT

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV
        +DPKDYVEVFEGLMDFQAASD IKCRAFQIALT SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSNDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFSGGRAEY
         HCS+DSAMCYFLTGLADEA TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP+RKIGRGRSGKD+ERAD KSKDKGSFS  RA Y
Subjt:  AHCSNDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFSGGRAEY

A0A6J1DDW5 uncharacterized protein LOC1110196342.2e-14377.75Show/hide
Query:  MRTQMRSMEAMYNEMVLAAGVGSRSENRVTCMDVREQRGSHLGPAEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHN-
        MRTQM +ME MY+EMV AAG  SRSENRV   D+ EQRG HLGP ++  PE  E E YT QR DLREHLNRKR SSLRKGQSPS SHR+SNQQAESS+N 
Subjt:  MRTQMRSMEAMYNEMVLAAGVGSRSENRVTCMDVREQRGSHLGPAEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHN-

Query:  --PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ
          P G+ITREEFDQL+ + DAQVEALKAKCE+K+ S  DGDLGESPFTSD+LEA I  KFK PT+KPYDG+KDPKDYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  --PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ

Query:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGEEA
        IALTGSARLWYRRLPARSISTYSQLR+EF+ QFSSRHYD+KTATHL TIRQKEGETLREYVTRFQEEQLKVAHCS+ SAMCYFLT LADE LTVKL EEA
Subjt:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGEEA

Query:  PATFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFS
        PATF EVLQKAKK+IDGQELLRTKT RP++KI +GR+ KD  + D K++DKG  S
Subjt:  PATFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFS

A0A6J1DS95 uncharacterized protein LOC1110234212.5e-13497.67Show/hide
Query:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        GIITREEFDQLRGELDAQVEALKAKCEQKDDSL DGDLGESPFTSDVLEAPI PKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCS+DSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFSGGRAEY
        AEVLQKAKKVIDGQELLRTKTGRP+RKIGRGRSGKDVERADPKSKDKGSFS GRAEY
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSFSGGRAEY

A0A6J1DZJ1 uncharacterized protein LOC1110257381.3e-15467.7Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDSLAAEPLRRSARITSPALPPVHPRASKAIRARGGTSKKGAWDPAPAPTSENFDALKREMEAMR
        MVQP +STNT DRR L A+D HQREVGA  VEGQ H+ L  EP  RSARIT+P L P HP+  KA R RGG S++     APAP+ ENFDAL++EMEAMR
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDSLAAEPLRRSARITSPALPPVHPRASKAIRARGGTSKKGAWDPAPAPTSENFDALKREMEAMR

Query:  TQMRSMEAMYNEMVLAAGVGSRSENRVTCMDVREQRGSHLGPAEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPA-
        TQM +ME MYNEMV A G GSRSE+R      R++RG                        DLR+HL+RKR SSLRKG+SPS SH++SNQQAESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGVGSRSENRVTCMDVREQRGSHLGPAEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPA-

Query:  --GIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA
          G+ITREEFDQL+ + DAQVE LKA+CE K  +  DGDLGESPFTSD+LEA I  KFK PT+KPYDG+KDPKDYVEVFEGLM FQAA+DAIK RAFQIA
Subjt:  --GIITREEFDQLRGELDAQVEALKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGEEAPA
        LT SARLWYRRLPARSISTYSQLR+EF +QFSSRHY++KTATHLATIRQKE ETLREYVT FQEEQLKVAH S+DSA+CYFLT L DE LTVKLGEEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSF
        TFAEVLQKAKKVIDGQEL RTKTGR +++I + +  ++  +A+ KSKDK  +
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERADPKSKDKGSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTGGAGGGGCAAGGTCACGA
CAGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCTCGCCCGCCCTACCGCCTGTGCACCCGAGGGCGTCCAAGGCCATCCGTGCCCGAGGTGGGACCTCTA
AGAAGGGCGCCTGGGATCCAGCTCCGGCCCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCGATGTAT
AACGAAATGGTGCTAGCTGCAGGCGTAGGGTCCCGATCTGAAAATAGAGTGACGTGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACG
TCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGAAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTTCGAAAAGGGCAGTCACCATCCCGCT
CCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATTACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAAAGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCTTCCGAAGTTCAAAGC
TCCTACCGTGAAACCTTATGATGGGACGAAGGACCCTAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGC
ACACTGCTCCAATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCC
AGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTAAACGAAAGATCGGCCGGGGCCGAAGTGGAAAAGATGTAGAAAGGGCAGAT
CCCAAGTCCAAGGACAAGGGATCCTTTTCCGGCGGCCGAGCTGAGTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTGGAGGGGCAAGGTCACGA
CAGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCTCGCCCGCCCTACCGCCTGTGCACCCGAGGGCGTCCAAGGCCATCCGTGCCCGAGGTGGGACCTCTA
AGAAGGGCGCCTGGGATCCAGCTCCGGCCCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCGATGTAT
AACGAAATGGTGCTAGCTGCAGGCGTAGGGTCCCGATCTGAAAATAGAGTGACGTGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACG
TCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGAAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTTCGAAAAGGGCAGTCACCATCCCGCT
CCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATTACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAAAGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCTTCCGAAGTTCAAAGC
TCCTACCGTGAAACCTTATGATGGGACGAAGGACCCTAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCT
CGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGC
ACACTGCTCCAATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCC
AGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTAAACGAAAGATCGGCCGGGGCCGAAGTGGAAAAGATGTAGAAAGGGCAGAT
CCCAAGTCCAAGGACAAGGGATCCTTTTCCGGCGGCCGAGCTGAGTATTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDSLAAEPLRRSARITSPALPPVHPRASKAIRARGGTSKKGAWDPAPAPTSENFDALKREMEAMRTQMRSMEAMY
NEMVLAAGVGSRSENRVTCMDVREQRGSHLGPAEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEA
LKAKCEQKDDSLKDGDLGESPFTSDVLEAPILPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSS
RHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKRKIGRGRSGKDVERAD
PKSKDKGSFSGGRAEY