; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g25770 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g25770
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:19253707..19255016
RNA-Seq ExpressionMoc09g25770
SyntenyMoc09g25770
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]8.5e-10163.5Show/hide
Query:  MRAQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDLYLDSIDEEYLGGDENVEYSRQKNDLPTILIEREARLIEK---------------------
        MR QM TME+MY+EMVQAAGARS+ E+++   ++HEQ   +L  + + +  G E+ EY+ Q+ DL   L  + +  + K                     
Subjt:  MRAQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDLYLDSIDEEYLGGDENVEYSRQKNDLPTILIEREARLIEK---------------------

Query:  ------------DELQHARTRTPTTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQ
                    D+L+         LKAKCE KES+FDD D GES FTSDILEA IP KFKTPTMKPYDGS+D KDY EVFEGL DFQAATDAIKCR FQ
Subjt:  ------------DELQHARTRTPTTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQ

Query:  ITLTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEA
        I LTGSARLWYRRL ARSISTYSQLRKEFI QFSSRHYDRKTATHL TIRQKEGETLREYVTRFQ+EQLKVAHCSD SAMCYFLT LADETLTVKL EEA
Subjt:  ITLTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEA

Query:  QATFAEVLQNAKNVIDRQELFPTKTGRLEKKIDQKKS
         ATF EVLQ AK +ID QEL  TKT R EKKIDQ ++
Subjt:  QATFAEVLQNAKNVIDRQELFPTKTGRLEKKIDQKKS

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]4.4e-10559.2Show/hide
Query:  MVQPANSTNTADRRALAVNSGLQREVEAKVAEDQIQEGLGTEQLRRSARITTHVLPPTHPKPNKANRGRGGASRRTTRGTAPAPTKENFDALQKEMEAMR
        MVQPANSTNTADRRALA N G QREV A+V E Q  E LGTE L RSARITT VLPP HPKP+KA                                   
Subjt:  MVQPANSTNTADRRALAVNSGLQREVEAKVAEDQIQEGLGTEQLRRSARITTHVLPPTHPKPNKANRGRGGASRRTTRGTAPAPTKENFDALQKEMEAMR

Query:  AQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDLYLDSIDEEYLGGDENVEYSRQKNDLPTILIEREARLIEKDELQHARTRTPTTLKAKCEVKES
              E  YN +      R +F                 D +  ++   D  VE                                   LKA+CE KES
Subjt:  AQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDLYLDSIDEEYLGGDENVEYSRQKNDLPTILIEREARLIEKDELQHARTRTPTTLKAKCEVKES

Query:  AFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLTGSARLWYRRLSARSISTYSQLRKEFISQFSS
        +FDD D GE SF+SDILEA IPPKFKTPTMKPYDGS+D KDY EVFE L DFQAATDAIKC AFQI LTGSARLWYRRL AR ISTYSQLRKEFISQFSS
Subjt:  AFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLTGSARLWYRRLSARSISTYSQLRKEFISQFSS

Query:  RHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATFAEVLQNAKNVIDRQELFPTKTGRLEKKIDQK
        RHYDRKT THLATIRQKEGETLREYVTRF +EQLKVAHCSDDSAMCYFLTGLADETLTVKL EEA ATFAEVLQ  K VID QEL  TKTGR EK IDQ 
Subjt:  RHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATFAEVLQNAKNVIDRQELFPTKTGRLEKKIDQK

Query:  KS
        ++
Subjt:  KS

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]2.4e-11168.77Show/hide
Query:  MEAMRAQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDLYLDSIDEEYLGGDENVEYSRQKND--------------------------LPTILIE
        MEAMR QMRTMEEMYN+MVQ AGARS+  DQ+VHE+VHEQGDL+ D +DEE+LGGD     +R++N                            P  +I 
Subjt:  MEAMRAQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDLYLDSIDEEYLGGDENVEYSRQKND--------------------------LPTILIE

Query:  REARLIEKDELQHARTRTPTTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLT
        RE    E ++L+         LK +CE KESAFDD D GES FTSDILEA IPPKFKTPTMK YDGS+D KDY EVFEGL DFQAATDAIKCRAFQI LT
Subjt:  REARLIEKDELQHARTRTPTTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLT

Query:  GSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATF
        GSARLWYRRL ARSISTYSQLRKEFISQF SRHYDRKT THLATIRQKEG+TL+EY+TRFQ+EQLKV HCSDDS+MCYFLTGLADET TVKLGEEA ATF
Subjt:  GSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATF

Query:  AEVLQNAKNVIDRQELFPTKTGRLEKKIDQKKS
        AEVLQ  K  ID QEL  TKT R EK+IDQKKS
Subjt:  AEVLQNAKNVIDRQELFPTKTGRLEKKIDQKKS

XP_022159250.1 uncharacterized protein LOC111025663 [Momordica charantia]7.0e-9583.78Show/hide
Query:  LQHARTRTPTTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLTGSARLWYRRL
        ++H        LKA+CE KE +FDD D GES FTSDILEAPIPPKFKTPTMKPYDGS+D KDY EVFEGL DFQAATDAIKCRAFQI LT SARLWYRRL
Subjt:  LQHARTRTPTTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLTGSARLWYRRL

Query:  SARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATFAEVLQNAKNV
         ARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRF +EQLKVAHCSDDSAMCYFLTGLADE LTVKLGEEA ATFAEVLQ AK V
Subjt:  SARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATFAEVLQNAKNV

Query:  IDRQELFPTKTGRLEKKIDQKK
        ID QEL  TKT R EK+IDQKK
Subjt:  IDRQELFPTKTGRLEKKIDQKK

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]8.2e-13669.73Show/hide
Query:  MVQPANSTNTADRRALAVNSGLQREVEAKVAEDQIQEGLGTEQLRRSARITTHVLPPTHPKPNKANRGRGGASRRTTRGTAPAPTKENFDALQKEMEAMR
        MVQP +STNT DRRAL  N G QREV A+V E QI EGLGTE   RSARITT  L P HPKP KANRGRGGASRRTT G APAP++ENFDALQKEMEAMR
Subjt:  MVQPANSTNTADRRALAVNSGLQREVEAKVAEDQIQEGLGTEQLRRSARITTHVLPPTHPKPNKANRGRGGASRRTTRGTAPAPTKENFDALQKEMEAMR

Query:  AQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDL--YLDSIDEEYLGGDENVEYSRQKND----------LPTILIEREARLIEKDELQHARTRTP
         QM TMEEMYNEMVQA GA S+ ED+   +   E+GDL  +L       L    +   S + ++          +P  +I RE    E D+L+       
Subjt:  AQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDL--YLDSIDEEYLGGDENVEYSRQKND----------LPTILIEREARLIEKDELQHARTRTP

Query:  TTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLTGSARLWYRRLSARSISTYS
         TLKA+CEVK S FDD D GES FTSDILEA IP KFKTPTMKPYDGS+D KDY EVFEGL  FQAATDAIK RAFQI LT SARLWYRRL ARSISTYS
Subjt:  TTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLTGSARLWYRRLSARSISTYS

Query:  QLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATFAEVLQNAKNVIDRQELFPT
        QLRKEF SQFSSRHY+RKTATHLATIRQKE ETLREYVT FQ+EQLKVAH SDDSA+CYFLT L DETLTVKLGEEA ATFAEVLQ AK VID QELF T
Subjt:  QLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATFAEVLQNAKNVIDRQELFPT

Query:  KTGRLEKKIDQKK
        KTGR EK+IDQKK
Subjt:  KTGRLEKKIDQKK

TrEMBL top hitse value%identityAlignment
A0A6J1DDW5 uncharacterized protein LOC1110196344.1e-10163.5Show/hide
Query:  MRAQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDLYLDSIDEEYLGGDENVEYSRQKNDLPTILIEREARLIEK---------------------
        MR QM TME+MY+EMVQAAGARS+ E+++   ++HEQ   +L  + + +  G E+ EY+ Q+ DL   L  + +  + K                     
Subjt:  MRAQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDLYLDSIDEEYLGGDENVEYSRQKNDLPTILIEREARLIEK---------------------

Query:  ------------DELQHARTRTPTTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQ
                    D+L+         LKAKCE KES+FDD D GES FTSDILEA IP KFKTPTMKPYDGS+D KDY EVFEGL DFQAATDAIKCR FQ
Subjt:  ------------DELQHARTRTPTTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQ

Query:  ITLTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEA
        I LTGSARLWYRRL ARSISTYSQLRKEFI QFSSRHYDRKTATHL TIRQKEGETLREYVTRFQ+EQLKVAHCSD SAMCYFLT LADETLTVKL EEA
Subjt:  ITLTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEA

Query:  QATFAEVLQNAKNVIDRQELFPTKTGRLEKKIDQKKS
         ATF EVLQ AK +ID QEL  TKT R EKKIDQ ++
Subjt:  QATFAEVLQNAKNVIDRQELFPTKTGRLEKKIDQKKS

A0A6J1DHB3 uncharacterized protein LOC1110204792.1e-10559.2Show/hide
Query:  MVQPANSTNTADRRALAVNSGLQREVEAKVAEDQIQEGLGTEQLRRSARITTHVLPPTHPKPNKANRGRGGASRRTTRGTAPAPTKENFDALQKEMEAMR
        MVQPANSTNTADRRALA N G QREV A+V E Q  E LGTE L RSARITT VLPP HPKP+KA                                   
Subjt:  MVQPANSTNTADRRALAVNSGLQREVEAKVAEDQIQEGLGTEQLRRSARITTHVLPPTHPKPNKANRGRGGASRRTTRGTAPAPTKENFDALQKEMEAMR

Query:  AQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDLYLDSIDEEYLGGDENVEYSRQKNDLPTILIEREARLIEKDELQHARTRTPTTLKAKCEVKES
              E  YN +      R +F                 D +  ++   D  VE                                   LKA+CE KES
Subjt:  AQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDLYLDSIDEEYLGGDENVEYSRQKNDLPTILIEREARLIEKDELQHARTRTPTTLKAKCEVKES

Query:  AFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLTGSARLWYRRLSARSISTYSQLRKEFISQFSS
        +FDD D GE SF+SDILEA IPPKFKTPTMKPYDGS+D KDY EVFE L DFQAATDAIKC AFQI LTGSARLWYRRL AR ISTYSQLRKEFISQFSS
Subjt:  AFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLTGSARLWYRRLSARSISTYSQLRKEFISQFSS

Query:  RHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATFAEVLQNAKNVIDRQELFPTKTGRLEKKIDQK
        RHYDRKT THLATIRQKEGETLREYVTRF +EQLKVAHCSDDSAMCYFLTGLADETLTVKL EEA ATFAEVLQ  K VID QEL  TKTGR EK IDQ 
Subjt:  RHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATFAEVLQNAKNVIDRQELFPTKTGRLEKKIDQK

Query:  KS
        ++
Subjt:  KS

A0A6J1DPN4 uncharacterized protein LOC1110230601.2e-11168.77Show/hide
Query:  MEAMRAQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDLYLDSIDEEYLGGDENVEYSRQKND--------------------------LPTILIE
        MEAMR QMRTMEEMYN+MVQ AGARS+  DQ+VHE+VHEQGDL+ D +DEE+LGGD     +R++N                            P  +I 
Subjt:  MEAMRAQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDLYLDSIDEEYLGGDENVEYSRQKND--------------------------LPTILIE

Query:  REARLIEKDELQHARTRTPTTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLT
        RE    E ++L+         LK +CE KESAFDD D GES FTSDILEA IPPKFKTPTMK YDGS+D KDY EVFEGL DFQAATDAIKCRAFQI LT
Subjt:  REARLIEKDELQHARTRTPTTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLT

Query:  GSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATF
        GSARLWYRRL ARSISTYSQLRKEFISQF SRHYDRKT THLATIRQKEG+TL+EY+TRFQ+EQLKV HCSDDS+MCYFLTGLADET TVKLGEEA ATF
Subjt:  GSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATF

Query:  AEVLQNAKNVIDRQELFPTKTGRLEKKIDQKKS
        AEVLQ  K  ID QEL  TKT R EK+IDQKKS
Subjt:  AEVLQNAKNVIDRQELFPTKTGRLEKKIDQKKS

A0A6J1DY58 uncharacterized protein LOC1110256633.4e-9583.78Show/hide
Query:  LQHARTRTPTTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLTGSARLWYRRL
        ++H        LKA+CE KE +FDD D GES FTSDILEAPIPPKFKTPTMKPYDGS+D KDY EVFEGL DFQAATDAIKCRAFQI LT SARLWYRRL
Subjt:  LQHARTRTPTTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLTGSARLWYRRL

Query:  SARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATFAEVLQNAKNV
         ARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRF +EQLKVAHCSDDSAMCYFLTGLADE LTVKLGEEA ATFAEVLQ AK V
Subjt:  SARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATFAEVLQNAKNV

Query:  IDRQELFPTKTGRLEKKIDQKK
        ID QEL  TKT R EK+IDQKK
Subjt:  IDRQELFPTKTGRLEKKIDQKK

A0A6J1DZJ1 uncharacterized protein LOC1110257384.0e-13669.73Show/hide
Query:  MVQPANSTNTADRRALAVNSGLQREVEAKVAEDQIQEGLGTEQLRRSARITTHVLPPTHPKPNKANRGRGGASRRTTRGTAPAPTKENFDALQKEMEAMR
        MVQP +STNT DRRAL  N G QREV A+V E QI EGLGTE   RSARITT  L P HPKP KANRGRGGASRRTT G APAP++ENFDALQKEMEAMR
Subjt:  MVQPANSTNTADRRALAVNSGLQREVEAKVAEDQIQEGLGTEQLRRSARITTHVLPPTHPKPNKANRGRGGASRRTTRGTAPAPTKENFDALQKEMEAMR

Query:  AQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDL--YLDSIDEEYLGGDENVEYSRQKND----------LPTILIEREARLIEKDELQHARTRTP
         QM TMEEMYNEMVQA GA S+ ED+   +   E+GDL  +L       L    +   S + ++          +P  +I RE    E D+L+       
Subjt:  AQMRTMEEMYNEMVQAAGARSQFEDQMVHEEVHEQGDL--YLDSIDEEYLGGDENVEYSRQKND----------LPTILIEREARLIEKDELQHARTRTP

Query:  TTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLTGSARLWYRRLSARSISTYS
         TLKA+CEVK S FDD D GES FTSDILEA IP KFKTPTMKPYDGS+D KDY EVFEGL  FQAATDAIK RAFQI LT SARLWYRRL ARSISTYS
Subjt:  TTLKAKCEVKESAFDDDDFGESSFTSDILEAPIPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLTGSARLWYRRLSARSISTYS

Query:  QLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATFAEVLQNAKNVIDRQELFPT
        QLRKEF SQFSSRHY+RKTATHLATIRQKE ETLREYVT FQ+EQLKVAH SDDSA+CYFLT L DETLTVKLGEEA ATFAEVLQ AK VID QELF T
Subjt:  QLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQKEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATFAEVLQNAKNVIDRQELFPT

Query:  KTGRLEKKIDQKK
        KTGR EK+IDQKK
Subjt:  KTGRLEKKIDQKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCGGCAAACTCAACCAACACAGCAGACCGAAGAGCTCTGGCTGTTAATAGTGGCCTCCAGAGAGAGGTCGAGGCGAAAGTGGCAGAGGATCAAATTCAGGA
AGGCCTAGGGACCGAACAGCTCCGTAGGTCAGCACGCATCACCACACATGTTCTGCCACCAACACATCCTAAACCAAATAAGGCCAACCGTGGCCGAGGCGGTGCCTCTA
GAAGAACCACTCGAGGAACAGCCCCAGCTCCTACTAAGGAGAATTTTGATGCCCTCCAAAAAGAAATGGAGGCAATGCGCGCCCAGATGCGCACCATGGAAGAGATGTAT
AATGAAATGGTGCAAGCTGCTGGTGCCAGGTCTCAATTTGAAGACCAAATGGTGCACGAAGAAGTGCACGAACAAGGGGATCTTTACCTCGACTCAATCGACGAAGAGTA
CCTCGGAGGCGATGAAAATGTGGAGTATAGTCGCCAAAAGAACGATCTTCCGACCATCTTAATAGAAAGAGAAGCTCGTCTCATTGAGAAGGACGAACTCCAGCATGCTC
GCACAAGAACTCCAACCACCTTGAAAGCAAAGTGCGAGGTGAAAGAGAGCGCATTTGATGATGACGACTTTGGAGAATCGTCATTCACCTCAGATATCTTGGAGGCTCCA
ATCCCTCCAAAGTTCAAGACTCCCACTATGAAGCCATACGATGGGTCTGAGGACCGAAAAGATTATGCTGAGGTCTTTGAAGGCCTCACGGATTTTCAAGCGGCAACAGA
CGCCATAAAATGCCGCGCCTTTCAGATCACGCTTACCGGCAGCGCACGTTTGTGGTACAGAAGATTGTCGGCTAGGTCGATCTCGACCTACTCACAACTGAGGAAAGAAT
TTATTAGTCAATTCTCTTCTCGGCACTATGATAGAAAAACAGCGACTCACCTCGCCACCATCAGACAGAAGGAGGGTGAGACACTTAGAGAATACGTCACCAGGTTCCAG
AAGGAGCAGCTGAAAGTCGCGCACTGCTCCGATGACTCGGCCATGTGTTACTTTCTCACCGGCTTGGCCGATGAGACCCTTACTGTGAAACTTGGAGAGGAGGCTCAAGC
TACCTTCGCCGAGGTCCTGCAAAACGCGAAGAATGTTATTGATAGGCAAGAGCTTTTCCCGACCAAGACTGGCCGGTTAGAGAAGAAGATCGACCAGAAGAAGTCTGACT
AA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCGGCAAACTCAACCAACACAGCAGACCGAAGAGCTCTGGCTGTTAATAGTGGCCTCCAGAGAGAGGTCGAGGCGAAAGTGGCAGAGGATCAAATTCAGGA
AGGCCTAGGGACCGAACAGCTCCGTAGGTCAGCACGCATCACCACACATGTTCTGCCACCAACACATCCTAAACCAAATAAGGCCAACCGTGGCCGAGGCGGTGCCTCTA
GAAGAACCACTCGAGGAACAGCCCCAGCTCCTACTAAGGAGAATTTTGATGCCCTCCAAAAAGAAATGGAGGCAATGCGCGCCCAGATGCGCACCATGGAAGAGATGTAT
AATGAAATGGTGCAAGCTGCTGGTGCCAGGTCTCAATTTGAAGACCAAATGGTGCACGAAGAAGTGCACGAACAAGGGGATCTTTACCTCGACTCAATCGACGAAGAGTA
CCTCGGAGGCGATGAAAATGTGGAGTATAGTCGCCAAAAGAACGATCTTCCGACCATCTTAATAGAAAGAGAAGCTCGTCTCATTGAGAAGGACGAACTCCAGCATGCTC
GCACAAGAACTCCAACCACCTTGAAAGCAAAGTGCGAGGTGAAAGAGAGCGCATTTGATGATGACGACTTTGGAGAATCGTCATTCACCTCAGATATCTTGGAGGCTCCA
ATCCCTCCAAAGTTCAAGACTCCCACTATGAAGCCATACGATGGGTCTGAGGACCGAAAAGATTATGCTGAGGTCTTTGAAGGCCTCACGGATTTTCAAGCGGCAACAGA
CGCCATAAAATGCCGCGCCTTTCAGATCACGCTTACCGGCAGCGCACGTTTGTGGTACAGAAGATTGTCGGCTAGGTCGATCTCGACCTACTCACAACTGAGGAAAGAAT
TTATTAGTCAATTCTCTTCTCGGCACTATGATAGAAAAACAGCGACTCACCTCGCCACCATCAGACAGAAGGAGGGTGAGACACTTAGAGAATACGTCACCAGGTTCCAG
AAGGAGCAGCTGAAAGTCGCGCACTGCTCCGATGACTCGGCCATGTGTTACTTTCTCACCGGCTTGGCCGATGAGACCCTTACTGTGAAACTTGGAGAGGAGGCTCAAGC
TACCTTCGCCGAGGTCCTGCAAAACGCGAAGAATGTTATTGATAGGCAAGAGCTTTTCCCGACCAAGACTGGCCGGTTAGAGAAGAAGATCGACCAGAAGAAGTCTGACT
AA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRALAVNSGLQREVEAKVAEDQIQEGLGTEQLRRSARITTHVLPPTHPKPNKANRGRGGASRRTTRGTAPAPTKENFDALQKEMEAMRAQMRTMEEMY
NEMVQAAGARSQFEDQMVHEEVHEQGDLYLDSIDEEYLGGDENVEYSRQKNDLPTILIEREARLIEKDELQHARTRTPTTLKAKCEVKESAFDDDDFGESSFTSDILEAP
IPPKFKTPTMKPYDGSEDRKDYAEVFEGLTDFQAATDAIKCRAFQITLTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQ
KEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAQATFAEVLQNAKNVIDRQELFPTKTGRLEKKIDQKKSD