; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g29650 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g29650
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr6:22357490..22359945
RNA-Seq ExpressionMoc06g29650
SyntenyMoc06g29650
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004386 - helicase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141932.1 uncharacterized protein LOC111012188 [Momordica charantia]2.3e-2335.31Show/hide
Query:  INMESHDARVNKEGSSEKKLGGVNKVYHRKNQSLEEKGAVLDEEIARLQERAEMFSKNNEIRDKENERVYAKIEELNMKWREFMENSKKVSGDPEHDTEP
        + +E H AR+N+   +EKKL   +KVY RKNQ + + G+ LDE I  + ER +  +K  EIRDKENE + AKI ELN KW+ FMENS+++S         
Subjt:  INMESHDARVNKEGSSEKKLGGVNKVYHRKNQSLEEKGAVLDEEIARLQERAEMFSKNNEIRDKENERVYAKIEELNMKWREFMENSKKVSGDPEHDTEP

Query:  LEHSDSATVEIQRQIAPGAIMDETPPATLQGILSPSFPDPILTKKPLVFVDSEQERTTSKIVEILVVLNEARGEDPLEDDGNSGAVQGQLNVDGDDEDLG
                                                                      EI + L+E                         DEDLG
Subjt:  LEHSDSATVEIQRQIAPGAIMDETPPATLQGILSPSFPDPILTKKPLVFVDSEQERTTSKIVEILVVLNEARGEDPLEDDGNSGAVQGQLNVDGDDEDLG

Query:  ELPQEVHGDEFEDEEDNDDISQCEVRVRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVATAETSDEEVSLTK
        +LPQEV  +E E+EE+N+DISQ +             +EE P + QEG S P DV ++A +ES S SS+  T S SSLNV DPNFVA AE S+EE  LTK
Subjt:  ELPQEVHGDEFEDEEDNDDISQCEVRVRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVATAETSDEEVSLTK

Query:  VGK
          K
Subjt:  VGK

XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]5.2e-12159.17Show/hide
Query:  QLNVDGDDEDLGELPQEVHGDEFEDEEDNDDISQCEVRVRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKT----------------
        QLNVD +DED GELPQEVHGDEFEDEEDNDDISQ EV+VRTPVHESQQVDEEPP KEQEGTSGPVDV S+A+EES S SSQG                  
Subjt:  QLNVDGDDEDLGELPQEVHGDEFEDEEDNDDISQCEVRVRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKT----------------

Query:  -----PS---------------LSSLNVSDPNFVATAETSDEEVSLTKVGKIGGEETYFTTSDILLERGFDETQESVLEYVRRRLVENGWEALFAPTTRV
             PS               L   N  +P      +++++  S  K  ++      FTT DILLERGFDE QE V EYVR+R+VENGWE LFAP TRV
Subjt:  -----PS---------------LSSLNVSDPNFVATAETSDEEVSLTKVGKIGGEETYFTTSDILLERGFDETQESVLEYVRRRLVENGWEALFAPTTRV

Query:  SEALVKEFYTAINPNRGDVVRVRGKVVKFSPSIINTHYGLLDVLNGIGNEILVHPSDEQVEEARRLICRPYKTWIVSTTGKLSLMTLDINEQATVWMYVV
        SEALVKEFYTAINPNRGD VRVR                        GNEILVHPSDEQVEEARRLICRP+KTW +ST GKLSL  LDINEQATVWMYVV
Subjt:  SEALVKEFYTAINPNRGDVVRVRGKVVKFSPSIINTHYGLLDVLNGIGNEILVHPSDEQVEEARRLICRPYKTWIVSTTGKLSLMTLDINEQATVWMYVV

Query:  KNLLILTSHDSSIKRNRAM---------------------------MTGVAADDANVVVPKKPFTSLRIVRG------------------------EQYD
        KN LI TS+DSSIKRNRAM                           + GV A DANVV+PKKPF SLR VRG                        EQYD
Subjt:  KNLLILTSHDSSIKRNRAM---------------------------MTGVAADDANVVVPKKPFTSLRIVRG------------------------EQYD

Query:  ELRHKYELLLVTQRATCAFLKKIYDDEAPSFPDELVVDLPSSSHFPTDSTNDESSDDE
        ELRHKYELLLVTQRATCAFLKKIY DEAPSFPDEL  DLPSSS  PTDS +DESSDDE
Subjt:  ELRHKYELLLVTQRATCAFLKKIYDDEAPSFPDELVVDLPSSSHFPTDSTNDESSDDE

XP_022156786.1 uncharacterized protein LOC111023620 [Momordica charantia]1.5e-3043.69Show/hide
Query:  VHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVATAETSDEEVSLTKVGK---------------------------
        +HESQQ DEE   +EQEG SG VDV ++A+EES S SS+GK+PSLSSLNVSDPNFVA A TS+E+V LTKV K                           
Subjt:  VHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVATAETSDEEVSLTKVGK---------------------------

Query:  ------------------------------IGGEE--------------------TYFTTSDILLERGFDETQESVLEYVRRRLVENGWEALFAPTTRVS
                                      +  EE                      FT  +IL+E+GFDE QE V +Y++RRL+ENGWE LFAPT RVS
Subjt:  ------------------------------IGGEE--------------------TYFTTSDILLERGFDETQESVLEYVRRRLVENGWEALFAPTTRVS

Query:  EALVKEFYTAINPNRGDVVRVR
        E LVKEFY  INPNRGD +  R
Subjt:  EALVKEFYTAINPNRGDVVRVR

XP_022156935.1 uncharacterized protein LOC111023761 [Momordica charantia]1.0e-3148.37Show/hide
Query:  KVSGDPEHDTEPLEHSDSATVEIQRQIAPGAIMDETPPATLQGILSPSFPDPILTKKPLVFVDSEQERTTSKIVEILVVLNEARGEDPLEDDGNSGAVQG
        +VSGD EHD EPLEHSDSATV+I+ QIAP  IM ETPPATLQ                                E+LV LNEARGEDPL+DDGNSG    
Subjt:  KVSGDPEHDTEPLEHSDSATVEIQRQIAPGAIMDETPPATLQGILSPSFPDPILTKKPLVFVDSEQERTTSKIVEILVVLNEARGEDPLEDDGNSGAVQG

Query:  QLNVDGDDEDLGELPQEVHGDEFEDEEDNDDISQCEVRVRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVAT
                                                       Q DEEP A+EQEGTSGP+DV S+A+EES S  SQ KT SLSSLNVSDPNFVAT
Subjt:  QLNVDGDDEDLGELPQEVHGDEFEDEEDNDDISQCEVRVRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVAT

Query:  AETSDEEVSLTKVGK
        AE SDEEV+L KV K
Subjt:  AETSDEEVSLTKVGK

XP_022158483.1 uncharacterized protein LOC111024964 [Momordica charantia]1.5e-5439.33Show/hide
Query:  MEGSSSSKPHDNEKEKKIVLLHPSTKPGMIPLEPPRISHEKLIFDSREQRRKYEEAIRMNPKRNLSIGVTNFEKINMESHDARVNKEGSSEKKLGGVNKV
        MEGSS SKP D E EKK V+L P       P+ P                                           E H ARVN+ G SEKKL G +KV
Subjt:  MEGSSSSKPHDNEKEKKIVLLHPSTKPGMIPLEPPRISHEKLIFDSREQRRKYEEAIRMNPKRNLSIGVTNFEKINMESHDARVNKEGSSEKKLGGVNKV

Query:  YHRKNQSLEEKGAVLDEEIARLQERAEMFSKNNEIRDKENERVYAKIEELNMKWREFMENSKKVSGDPEHDTEPLEHSDSATVEIQRQIAPGAIMDETPP
        Y RKNQS+ +K + LDE IAR+ E+ ++ +K  EI DK+NE + AKI ELN KW+ FMENS+++S                  EIQ ++           
Subjt:  YHRKNQSLEEKGAVLDEEIARLQERAEMFSKNNEIRDKENERVYAKIEELNMKWREFMENSKKVSGDPEHDTEPLEHSDSATVEIQRQIAPGAIMDETPP

Query:  ATLQGILSPSFPDPILTKKPLVFVDSEQERTTSKIVEILVVLNEARGEDPLEDDGNSGAVQGQLNVDGDDEDLGELPQEVHGDEFEDEEDNDDISQCEVR
                                 +EQERTTSKI +ILV LNEA GEDPLEDDGNS   QG+LNVDG+DEDLG+LPQEVHGDE E+EE+NDDISQ EVR
Subjt:  ATLQGILSPSFPDPILTKKPLVFVDSEQERTTSKIVEILVVLNEARGEDPLEDDGNSGAVQGQLNVDGDDEDLGELPQEVHGDEFEDEEDNDDISQCEVR

Query:  VRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVATAETSDEEVSLTKVGKIGGEETYFTTSDILLERGFDETQ
        +   VHESQ+   E P +  EG S PVDV ++A  +S S SS  K  S   +N  +P       +++++ S  K                          
Subjt:  VRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVATAETSDEEVSLTKVGKIGGEETYFTTSDILLERGFDETQ

Query:  ESVLEYVRRRLVENGWEALFAPTTRVSEALVKEFYTAINPNRGDVVRVRG
                                RV EALVKEFY AI+PN+GD VRVRG
Subjt:  ESVLEYVRRRLVENGWEALFAPTTRVSEALVKEFYTAINPNRGDVVRVRG

TrEMBL top hitse value%identityAlignment
A0A6J1CL76 uncharacterized protein LOC1110121881.1e-2335.31Show/hide
Query:  INMESHDARVNKEGSSEKKLGGVNKVYHRKNQSLEEKGAVLDEEIARLQERAEMFSKNNEIRDKENERVYAKIEELNMKWREFMENSKKVSGDPEHDTEP
        + +E H AR+N+   +EKKL   +KVY RKNQ + + G+ LDE I  + ER +  +K  EIRDKENE + AKI ELN KW+ FMENS+++S         
Subjt:  INMESHDARVNKEGSSEKKLGGVNKVYHRKNQSLEEKGAVLDEEIARLQERAEMFSKNNEIRDKENERVYAKIEELNMKWREFMENSKKVSGDPEHDTEP

Query:  LEHSDSATVEIQRQIAPGAIMDETPPATLQGILSPSFPDPILTKKPLVFVDSEQERTTSKIVEILVVLNEARGEDPLEDDGNSGAVQGQLNVDGDDEDLG
                                                                      EI + L+E                         DEDLG
Subjt:  LEHSDSATVEIQRQIAPGAIMDETPPATLQGILSPSFPDPILTKKPLVFVDSEQERTTSKIVEILVVLNEARGEDPLEDDGNSGAVQGQLNVDGDDEDLG

Query:  ELPQEVHGDEFEDEEDNDDISQCEVRVRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVATAETSDEEVSLTK
        +LPQEV  +E E+EE+N+DISQ +             +EE P + QEG S P DV ++A +ES S SS+  T S SSLNV DPNFVA AE S+EE  LTK
Subjt:  ELPQEVHGDEFEDEEDNDDISQCEVRVRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVATAETSDEEVSLTK

Query:  VGK
          K
Subjt:  VGK

A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220072.5e-12159.17Show/hide
Query:  QLNVDGDDEDLGELPQEVHGDEFEDEEDNDDISQCEVRVRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKT----------------
        QLNVD +DED GELPQEVHGDEFEDEEDNDDISQ EV+VRTPVHESQQVDEEPP KEQEGTSGPVDV S+A+EES S SSQG                  
Subjt:  QLNVDGDDEDLGELPQEVHGDEFEDEEDNDDISQCEVRVRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKT----------------

Query:  -----PS---------------LSSLNVSDPNFVATAETSDEEVSLTKVGKIGGEETYFTTSDILLERGFDETQESVLEYVRRRLVENGWEALFAPTTRV
             PS               L   N  +P      +++++  S  K  ++      FTT DILLERGFDE QE V EYVR+R+VENGWE LFAP TRV
Subjt:  -----PS---------------LSSLNVSDPNFVATAETSDEEVSLTKVGKIGGEETYFTTSDILLERGFDETQESVLEYVRRRLVENGWEALFAPTTRV

Query:  SEALVKEFYTAINPNRGDVVRVRGKVVKFSPSIINTHYGLLDVLNGIGNEILVHPSDEQVEEARRLICRPYKTWIVSTTGKLSLMTLDINEQATVWMYVV
        SEALVKEFYTAINPNRGD VRVR                        GNEILVHPSDEQVEEARRLICRP+KTW +ST GKLSL  LDINEQATVWMYVV
Subjt:  SEALVKEFYTAINPNRGDVVRVRGKVVKFSPSIINTHYGLLDVLNGIGNEILVHPSDEQVEEARRLICRPYKTWIVSTTGKLSLMTLDINEQATVWMYVV

Query:  KNLLILTSHDSSIKRNRAM---------------------------MTGVAADDANVVVPKKPFTSLRIVRG------------------------EQYD
        KN LI TS+DSSIKRNRAM                           + GV A DANVV+PKKPF SLR VRG                        EQYD
Subjt:  KNLLILTSHDSSIKRNRAM---------------------------MTGVAADDANVVVPKKPFTSLRIVRG------------------------EQYD

Query:  ELRHKYELLLVTQRATCAFLKKIYDDEAPSFPDELVVDLPSSSHFPTDSTNDESSDDE
        ELRHKYELLLVTQRATCAFLKKIY DEAPSFPDEL  DLPSSS  PTDS +DESSDDE
Subjt:  ELRHKYELLLVTQRATCAFLKKIYDDEAPSFPDELVVDLPSSSHFPTDSTNDESSDDE

A0A6J1DRR9 uncharacterized protein LOC1110237614.9e-3248.37Show/hide
Query:  KVSGDPEHDTEPLEHSDSATVEIQRQIAPGAIMDETPPATLQGILSPSFPDPILTKKPLVFVDSEQERTTSKIVEILVVLNEARGEDPLEDDGNSGAVQG
        +VSGD EHD EPLEHSDSATV+I+ QIAP  IM ETPPATLQ                                E+LV LNEARGEDPL+DDGNSG    
Subjt:  KVSGDPEHDTEPLEHSDSATVEIQRQIAPGAIMDETPPATLQGILSPSFPDPILTKKPLVFVDSEQERTTSKIVEILVVLNEARGEDPLEDDGNSGAVQG

Query:  QLNVDGDDEDLGELPQEVHGDEFEDEEDNDDISQCEVRVRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVAT
                                                       Q DEEP A+EQEGTSGP+DV S+A+EES S  SQ KT SLSSLNVSDPNFVAT
Subjt:  QLNVDGDDEDLGELPQEVHGDEFEDEEDNDDISQCEVRVRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVAT

Query:  AETSDEEVSLTKVGK
        AE SDEEV+L KV K
Subjt:  AETSDEEVSLTKVGK

A0A6J1DW11 uncharacterized protein LOC1110236207.1e-3143.69Show/hide
Query:  VHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVATAETSDEEVSLTKVGK---------------------------
        +HESQQ DEE   +EQEG SG VDV ++A+EES S SS+GK+PSLSSLNVSDPNFVA A TS+E+V LTKV K                           
Subjt:  VHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVATAETSDEEVSLTKVGK---------------------------

Query:  ------------------------------IGGEE--------------------TYFTTSDILLERGFDETQESVLEYVRRRLVENGWEALFAPTTRVS
                                      +  EE                      FT  +IL+E+GFDE QE V +Y++RRL+ENGWE LFAPT RVS
Subjt:  ------------------------------IGGEE--------------------TYFTTSDILLERGFDETQESVLEYVRRRLVENGWEALFAPTTRVS

Query:  EALVKEFYTAINPNRGDVVRVR
        E LVKEFY  INPNRGD +  R
Subjt:  EALVKEFYTAINPNRGDVVRVR

A0A6J1DW79 uncharacterized protein LOC1110249647.0e-5539.33Show/hide
Query:  MEGSSSSKPHDNEKEKKIVLLHPSTKPGMIPLEPPRISHEKLIFDSREQRRKYEEAIRMNPKRNLSIGVTNFEKINMESHDARVNKEGSSEKKLGGVNKV
        MEGSS SKP D E EKK V+L P       P+ P                                           E H ARVN+ G SEKKL G +KV
Subjt:  MEGSSSSKPHDNEKEKKIVLLHPSTKPGMIPLEPPRISHEKLIFDSREQRRKYEEAIRMNPKRNLSIGVTNFEKINMESHDARVNKEGSSEKKLGGVNKV

Query:  YHRKNQSLEEKGAVLDEEIARLQERAEMFSKNNEIRDKENERVYAKIEELNMKWREFMENSKKVSGDPEHDTEPLEHSDSATVEIQRQIAPGAIMDETPP
        Y RKNQS+ +K + LDE IAR+ E+ ++ +K  EI DK+NE + AKI ELN KW+ FMENS+++S                  EIQ ++           
Subjt:  YHRKNQSLEEKGAVLDEEIARLQERAEMFSKNNEIRDKENERVYAKIEELNMKWREFMENSKKVSGDPEHDTEPLEHSDSATVEIQRQIAPGAIMDETPP

Query:  ATLQGILSPSFPDPILTKKPLVFVDSEQERTTSKIVEILVVLNEARGEDPLEDDGNSGAVQGQLNVDGDDEDLGELPQEVHGDEFEDEEDNDDISQCEVR
                                 +EQERTTSKI +ILV LNEA GEDPLEDDGNS   QG+LNVDG+DEDLG+LPQEVHGDE E+EE+NDDISQ EVR
Subjt:  ATLQGILSPSFPDPILTKKPLVFVDSEQERTTSKIVEILVVLNEARGEDPLEDDGNSGAVQGQLNVDGDDEDLGELPQEVHGDEFEDEEDNDDISQCEVR

Query:  VRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVATAETSDEEVSLTKVGKIGGEETYFTTSDILLERGFDETQ
        +   VHESQ+   E P +  EG S PVDV ++A  +S S SS  K  S   +N  +P       +++++ S  K                          
Subjt:  VRTPVHESQQVDEEPPAKEQEGTSGPVDVSSKAIEESFSFSSQGKTPSLSSLNVSDPNFVATAETSDEEVSLTKVGKIGGEETYFTTSDILLERGFDETQ

Query:  ESVLEYVRRRLVENGWEALFAPTTRVSEALVKEFYTAINPNRGDVVRVRG
                                RV EALVKEFY AI+PN+GD VRVRG
Subjt:  ESVLEYVRRRLVENGWEALFAPTTRVSEALVKEFYTAINPNRGDVVRVRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGTTCATCTTCCTCCAAGCCACACGACAATGAGAAGGAAAAGAAGATAGTGTTGTTGCATCCATCAACCAAACCGGGTATGATTCCTCTTGAACCTCCTAGGAT
TTCTCATGAAAAATTAATTTTTGATTCTAGGGAACAAAGAAGAAAATATGAGGAAGCTATAAGAATGAACCCTAAGAGAAATCTATCCATAGGTGTTACAAATTTTGAAA
AAATCAATATGGAATCTCATGATGCTAGGGTTAATAAAGAAGGTTCTAGTGAAAAGAAATTAGGAGGTGTTAATAAAGTTTATCATCGAAAAAATCAATCTCTAGAGGAA
AAAGGTGCTGTTTTAGATGAGGAAATAGCTAGACTTCAAGAGAGAGCGGAGATGTTCAGTAAAAATAACGAAATTAGGGATAAAGAGAATGAGAGGGTTTATGCGAAAAT
TGAGGAATTAAACATGAAATGGCGAGAATTCATGGAAAATTCAAAGAAAGTTAGTGGGGACCCAGAACACGACACGGAGCCCTTGGAGCATTCAGATTCGGCCACGGTCG
AAATTCAACGCCAAATTGCGCCTGGCGCAATTATGGATGAAACTCCACCGGCCACTCTACAAGGTATTTTGTCTCCATCTTTTCCAGATCCTATCTTGACTAAAAAGCCC
CTAGTTTTTGTTGATTCAGAACAGGAAAGGACAACTTCGAAAATTGTCGAAATTTTGGTAGTGTTGAATGAAGCAAGGGGAGAAGATCCTTTAGAGGATGATGGAAACAG
TGGGGCAGTACAAGGACAATTGAATGTTGATGGAGATGATGAAGATCTTGGAGAATTACCCCAAGAAGTGCATGGAGATGAGTTTGAGGACGAAGAAGACAATGACGATA
TCTCTCAATGTGAAGTGAGAGTACGAACTCCAGTGCACGAATCTCAGCAAGTTGATGAGGAGCCCCCTGCAAAAGAGCAAGAAGGAACATCCGGTCCTGTGGATGTCTCT
AGTAAGGCCATAGAGGAATCATTTTCCTTTTCTTCACAAGGTAAAACCCCTTCTTTGTCGAGTTTGAATGTTTCTGACCCAAACTTTGTTGCTACTGCAGAGACTTCAGA
TGAGGAGGTGAGTTTGACCAAAGTGGGTAAGATTGGAGGTGAGGAGACCTACTTCACAACAAGTGATATCCTCCTTGAAAGAGGTTTTGATGAGACCCAAGAGTCGGTGC
TAGAATATGTTAGGAGGAGGCTTGTGGAGAATGGTTGGGAGGCGTTGTTTGCCCCAACTACACGTGTATCAGAGGCCTTGGTGAAAGAGTTTTACACTGCCATCAACCCA
AACCGAGGGGATGTAGTGAGAGTACGGGGTAAAGTGGTAAAATTCTCGCCTTCCATTATTAATACTCACTATGGTTTGTTAGATGTTCTTAATGGCATAGGTAATGAAAT
TTTGGTGCATCCATCGGACGAGCAAGTGGAGGAGGCACGTAGGCTTATTTGTAGACCATATAAGACATGGATCGTCTCAACCACGGGGAAGCTTTCCTTAATGACCCTTG
ACATTAATGAGCAAGCGACGGTTTGGATGTATGTGGTGAAGAATTTGTTGATCCTCACTTCTCACGATTCCTCCATTAAGCGTAATAGGGCGATGATGACGGGAGTGGCG
GCTGATGATGCCAATGTTGTGGTGCCCAAGAAGCCGTTCACATCCCTAAGAATAGTTCGGGGGGAGCAGTATGATGAGCTTAGGCACAAGTATGAGCTTCTTTTGGTTAC
TCAACGTGCCACATGTGCTTTCCTTAAGAAGATATACGATGATGAAGCACCTTCATTCCCCGATGAACTTGTGGTCGACTTACCATCTTCTTCCCATTTTCCTACCGATT
CCACCAACGATGAGTCTTCCGATGATGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGTTCATCTTCCTCCAAGCCACACGACAATGAGAAGGAAAAGAAGATAGTGTTGTTGCATCCATCAACCAAACCGGGTATGATTCCTCTTGAACCTCCTAGGAT
TTCTCATGAAAAATTAATTTTTGATTCTAGGGAACAAAGAAGAAAATATGAGGAAGCTATAAGAATGAACCCTAAGAGAAATCTATCCATAGGTGTTACAAATTTTGAAA
AAATCAATATGGAATCTCATGATGCTAGGGTTAATAAAGAAGGTTCTAGTGAAAAGAAATTAGGAGGTGTTAATAAAGTTTATCATCGAAAAAATCAATCTCTAGAGGAA
AAAGGTGCTGTTTTAGATGAGGAAATAGCTAGACTTCAAGAGAGAGCGGAGATGTTCAGTAAAAATAACGAAATTAGGGATAAAGAGAATGAGAGGGTTTATGCGAAAAT
TGAGGAATTAAACATGAAATGGCGAGAATTCATGGAAAATTCAAAGAAAGTTAGTGGGGACCCAGAACACGACACGGAGCCCTTGGAGCATTCAGATTCGGCCACGGTCG
AAATTCAACGCCAAATTGCGCCTGGCGCAATTATGGATGAAACTCCACCGGCCACTCTACAAGGTATTTTGTCTCCATCTTTTCCAGATCCTATCTTGACTAAAAAGCCC
CTAGTTTTTGTTGATTCAGAACAGGAAAGGACAACTTCGAAAATTGTCGAAATTTTGGTAGTGTTGAATGAAGCAAGGGGAGAAGATCCTTTAGAGGATGATGGAAACAG
TGGGGCAGTACAAGGACAATTGAATGTTGATGGAGATGATGAAGATCTTGGAGAATTACCCCAAGAAGTGCATGGAGATGAGTTTGAGGACGAAGAAGACAATGACGATA
TCTCTCAATGTGAAGTGAGAGTACGAACTCCAGTGCACGAATCTCAGCAAGTTGATGAGGAGCCCCCTGCAAAAGAGCAAGAAGGAACATCCGGTCCTGTGGATGTCTCT
AGTAAGGCCATAGAGGAATCATTTTCCTTTTCTTCACAAGGTAAAACCCCTTCTTTGTCGAGTTTGAATGTTTCTGACCCAAACTTTGTTGCTACTGCAGAGACTTCAGA
TGAGGAGGTGAGTTTGACCAAAGTGGGTAAGATTGGAGGTGAGGAGACCTACTTCACAACAAGTGATATCCTCCTTGAAAGAGGTTTTGATGAGACCCAAGAGTCGGTGC
TAGAATATGTTAGGAGGAGGCTTGTGGAGAATGGTTGGGAGGCGTTGTTTGCCCCAACTACACGTGTATCAGAGGCCTTGGTGAAAGAGTTTTACACTGCCATCAACCCA
AACCGAGGGGATGTAGTGAGAGTACGGGGTAAAGTGGTAAAATTCTCGCCTTCCATTATTAATACTCACTATGGTTTGTTAGATGTTCTTAATGGCATAGGTAATGAAAT
TTTGGTGCATCCATCGGACGAGCAAGTGGAGGAGGCACGTAGGCTTATTTGTAGACCATATAAGACATGGATCGTCTCAACCACGGGGAAGCTTTCCTTAATGACCCTTG
ACATTAATGAGCAAGCGACGGTTTGGATGTATGTGGTGAAGAATTTGTTGATCCTCACTTCTCACGATTCCTCCATTAAGCGTAATAGGGCGATGATGACGGGAGTGGCG
GCTGATGATGCCAATGTTGTGGTGCCCAAGAAGCCGTTCACATCCCTAAGAATAGTTCGGGGGGAGCAGTATGATGAGCTTAGGCACAAGTATGAGCTTCTTTTGGTTAC
TCAACGTGCCACATGTGCTTTCCTTAAGAAGATATACGATGATGAAGCACCTTCATTCCCCGATGAACTTGTGGTCGACTTACCATCTTCTTCCCATTTTCCTACCGATT
CCACCAACGATGAGTCTTCCGATGATGAGTAG
Protein sequenceShow/hide protein sequence
MEGSSSSKPHDNEKEKKIVLLHPSTKPGMIPLEPPRISHEKLIFDSREQRRKYEEAIRMNPKRNLSIGVTNFEKINMESHDARVNKEGSSEKKLGGVNKVYHRKNQSLEE
KGAVLDEEIARLQERAEMFSKNNEIRDKENERVYAKIEELNMKWREFMENSKKVSGDPEHDTEPLEHSDSATVEIQRQIAPGAIMDETPPATLQGILSPSFPDPILTKKP
LVFVDSEQERTTSKIVEILVVLNEARGEDPLEDDGNSGAVQGQLNVDGDDEDLGELPQEVHGDEFEDEEDNDDISQCEVRVRTPVHESQQVDEEPPAKEQEGTSGPVDVS
SKAIEESFSFSSQGKTPSLSSLNVSDPNFVATAETSDEEVSLTKVGKIGGEETYFTTSDILLERGFDETQESVLEYVRRRLVENGWEALFAPTTRVSEALVKEFYTAINP
NRGDVVRVRGKVVKFSPSIINTHYGLLDVLNGIGNEILVHPSDEQVEEARRLICRPYKTWIVSTTGKLSLMTLDINEQATVWMYVVKNLLILTSHDSSIKRNRAMMTGVA
ADDANVVVPKKPFTSLRIVRGEQYDELRHKYELLLVTQRATCAFLKKIYDDEAPSFPDELVVDLPSSSHFPTDSTNDESSDDE