; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g15960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g15960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr5:12023520..12027501
RNA-Seq ExpressionMoc05g15960
SyntenyMoc05g15960
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]1.5e-9658.94Show/hide
Query:  MRTQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLR----------------------------------
        MRTQM +ME+MY+EM+ AAGA SRSENRV R D+ EQRG HLGP ++  PE  E E YT QRGDLR                                  
Subjt:  MRTQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLR----------------------------------

Query:  -----IITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ
             +ITREEFDQL+ + DAQVEALKAKCE+K+ S  DGDL ESPFTSD+LEA IP KFK PT+KPYDG+KDPKDYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  -----IITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ

Query:  IALTGSARLWYRRLPA----------------------------------------------RFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEA
        IALTGSARLWYRRLPA                                              RFQEEQL+VAHCSD SAMCYFLT LADE LTVKL EEA
Subjt:  IALTGSARLWYRRLPA----------------------------------------------RFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEA

Query:  PATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDVERADPKSKDKG--SFSS
        PATF EVLQKAKK+IDGQELLRTKT RPE+KID+GR+ KD  + D K++DKG  SFSS
Subjt:  PATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDVERADPKSKDKG--SFSS

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]5.8e-9652.77Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPDPTSENFDALKREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SKA                                   
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPDPTSENFDALKREMEAMR

Query:  TQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLRIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGD
              E  YN +                                                   +ITREEFDQL+ + DAQVEALKA+CE+K+ S  DGD
Subjt:  TQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLRIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGD

Query:  LEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA-------------------------
        L E  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPA                         
Subjt:  LEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA-------------------------

Query:  ---------------------RFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDV
                             RF EEQL+VAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ ID+GR+GKD 
Subjt:  ---------------------RFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDV

Query:  ERADPKSKDKGSFSS
         +AD KS+DKG  SS
Subjt:  ERADPKSKDKGSFSS

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]6.6e-10079.61Show/hide
Query:  IITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG
        IITREEFDQLRGELDAQVEALKAKCEQKDDSL+DGDL ESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG
Subjt:  IITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG

Query:  SARLWYRRLP----------------------------------------------ARFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA
        SARLWYRRLP                                               RFQEEQL+VAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA
Subjt:  SARLWYRRLP----------------------------------------------ARFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDVERADPKSKDKGSFSSGRDE
        EVLQKAKKVIDGQELLRTKTGRPERKI RGRSGKDVERADPKSKDKGSFSSGR E
Subjt:  EVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDVERADPKSKDKGSFSSGRDE

XP_022158652.1 uncharacterized protein LOC111025109 [Momordica charantia]2.9e-10381.71Show/hide
Query:  EQRGSHL----GPAEEERPEDNESEGYTRQRGDLRIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDG
        E+RGS L     P+   R  + ++E      G   IITREEFDQLRGELDAQVEALKAKCEQKDDSL+DGDL E PFTSDVLEAPIPPKFKAPTVKPYDG
Subjt:  EQRGSHL----GPAEEERPEDNESEGYTRQRGDLRIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDG

Query:  TKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA-------------RFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA             RFQEEQL+VAHCSDDSAMCYF TGLADEALTVKLGEEAP T
Subjt:  TKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA-------------RFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDVERADPKSKDKGSFSSGRDE
        FAEVLQKAKKVIDGQELLRTKTGRPERKI RGRSGKDVERADPKSKDKGSFSSGR E
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDVERADPKSKDKGSFSSGRDE

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]5.0e-11658.03Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPDPTSENFDALKREMEAMR
        MVQP +STNT DRR L A+D HQREVGA  VEGQ H+GL TEP  RSARIT P L PAHP+  KA RGRGG S++   G AP P+ ENFDAL++EMEAMR
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPDPTSENFDALKREMEAMR

Query:  TQMRSMEEMYNEMMLAAGAGSRSENRVT---RVDVRE----QRGSHL----GPAEEERPEDNESEGYTRQRGDLRIITREEFDQLRGELDAQVEALKAKC
        TQM +MEEMYNEM+ A GAGSRSE+R     R D+R+    +R S L     P+   +  + ++E          +ITREEFDQL+ + DAQVE LKA+C
Subjt:  TQMRSMEEMYNEMMLAAGAGSRSENRVT---RVDVRE----QRGSHL----GPAEEERPEDNESEGYTRQRGDLRIITREEFDQLRGELDAQVEALKAKC

Query:  EQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPAR-------------
        E K  +  DGDL ESPFTSD+LEA IP KFK PT+KPYDG+KDPKDYVEVFEGLM FQAA+DAIK RAFQIALT SARLWYRRLPAR             
Subjt:  EQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPAR-------------

Query:  ---------------------------------FQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER
                                         FQEEQL+VAH SDDSA+CYFLT L DE LTVKLGEEAPATFAEVLQKAKKVIDGQEL RTKTGR E+
Subjt:  ---------------------------------FQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER

Query:  KIDRGRSGKDVERADPKSKDKGSF---SSGRDESVP
        +ID+ +  ++  +A+ KSKDK  +    SG   S P
Subjt:  KIDRGRSGKDVERADPKSKDKGSF---SSGRDESVP

TrEMBL top hitse value%identityAlignment
A0A6J1DDW5 uncharacterized protein LOC1110196347.4e-9758.94Show/hide
Query:  MRTQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLR----------------------------------
        MRTQM +ME+MY+EM+ AAGA SRSENRV R D+ EQRG HLGP ++  PE  E E YT QRGDLR                                  
Subjt:  MRTQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLR----------------------------------

Query:  -----IITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ
             +ITREEFDQL+ + DAQVEALKAKCE+K+ S  DGDL ESPFTSD+LEA IP KFK PT+KPYDG+KDPKDYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  -----IITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQ

Query:  IALTGSARLWYRRLPA----------------------------------------------RFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEA
        IALTGSARLWYRRLPA                                              RFQEEQL+VAHCSD SAMCYFLT LADE LTVKL EEA
Subjt:  IALTGSARLWYRRLPA----------------------------------------------RFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEA

Query:  PATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDVERADPKSKDKG--SFSS
        PATF EVLQKAKK+IDGQELLRTKT RPE+KID+GR+ KD  + D K++DKG  SFSS
Subjt:  PATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDVERADPKSKDKG--SFSS

A0A6J1DHB3 uncharacterized protein LOC1110204792.8e-9652.77Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPDPTSENFDALKREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SKA                                   
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPDPTSENFDALKREMEAMR

Query:  TQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLRIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGD
              E  YN +                                                   +ITREEFDQL+ + DAQVEALKA+CE+K+ S  DGD
Subjt:  TQMRSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLRIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGD

Query:  LEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA-------------------------
        L E  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPA                         
Subjt:  LEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA-------------------------

Query:  ---------------------RFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDV
                             RF EEQL+VAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ ID+GR+GKD 
Subjt:  ---------------------RFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDV

Query:  ERADPKSKDKGSFSS
         +AD KS+DKG  SS
Subjt:  ERADPKSKDKGSFSS

A0A6J1DS95 uncharacterized protein LOC1110234213.2e-10079.61Show/hide
Query:  IITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG
        IITREEFDQLRGELDAQVEALKAKCEQKDDSL+DGDL ESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG
Subjt:  IITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG

Query:  SARLWYRRLP----------------------------------------------ARFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA
        SARLWYRRLP                                               RFQEEQL+VAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA
Subjt:  SARLWYRRLP----------------------------------------------ARFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDVERADPKSKDKGSFSSGRDE
        EVLQKAKKVIDGQELLRTKTGRPERKI RGRSGKDVERADPKSKDKGSFSSGR E
Subjt:  EVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDVERADPKSKDKGSFSSGRDE

A0A6J1DXR9 uncharacterized protein LOC1110251091.4e-10381.71Show/hide
Query:  EQRGSHL----GPAEEERPEDNESEGYTRQRGDLRIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDG
        E+RGS L     P+   R  + ++E      G   IITREEFDQLRGELDAQVEALKAKCEQKDDSL+DGDL E PFTSDVLEAPIPPKFKAPTVKPYDG
Subjt:  EQRGSHL----GPAEEERPEDNESEGYTRQRGDLRIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDG

Query:  TKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA-------------RFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA             RFQEEQL+VAHCSDDSAMCYF TGLADEALTVKLGEEAP T
Subjt:  TKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA-------------RFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDVERADPKSKDKGSFSSGRDE
        FAEVLQKAKKVIDGQELLRTKTGRPERKI RGRSGKDVERADPKSKDKGSFSSGR E
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDVERADPKSKDKGSFSSGRDE

A0A6J1DZJ1 uncharacterized protein LOC1110257382.4e-11658.03Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPDPTSENFDALKREMEAMR
        MVQP +STNT DRR L A+D HQREVGA  VEGQ H+GL TEP  RSARIT P L PAHP+  KA RGRGG S++   G AP P+ ENFDAL++EMEAMR
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPDPTSENFDALKREMEAMR

Query:  TQMRSMEEMYNEMMLAAGAGSRSENRVT---RVDVRE----QRGSHL----GPAEEERPEDNESEGYTRQRGDLRIITREEFDQLRGELDAQVEALKAKC
        TQM +MEEMYNEM+ A GAGSRSE+R     R D+R+    +R S L     P+   +  + ++E          +ITREEFDQL+ + DAQVE LKA+C
Subjt:  TQMRSMEEMYNEMMLAAGAGSRSENRVT---RVDVRE----QRGSHL----GPAEEERPEDNESEGYTRQRGDLRIITREEFDQLRGELDAQVEALKAKC

Query:  EQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPAR-------------
        E K  +  DGDL ESPFTSD+LEA IP KFK PT+KPYDG+KDPKDYVEVFEGLM FQAA+DAIK RAFQIALT SARLWYRRLPAR             
Subjt:  EQKDDSLSDGDLEESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPAR-------------

Query:  ---------------------------------FQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER
                                         FQEEQL+VAH SDDSA+CYFLT L DE LTVKLGEEAPATFAEVLQKAKKVIDGQEL RTKTGR E+
Subjt:  ---------------------------------FQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER

Query:  KIDRGRSGKDVERADPKSKDKGSF---SSGRDESVP
        +ID+ +  ++  +A+ KSKDK  +    SG   S P
Subjt:  KIDRGRSGKDVERADPKSKDKGSF---SSGRDESVP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTGGAGGGGCAAGGT
CACGACGGCCTAGTAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCACCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGATCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCC
ATGGAGGAAATGTATAACGAAATGATGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTC
GGCCCGGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTATACTCGCCAGAGGGGAGACCTCCGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGG
GGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAGCGATGGCGACTTGGAGGAATCGCCTTTCACCTCGGACGTT
TTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGAC
TTCCAAGCGGCATCAGACGCGATCAAGTGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGATTCCAGGAGGAG
CAGTTGAGAGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCC
ACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAGCGAAAGATCGACCGGGGCAGAAGC
GGGAAAGATGTAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGATGAGTCGGTCCCCGTGGAGATCTTAGATAATCCTTCGATC
ACGGAGCCAAATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAAGGGCAACTCACCACAAGACCCCAAGGAGCGCAGA
AAGTTGGCAAGGCGGGCAGCTCGGTTCGTGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTTCGACCT
GGGACGTACATATTGGCCGATTTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTGGAGGGGCAAGGT
CACGACGGCCTAGTAACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCACCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGATCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCC
ATGGAGGAAATGTATAACGAAATGATGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTC
GGCCCGGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTATACTCGCCAGAGGGGAGACCTCCGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGG
GGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAGCGATGGCGACTTGGAGGAATCGCCTTTCACCTCGGACGTT
TTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGAC
TTCCAAGCGGCATCAGACGCGATCAAGTGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGATTCCAGGAGGAG
CAGTTGAGAGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCC
ACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAGCGAAAGATCGACCGGGGCAGAAGC
GGGAAAGATGTAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGATGAGTCGGTCCCCGTGGAGATCTTAGATAATCCTTCGATC
ACGGAGCCAAATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAAGGGCAACTCACCACAAGACCCCAAGGAGCGCAGA
AAGTTGGCAAGGCGGGCAGCTCGGTTCGTGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTTCGACCT
GGGACGTACATATTGGCCGATTTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPDPTSENFDALKREMEAMRTQMRS
MEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLRIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLEESPFTSDV
LEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA
TFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDVERADPKSKDKGSFSSGRDESVPVEILDNPSITEPNLMEIGAPESSWMDPIADFIKGNSPQDPKERR
KLARRAARFVRVQTHVGALDPAWEGPFEIKGIVRPGTYILADLKGDVLAHPWNAEHLKRYYP