; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g09030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g09030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:6993342..6994741
RNA-Seq ExpressionMoc07g09030
SyntenyMoc07g09030
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.9e-10175.45Show/hide
Query:  QAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKD-------------------
        +AES  NPATPAG+ITREEFDQLRG+LDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKD                   
Subjt:  QAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKD-------------------

Query:  ------LRIMLRSLRALWIFKR--HQTQSNVAPFRSRLLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEA
               RI L     LW ++R    + S  +  R   LA       D ++ATHLATIRQKEGETLREYVTRFQEEQLKVAH SDDSAMCYFLTGLADEA
Subjt:  ------LRIMLRSLRALWIFKR--HQTQSNVAPFRSRLLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEA

Query:  FTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP
         TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGSFS+GRAEYRRAENGP
Subjt:  FTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP

XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]4.2e-9561.56Show/hide
Query:  EEMGSNLGPVEEEHPEDNESEGHTRQRGDLCEHLNRKRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEG
        E+ G +LGPV++ HPE  E E +T QRGDL EHLNRKR SS RKGQSPS SHR+SNQQAES +NP TP G+ITREEFDQL+ K DAQVEALKAKCE+KE 
Subjt:  EEMGSNLGPVEEEHPEDNESEGHTRQRGDLCEHLNRKRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEG

Query:  PLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSK-------------------------DLRIMLRSLRALWIFKR--HQTQSNVAPFRSRLLAARD
          +DGDLGESPFTSD+LEA IP KFK PT+KPYDGSK                         D +I L     LW ++R   ++ S  +  R   +    
Subjt:  PLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSK-------------------------DLRIMLRSLRALWIFKR--HQTQSNVAPFRSRLLAARD

Query:  CGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGR
            D ++ATHL TIRQKEGETLREYVTRFQEEQLKVAH SD SAMCYFLT LADE  TVKL EEAPATF EVLQKAKK+IDGQELLRTKT RPE+KI +
Subjt:  CGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGR

Query:  GRSGKDIEKADPKSKDKGSFS-NGRAEYRRAEN
        GR+ KD  K D K++DKG  S + R  YRR++N
Subjt:  GRSGKDIEKADPKSKDKGSFS-NGRAEYRRAEN

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]2.2e-10774.83Show/hide
Query:  KRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGS
        +RGSS RKGQSPSRSHRSSNQQAES HNPATPAG+ITREEFDQLRGKLDAQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDGS
Subjt:  KRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGS

Query:  KD-------------------------LRIMLRSLRALWIFKR--HQTQSNVAPFRSRLLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLK
        +D                          +I L     LW ++R   ++ S  +  R   LA       D ++ATHLATIRQKEGETLREYVTRFQEEQLK
Subjt:  KD-------------------------LRIMLRSLRALWIFKR--HQTQSNVAPFRSRLLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLK

Query:  VAHWSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP
        V H SDDSAMCYFLTGLADEA TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE+AD KSKDKGSFS+ RA YRRAENGP
Subjt:  VAHWSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP

XP_022158652.1 uncharacterized protein LOC111025109 [Momordica charantia]3.0e-10176.19Show/hide
Query:  NRKRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYD
        + KRGSS RKGQSPSRSHRSSNQQAES HN   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGE PFTSDVLEAPIPPKFKAPTVKPYD
Subjt:  NRKRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYD

Query:  GSKDLRIMLRSLRALWIFKRHQTQSNVAPFRSRLLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEAFTVK
        G+KD +  +     L  F   Q  S+    R+  +A      G  +        RQKE ETLREYVTRFQEEQLKVAH SDDSAMCYF TGLADEA TVK
Subjt:  GSKDLRIMLRSLRALWIFKRHQTQSNVAPFRSRLLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEAFTVK

Query:  LGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP
        LGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E+ADPKSKDKGSFS+GRAEYRRAENGP
Subjt:  LGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]3.2e-11957.95Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAALVQGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGAWGPTSAPTSENFDALQREMEAMR
        MVQP +STNT DRR L A+D HQREVGA +V+GQ H+GL TEP  RSARIT P L PAHP+  KA RGRGG S++   G   AP+ ENFDALQ+EMEAMR
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAALVQGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGAWGPTSAPTSENFDALQREMEAMR

Query:  TQMRSMEEMGSNLGPVEEEHPEDNESEGHTRQRGDLCEHLNRKRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAK
        TQM +MEEM + +          +E      +RGDL +HL+RKR SS RKG+SPS SH++SNQQAES +NP  P G+ITREEFDQL+ K DAQVE LKA+
Subjt:  TQMRSMEEMGSNLGPVEEEHPEDNESEGHTRQRGDLCEHLNRKRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAK

Query:  CEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKD-------------------------LRIMLRSLRALWIFKR--HQTQSNVAPFRSR
        CE K    +DGDLGESPFTSD+LEA IP KFK PT+KPYDGSKD                          +I L S   LW ++R   ++ S  +  R  
Subjt:  CEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKD-------------------------LRIMLRSLRALWIFKR--HQTQSNVAPFRSR

Query:  LLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP
          +       + ++ATHLATIRQKE ETLREYVT FQEEQLKVAH+SDDSA+CYFLT L DE  TVKLGEEAPATFAEVLQKAKKVIDGQEL RTKTGR 
Subjt:  LLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP

Query:  ERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP
        E++I + +  ++  KA+ KSKDK       AEYRR+++GP
Subjt:  ERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.9e-10175.45Show/hide
Query:  QAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKD-------------------
        +AES  NPATPAG+ITREEFDQLRG+LDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKD                   
Subjt:  QAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKD-------------------

Query:  ------LRIMLRSLRALWIFKR--HQTQSNVAPFRSRLLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEA
               RI L     LW ++R    + S  +  R   LA       D ++ATHLATIRQKEGETLREYVTRFQEEQLKVAH SDDSAMCYFLTGLADEA
Subjt:  ------LRIMLRSLRALWIFKR--HQTQSNVAPFRSRLLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEA

Query:  FTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP
         TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGSFS+GRAEYRRAENGP
Subjt:  FTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP

A0A6J1DDS5 uncharacterized protein LOC1110198421.0e-10774.83Show/hide
Query:  KRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGS
        +RGSS RKGQSPSRSHRSSNQQAES HNPATPAG+ITREEFDQLRGKLDAQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDGS
Subjt:  KRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGS

Query:  KD-------------------------LRIMLRSLRALWIFKR--HQTQSNVAPFRSRLLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLK
        +D                          +I L     LW ++R   ++ S  +  R   LA       D ++ATHLATIRQKEGETLREYVTRFQEEQLK
Subjt:  KD-------------------------LRIMLRSLRALWIFKR--HQTQSNVAPFRSRLLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLK

Query:  VAHWSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP
        V H SDDSAMCYFLTGLADEA TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE+AD KSKDKGSFS+ RA YRRAENGP
Subjt:  VAHWSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP

A0A6J1DDW5 uncharacterized protein LOC1110196342.0e-9561.56Show/hide
Query:  EEMGSNLGPVEEEHPEDNESEGHTRQRGDLCEHLNRKRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEG
        E+ G +LGPV++ HPE  E E +T QRGDL EHLNRKR SS RKGQSPS SHR+SNQQAES +NP TP G+ITREEFDQL+ K DAQVEALKAKCE+KE 
Subjt:  EEMGSNLGPVEEEHPEDNESEGHTRQRGDLCEHLNRKRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEG

Query:  PLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSK-------------------------DLRIMLRSLRALWIFKR--HQTQSNVAPFRSRLLAARD
          +DGDLGESPFTSD+LEA IP KFK PT+KPYDGSK                         D +I L     LW ++R   ++ S  +  R   +    
Subjt:  PLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSK-------------------------DLRIMLRSLRALWIFKR--HQTQSNVAPFRSRLLAARD

Query:  CGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGR
            D ++ATHL TIRQKEGETLREYVTRFQEEQLKVAH SD SAMCYFLT LADE  TVKL EEAPATF EVLQKAKK+IDGQELLRTKT RPE+KI +
Subjt:  CGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGR

Query:  GRSGKDIEKADPKSKDKGSFS-NGRAEYRRAEN
        GR+ KD  K D K++DKG  S + R  YRR++N
Subjt:  GRSGKDIEKADPKSKDKGSFS-NGRAEYRRAEN

A0A6J1DXR9 uncharacterized protein LOC1110251091.5e-10176.19Show/hide
Query:  NRKRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYD
        + KRGSS RKGQSPSRSHRSSNQQAES HN   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGE PFTSDVLEAPIPPKFKAPTVKPYD
Subjt:  NRKRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYD

Query:  GSKDLRIMLRSLRALWIFKRHQTQSNVAPFRSRLLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEAFTVK
        G+KD +  +     L  F   Q  S+    R+  +A      G  +        RQKE ETLREYVTRFQEEQLKVAH SDDSAMCYF TGLADEA TVK
Subjt:  GSKDLRIMLRSLRALWIFKRHQTQSNVAPFRSRLLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEAFTVK

Query:  LGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP
        LGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E+ADPKSKDKGSFS+GRAEYRRAENGP
Subjt:  LGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP

A0A6J1DZJ1 uncharacterized protein LOC1110257381.5e-11957.95Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAALVQGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGAWGPTSAPTSENFDALQREMEAMR
        MVQP +STNT DRR L A+D HQREVGA +V+GQ H+GL TEP  RSARIT P L PAHP+  KA RGRGG S++   G   AP+ ENFDALQ+EMEAMR
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAALVQGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGAWGPTSAPTSENFDALQREMEAMR

Query:  TQMRSMEEMGSNLGPVEEEHPEDNESEGHTRQRGDLCEHLNRKRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAK
        TQM +MEEM + +          +E      +RGDL +HL+RKR SS RKG+SPS SH++SNQQAES +NP  P G+ITREEFDQL+ K DAQVE LKA+
Subjt:  TQMRSMEEMGSNLGPVEEEHPEDNESEGHTRQRGDLCEHLNRKRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAK

Query:  CEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKD-------------------------LRIMLRSLRALWIFKR--HQTQSNVAPFRSR
        CE K    +DGDLGESPFTSD+LEA IP KFK PT+KPYDGSKD                          +I L S   LW ++R   ++ S  +  R  
Subjt:  CEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKD-------------------------LRIMLRSLRALWIFKR--HQTQSNVAPFRSR

Query:  LLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP
          +       + ++ATHLATIRQKE ETLREYVT FQEEQLKVAH+SDDSA+CYFLT L DE  TVKLGEEAPATFAEVLQKAKKVIDGQEL RTKTGR 
Subjt:  LLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLTGLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP

Query:  ERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP
        E++I + +  ++  KA+ KSKDK       AEYRR+++GP
Subjt:  ERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAAAGGGAGGTCGGAGCAGCATTGGTACAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCTGGGGTCCAACCTCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGGGT
TCCAACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGGGGAGACCTCTGTGAGCATCTCAACAGAAAGAGAGGCTCATC
CTTCCGAAAAGGACAGTCACCATCCCGCTCACATCGGAGCTCCAACCAGCAGGCTGAATCTTTTCACAACCCAGCAACTCCCGCAGGGATGATCACAAGGGAGGAGTTCG
ACCAGCTGAGGGGCAAGCTCGATGCTCAGGTTGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCATTGAACGATGGCGACCTAGGAGAATCGCCTTTCACCTCG
GACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCAAAGGACCTAAGGATTATGTTGAGGTCTTTGAGGGCCTTATGGAT
TTTCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGTCAGCGACCCATCTCGCCACCA
TCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACTAGGTTCCAGGAGGAGCAATTGAAGGTCGCACACTGGTCCGATGACTCGGCCATGTGCTATTTTCTCACC
GGTCTAGCCGACGAAGCCTTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAAGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCG
AACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAATGGCCGAG
CTGAGTATCGAAGGGCGGAGAACGGACCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAAAGGGAGGTCGGAGCAGCATTGGTACAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCTGGGGTCCAACCTCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGGGT
TCCAACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGGGGAGACCTCTGTGAGCATCTCAACAGAAAGAGAGGCTCATC
CTTCCGAAAAGGACAGTCACCATCCCGCTCACATCGGAGCTCCAACCAGCAGGCTGAATCTTTTCACAACCCAGCAACTCCCGCAGGGATGATCACAAGGGAGGAGTTCG
ACCAGCTGAGGGGCAAGCTCGATGCTCAGGTTGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCATTGAACGATGGCGACCTAGGAGAATCGCCTTTCACCTCG
GACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCAAAGGACCTAAGGATTATGTTGAGGTCTTTGAGGGCCTTATGGAT
TTTCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGTCAGCGACCCATCTCGCCACCA
TCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACTAGGTTCCAGGAGGAGCAATTGAAGGTCGCACACTGGTCCGATGACTCGGCCATGTGCTATTTTCTCACC
GGTCTAGCCGACGAAGCCTTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAAGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCG
AACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAATGGCCGAG
CTGAGTATCGAAGGGCGGAGAACGGACCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAALVQGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGAWGPTSAPTSENFDALQREMEAMRTQMRSMEEMG
SNLGPVEEEHPEDNESEGHTRQRGDLCEHLNRKRGSSFRKGQSPSRSHRSSNQQAESFHNPATPAGMITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTS
DVLEAPIPPKFKAPTVKPYDGSKDLRIMLRSLRALWIFKRHQTQSNVAPFRSRLLAARDCGIGDCQSATHLATIRQKEGETLREYVTRFQEEQLKVAHWSDDSAMCYFLT
GLADEAFTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSNGRAEYRRAENGP