; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g26700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g26700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr11:19650692..19652717
RNA-Seq ExpressionMoc11g26700
SyntenyMoc11g26700
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.2e-10880.67Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE-------------
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGES FTSDVLEAPIPPKFKA TVKPYDG+KDPKDYVE             
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE-------------

Query:  --------------PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
                       RLWYRRL A SISTYSQLR+EFLA FSSRHYD+KTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  --------------PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGKEAPATFVEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKGSFSSGPAEY
        TVKLG+EAPATF EVLQKAKKVIDGQELLRTKT RPERKIGRGRSGKD+E ADPKSKDKGSFSSG AEY
Subjt:  TVKLGKEAPATFVEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKGSFSSGPAEY

XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]1.7e-12370.39Show/hide
Query:  MRTQMRSVEEMYNEMMLATGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESERYTRQRGDLREHLNRKRGSSLRKGQSPSRSLRSFNQQAESSHN-
        MRTQM ++E+MY+EM+ A GA SRSENRV R D+ EQRG HLGP ++  PE  E E YT QRGDLREHLNRKR SSLRKGQSPS S R+ NQQAESS+N 
Subjt:  MRTQMRSVEEMYNEMMLATGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESERYTRQRGDLREHLNRKRGSSLRKGQSPSRSLRSFNQQAESSHN-

Query:  --PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------------------
          P G+ITREEFDQL+ + DAQVEALKAKCE+K+ S +DGDLGES FTSD+LEA IP KFK  T+KPYDG+KDPKDYVE                     
Subjt:  --PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------------------

Query:  ------PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGKEA
               RLWYRRL A SISTYSQLRKEF+ QFSSRHYD KTATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD SAMCYFLT LADE LTVKL +EA
Subjt:  ------PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGKEA

Query:  PATFVEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKG--SFSS
        PATFVEVLQKAKK+IDGQELLRTKTDRPE+KI +GR+ KD  + D K++DKG  SFSS
Subjt:  PATFVEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKG--SFSS

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]1.1e-11480Show/hide
Query:  KRGSSLRKGQSPSRSLRSFNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGT
        +RGSSLRKGQSPSRS RS NQQAESSHN   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ SLNDGDLGES FTSDVLEAPIP KFKA TVKPYDG+
Subjt:  KRGSSLRKGQSPSRSLRSFNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGT

Query:  KDPKDYVE---------------------------PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKV
        +DPKDYVE                            RLWYRRL A SISTYSQLR+EFLAQFSSRHYD++TATHLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPKDYVE---------------------------PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGKEAPATFVEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKGSFSSGPAEY
         HCSDDSAMCYFLTGLADEA TVKLG+EAPATF EVLQKAKKVIDGQELLRTKT RPERKIGRGRSGKD+ERAD KSKDKGSFSS  A Y
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGKEAPATFVEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKGSFSSGPAEY

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]8.0e-11084.82Show/hide
Query:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE-------------------------
        GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGES FTSDVLEAPIPPKFKA TVKPYDGTKDPKDYVE                         
Subjt:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE-------------------------

Query:  --PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGKEAPATF
           RLWYRRL   SISTYSQLR+EFLAQFSSRHYD+KTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLG+EAPATF
Subjt:  --PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGKEAPATF

Query:  VEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKGSFSSGPAEY
         EVLQKAKKVIDGQELLRTKT RPERKIGRGRSGKDVERADPKSKDKGSFSSG AEY
Subjt:  VEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKGSFSSGPAEY

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]1.4e-12561.4Show/hide
Query:  VEGQGHDGLATEPLRRSARITTPALPPAHPRTSKATRGRGGTSKKGARGPAPVPTSENFDALKREMEAMRTQMRSVEEMYNEMMLATGAGSRSENRVTRV
        VEGQ H+GL TEP  RSARITTP L PAHP+  KA RGRGG S++   G AP P+ ENFDAL++EMEAMRTQM ++EEMYNEM+ A GAGSRSE+R  R 
Subjt:  VEGQGHDGLATEPLRRSARITTPALPPAHPRTSKATRGRGGTSKKGARGPAPVPTSENFDALKREMEAMRTQMRSVEEMYNEMMLATGAGSRSENRVTRV

Query:  DVREQRGSHLGPAEEERPEDNESERYTRQRGDLREHLNRKRGSSLRKGQSPSRSLRSFNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQ
                                    +RGDLR+HL+RKR SSLRKG+SPS S ++ NQQAESS+NP    G+ITREEFDQL+ + DAQVE LKA+CE 
Subjt:  DVREQRGSHLGPAEEERPEDNESERYTRQRGDLREHLNRKRGSSLRKGQSPSRSLRSFNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQ

Query:  KDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------------------------PRLWYRRLLAWSISTYSQLRKEFLAQ
        K  + +DGDLGES FTSD+LEA IP KFK  T+KPYDG+KDPKDYVE                            RLWYRRL A SISTYSQLRKEF +Q
Subjt:  KDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------------------------PRLWYRRLLAWSISTYSQLRKEFLAQ

Query:  FSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGKEAPATFVEVLQKAKKVIDGQELLRTKTDRPERKI
        FSSRHY+ KTATHLATIRQKE ETLREYVT FQEEQLKVAH SDDSA+CYFLT L DE LTVKLG+EAPATF EVLQKAKKVIDGQEL RTKT R E++I
Subjt:  FSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGKEAPATFVEVLQKAKKVIDGQELLRTKTDRPERKI

Query:  GRGRSGKDVERADPKSKDKGSF---SSGPA
         + +  ++  +A+ KSKDK  +    SGP+
Subjt:  GRGRSGKDVERADPKSKDKGSF---SSGPA

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088135.6e-10980.67Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE-------------
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGES FTSDVLEAPIPPKFKA TVKPYDG+KDPKDYVE             
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE-------------

Query:  --------------PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
                       RLWYRRL A SISTYSQLR+EFLA FSSRHYD+KTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  --------------PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGKEAPATFVEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKGSFSSGPAEY
        TVKLG+EAPATF EVLQKAKKVIDGQELLRTKT RPERKIGRGRSGKD+E ADPKSKDKGSFSSG AEY
Subjt:  TVKLGKEAPATFVEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKGSFSSGPAEY

A0A6J1DDS5 uncharacterized protein LOC1110198425.2e-11580Show/hide
Query:  KRGSSLRKGQSPSRSLRSFNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGT
        +RGSSLRKGQSPSRS RS NQQAESSHN   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ SLNDGDLGES FTSDVLEAPIP KFKA TVKPYDG+
Subjt:  KRGSSLRKGQSPSRSLRSFNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGT

Query:  KDPKDYVE---------------------------PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKV
        +DPKDYVE                            RLWYRRL A SISTYSQLR+EFLAQFSSRHYD++TATHLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPKDYVE---------------------------PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGKEAPATFVEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKGSFSSGPAEY
         HCSDDSAMCYFLTGLADEA TVKLG+EAPATF EVLQKAKKVIDGQELLRTKT RPERKIGRGRSGKD+ERAD KSKDKGSFSS  A Y
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGKEAPATFVEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKGSFSSGPAEY

A0A6J1DDW5 uncharacterized protein LOC1110196348.0e-12470.39Show/hide
Query:  MRTQMRSVEEMYNEMMLATGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESERYTRQRGDLREHLNRKRGSSLRKGQSPSRSLRSFNQQAESSHN-
        MRTQM ++E+MY+EM+ A GA SRSENRV R D+ EQRG HLGP ++  PE  E E YT QRGDLREHLNRKR SSLRKGQSPS S R+ NQQAESS+N 
Subjt:  MRTQMRSVEEMYNEMMLATGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESERYTRQRGDLREHLNRKRGSSLRKGQSPSRSLRSFNQQAESSHN-

Query:  --PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------------------
          P G+ITREEFDQL+ + DAQVEALKAKCE+K+ S +DGDLGES FTSD+LEA IP KFK  T+KPYDG+KDPKDYVE                     
Subjt:  --PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------------------

Query:  ------PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGKEA
               RLWYRRL A SISTYSQLRKEF+ QFSSRHYD KTATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD SAMCYFLT LADE LTVKL +EA
Subjt:  ------PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGKEA

Query:  PATFVEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKG--SFSS
        PATFVEVLQKAKK+IDGQELLRTKTDRPE+KI +GR+ KD  + D K++DKG  SFSS
Subjt:  PATFVEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKG--SFSS

A0A6J1DS95 uncharacterized protein LOC1110234213.9e-11084.82Show/hide
Query:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE-------------------------
        GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGES FTSDVLEAPIPPKFKA TVKPYDGTKDPKDYVE                         
Subjt:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE-------------------------

Query:  --PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGKEAPATF
           RLWYRRL   SISTYSQLR+EFLAQFSSRHYD+KTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLG+EAPATF
Subjt:  --PRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGKEAPATF

Query:  VEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKGSFSSGPAEY
         EVLQKAKKVIDGQELLRTKT RPERKIGRGRSGKDVERADPKSKDKGSFSSG AEY
Subjt:  VEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKGSFSSGPAEY

A0A6J1DZJ1 uncharacterized protein LOC1110257386.6e-12661.4Show/hide
Query:  VEGQGHDGLATEPLRRSARITTPALPPAHPRTSKATRGRGGTSKKGARGPAPVPTSENFDALKREMEAMRTQMRSVEEMYNEMMLATGAGSRSENRVTRV
        VEGQ H+GL TEP  RSARITTP L PAHP+  KA RGRGG S++   G AP P+ ENFDAL++EMEAMRTQM ++EEMYNEM+ A GAGSRSE+R  R 
Subjt:  VEGQGHDGLATEPLRRSARITTPALPPAHPRTSKATRGRGGTSKKGARGPAPVPTSENFDALKREMEAMRTQMRSVEEMYNEMMLATGAGSRSENRVTRV

Query:  DVREQRGSHLGPAEEERPEDNESERYTRQRGDLREHLNRKRGSSLRKGQSPSRSLRSFNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQ
                                    +RGDLR+HL+RKR SSLRKG+SPS S ++ NQQAESS+NP    G+ITREEFDQL+ + DAQVE LKA+CE 
Subjt:  DVREQRGSHLGPAEEERPEDNESERYTRQRGDLREHLNRKRGSSLRKGQSPSRSLRSFNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQ

Query:  KDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------------------------PRLWYRRLLAWSISTYSQLRKEFLAQ
        K  + +DGDLGES FTSD+LEA IP KFK  T+KPYDG+KDPKDYVE                            RLWYRRL A SISTYSQLRKEF +Q
Subjt:  KDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVE---------------------------PRLWYRRLLAWSISTYSQLRKEFLAQ

Query:  FSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGKEAPATFVEVLQKAKKVIDGQELLRTKTDRPERKI
        FSSRHY+ KTATHLATIRQKE ETLREYVT FQEEQLKVAH SDDSA+CYFLT L DE LTVKLG+EAPATF EVLQKAKKVIDGQEL RTKT R E++I
Subjt:  FSSRHYDEKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGKEAPATFVEVLQKAKKVIDGQELLRTKTDRPERKI

Query:  GRGRSGKDVERADPKSKDKGSF---SSGPA
         + +  ++  +A+ KSKDK  +    SGP+
Subjt:  GRGRSGKDVERADPKSKDKGSF---SSGPA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTCGACACAACAGATTATAGTACCGGTATACAAGTTTCATCATGAAAACCAATTAGAAAATAACAACAATTCAATCATAGTTGTTTGGGGAAGGGAATACCTTGG
ACACGTGGAGGACCCCTATTGGTGCTCCAAGATGGCTCATGAGTCATGCCCAGCTCGGATAGCCGAGATGGCCGCCACTAAGGTCTGCCCAAGTGTTCAGGTCGGTCCGG
AGGCCGAGTTCGAGCTGCAATCTGCAACACACTATTGTGCATATCCTTGCATAAACATTTGGCGCCGTCTGTGGGGAACGACAATCTTAGTCATCCCAATTCTTTTAAAC
CAACACGCGAGCGAACATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGTCGA
GGGGCAAGGTCACGACGGCCTGGCAACGGAACCCCTCCGCAGGTCGGCACGGATCACCACGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCC
GAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGTTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCT
GTGGAGGAAATGTATAACGAAATGATGCTAGCTACAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCC
AGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGAGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGC
AGTCACCATCCCGTTCACTCAGGAGCTTCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGAT
GCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGTCTTTCACCTCGGACGTTTTGGAAGCACCAATCCC
TCCGAAGTTCAAAGCTCTTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGCCGCGATTGTGGTATCGGAGACTGCTAGCCTGGTCGATCTCGA
CCTACTCTCAGCTGAGAAAGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACGAAAAGACAGCGACTCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTG
CGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACAGT
GAAACTTGGAAAGGAGGCCCCGGCCACCTTCGTCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTTCGAACCAAAACCGACCGACCGGAGCGAA
AGATCGGCCGGGGCAGAAGTGGAAAAGATGTAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCCAGCTGAGTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTTCGACACAACAGATTATAGTACCGGTATACAAGTTTCATCATGAAAACCAATTAGAAAATAACAACAATTCAATCATAGTTGTTTGGGGAAGGGAATACCTTGG
ACACGTGGAGGACCCCTATTGGTGCTCCAAGATGGCTCATGAGTCATGCCCAGCTCGGATAGCCGAGATGGCCGCCACTAAGGTCTGCCCAAGTGTTCAGGTCGGTCCGG
AGGCCGAGTTCGAGCTGCAATCTGCAACACACTATTGTGCATATCCTTGCATAAACATTTGGCGCCGTCTGTGGGGAACGACAATCTTAGTCATCCCAATTCTTTTAAAC
CAACACGCGAGCGAACATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGTCGA
GGGGCAAGGTCACGACGGCCTGGCAACGGAACCCCTCCGCAGGTCGGCACGGATCACCACGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCC
GAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGTTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCT
GTGGAGGAAATGTATAACGAAATGATGCTAGCTACAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCC
AGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGAGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGC
AGTCACCATCCCGTTCACTCAGGAGCTTCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGAT
GCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGTCTTTCACCTCGGACGTTTTGGAAGCACCAATCCC
TCCGAAGTTCAAAGCTCTTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGCCGCGATTGTGGTATCGGAGACTGCTAGCCTGGTCGATCTCGA
CCTACTCTCAGCTGAGAAAGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACGAAAAGACAGCGACTCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTG
CGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACAGT
GAAACTTGGAAAGGAGGCCCCGGCCACCTTCGTCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTTCGAACCAAAACCGACCGACCGGAGCGAA
AGATCGGCCGGGGCAGAAGTGGAAAAGATGTAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCCAGCTGAGTATTGA
Protein sequenceShow/hide protein sequence
MLSTQQIIVPVYKFHHENQLENNNNSIIVVWGREYLGHVEDPYWCSKMAHESCPARIAEMAATKVCPSVQVGPEAEFELQSATHYCAYPCINIWRRLWGTTILVIPILLN
QHASEHGSTSELDQYDRSKDSSCQRCPPEGGRSSSVEGQGHDGLATEPLRRSARITTPALPPAHPRTSKATRGRGGTSKKGARGPAPVPTSENFDALKREMEAMRTQMRS
VEEMYNEMMLATGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESERYTRQRGDLREHLNRKRGSSLRKGQSPSRSLRSFNQQAESSHNPAGIITREEFDQLRGELD
AQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKALTVKPYDGTKDPKDYVEPRLWYRRLLAWSISTYSQLRKEFLAQFSSRHYDEKTATHLATIRQKEGETL
REYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGKEAPATFVEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDVERADPKSKDKGSFSSGPAEY