; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g02280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g02280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:1806951..1808769
RNA-Seq ExpressionMoc07g02280
SyntenyMoc07g02280
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]8.1e-13079.5Show/hide
Query:  MRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPPEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESSHD-
        MRTQM +ME MY+EMV AAGA SRSENRV R D+ EQRG HLGP ++  PE  E E YT QR DLREHLNRKR SSLRKGQSPS SH +SNQQAESS++ 
Subjt:  MRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPPEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESSHD-

Query:  --LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCRAFQ
            G+IT+EEFDQL+ + DAQVEALKAKCE+K+ S +DGDLGESPFTSD+LEA IP KFK PT+KPYDG+KDPKDYVEVFEGLM FQAA+DAIKCR FQ
Subjt:  --LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCRAFQ

Query:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEA
        IALTGSARLWYRRLPARSISTYSQLR+EF+ QFS RHYD+KTATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD SAMCYFLT LADE LTVKL EEA
Subjt:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEA

Query:  PATFAEVLQKAKKVIDG
        PATF EVLQKAKK+IDG
Subjt:  PATFAEVLQKAKKVIDG

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]1.7e-11691.06Show/hide
Query:  KRGSSLRKGQSPSRSHGSSNQQAESSHD---LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGT
        +RGSSLRKGQSPSRSH SSNQQAESSH+    AG+IT+EEFDQLRG+LDAQVEALKAKCEQK+ SLNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDG+
Subjt:  KRGSSLRKGQSPSRSHGSSNQQAESSHD---LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGT

Query:  KDPKDYVEVFEGLMAFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV
        +DPKDYVEVFEGLM FQAASD IKCRAFQIALT SARLWYRRLPARSISTYSQLRREFLAQFS RHYDK+TATHLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPKDYVEVFEGLMAFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDG
         HCSDDSAMCYFLTGLADEA TVKLGEEAPATFAEVLQKAKKVIDG
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDG

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]3.4e-11280.83Show/hide
Query:  DNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESSHD---LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDV
        + E E YT QR DLREHLNRKR SSLRKGQSPS SH +SNQQAESS++      +IT+EEFDQL+ + DAQVEALKA CE+K+ S +DGDLGE PFT D+
Subjt:  DNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESSHD---LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDV

Query:  LEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQ
        LEAPI PKFK PT+KPYDG+K+PKDYV+VFEGLM FQAA+DAIKCRAFQIA TGSARLWYRRLPARSISTYSQLR+EF++QFS R+YD+KTATHLATIRQ
Subjt:  LEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQ

Query:  KEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDG
        K+GETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAD+ LTVKLGEEAPATFAEVLQKAKKVIDG
Subjt:  KEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDG

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]7.6e-11270Show/hide
Query:  MEAMRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPPEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESS
        MEAMRTQMR+ME MYN+MV  AGA SRS ++V   DV EQ   H  P +EE               DLR+HLNRKR SS R  ++ +  H +SNQQAESS
Subjt:  MEAMRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPPEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESS

Query:  HD---LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCR
        ++     G+IT+EEF+QL+ + DAQVEALK +CE+K+ + +DGDLGESPFTSD+LEA IPPKFK PT+K YDG+KDPKDYVEVFEGLM FQAA+DAIKCR
Subjt:  HD---LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCR

Query:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLG
        AFQIALTGSARLWYRRLPARSISTYSQLR+EF++QF  RHYD+KT THLATIRQKEG+TL+EY+TRFQEEQLKV HCSDDS+MCYFLTGLADE  TVKLG
Subjt:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLG

Query:  EEAPATFAEVLQKAKKVIDG
        EEA ATFAEVLQ  KK IDG
Subjt:  EEAPATFAEVLQKAKKVIDG

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]4.1e-14270.4Show/hide
Query:  KTLAASDAHQREVGAAAVEGQGHDGLAMEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNEM
        + L A+D HQREVGA  VEGQ H+GL  EP  RSARIT P L PAHP+  KA RGRGG S++   G APAP+ ENFDALQ+EMEAMRTQM +ME MYNEM
Subjt:  KTLAASDAHQREVGAAAVEGQGHDGLAMEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNEM

Query:  VLAAGAGSRSENRVTRMDVREQRGSHLGPPEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESSHDLA---GIITKEEFDQL
        V A GAGSRSE+R      R++RG                        DLR+HL+RKR SSLRKG+SPS SH +SNQQAESS++     G+IT+EEFDQL
Subjt:  VLAAGAGSRSENRVTRMDVREQRGSHLGPPEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESSHDLA---GIITKEEFDQL

Query:  RGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCRAFQIALTGSARLWYRRLP
        + + DAQVE LKA+CE K  + +DGDLGESPFTSD+LEA IP KFK PT+KPYDG+KDPKDYVEVFEGLM FQAA+DAIK RAFQIALT SARLWYRRLP
Subjt:  RGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCRAFQIALTGSARLWYRRLP

Query:  ARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVI
        ARSISTYSQLR+EF +QFS RHY++KTATHLATIRQKE ETLREYVT FQEEQLKVAH SDDSA+CYFLT L DE LTVKLGEEAPATFAEVLQKAKKVI
Subjt:  ARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVI

Query:  DG
        DG
Subjt:  DG

TrEMBL top hitse value%identityAlignment
A0A6J1DDS5 uncharacterized protein LOC1110198428.5e-11791.06Show/hide
Query:  KRGSSLRKGQSPSRSHGSSNQQAESSHD---LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGT
        +RGSSLRKGQSPSRSH SSNQQAESSH+    AG+IT+EEFDQLRG+LDAQVEALKAKCEQK+ SLNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDG+
Subjt:  KRGSSLRKGQSPSRSHGSSNQQAESSHD---LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGT

Query:  KDPKDYVEVFEGLMAFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV
        +DPKDYVEVFEGLM FQAASD IKCRAFQIALT SARLWYRRLPARSISTYSQLRREFLAQFS RHYDK+TATHLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPKDYVEVFEGLMAFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDG
         HCSDDSAMCYFLTGLADEA TVKLGEEAPATFAEVLQKAKKVIDG
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDG

A0A6J1DDW5 uncharacterized protein LOC1110196343.9e-13079.5Show/hide
Query:  MRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPPEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESSHD-
        MRTQM +ME MY+EMV AAGA SRSENRV R D+ EQRG HLGP ++  PE  E E YT QR DLREHLNRKR SSLRKGQSPS SH +SNQQAESS++ 
Subjt:  MRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPPEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESSHD-

Query:  --LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCRAFQ
            G+IT+EEFDQL+ + DAQVEALKAKCE+K+ S +DGDLGESPFTSD+LEA IP KFK PT+KPYDG+KDPKDYVEVFEGLM FQAA+DAIKCR FQ
Subjt:  --LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCRAFQ

Query:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEA
        IALTGSARLWYRRLPARSISTYSQLR+EF+ QFS RHYD+KTATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD SAMCYFLT LADE LTVKL EEA
Subjt:  IALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEA

Query:  PATFAEVLQKAKKVIDG
        PATF EVLQKAKK+IDG
Subjt:  PATFAEVLQKAKKVIDG

A0A6J1DM55 uncharacterized protein LOC1110222671.7e-11280.83Show/hide
Query:  DNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESSHD---LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDV
        + E E YT QR DLREHLNRKR SSLRKGQSPS SH +SNQQAESS++      +IT+EEFDQL+ + DAQVEALKA CE+K+ S +DGDLGE PFT D+
Subjt:  DNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESSHD---LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDV

Query:  LEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQ
        LEAPI PKFK PT+KPYDG+K+PKDYV+VFEGLM FQAA+DAIKCRAFQIA TGSARLWYRRLPARSISTYSQLR+EF++QFS R+YD+KTATHLATIRQ
Subjt:  LEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQ

Query:  KEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDG
        K+GETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAD+ LTVKLGEEAPATFAEVLQKAKKVIDG
Subjt:  KEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDG

A0A6J1DPN4 uncharacterized protein LOC1110230603.7e-11270Show/hide
Query:  MEAMRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPPEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESS
        MEAMRTQMR+ME MYN+MV  AGA SRS ++V   DV EQ   H  P +EE               DLR+HLNRKR SS R  ++ +  H +SNQQAESS
Subjt:  MEAMRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPPEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESS

Query:  HD---LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCR
        ++     G+IT+EEF+QL+ + DAQVEALK +CE+K+ + +DGDLGESPFTSD+LEA IPPKFK PT+K YDG+KDPKDYVEVFEGLM FQAA+DAIKCR
Subjt:  HD---LAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCR

Query:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLG
        AFQIALTGSARLWYRRLPARSISTYSQLR+EF++QF  RHYD+KT THLATIRQKEG+TL+EY+TRFQEEQLKV HCSDDS+MCYFLTGLADE  TVKLG
Subjt:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLG

Query:  EEAPATFAEVLQKAKKVIDG
        EEA ATFAEVLQ  KK IDG
Subjt:  EEAPATFAEVLQKAKKVIDG

A0A6J1DZJ1 uncharacterized protein LOC1110257382.0e-14270.4Show/hide
Query:  KTLAASDAHQREVGAAAVEGQGHDGLAMEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNEM
        + L A+D HQREVGA  VEGQ H+GL  EP  RSARIT P L PAHP+  KA RGRGG S++   G APAP+ ENFDALQ+EMEAMRTQM +ME MYNEM
Subjt:  KTLAASDAHQREVGAAAVEGQGHDGLAMEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNEM

Query:  VLAAGAGSRSENRVTRMDVREQRGSHLGPPEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESSHDLA---GIITKEEFDQL
        V A GAGSRSE+R      R++RG                        DLR+HL+RKR SSLRKG+SPS SH +SNQQAESS++     G+IT+EEFDQL
Subjt:  VLAAGAGSRSENRVTRMDVREQRGSHLGPPEEERPEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESSHDLA---GIITKEEFDQL

Query:  RGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCRAFQIALTGSARLWYRRLP
        + + DAQVE LKA+CE K  + +DGDLGESPFTSD+LEA IP KFK PT+KPYDG+KDPKDYVEVFEGLM FQAA+DAIK RAFQIALT SARLWYRRLP
Subjt:  RGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCRAFQIALTGSARLWYRRLP

Query:  ARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVI
        ARSISTYSQLR+EF +QFS RHY++KTATHLATIRQKE ETLREYVT FQEEQLKVAH SDDSA+CYFLT L DE LTVKLGEEAPATFAEVLQKAKKVI
Subjt:  ARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVI

Query:  DG
        DG
Subjt:  DG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCCTGTTTGGGGAAGGGAATACCCTGGACACGTGGAGGACCCCATTGGTGCTCCAAGATGGCTCATGAGTCATCCCAGCTCGGATAGCCGAGATAGCTGCCACCA
ACGCCGACGAAAGCTAAGTATGAGGGCCGAGGTGAACCTGGCCCAGGTCCGCCCAAGTGTTCAGGTCGGTCCGGAGGCCGAGTTCGAGCTACAATCAGGAACACACTGTT
GTGCAAATCCTTGCATAAACATTTGGCGCCGTCTATCGAAGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGAC
GGCCTAGCAATGGAACCCCTCCGTAGGTCGGCGCGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAA
GAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAATTTTGATGCGCTTCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATA
ACGAAATGGTGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCACCCGAGGAAGAACGT
CCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGAAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCGTCCCGCTC
CCACGGGAGCTCCAACCAGCAGGCTGAATCCTCTCACGACCTCGCAGGGATAATCACAAAGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCCT
TAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCT
CCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGCCTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTT
TCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTTTC
GGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCA
CACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCA
GAAGGCGAAGAAAGTCATCGATGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCCTGTTTGGGGAAGGGAATACCCTGGACACGTGGAGGACCCCATTGGTGCTCCAAGATGGCTCATGAGTCATCCCAGCTCGGATAGCCGAGATAGCTGCCACCA
ACGCCGACGAAAGCTAAGTATGAGGGCCGAGGTGAACCTGGCCCAGGTCCGCCCAAGTGTTCAGGTCGGTCCGGAGGCCGAGTTCGAGCTACAATCAGGAACACACTGTT
GTGCAAATCCTTGCATAAACATTTGGCGCCGTCTATCGAAGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGAC
GGCCTAGCAATGGAACCCCTCCGTAGGTCGGCGCGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAA
GAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAATTTTGATGCGCTTCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATA
ACGAAATGGTGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCACCCGAGGAAGAACGT
CCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGAAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCGTCCCGCTC
CCACGGGAGCTCCAACCAGCAGGCTGAATCCTCTCACGACCTCGCAGGGATAATCACAAAGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCCT
TAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCT
CCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGCCTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTT
TCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTTTC
GGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCA
CACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCA
GAAGGCGAAGAAAGTCATCGATGGATAG
Protein sequenceShow/hide protein sequence
MVPVWGREYPGHVEDPIGAPRWLMSHPSSDSRDSCHQRRRKLSMRAEVNLAQVRPSVQVGPEAEFELQSGTHCCANPCINIWRRLSKTLAASDAHQREVGAAAVEGQGHD
GLAMEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPPEEER
PEDNESEGYTRQREDLREHLNRKRGSSLRKGQSPSRSHGSSNQQAESSHDLAGIITKEEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKA
PTVKPYDGTKDPKDYVEVFEGLMAFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSFRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDG