; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g07960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g07960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:5792827..5794226
RNA-Seq ExpressionMoc04g07960
SyntenyMoc04g07960
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]5.0e-13691.4Show/hide
Query:  QAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASD
        +AESS N   P G+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPY+G+KDPKD VEVFE LMDFQAASD
Subjt:  QAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGR+GKD+E ADPKS+DKGSF  GRAEYRRAENGPTRS
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS

XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]7.9e-14276.62Show/hide
Query:  SKAIRGRGAGSRSENRVTRMDVCEQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNPV---GIITREEF
        S+ ++  GA SRSENRV R D+ EQRG HLGP ++  PE  E E YT QRGDLREHLNRKR SSLRKGQSPS SH++SNQQAESS+NP+   G+ITREEF
Subjt:  SKAIRGRGAGSRSENRVTRMDVCEQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNPV---GIITREEF

Query:  DQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR
        DQL+ + DAQVEALKAKCE+K+ S +DGDLGESPFTSD+LEA IP KFK PT+KPY+G+KDPKD VEVFEGLMDFQAA+DAIKCR FQIALTGSARLWYR
Subjt:  DQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR

Query:  RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAK
        RLPARSISTYSQLR+EF+ QFSSRHYD+KTATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD SAMCYFLT LADE LTVKL EEAPATF EVLQKAK
Subjt:  RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAK

Query:  KVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKG-SFFGGRAEYRRAEN
        K+IDGQELLRTKT RPE+KI +GR  KD  + D K+ DKG S F  R  YRR++N
Subjt:  KVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKG-SFFGGRAEYRRAEN

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]6.5e-14490.67Show/hide
Query:  KRGSSLRKGQSPSRSHKSSNQQAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGT
        +RGSSLRKGQSPSRSH+SSNQQAESSHN   P G+ITREEFDQLRG+LDAQVEALKAKCEQK+ SLNDGDLGESPFTSDVLEAPIP KFKAPTVKPY+G+
Subjt:  KRGSSLRKGQSPSRSHKSSNQQAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGT

Query:  KDPKDNVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV
        +DPKD VEVFEGLMDFQAASD IKCRAFQIALT SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPKDNVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS
         HCSDDSAMCYFLTGLADEA TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGR+GKD+ERAD KS+DKGSF   RA YRRAENGPTRS
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]9.7e-14097.38Show/hide
Query:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASDAIKCRAFQIALT
        GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPY+GTKDPKD VEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGR+GKDVERADPKS+DKGSF  GRAEYRRAENGPTRS
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]1.4e-13865.53Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDSLAAEPLRRSARITAPALPPAHARASKAIRGRGAGSR-----------SENRVTRMDVCEQRG
        MVQP +STNT DRR L A+D HQREVGA  VEGQ H+ L  EP  RSARIT P L PAH +  KA RGRG  SR            EN        E   
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDSLAAEPLRRSARITAPALPPAHARASKAIRGRGAGSR-----------SENRVTRMDVCEQRG

Query:  SHLGPAEEERPE---------DNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNPV---GIITREEFDQLRGELDAQVEALKAKC
        + +   EE   E          +E      +RGDLR+HL+RKR SSLRKG+SPS SHK+SNQQAESS+NPV   G+ITREEFDQL+ + DAQVE LKA+C
Subjt:  SHLGPAEEERPE---------DNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNPV---GIITREEFDQLRGELDAQVEALKAKC

Query:  EQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFL
        E K  + +DGDLGESPFTSD+LEA IP KFK PT+KPY+G+KDPKD VEVFEGLM FQAA+DAIK RAFQIALT SARLWYRRLPARSISTYSQLR+EF 
Subjt:  EQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFL

Query:  AQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER
        +QFSSRHY++KTATHLATIRQKE ETLREYVT FQEEQLKVAH SDDSA+CYFLT L DE LTVKLGEEAPATFAEVLQKAKKVIDGQEL RTKTGR E+
Subjt:  AQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER

Query:  KIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS
        +I + +  ++  +A+ KS+DK       AEYRR+++GP+RS
Subjt:  KIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.4e-13691.4Show/hide
Query:  QAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASD
        +AESS N   P G+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPY+G+KDPKD VEVFE LMDFQAASD
Subjt:  QAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGR+GKD+E ADPKS+DKGSF  GRAEYRRAENGPTRS
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS

A0A6J1DDS5 uncharacterized protein LOC1110198423.1e-14490.67Show/hide
Query:  KRGSSLRKGQSPSRSHKSSNQQAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGT
        +RGSSLRKGQSPSRSH+SSNQQAESSHN   P G+ITREEFDQLRG+LDAQVEALKAKCEQK+ SLNDGDLGESPFTSDVLEAPIP KFKAPTVKPY+G+
Subjt:  KRGSSLRKGQSPSRSHKSSNQQAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGT

Query:  KDPKDNVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV
        +DPKD VEVFEGLMDFQAASD IKCRAFQIALT SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPKDNVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS
         HCSDDSAMCYFLTGLADEA TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGR+GKD+ERAD KS+DKGSF   RA YRRAENGPTRS
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS

A0A6J1DDW5 uncharacterized protein LOC1110196343.8e-14276.62Show/hide
Query:  SKAIRGRGAGSRSENRVTRMDVCEQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNPV---GIITREEF
        S+ ++  GA SRSENRV R D+ EQRG HLGP ++  PE  E E YT QRGDLREHLNRKR SSLRKGQSPS SH++SNQQAESS+NP+   G+ITREEF
Subjt:  SKAIRGRGAGSRSENRVTRMDVCEQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNPV---GIITREEF

Query:  DQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR
        DQL+ + DAQVEALKAKCE+K+ S +DGDLGESPFTSD+LEA IP KFK PT+KPY+G+KDPKD VEVFEGLMDFQAA+DAIKCR FQIALTGSARLWYR
Subjt:  DQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR

Query:  RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAK
        RLPARSISTYSQLR+EF+ QFSSRHYD+KTATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD SAMCYFLT LADE LTVKL EEAPATF EVLQKAK
Subjt:  RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAK

Query:  KVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKG-SFFGGRAEYRRAEN
        K+IDGQELLRTKT RPE+KI +GR  KD  + D K+ DKG S F  R  YRR++N
Subjt:  KVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKG-SFFGGRAEYRRAEN

A0A6J1DS95 uncharacterized protein LOC1110234214.7e-14097.38Show/hide
Query:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASDAIKCRAFQIALT
        GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPY+GTKDPKD VEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGR+GKDVERADPKS+DKGSF  GRAEYRRAENGPTRS
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS

A0A6J1DZJ1 uncharacterized protein LOC1110257386.8e-13965.53Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDSLAAEPLRRSARITAPALPPAHARASKAIRGRGAGSR-----------SENRVTRMDVCEQRG
        MVQP +STNT DRR L A+D HQREVGA  VEGQ H+ L  EP  RSARIT P L PAH +  KA RGRG  SR            EN        E   
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDSLAAEPLRRSARITAPALPPAHARASKAIRGRGAGSR-----------SENRVTRMDVCEQRG

Query:  SHLGPAEEERPE---------DNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNPV---GIITREEFDQLRGELDAQVEALKAKC
        + +   EE   E          +E      +RGDLR+HL+RKR SSLRKG+SPS SHK+SNQQAESS+NPV   G+ITREEFDQL+ + DAQVE LKA+C
Subjt:  SHLGPAEEERPE---------DNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNPV---GIITREEFDQLRGELDAQVEALKAKC

Query:  EQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFL
        E K  + +DGDLGESPFTSD+LEA IP KFK PT+KPY+G+KDPKD VEVFEGLM FQAA+DAIK RAFQIALT SARLWYRRLPARSISTYSQLR+EF 
Subjt:  EQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKDPKDNVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFL

Query:  AQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER
        +QFSSRHY++KTATHLATIRQKE ETLREYVT FQEEQLKVAH SDDSA+CYFLT L DE LTVKLGEEAPATFAEVLQKAKKVIDGQEL RTKTGR E+
Subjt:  AQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER

Query:  KIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS
        +I + +  ++  +A+ KS+DK       AEYRR+++GP+RS
Subjt:  KIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTGGAGGGGCAAGGTCACGA
CAGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCCGCCCTACCGCCTGCGCACGCGAGGGCGTCCAAGGCCATCCGTGGCCGAGGCGCAGGGTCCC
GATCTGAAAATAGAGTGACGCGCATGGACGTATGCGAGCAAAGGGGTTCCCACCTAGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGC
CAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCCCACAAGAGCTCCAACCAGCAGGCTGAATCCTC
TCACAATCCCGTAGGGATAATTACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCAC
TGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGAGGGGACGAAGGAC
CCCAAGGACAATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTG
GTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACTCATCTCG
CCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTTCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTC
CTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCT
CCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGTCGGGGCCGAAATGGAAAAGATGTAGAAAGGGCAGATCCCAAGTCCGAGGACAAGGGATCCTTTTTCGGCG
GCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACTAGGAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTGGAGGGGCAAGGTCACGA
CAGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCCGCCCTACCGCCTGCGCACGCGAGGGCGTCCAAGGCCATCCGTGGCCGAGGCGCAGGGTCCC
GATCTGAAAATAGAGTGACGCGCATGGACGTATGCGAGCAAAGGGGTTCCCACCTAGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGC
CAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCCCACAAGAGCTCCAACCAGCAGGCTGAATCCTC
TCACAATCCCGTAGGGATAATTACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCAC
TGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGAGGGGACGAAGGAC
CCCAAGGACAATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTG
GTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACTCATCTCG
CCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTTCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTC
CTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCT
CCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGTCGGGGCCGAAATGGAAAAGATGTAGAAAGGGCAGATCCCAAGTCCGAGGACAAGGGATCCTTTTTCGGCG
GCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACTAGGAGCTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDSLAAEPLRRSARITAPALPPAHARASKAIRGRGAGSRSENRVTRMDVCEQRGSHLGPAEEERPEDNESEGYTR
QRGDLREHLNRKRGSSLRKGQSPSRSHKSSNQQAESSHNPVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYEGTKD
PKDNVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYF
LTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRNGKDVERADPKSEDKGSFFGGRAEYRRAENGPTRS