; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g14430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g14430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:12433497..12436621
RNA-Seq ExpressionMoc09g14430
SyntenyMoc09g14430
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERM96107.1 hypothetical protein AMTR_s02760p00000080, partial [Amborella trichopoda]1.9e-7651.47Show/hide
Query:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKD--ESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKR
        ++HKL GSNY  W   +RL+LRSID DDH+T+DPP++  +S+K WLR+DA+L LQI+NSI+ ++I L+N CE VKEL+ YLEFLYSGK N+SRI+++CK 
Subjt:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKD--ESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKR

Query:  FYQPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALV
        FY+ E  D+SL NYFM  K+ Y E N LLP S D KV+  QREQ+ ++SFL GL   FD  K  +LS S++ +L++ +T VL+ E +    S+  NSALV
Subjt:  FYQPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALV

Query:  GRSTNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVASTPDNAGKLVTIPAEEFAKFQQYQESLMASSSNP
         R+ +A   N+        N+Q       NSG I+ CNYC KP HTK ECRKL  K ++  SA++AST D + K V I A+EFAKF +YQE+L +SSS+ 
Subjt:  GRSTNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVASTPDNAGKLVTIPAEEFAKFQQYQESLMASSSNP

Query:  ITAIAES
        +TAIA+S
Subjt:  ITAIAES

XP_010650945.1 PREDICTED: uncharacterized protein LOC104879534 [Vitis vinifera]4.0e-7948.76Show/hide
Query:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFY
        +EHKLNGSNY  W   ++++LRS+  DDH+TE+PP D ++K W++DDA+L LQ+KNSI  DI+GL++ CE+VKEL+ YL+FLYSGKGNVSR++++   F+
Subjt:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFY

Query:  QPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGR
         PE G +SLT YFM+ K++Y E NAL+P S D +V+ AQREQ+ V+SFL GLP  F+  K  +LS S+I +L+E ++ VL+ E    V SSQ  + LV +
Subjt:  QPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGR

Query:  ---STNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLM
           + NA R N   GN    N        ++S   I C YCH+  HTK+ CRKL N+ ++ Q+A+VA++      D++ K+VT+ AEEF+K+ QYQ++L 
Subjt:  ---STNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLM

Query:  ASSSNPITAIAESGSYDEGDYW
        AS+  P++A+AESGSYDE D W
Subjt:  ASSSNPITAIAESGSYDEGDYW

XP_022850817.1 uncharacterized protein LOC111372670 [Olea europaea var. sylvestris]1.5e-7847.2Show/hide
Query:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFY
        +EHKLN +NY  W   +R++LRSID DDH+ +DPPKD++K++WLR+DA+L LQI+NSI+ ++IGL+N CE+VKEL+ YLEFLYS KGNVSRI+E+C+ FY
Subjt:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFY

Query:  QPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGR
        + E   +SLT +FM+ KR Y E N LLP S D KV+  QREQ+ V+SFL GLP  F+ +K  +LS SEIP L++ ++ VL+ E + P+   Q N+ LV +
Subjt:  QPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGR

Query:  STNANRGNTYQGNSTTSNAQRSSAPKSN-SGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVASTPDNA-GKLVTIPAEEFAKFQQYQESLMASSSNP
              GN Y         +  +    N     + C Y H+P HTK  CRKL N+ ++TQ+A+VA+TP ++  K V I  +E+AKF QYQESL    S P
Subjt:  STNANRGNTYQGNSTTSNAQRSSAPKSN-SGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVASTPDNA-GKLVTIPAEEFAKFQQYQESLMASSSNP

Query:  ITAIAES-----------------------GSYDEGDYW
        +TA+AES                       GSYDE DYW
Subjt:  ITAIAES-----------------------GSYDEGDYW

XP_034673007.1 uncharacterized protein LOC117904490 [Vitis riparia]2.1e-8049.84Show/hide
Query:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFY
        +EHKLNGSNY  W   ++++LRS+  DDH+TE+PP D ++K WL+DDA+L LQ+KNSI  DI+GL++ CE+VKEL+ YL+FLYSGKGNVSR++++   F+
Subjt:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFY

Query:  QPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGR
         PE G +SLT YFM+ K++Y E NAL+P S D +V+ AQREQ+ V+SFL GLP  F+  K  +LS S+I +L+E ++ VL+ E    V SSQ  + LV +
Subjt:  QPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGR

Query:  STNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLMASS
          NA   NT + N+   N  R+   + N    I C YCH+  HTKR CRKL N+ ++ Q+A+VA++      D++ K++T+ AEEFAK+ QYQ++L AS+
Subjt:  STNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLMASS

Query:  SNPITAIAESGSYDEGDYW
          P++A+AESGSYDE D W
Subjt:  SNPITAIAESGSYDEGDYW

XP_038882618.1 uncharacterized protein LOC120073824 [Benincasa hispida]1.5e-7856.49Show/hide
Query:  EHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFYQ
        EHKLNGSNYSSW+ NVR F+RS++MDDHITE+ P D +KK W RDD++++LQIKNSI+ +I+ LVN CE VK+LL+YL+FLYSGK N++R+F++CK  YQ
Subjt:  EHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFYQ

Query:  PEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGRS
        P+ G++SLT+YFME K    EFNAL+P S D KV +A+ E+L ++SFL+GL P++++ KD +LS   I +LEEAYT +L+ EK+Q V S  S+S L+GR 
Subjt:  PEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGRS

Query:  TNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQS--AHVASTPDNAGKLVTIPAEEFAKF
        TN  RGN  +G    S+    +  +SN G  + C Y     HTKRECR+LLNKGQ+  S  AHVASTPDN  K +TI AEEFAKF
Subjt:  TNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQS--AHVASTPDNAGKLVTIPAEEFAKF

TrEMBL top hitse value%identityAlignment
A0A438DE60 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-7547.27Show/hide
Query:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFY
        +EHKLNGSNY  W   ++++LRS+  DDH+TE+PP D ++K W++DDA+L LQ+KNSI  DI+GL++ CE+VKEL+ YL+FLYSGKGNVSR++++   F+
Subjt:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFY

Query:  QPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGR
         PE G +SLT YFM+ K++Y E NAL+P S D +V+ AQREQ+ V+SFL GLP  F+  K  +LS S+I +L+E ++ VL   +++ V SSQ  + LV +
Subjt:  QPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGR

Query:  STNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLMASS
          NA          T     R+   + N    I C YCH+  HTK+ CRKL N+ ++ Q+A+VA++      D++ K+VT+ AEEF+K+ QYQ++L AS+
Subjt:  STNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLMASS

Query:  SNPITAIAESG
          P++A+AESG
Subjt:  SNPITAIAESG

A0A438DT29 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-7547.27Show/hide
Query:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFY
        +EHKLNGSNY  W   ++++LRS+  DDH+TE+PP D ++K W++DDA+L LQ+KNSI  DI+GL++ CE+VKEL+ YL+FLYSGKGNVSR++++   F+
Subjt:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFY

Query:  QPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGR
         PE G +SLT YFM+ K++Y E NAL+P S D +V+ AQREQ+ V+SFL GLP  F+  K  +LS S+I +L+E ++ VL+ E    V SSQ  + LV +
Subjt:  QPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGR

Query:  STNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLMASS
          NA          T     R+   + N    I C YCH+  HTK+ C+KL N+ ++ Q+A+VA++      D++ K+VT+ AEEF+K+ QYQ++L AS+
Subjt:  STNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLMASS

Query:  SNPITAIAESG
          P++A+AESG
Subjt:  SNPITAIAESG

A0A438HPS2 Retrovirus-related Pol polyprotein from transposon TNT 1-945.8e-7648.41Show/hide
Query:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFY
        +EHKLNGSNY  W   ++++LRS+  DDH+TE+PP D ++K W++DDA+LILQ+KNSI  DI+GL + CE+VKEL+ YL+FLYSGKGNVSR++++   F+
Subjt:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFY

Query:  QPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGR
         PE G +SLT YFM+ K++Y E NAL+P S D +V+ AQREQ+ V+SFL GLP  F+  K  +LS S+I +L+E ++ VL+ E    V SSQ  + LV +
Subjt:  QPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGR

Query:  ---STNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLM
           + NA R N   GN    N       + N    I C YCH+  HTK+ CRKL N+ ++ Q+A+VA++      D++ K+VT+ AEEF+K+ QYQ++L 
Subjt:  ---STNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLM

Query:  ASSSNPITAIAESG
        AS+  P++A+AESG
Subjt:  ASSSNPITAIAESG

A5AWD0 Uncharacterized protein2.2e-7548.09Show/hide
Query:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFY
        +EHKLNGSNY  W   ++++LRS+  DDH+TE+PP D ++K W++DDA+L LQ+KNSI  DI+GL++ CE+VKEL+ YL+FLYSGKGNVSR++++   F+
Subjt:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFY

Query:  QPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGR
         PE G +SLT YFM+ K++Y E NAL+P S D +V+ AQREQ+ V+SFL GLP  F+  K  +LS S+I +L+E ++ VL+ E    V SSQ  + LV +
Subjt:  QPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGR

Query:  ---STNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLM
           + NA R N   GN    N        ++S   I C YCH+  HTK+ CRKL N+ ++ Q+A+VA++      D++ K+VT+ AEEF+K+ QYQ++L 
Subjt:  ---STNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLM

Query:  ASSSNPITAIAESG
        AS+  P++A+AESG
Subjt:  ASSSNPITAIAESG

U5CZW1 Uncharacterized protein (Fragment)9.0e-7751.47Show/hide
Query:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKD--ESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKR
        ++HKL GSNY  W   +RL+LRSID DDH+T+DPP++  +S+K WLR+DA+L LQI+NSI+ ++I L+N CE VKEL+ YLEFLYSGK N+SRI+++CK 
Subjt:  SEHKLNGSNYSSWKINVRLFLRSIDMDDHITEDPPKD--ESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKR

Query:  FYQPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALV
        FY+ E  D+SL NYFM  K+ Y E N LLP S D KV+  QREQ+ ++SFL GL   FD  K  +LS S++ +L++ +T VL+ E +    S+  NSALV
Subjt:  FYQPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDTKVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALV

Query:  GRSTNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVASTPDNAGKLVTIPAEEFAKFQQYQESLMASSSNP
         R+ +A   N+        N+Q       NSG I+ CNYC KP HTK ECRKL  K ++  SA++AST D + K V I A+EFAKF +YQE+L +SSS+ 
Subjt:  GRSTNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSHTKRECRKLLNKGQKTQSAHVASTPDNAGKLVTIPAEEFAKFQQYQESLMASSSNP

Query:  ITAIAES
        +TAIA+S
Subjt:  ITAIAES

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGAGTCGGGCCCCTTGTTCAAGTCCCGGAGTCAGCATTTAAGGGAACACTCATCTACTCCCCTAAAGTTGGGGAGGAGTGAATTCCATCTTGTGAAGTTATGTTC
CCAGCTCCCCACTCGGTCTCGTCCCCAAAATGGTAGGAATGTTGAGCCGGCGACTCGGGCCACTCTCACCCATACAGATCAAAGGACGAGCCCTCACGGGCAGGAGATAC
CCCCACTCGCATGTCTCCACACGAACGACCTAGATCAAGTCGTCTGTAACCTTTACAGCGTGGGCCGTATCCATAGTGTTGCCAGGATAAGCAGGATAAGGGTTTTGATG
CCCATTAGCGTTTTGTGCCCATTAGGGTTTTGTCCCATAGGGTTTTGTTTACTGCTATCCAACACTACGACCACTGCCGACCACCACCGCCACCGTCGCCGCCGCCGTCG
ATCATCGCCGTCGTTTGCTGATTTGCCAGCTGTTTTCCAATTGCTATTCGTGGGTTTCGTCTCGCCGTCCTTCGCACCTTCGTCGCCGTGCGCTTCCGTCACCGCCAATC
GCCTTCGCCCCTCTGGTTGTCGTGGGGTACCACTTGGGGATTCTGAACACAAGTTGAATGGATCAAACTATTCTTCATGGAAAATCAATGTTCGTCTTTTTTTGCGAAGT
ATTGATATGGACGATCACATAACTGAGGATCCGCCTAAGGATGAGAGCAAGAAAGCTTGGTTGCGGGATGATGCTCAATTGATTTTGCAGATCAAGAATTCGATCGAGGG
TGACATTATTGGCTTGGTCAATGAATGTGAGTATGTTAAAGAGTTGCTTAAATATTTAGAATTCCTTTATTCTGGAAAAGGAAATGTTAGTCGAATATTTGAAATCTGCA
AGCGCTTCTACCAACCTGAGTTTGGTGACCAATCTCTTACAAATTACTTTATGGAACACAAACGAATTTATACAGAGTTTAATGCATTACTCCCAGATAGTAATGATACA
AAAGTTCGACTTGCCCAACGCGAACAACTAGTAGTTATCAGTTTTCTTCTTGGTCTTCCACCTAGATTTGATGTGACCAAAGACCTATTACTCTCTGATTCGGAAATTCC
AGCTTTAGAGGAGGCATACACTCTAGTACTTCAAGTTGAGAAGTCACAACCCGTCTTGTCATCTCAGTCTAATAGTGCATTGGTTGGACGTAGTACAAATGCAAACAGAG
GTAATACCTACCAAGGGAATTCCACGACTTCGAATGCTCAACGTTCTAGTGCTCCAAAGTCAAATTCAGGAGATATTATTTTTTGCAATTATTGCCATAAGCCTAGTCAT
ACAAAACGAGAATGTAGAAAGCTATTGAATAAAGGTCAGAAAACACAGTCTGCACATGTTGCATCTACTCCTGATAATGCTGGCAAGTTAGTTACAATTCCTGCGGAAGA
ATTTGCTAAGTTCCAACAGTATCAAGAGTCATTGATGGCATCGTCCTCTAATCCGATTACCGCCATCGCTGAGTCAGGATCTTACGACGAAGGAGACTATTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGAGTCGGGCCCCTTGTTCAAGTCCCGGAGTCAGCATTTAAGGGAACACTCATCTACTCCCCTAAAGTTGGGGAGGAGTGAATTCCATCTTGTGAAGTTATGTTC
CCAGCTCCCCACTCGGTCTCGTCCCCAAAATGGTAGGAATGTTGAGCCGGCGACTCGGGCCACTCTCACCCATACAGATCAAAGGACGAGCCCTCACGGGCAGGAGATAC
CCCCACTCGCATGTCTCCACACGAACGACCTAGATCAAGTCGTCTGTAACCTTTACAGCGTGGGCCGTATCCATAGTGTTGCCAGGATAAGCAGGATAAGGGTTTTGATG
CCCATTAGCGTTTTGTGCCCATTAGGGTTTTGTCCCATAGGGTTTTGTTTACTGCTATCCAACACTACGACCACTGCCGACCACCACCGCCACCGTCGCCGCCGCCGTCG
ATCATCGCCGTCGTTTGCTGATTTGCCAGCTGTTTTCCAATTGCTATTCGTGGGTTTCGTCTCGCCGTCCTTCGCACCTTCGTCGCCGTGCGCTTCCGTCACCGCCAATC
GCCTTCGCCCCTCTGGTTGTCGTGGGGTACCACTTGGGGATTCTGAACACAAGTTGAATGGATCAAACTATTCTTCATGGAAAATCAATGTTCGTCTTTTTTTGCGAAGT
ATTGATATGGACGATCACATAACTGAGGATCCGCCTAAGGATGAGAGCAAGAAAGCTTGGTTGCGGGATGATGCTCAATTGATTTTGCAGATCAAGAATTCGATCGAGGG
TGACATTATTGGCTTGGTCAATGAATGTGAGTATGTTAAAGAGTTGCTTAAATATTTAGAATTCCTTTATTCTGGAAAAGGAAATGTTAGTCGAATATTTGAAATCTGCA
AGCGCTTCTACCAACCTGAGTTTGGTGACCAATCTCTTACAAATTACTTTATGGAACACAAACGAATTTATACAGAGTTTAATGCATTACTCCCAGATAGTAATGATACA
AAAGTTCGACTTGCCCAACGCGAACAACTAGTAGTTATCAGTTTTCTTCTTGGTCTTCCACCTAGATTTGATGTGACCAAAGACCTATTACTCTCTGATTCGGAAATTCC
AGCTTTAGAGGAGGCATACACTCTAGTACTTCAAGTTGAGAAGTCACAACCCGTCTTGTCATCTCAGTCTAATAGTGCATTGGTTGGACGTAGTACAAATGCAAACAGAG
GTAATACCTACCAAGGGAATTCCACGACTTCGAATGCTCAACGTTCTAGTGCTCCAAAGTCAAATTCAGGAGATATTATTTTTTGCAATTATTGCCATAAGCCTAGTCAT
ACAAAACGAGAATGTAGAAAGCTATTGAATAAAGGTCAGAAAACACAGTCTGCACATGTTGCATCTACTCCTGATAATGCTGGCAAGTTAGTTACAATTCCTGCGGAAGA
ATTTGCTAAGTTCCAACAGTATCAAGAGTCATTGATGGCATCGTCCTCTAATCCGATTACCGCCATCGCTGAGTCAGGATCTTACGACGAAGGAGACTATTGGTAA
Protein sequenceShow/hide protein sequence
MRESGPLFKSRSQHLREHSSTPLKLGRSEFHLVKLCSQLPTRSRPQNGRNVEPATRATLTHTDQRTSPHGQEIPPLACLHTNDLDQVVCNLYSVGRIHSVARISRIRVLM
PISVLCPLGFCPIGFCLLLSNTTTTADHHRHRRRRRRSSPSFADLPAVFQLLFVGFVSPSFAPSSPCASVTANRLRPSGCRGVPLGDSEHKLNGSNYSSWKINVRLFLRS
IDMDDHITEDPPKDESKKAWLRDDAQLILQIKNSIEGDIIGLVNECEYVKELLKYLEFLYSGKGNVSRIFEICKRFYQPEFGDQSLTNYFMEHKRIYTEFNALLPDSNDT
KVRLAQREQLVVISFLLGLPPRFDVTKDLLLSDSEIPALEEAYTLVLQVEKSQPVLSSQSNSALVGRSTNANRGNTYQGNSTTSNAQRSSAPKSNSGDIIFCNYCHKPSH
TKRECRKLLNKGQKTQSAHVASTPDNAGKLVTIPAEEFAKFQQYQESLMASSSNPITAIAESGSYDEGDYW