; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g30650 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g30650
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr8:21958396..21959417
RNA-Seq ExpressionMoc08g30650
SyntenyMoc08g30650
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]1.8e-9860.06Show/hide
Query:  MTSKEASSTMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQ-QEHSVSIASLSVLQLQDMITNSIKAQ
        M+   A   +E  + EME+K+N+L K+V+E D++IA+L   ++ ++     ESS+  V KV+DK KN++Q+ Q Q+ S S+ASLSV QLQDMITNSI+AQ
Subjt:  MTSKEASSTMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQ-QEHSVSIASLSVLQLQDMITNSIKAQ

Query:  YGGPSQSSLTYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNR
        YGGPSQ+S  YSK YTKRID+LR+P+GYQPPKFQQFD KGNPKQH+AHFVET                             LEPESI  WE+LE+EFLNR
Subjt:  YGGPSQSSLTYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNR

Query:  FYNTRRTVSMMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVPDMKK
        FY+TRRTVSMMELTNTKQRK EPV+DYINRWRALSLD KDRL E+SA+E+C QG+HWGLLYI QGIKP TFEEL TRAHDMELSIA++G KD LVP++KK
Subjt:  FYNTRRTVSMMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVPDMKK

Query:  GSKS-------AEMTSKESMVINTTYLKSSSKRKEKKVENRQE
          K         + TSKESMV+NTT LK  SK KE +VE + +
Subjt:  GSKS-------AEMTSKESMVINTTYLKSSSKRKEKKVENRQE

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]1.5e-9759.48Show/hide
Query:  MTSKEASSTMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQ-QEHSVSIASLSVLQLQDMITNSIKAQ
        M+   A   +E  + EME+K+N+L K+V+E D++IA+L   ++ ++     ESS+  V KV+DK KN++Q+ Q Q+ S S+ASLSV QLQDMIT+SI+AQ
Subjt:  MTSKEASSTMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQ-QEHSVSIASLSVLQLQDMITNSIKAQ

Query:  YGGPSQSSLTYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNR
        YGGPSQ+S  YSK YTKRID+LR+P+GYQPPKFQQFD KGNPKQH+AHFVET                             LEPESI  WE+LE+EFLNR
Subjt:  YGGPSQSSLTYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNR

Query:  FYNTRRTVSMMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVPDMKK
        FY+TRRTVSMMELTNTKQRK EPV+DYINRWRALSLD KDRL E+SA+E+C QG+HWGLLYI QGIKP TFEEL TRAHDMELSIA++G KD LVP++KK
Subjt:  FYNTRRTVSMMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVPDMKK

Query:  GSKS-------AEMTSKESMVINTTYLKSSSKRKEKKVENRQE
          K         + T+KESMV+NTT LK  SK KE +VE + +
Subjt:  GSKS-------AEMTSKESMVINTTYLKSSSKRKEKKVENRQE

XP_031740568.1 uncharacterized protein LOC116403508 [Cucumis sativus]6.8e-9859.77Show/hide
Query:  MTSKEASSTMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQ-QEHSVSIASLSVLQLQDMITNSIKAQ
        M+   A   +E  + EME+K+N+L K+V+E D++IA+L   ++ ++     ESS+  V KV+DK KN++Q+ Q Q+ S S+ASLSV QLQDMIT+SI+AQ
Subjt:  MTSKEASSTMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQ-QEHSVSIASLSVLQLQDMITNSIKAQ

Query:  YGGPSQSSLTYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNR
        YGGPSQ+S  YSK YTKRID+LR+P+GYQPPKFQQFD KGNPKQH+AHFVET                             LEPESI  WE+LE+EFLNR
Subjt:  YGGPSQSSLTYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNR

Query:  FYNTRRTVSMMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVPDMKK
        FY+TRRTVSMMELTNTKQRK EPV+DYINRWRALSLD KDRL E+SA+E+C QG+HWGLLYI QGIKP TFEEL TRAHDMELSIA++G KD LVP++KK
Subjt:  FYNTRRTVSMMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVPDMKK

Query:  GSKS-------AEMTSKESMVINTTYLKSSSKRKEKKVENRQE
          K         + TSKESMV+NTT LK  SK KE +VE + +
Subjt:  GSKS-------AEMTSKESMVINTTYLKSSSKRKEKKVENRQE

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]3.4e-9759.48Show/hide
Query:  MTSKEASSTMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQ-QEHSVSIASLSVLQLQDMITNSIKAQ
        M+   A   +E  + EME+K+N+L K+V+E D++IA+L   ++ ++     ESS+  V KV+DK KN++Q+ Q Q+ S S+ASLSV QLQDMIT+SI+AQ
Subjt:  MTSKEASSTMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQ-QEHSVSIASLSVLQLQDMITNSIKAQ

Query:  YGGPSQSSLTYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNR
        YGGPSQ+S  YSK YTKRID+LR+P+GYQPPKFQQFD KGNPKQH+AHFVET                             LEPESI  WE+LE+EFLNR
Subjt:  YGGPSQSSLTYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNR

Query:  FYNTRRTVSMMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVPDMKK
        FY+TRRTVSMMELTNTKQRK EPV+DYINRWRALSLD KDRL E+SA+E+C QG+HWGLLYI QGIKP TFEEL TRAHDMELSIA++G KD LVP++KK
Subjt:  FYNTRRTVSMMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVPDMKK

Query:  GSKS-------AEMTSKESMVINTTYLKSSSKRKEKKVENRQE
          K         + T KESMV+NTT LK  SK KE +VE + +
Subjt:  GSKS-------AEMTSKESMVINTTYLKSSSKRKEKKVENRQE

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]1.8e-9860.06Show/hide
Query:  MTSKEASSTMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQ-QEHSVSIASLSVLQLQDMITNSIKAQ
        M+   A   +E  + EME+K+N+L K+V+E D++IA+L   ++ ++     ESS+  V KV+DK KN++Q+ Q Q+ S S+ASLSV QLQDMITNSI+AQ
Subjt:  MTSKEASSTMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQ-QEHSVSIASLSVLQLQDMITNSIKAQ

Query:  YGGPSQSSLTYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNR
        YGGPSQ+S  YSK YTKRID+LR+P+GYQPPKFQQFD KGNPKQH+AHFVET                             LEPESI  WE+LE+EFLNR
Subjt:  YGGPSQSSLTYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNR

Query:  FYNTRRTVSMMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVPDMKK
        FY+TRRTVSMMELTNTKQRK EPV+DYINRWRALSLD KDRL E+SA+E+C QG+HWGLLYI QGIKP TFEEL TRAHDMELSIA++G KD LVP++KK
Subjt:  FYNTRRTVSMMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVPDMKK

Query:  GSKS-------AEMTSKESMVINTTYLKSSSKRKEKKVENRQE
          K         + TSKESMV+NTT LK  SK KE +VE + +
Subjt:  GSKS-------AEMTSKESMVINTTYLKSSSKRKEKKVENRQE

TrEMBL top hitse value%identityAlignment
A0A5A7TZU9 Ribonuclease H6.9e-9659.58Show/hide
Query:  TMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQQEHSVSIASLSVLQLQDMITNSIKAQYGGPSQSSL
        T E+++ E+EKK+NML K VEE D++IA L +HIE +D     ESS  H  K  +K K +MQ+ Q ++S SIASLSV QLQ+MI NSIK QYGGP+Q+  
Subjt:  TMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQQEHSVSIASLSVLQLQDMITNSIKAQYGGPSQSSL

Query:  TYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNRFYNTRRTVS
         YSK YTKRID++R+P GYQPPKFQQFD KGNPKQH+AHF+ET                             LEPESI+ WE+LER+FLNRFY+TRR VS
Subjt:  TYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNRFYNTRRTVS

Query:  MMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVPDMKKGSKSAEMT-
        M+ELT TKQRK EPV+DYINRWRALSLD KDRL E+SA+E+C QG+HWGLLYI QGIKP TFEEL TRAHDMELSIAN+G+ DLLVP+++K  K  + T 
Subjt:  MMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVPDMKKGSKSAEMT-

Query:  ------SKESMVINTTYLKSSSKRKEKKVENRQE
              +KE+MV++TT LK  S  KEKK+E RQ+
Subjt:  ------SKESMVINTTYLKSSSKRKEKKVENRQE

A0A5A7UUI7 Ty3-gypsy retrotransposon protein5.3e-9659.28Show/hide
Query:  TMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQQEHSVSIASLSVLQLQDMITNSIKAQYGGPSQSSL
        T E ++ E+EKK+NML K+VEE DY+IA L +HIE +D     ESS  H  K  DK K +MQ+ Q ++S SIASLSV QLQ+MI +SIK QYGGP+Q+  
Subjt:  TMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQQEHSVSIASLSVLQLQDMITNSIKAQYGGPSQSSL

Query:  TYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNRFYNTRRTVS
         YSK YTKRID+LR+P GYQPPKFQQFD KGNPKQH+AHF+ET                             LEPESI++WE+LER+FLNRFY+TRR VS
Subjt:  TYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNRFYNTRRTVS

Query:  MMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVP-------DMKKGS
        MMELTNT+Q+K E V+DYINRWRALSLD KDRL E+SA+E+C QG+HWGLLYI QGIKP TFEEL TRAHDMELSIAN+G KD L+P       ++    
Subjt:  MMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVP-------DMKKGS

Query:  KSAEMTSKESMVINTTYLKSSSKRKEKKVENRQE
        K A    KESMV++ T LKS SKRKE K+E + +
Subjt:  KSAEMTSKESMVINTTYLKSSSKRKEKKVENRQE

A0A5A7UXF0 Ty3-gypsy retrotransposon protein9.0e-9658.98Show/hide
Query:  TMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQQEHSVSIASLSVLQLQDMITNSIKAQYGGPSQSSL
        T E ++ E+EKK+NML K+VEE DY+IA L +HIE +D     ESS  H+ K  DK K +MQ+ Q ++S SIASLSV QLQ+MI +SIK QYGGP+Q+  
Subjt:  TMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQQEHSVSIASLSVLQLQDMITNSIKAQYGGPSQSSL

Query:  TYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNRFYNTRRTVS
         YSK YTKRID+LR+P GYQPPKFQQFD KGNPKQH+AHF+ET                             LEPESI++WE+LER+FLNRFY+TRR VS
Subjt:  TYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNRFYNTRRTVS

Query:  MMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVP-------DMKKGS
        MMELTNT+Q+K E V++YINRWRALSLD KDRL E+SA+E+C QG+HWGLLYI QGIKP TFEEL TRAHDMELSIAN+G KD L+P       ++    
Subjt:  MMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVP-------DMKKGS

Query:  KSAEMTSKESMVINTTYLKSSSKRKEKKVENRQE
        K A    KESMV++ T LKS SKRKE K+E + +
Subjt:  KSAEMTSKESMVINTTYLKSSSKRKEKKVENRQE

A0A5D3BX77 Retrotransposon gag protein1.5e-9558.98Show/hide
Query:  TMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQQEHSVSIASLSVLQLQDMITNSIKAQYGGPSQSSL
        T E ++ E+EKK+NML K+VEE DY+IA L +HIE +D     ESS  H  K  DK K +MQ+ Q ++S SIASLSV QLQ+MI +SIK QYGGP+Q+  
Subjt:  TMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQQEHSVSIASLSVLQLQDMITNSIKAQYGGPSQSSL

Query:  TYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNRFYNTRRTVS
         YSK YTKRID+LR+P GYQPPKFQQFD KGNPKQH+AHF+ET                             LEPESI++WE+LER+FLNRFY+TRR VS
Subjt:  TYSKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNRFYNTRRTVS

Query:  MMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVP-------DMKKGS
        MMELTNT+Q+K E V+DYINRWRALSLD KDRL E+SA+E+C QG+HWGLLYI QGIKP TFEEL TRAHDMELSI N+G KD L+P       ++    
Subjt:  MMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVP-------DMKKGS

Query:  KSAEMTSKESMVINTTYLKSSSKRKEKKVENRQE
        K A    KESMV++ T LKS SKRKE K+E + +
Subjt:  KSAEMTSKESMVINTTYLKSSSKRKEKKVENRQE

A0A5D3D4X3 Ty3-gypsy retrotransposon protein6.9e-9659.34Show/hide
Query:  EEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQQEHSVSIASLSVLQLQDMITNSIKAQYGGPSQSSLTY
        E+++ E+EKK+NML K+VEE DY+IA L +HIE +D     ESS  H  K  DK K +MQ+ Q ++S SIASLSV QLQ+MI +SIK QYGGP+Q+   Y
Subjt:  EEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQQEHSVSIASLSVLQLQDMITNSIKAQYGGPSQSSLTY

Query:  SKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNRFYNTRRTVSMM
        SK YTKRID+LR+P GYQPPKFQQFD KGNPKQH+AHF+ET                             LEPESI++WE+LER+FLNRFY+TRR VSMM
Subjt:  SKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETY----------------------------LEPESINDWEKLEREFLNRFYNTRRTVSMM

Query:  ELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVP-------DMKKGSKS
        ELTNT+Q+K E V+DYINRWRALSLD KDRL E+SA+E+C QG+HWGLLYI QGIKP TFEEL TRAHDMELSIAN+G KD L+P       ++    K 
Subjt:  ELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIELCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVP-------DMKKGSKS

Query:  AEMTSKESMVINTTYLKSSSKRKEKKVENRQE
        A    KESMV++ T LKS SKRKE K+E + +
Subjt:  AEMTSKESMVINTTYLKSSSKRKEKKVENRQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACATCCAAGGAAGCTTCATCCACCATGGAAGAAAAGTTGGTGGAAATGGAGAAGAAACTTAACATGCTGACAAAAATGGTCGAAGAGATGGACTATAAGATTGCTTC
CCTTATCAGCCACATTGAACTTCAAGATGTTGTGAATGTTGTTGAGTCAAGCAAAAATCATGTTGCGAAAGTGAATGACAAAAGAAAGAACATGATGCAAAAGGAACAAC
AAGAACACTCTGTTTCAATTGCTTCATTGTCTGTCCTGCAGCTCCAAGATATGATCACAAATTCGATCAAAGCTCAATATGGCGGACCTTCTCAAAGCTCCCTAACATAC
TCCAAGTCGTACACCAAGAGGATTGATGACTTGAGAGTGCCAGTTGGATATCAGCCTCCGAAGTTTCAACAATTCGACGAAAAGGGCAATCCTAAGCAACATATTGCACA
TTTTGTTGAGACATATCTGGAGCCTGAATCTATCAATGATTGGGAGAAGTTGGAAAGAGAGTTCTTAAATCGCTTCTACAACACAAGGCGAACTGTAAGCATGATGGAAC
TCACCAACACCAAGCAACGAAAAGATGAACCTGTTGTTGACTACATCAATAGGTGGAGAGCTTTGAGTCTCGACTACAAAGATCGACTGATTGAAATATCTGCTATCGAG
TTGTGCATTCAAGGCATACATTGGGGACTTCTATACATTCCTCAAGGAATAAAACCTTGCACATTTGAAGAATTAGTAACTCGTGCCCACGACATGGAGCTAAGCATTGC
TAATCAAGGAGATAAAGATCTTTTAGTCCCTGATATGAAGAAAGGAAGCAAGAGTGCCGAGATGACCTCGAAAGAATCAATGGTTATCAACACGACCTATCTCAAATCTT
CTTCAAAAAGAAAGGAGAAGAAGGTTGAAAACCGACAAGAAATGAAAGGCATCGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGACATCCAAGGAAGCTTCATCCACCATGGAAGAAAAGTTGGTGGAAATGGAGAAGAAACTTAACATGCTGACAAAAATGGTCGAAGAGATGGACTATAAGATTGCTTC
CCTTATCAGCCACATTGAACTTCAAGATGTTGTGAATGTTGTTGAGTCAAGCAAAAATCATGTTGCGAAAGTGAATGACAAAAGAAAGAACATGATGCAAAAGGAACAAC
AAGAACACTCTGTTTCAATTGCTTCATTGTCTGTCCTGCAGCTCCAAGATATGATCACAAATTCGATCAAAGCTCAATATGGCGGACCTTCTCAAAGCTCCCTAACATAC
TCCAAGTCGTACACCAAGAGGATTGATGACTTGAGAGTGCCAGTTGGATATCAGCCTCCGAAGTTTCAACAATTCGACGAAAAGGGCAATCCTAAGCAACATATTGCACA
TTTTGTTGAGACATATCTGGAGCCTGAATCTATCAATGATTGGGAGAAGTTGGAAAGAGAGTTCTTAAATCGCTTCTACAACACAAGGCGAACTGTAAGCATGATGGAAC
TCACCAACACCAAGCAACGAAAAGATGAACCTGTTGTTGACTACATCAATAGGTGGAGAGCTTTGAGTCTCGACTACAAAGATCGACTGATTGAAATATCTGCTATCGAG
TTGTGCATTCAAGGCATACATTGGGGACTTCTATACATTCCTCAAGGAATAAAACCTTGCACATTTGAAGAATTAGTAACTCGTGCCCACGACATGGAGCTAAGCATTGC
TAATCAAGGAGATAAAGATCTTTTAGTCCCTGATATGAAGAAAGGAAGCAAGAGTGCCGAGATGACCTCGAAAGAATCAATGGTTATCAACACGACCTATCTCAAATCTT
CTTCAAAAAGAAAGGAGAAGAAGGTTGAAAACCGACAAGAAATGAAAGGCATCGTCTAA
Protein sequenceShow/hide protein sequence
MTSKEASSTMEEKLVEMEKKLNMLTKMVEEMDYKIASLISHIELQDVVNVVESSKNHVAKVNDKRKNMMQKEQQEHSVSIASLSVLQLQDMITNSIKAQYGGPSQSSLTY
SKSYTKRIDDLRVPVGYQPPKFQQFDEKGNPKQHIAHFVETYLEPESINDWEKLEREFLNRFYNTRRTVSMMELTNTKQRKDEPVVDYINRWRALSLDYKDRLIEISAIE
LCIQGIHWGLLYIPQGIKPCTFEELVTRAHDMELSIANQGDKDLLVPDMKKGSKSAEMTSKESMVINTTYLKSSSKRKEKKVENRQEMKGIV