; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g12380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g12380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr5:9621245..9631487
RNA-Seq ExpressionMoc05g12380
SyntenyMoc05g12380
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]3.1e-8752.63Show/hide
Query:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVEL
        M  KK ++ +   S  +T  IT   S  +   ++Q S I +  +  L+E+  +   ++  NPL+            + + D +SVMMAD + ++  M E+
Subjt:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVEL

Query:  ERKIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTPV--------------------------------DMIANSIRAQYGGSSQNTLLYSKPYSKR
        ERKI  LMK+++E D +IA+LK +++++  + A+SSQTPV                                DMI NSIRAQYGG SQ + +YSKPY+KR
Subjt:  ERKIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTPV--------------------------------DMIANSIRAQYGGSSQNTLLYSKPYSKR

Query:  IDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQ
        IDNLRM +GYQPPKFQ FDGKGNPKQH+AHFVETCENA +RGDQLV+QFVR+LKGNAFEWYTDLEPE+IESWEQLE+EFLNRFYSTRR VS+M+LTN KQ
Subjt:  IDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQ

Query:  RKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCTQGAQQGII
        RKG+ +++YINRWRA+SLDCKDRL ELSAV++CTQG   G++
Subjt:  RKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCTQGAQQGII

XP_031737053.1 uncharacterized protein LOC116402138 [Cucumis sativus]1.2e-8652.34Show/hide
Query:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVEL
        M  KK ++ +   S  +T  IT   S  +   ++Q S I +  +  L+E+  +   ++  NPL+            + + D +SVMMAD + ++  M E+
Subjt:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVEL

Query:  ERKIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTPV--------------------------------DMIANSIRAQYGGSSQNTLLYSKPYSKR
        ERKI  LMK+++E D +IA+LK +++++  + A+SSQTPV                                DMI +SIRAQYGG SQ + +YSKPY+KR
Subjt:  ERKIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTPV--------------------------------DMIANSIRAQYGGSSQNTLLYSKPYSKR

Query:  IDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQ
        IDNLRM +GYQPPKFQ FDGKGNPKQH+AHFVETCENA +RGDQLV+QFVR+LKGNAFEWYTDLEPE+IESWEQLE+EFLNRFYSTRR VS+M+LTN KQ
Subjt:  IDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQ

Query:  RKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCTQGAQQGII
        RKG+ +++YINRWRA+SLDCKDRL ELSAV++CTQG   G++
Subjt:  RKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCTQGAQQGII

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]1.2e-8652.34Show/hide
Query:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVEL
        M  KK ++ +   S  +T  IT   S  +   ++Q S I +  +  L+E+  +   ++  NPL+            + + D +SVMMAD + ++  M E+
Subjt:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVEL

Query:  ERKIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTPV--------------------------------DMIANSIRAQYGGSSQNTLLYSKPYSKR
        ERKI  LMK+++E D +IA+LK +++++  + A+SSQTPV                                DMI +SIRAQYGG SQ + +YSKPY+KR
Subjt:  ERKIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTPV--------------------------------DMIANSIRAQYGGSSQNTLLYSKPYSKR

Query:  IDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQ
        IDNLRM +GYQPPKFQ FDGKGNPKQH+AHFVETCENA +RGDQLV+QFVR+LKGNAFEWYTDLEPE+IESWEQLE+EFLNRFYSTRR VS+M+LTN KQ
Subjt:  IDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQ

Query:  RKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCTQGAQQGII
        RKG+ +++YINRWRA+SLDCKDRL ELSAV++CTQG   G++
Subjt:  RKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCTQGAQQGII

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]1.2e-8652.34Show/hide
Query:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVEL
        M  KK ++ +   S  +T  IT   S  +   ++Q S I +  +  L+E+  +   ++  NPL+            + + D +SVMMAD + ++  M E+
Subjt:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVEL

Query:  ERKIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTPV--------------------------------DMIANSIRAQYGGSSQNTLLYSKPYSKR
        ERKI  LMK+++E D +IA+LK +++++  + A+SSQTPV                                DMI +SIRAQYGG SQ + +YSKPY+KR
Subjt:  ERKIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTPV--------------------------------DMIANSIRAQYGGSSQNTLLYSKPYSKR

Query:  IDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQ
        IDNLRM +GYQPPKFQ FDGKGNPKQH+AHFVETCENA +RGDQLV+QFVR+LKGNAFEWYTDLEPE+IESWEQLE+EFLNRFYSTRR VS+M+LTN KQ
Subjt:  IDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQ

Query:  RKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCTQGAQQGII
        RKG+ +++YINRWRA+SLDCKDRL ELSAV++CTQG   G++
Subjt:  RKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCTQGAQQGII

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]3.1e-8752.63Show/hide
Query:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVEL
        M  KK ++ +   S  +T  IT   S  +   ++Q S I +  +  L+E+  +   ++  NPL+            + + D +SVMMAD + ++  M E+
Subjt:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVEL

Query:  ERKIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTPV--------------------------------DMIANSIRAQYGGSSQNTLLYSKPYSKR
        ERKI  LMK+++E D +IA+LK +++++  + A+SSQTPV                                DMI NSIRAQYGG SQ + +YSKPY+KR
Subjt:  ERKIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTPV--------------------------------DMIANSIRAQYGGSSQNTLLYSKPYSKR

Query:  IDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQ
        IDNLRM +GYQPPKFQ FDGKGNPKQH+AHFVETCENA +RGDQLV+QFVR+LKGNAFEWYTDLEPE+IESWEQLE+EFLNRFYSTRR VS+M+LTN KQ
Subjt:  IDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQ

Query:  RKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCTQGAQQGII
        RKG+ +++YINRWRA+SLDCKDRL ELSAV++CTQG   G++
Subjt:  RKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCTQGAQQGII

TrEMBL top hitse value%identityAlignment
A0A5A7SU65 Ty3-gypsy retrotransposon protein1.3e-8354.25Show/hide
Query:  LRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVELERKIRNLMKIIEEEDSKIASLKNRIESQDV
        ++E +   +++KK +  L  + + E ++  NPLF            +++++ +SVMM D  T +  M ++ERKI  LMK++EE D +IA+LK+++++   
Subjt:  LRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVELERKIRNLMKIIEEEDSKIASLKNRIESQDV

Query:  DAAKSSQTPV--------------------------------DMIANSIRAQYGGSSQNTLLYSKPYSKRIDNLRMRVGYQPPKFQHFDGKGNPKQHIAH
        + A+SSQTPV                                DMI NSIRAQYGG  Q + +YSKPY+KRIDNLRM +GYQPPKFQ FDGKGNPKQHIAH
Subjt:  DAAKSSQTPV--------------------------------DMIANSIRAQYGGSSQNTLLYSKPYSKRIDNLRMRVGYQPPKFQHFDGKGNPKQHIAH

Query:  FVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQRKGKSIVEYINRWRAVSLDCKDRLIELSAV
        FVETCENA +RGDQLV+QFVR+LKGNAFEWYTDLEPE I+SWEQLE EFLNRFYSTR ++S+M+LTN KQ+KG+ +++YINRWRA+SLDCKD+L ELSAV
Subjt:  FVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQRKGKSIVEYINRWRAVSLDCKDRLIELSAV

Query:  QLCTQG
        ++CTQG
Subjt:  QLCTQG

A0A5A7U7T2 Ty3-gypsy retrotransposon protein9.1e-8554.81Show/hide
Query:  FKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVELER
        FK ++ +   +  +  SH+      +  ++E +   +++KK +  L  + +   ++  NPLF            +++++ +SVMM D  T +  M E+ER
Subjt:  FKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVELER

Query:  KIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTP----------VDMIANSIRAQYGGSSQNTLLYSKPYSKRIDNLRMRVGYQPPKFQHFDGKGNP
        KI  LMK++EE D +IA+LK+++++ + D  +SSQTP          VDMIANSIRAQYGG  Q T +YSKPY+KRIDNLRM +GYQPPKFQ FDGKGNP
Subjt:  KIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTP----------VDMIANSIRAQYGGSSQNTLLYSKPYSKRIDNLRMRVGYQPPKFQHFDGKGNP

Query:  KQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQRKGKSIVEYINRWRAVSLDCKDRL
        KQHIA FVETCENA +RGDQLVKQFVRTLKGNAF+WY DLEPE+I+ WEQLER FLNRFYSTRRI S+M+LTN +Q+KG+ +++YINRWRA+SLDCKDRL
Subjt:  KQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQRKGKSIVEYINRWRAVSLDCKDRL

Query:  IELSAVQLCTQG
         ELSA+++CTQG
Subjt:  IELSAVQLCTQG

A0A5A7U8E4 Retrotransposon gag protein1.2e-8455.52Show/hide
Query:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLFDTNIDTISVMMADASTMDEKMVELERKIRNLMKIIE
        M  KK ++ + + S  +T  IT   S E+   ++Q                             + + D +SVMMAD  T +  M E+ERKI  LMK +E
Subjt:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLFDTNIDTISVMMADASTMDEKMVELERKIRNLMKIIE

Query:  EEDSKIASLKNRIESQDVDAAKSSQTP----------VDMIANSIRAQYGGSSQNTLLYSKPYSKRIDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVET
        E D +I +L+ ++ ++  + A+SSQTP          VDMIANSIRAQYGG  Q T +YSKPY+KRIDNLRM +GYQPPKFQ FD KGNPKQHIAHFVET
Subjt:  EEDSKIASLKNRIESQDVDAAKSSQTP----------VDMIANSIRAQYGGSSQNTLLYSKPYSKRIDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVET

Query:  CENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQRKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCT
        CENA +RGDQLV+QF+R+LKGN FEWYTDLEPE I+SW+QLE+EFLNRFYSTRR VS+M+LTN KQRKG+ +++YINRWRA+SLDCKDRL ELSAV++CT
Subjt:  CENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQRKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCT

Query:  QGAQQGII
        QG   G++
Subjt:  QGAQQGII

A0A5D3DHB7 Ty3-gypsy retrotransposon protein1.8e-8553.87Show/hide
Query:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLFD------------TNIDTISVMMADASTMDEKMVEL
        M  KK ++ + +    +T  IT   S  +   ++Q S + +  +  L+E+  +   ++  NPL+D             + + +SVMMAD +  +  M E+
Subjt:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLFD------------TNIDTISVMMADASTMDEKMVEL

Query:  ERKIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTPVDMIANSIRAQYGGSSQNTLLYSKPYSKRIDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFV
        ERKI  LMK++EE D +I +L+ ++ ++++  A+S Q   DMI NSIR QYGG  Q T +YSKPY+KR DNLR+ +GYQPPKFQ FDGKGNPKQHIAHFV
Subjt:  ERKIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTPVDMIANSIRAQYGGSSQNTLLYSKPYSKRIDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFV

Query:  ETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQRKGKSIVEYINRWRAVSLDCKDRLIELSAVQL
        ETCENA +RGDQLV+QFVR+LKGNAFEWYTDLEPE I+SWEQL++EFLNRFY TRR VS+M+LTN KQ+KGK +++YINRWRA+SLDCKDRL ELSA+++
Subjt:  ETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQRKGKSIVEYINRWRAVSLDCKDRLIELSAVQL

Query:  CTQGAQQGII
        CTQG   G++
Subjt:  CTQGAQQGII

A0A5D3DIN4 Ty3-gypsy retrotransposon protein1.0e-8351.19Show/hide
Query:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVEL
        M  KK ++ +   S  +   +T  +  +  ++E +   +++KK +  L  + +   ++  NPLF            +++++ +SVMM D +T +  MVE+
Subjt:  MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLF------------DTNIDTISVMMADASTMDEKMVEL

Query:  ERKIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTPV--------------------------------DMIANSIRAQYGGSSQNTLLYSKPYSKR
        E+KI  LMK++EE D +IA+LK+++++   + A+SS+TPV                                DMI NSIRAQYGG SQ + +YSKPY+KR
Subjt:  ERKIRNLMKIIEEEDSKIASLKNRIESQDVDAAKSSQTPV--------------------------------DMIANSIRAQYGGSSQNTLLYSKPYSKR

Query:  IDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQ
        IDNLRM +GYQPPKFQ FDGKGNPKQHIAHFVE CENA +RGDQLV+QFVR+LKGNAFEWYTDLEPE I+SWEQLE EFLN FYSTRR+VS+M+LTN KQ
Subjt:  IDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDLEPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQ

Query:  RKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCTQG
        RKG+ +++YINRWRA+SLDCKD+L ELSAV++CTQG
Subjt:  RKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCTQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTCAAAAAGACTTCTACAACAGCTATTATCATGAGTAAACTTCATACGAGTCATATCACCCATGGTTACTCTGATGAACTCCAACTACGGGAAAATCAAACTTC
TCCTATTGTTGAGAAAAAAGTAATGATGCTACTAGAAACAACTATTGAAGATGAGTTTCTTGTTAATCATAACCCCTTATTTGATACTAACATTGACACGATCTCTGTTA
TGATGGCTGATGCAAGCACTATGGATGAAAAAATGGTAGAATTAGAGAGAAAAATCAGAAATTTGATGAAGATAATCGAAGAAGAAGATTCTAAAATCGCTTCTCTGAAG
AATAGGATTGAGAGCCAAGATGTTGATGCTGCTAAGTCAAGTCAGACTCCAGTTGACATGATAGCAAACTCAATTAGGGCTCAATACGGTGGATCTTCTCAAAATACTCT
CTTGTATTCAAAACCATACTCCAAGAGAATTGACAACTTGAGAATGCGTGTTGGATATCAGCCACCTAAGTTCCAACATTTTGATGGGAAAGGCAATCCCAAACAACATA
TTGCTCACTTCGTCGAAACTTGTGAGAATGCTAGTACTAGAGGCGATCAACTGGTCAAGCAATTCGTCCGAACGTTGAAGGGAAACGCCTTTGAATGGTATACAGATCTA
GAGCCTGAAACGATCGAGAGCTGGGAACAGCTTGAAAGAGAGTTCCTAAATCGCTTCTATAGTACGAGGAGAATTGTTAGCATAATGGATCTCACCAACCGTAAACAAAG
AAAAGGCAAATCAATTGTCGAATATATCAACCGATGGAGAGCTGTAAGTCTTGATTGTAAAGATAGGCTTATTGAACTATCCGCCGTCCAATTATGCACTCAAGGGGCGC
AACAAGGCATCATCGTCGAAATGATGTTAGCCGCAAAAATACTTAACGGCGAAAACTACAAACAATGGAAGTCGAGCCTAAACACTATACTAGTGATAGATGATCTTAGG
TTTGTCTTGCAAGCGGATTGTCCTCAAGCTCCTGCGCCTGACGCCACTGTGGCGGAGCGCAACGTCTATGACTGTTGGATCAAGGTCAAGATCTACATCTTGGCGAGCAT
ATATGATGTGCTTGCTAAGAAGCACGAGGACACGGTCACCGCTAAAGAGATCGTGGACTCGCTGCAGAGCATGCTTGGACAACCGTCCTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAATTCAAAAAGACTTCTACAACAGCTATTATCATGAGTAAACTTCATACGAGTCATATCACCCATGGTTACTCTGATGAACTCCAACTACGGGAAAATCAAACTTC
TCCTATTGTTGAGAAAAAAGTAATGATGCTACTAGAAACAACTATTGAAGATGAGTTTCTTGTTAATCATAACCCCTTATTTGATACTAACATTGACACGATCTCTGTTA
TGATGGCTGATGCAAGCACTATGGATGAAAAAATGGTAGAATTAGAGAGAAAAATCAGAAATTTGATGAAGATAATCGAAGAAGAAGATTCTAAAATCGCTTCTCTGAAG
AATAGGATTGAGAGCCAAGATGTTGATGCTGCTAAGTCAAGTCAGACTCCAGTTGACATGATAGCAAACTCAATTAGGGCTCAATACGGTGGATCTTCTCAAAATACTCT
CTTGTATTCAAAACCATACTCCAAGAGAATTGACAACTTGAGAATGCGTGTTGGATATCAGCCACCTAAGTTCCAACATTTTGATGGGAAAGGCAATCCCAAACAACATA
TTGCTCACTTCGTCGAAACTTGTGAGAATGCTAGTACTAGAGGCGATCAACTGGTCAAGCAATTCGTCCGAACGTTGAAGGGAAACGCCTTTGAATGGTATACAGATCTA
GAGCCTGAAACGATCGAGAGCTGGGAACAGCTTGAAAGAGAGTTCCTAAATCGCTTCTATAGTACGAGGAGAATTGTTAGCATAATGGATCTCACCAACCGTAAACAAAG
AAAAGGCAAATCAATTGTCGAATATATCAACCGATGGAGAGCTGTAAGTCTTGATTGTAAAGATAGGCTTATTGAACTATCCGCCGTCCAATTATGCACTCAAGGGGCGC
AACAAGGCATCATCGTCGAAATGATGTTAGCCGCAAAAATACTTAACGGCGAAAACTACAAACAATGGAAGTCGAGCCTAAACACTATACTAGTGATAGATGATCTTAGG
TTTGTCTTGCAAGCGGATTGTCCTCAAGCTCCTGCGCCTGACGCCACTGTGGCGGAGCGCAACGTCTATGACTGTTGGATCAAGGTCAAGATCTACATCTTGGCGAGCAT
ATATGATGTGCTTGCTAAGAAGCACGAGGACACGGTCACCGCTAAAGAGATCGTGGACTCGCTGCAGAGCATGCTTGGACAACCGTCCTCATAG
Protein sequenceShow/hide protein sequence
MEFKKTSTTAIIMSKLHTSHITHGYSDELQLRENQTSPIVEKKVMMLLETTIEDEFLVNHNPLFDTNIDTISVMMADASTMDEKMVELERKIRNLMKIIEEEDSKIASLK
NRIESQDVDAAKSSQTPVDMIANSIRAQYGGSSQNTLLYSKPYSKRIDNLRMRVGYQPPKFQHFDGKGNPKQHIAHFVETCENASTRGDQLVKQFVRTLKGNAFEWYTDL
EPETIESWEQLEREFLNRFYSTRRIVSIMDLTNRKQRKGKSIVEYINRWRAVSLDCKDRLIELSAVQLCTQGAQQGIIVEMMLAAKILNGENYKQWKSSLNTILVIDDLR
FVLQADCPQAPAPDATVAERNVYDCWIKVKIYILASIYDVLAKKHEDTVTAKEIVDSLQSMLGQPSS