; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g09250 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g09250
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTransposon, En/Spm-like protein
Genome locationchr4:6833174..6840402
RNA-Seq ExpressionMoc04g09250
SyntenyMoc04g09250
Gene Ontology termsNA
InterPro domainsIPR025312 - Domain of unknown function DUF4216
IPR025452 - Domain of unknown function DUF4218


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035742.1 uncharacterized protein E6C27_scaffold403G00130 [Cucumis melo var. makuwa]7.1e-10661.72Show/hide
Query:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC
        I RSLRYLK+YVRNKARPEG IAEAY+INESL FCSMYL GIETRFNR +RN D++++ E NN  LSIFS  +R LGG+  + LT DELEK+HWYIL+NC
Subjt:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC

Query:  DDVEPYLEEHMSLLQ-SNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRIN
        D+V+PYL+EHM +L+ SNG+GDL++RQQLEFPTWFK KAQ L+ +K ISD LY LA GPNDC RSY+G                          RK RIN
Subjt:  DDVEPYLEEHMSLLQ-SNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRIN

Query:  VSDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI--
        + DG+TS+NTR RWYKDE FIL SQA QVFY+DDYKLGQ+WKIVQ++QPRHVWDIPE E+NE+ EV +DE+I TCYP+V+ NLDSQ F+REDVEP  I  
Subjt:  VSDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI--

Query:  -EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD
         EDI+   +++ +T     DEE FD   SSD+S S +
Subjt:  -EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD

KAA0064140.1 uncharacterized protein E6C27_scaffold548G00590 [Cucumis melo var. makuwa]8.4e-9959.52Show/hide
Query:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC
        I RSLRYLKQYVRNKARPEG I EAY+INESL FCSMYL GIET  NR +RN D++++ E NN  LSIFS  +R LGG+  + LT DELEK+HW      
Subjt:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC

Query:  DDVEPYLEEHMSLLQSNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRINV
         ++  + E    L  SNG+GDL++RQ+LEF TWFK KAQ L+ +K ISD LY LA GP+DC RSY+ CIANG+RFH K+H         D   RK RIN+
Subjt:  DDVEPYLEEHMSLLQSNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRINV

Query:  SDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI---
         DG+TSINTR RWYKDE FIL SQA QVFY+DDYKLGQ+WKIV QIQPRHVWDIP+ E+NE+ EV ++E+I TCYP+V+ NLDSQTF+ +DVEP  I   
Subjt:  SDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI---

Query:  EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD
        EDI+   +++ +T     DEE+FD   SSD+S S +
Subjt:  EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD

TYK21879.1 uncharacterized protein E5676_scaffold494G00120 [Cucumis melo var. makuwa]1.2e-10561.72Show/hide
Query:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC
        I RSL YLK+YVRNKARPEG IAEAY+INESL FCSMYL GIETRFNR +RN D++++ E NN  LSIFS  +R LGG+  + LT DELEK+HWYIL+NC
Subjt:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC

Query:  DDVEPYLEEHMSLLQ-SNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRIN
        D+V+PYL+EHM +L+ SNG+GDL++RQQLEFPTWFK KAQ L+ +K ISD LY LA GPNDC RSY+G                          RK RIN
Subjt:  DDVEPYLEEHMSLLQ-SNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRIN

Query:  VSDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI--
        + DG+TSINTR RWYKDE FIL SQA QVFY+DDYKLGQ+WKIVQ++QPRHVWDIPE E+NE+ EV +DE+I TCYP+V+ NLDSQ F+REDVEP  I  
Subjt:  VSDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI--

Query:  -EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD
         EDI+   +++ +T     DEE FD   SSD+S S +
Subjt:  -EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD

TYK22071.1 uncharacterized protein E5676_scaffold318G00540 [Cucumis melo var. makuwa]1.4e-10158.77Show/hide
Query:  MYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNCDDVEPYLEEHMSLLQ-SNGHGDLHRRQQLEFPTWFK
        MYL GIETR+NR +RN D++++ E NN  LSIFS  +R LGG+  + LT DELEK+HWYIL+NCD+VEPYL+EHM +L+ SNG+GDL++RQQLEFPTWFK
Subjt:  MYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNCDDVEPYLEEHMSLLQ-SNGHGDLHRRQQLEFPTWFK

Query:  KKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEH-----------------------------------------VILLKCEWYDTSSR
         KAQ L+ +K ISD LY LA GPNDC RSY+GCIANG+RFH+K+                                          V+L KCEWYDTS R
Subjt:  KKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEH-----------------------------------------VILLKCEWYDTSSR

Query:  KGRINVSDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEP
        K RIN+ DG+TSINTR RWYKDE FIL SQATQVFY+DDYKLGQ WKIVQQIQPRHVWDIPE E+NE+ EV +DE+I TCYP+V+ NLDSQTF+REDVEP
Subjt:  KGRINVSDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEP

Query:  MPI---EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD
          I   EDI+    ++ +T     DEE+FD   SSD+S S +
Subjt:  MPI---EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD

TYK28599.1 uncharacterized protein E5676_scaffold2030G00110 [Cucumis melo var. makuwa]8.4e-9959.52Show/hide
Query:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC
        I RSLRYLKQYVRNKARPEG I EAY+INESL FCSMYL GIET  NR +RN D++++ E NN  LSIFS  +R LGG+  + LT DELEK+HW      
Subjt:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC

Query:  DDVEPYLEEHMSLLQSNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRINV
         ++  + E    L  SNG+GDL++RQ+LEF TWFK KAQ L+ +K ISD LY LA GP+DC RSY+ CIANG+RFH K+H         D   RK RIN+
Subjt:  DDVEPYLEEHMSLLQSNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRINV

Query:  SDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI---
         DG+TSINTR RWYKDE FIL SQA QVFY+DDYKLGQ+WKIV QIQPRHVWDIP+ E+NE+ EV ++E+I TCYP+V+ NLDSQTF+ +DVEP  I   
Subjt:  SDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI---

Query:  EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD
        EDI+   +++ +T     DEE+FD   SSD+S S +
Subjt:  EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD

TrEMBL top hitse value%identityAlignment
A0A5A7SWV9 Uncharacterized protein3.4e-10661.72Show/hide
Query:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC
        I RSLRYLK+YVRNKARPEG IAEAY+INESL FCSMYL GIETRFNR +RN D++++ E NN  LSIFS  +R LGG+  + LT DELEK+HWYIL+NC
Subjt:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC

Query:  DDVEPYLEEHMSLLQ-SNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRIN
        D+V+PYL+EHM +L+ SNG+GDL++RQQLEFPTWFK KAQ L+ +K ISD LY LA GPNDC RSY+G                          RK RIN
Subjt:  DDVEPYLEEHMSLLQ-SNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRIN

Query:  VSDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI--
        + DG+TS+NTR RWYKDE FIL SQA QVFY+DDYKLGQ+WKIVQ++QPRHVWDIPE E+NE+ EV +DE+I TCYP+V+ NLDSQ F+REDVEP  I  
Subjt:  VSDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI--

Query:  -EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD
         EDI+   +++ +T     DEE FD   SSD+S S +
Subjt:  -EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD

A0A5A7VA83 Uncharacterized protein4.1e-9959.52Show/hide
Query:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC
        I RSLRYLKQYVRNKARPEG I EAY+INESL FCSMYL GIET  NR +RN D++++ E NN  LSIFS  +R LGG+  + LT DELEK+HW      
Subjt:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC

Query:  DDVEPYLEEHMSLLQSNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRINV
         ++  + E    L  SNG+GDL++RQ+LEF TWFK KAQ L+ +K ISD LY LA GP+DC RSY+ CIANG+RFH K+H         D   RK RIN+
Subjt:  DDVEPYLEEHMSLLQSNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRINV

Query:  SDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI---
         DG+TSINTR RWYKDE FIL SQA QVFY+DDYKLGQ+WKIV QIQPRHVWDIP+ E+NE+ EV ++E+I TCYP+V+ NLDSQTF+ +DVEP  I   
Subjt:  SDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI---

Query:  EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD
        EDI+   +++ +T     DEE+FD   SSD+S S +
Subjt:  EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD

A0A5D3DEG0 DUF4216 domain-containing protein6.7e-10258.77Show/hide
Query:  MYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNCDDVEPYLEEHMSLLQ-SNGHGDLHRRQQLEFPTWFK
        MYL GIETR+NR +RN D++++ E NN  LSIFS  +R LGG+  + LT DELEK+HWYIL+NCD+VEPYL+EHM +L+ SNG+GDL++RQQLEFPTWFK
Subjt:  MYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNCDDVEPYLEEHMSLLQ-SNGHGDLHRRQQLEFPTWFK

Query:  KKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEH-----------------------------------------VILLKCEWYDTSSR
         KAQ L+ +K ISD LY LA GPNDC RSY+GCIANG+RFH+K+                                          V+L KCEWYDTS R
Subjt:  KKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEH-----------------------------------------VILLKCEWYDTSSR

Query:  KGRINVSDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEP
        K RIN+ DG+TSINTR RWYKDE FIL SQATQVFY+DDYKLGQ WKIVQQIQPRHVWDIPE E+NE+ EV +DE+I TCYP+V+ NLDSQTF+REDVEP
Subjt:  KGRINVSDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEP

Query:  MPI---EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD
          I   EDI+    ++ +T     DEE+FD   SSD+S S +
Subjt:  MPI---EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD

A0A5D3DEV9 Uncharacterized protein5.9e-10661.72Show/hide
Query:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC
        I RSL YLK+YVRNKARPEG IAEAY+INESL FCSMYL GIETRFNR +RN D++++ E NN  LSIFS  +R LGG+  + LT DELEK+HWYIL+NC
Subjt:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC

Query:  DDVEPYLEEHMSLLQ-SNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRIN
        D+V+PYL+EHM +L+ SNG+GDL++RQQLEFPTWFK KAQ L+ +K ISD LY LA GPNDC RSY+G                          RK RIN
Subjt:  DDVEPYLEEHMSLLQ-SNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRIN

Query:  VSDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI--
        + DG+TSINTR RWYKDE FIL SQA QVFY+DDYKLGQ+WKIVQ++QPRHVWDIPE E+NE+ EV +DE+I TCYP+V+ NLDSQ F+REDVEP  I  
Subjt:  VSDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI--

Query:  -EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD
         EDI+   +++ +T     DEE FD   SSD+S S +
Subjt:  -EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD

A0A5D3DYY6 Uncharacterized protein4.1e-9959.52Show/hide
Query:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC
        I RSLRYLKQYVRNKARPEG I EAY+INESL FCSMYL GIET  NR +RN D++++ E NN  LSIFS  +R LGG+  + LT DELEK+HW      
Subjt:  IPRSLRYLKQYVRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVE-NNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNC

Query:  DDVEPYLEEHMSLLQSNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRINV
         ++  + E    L  SNG+GDL++RQ+LEF TWFK KAQ L+ +K ISD LY LA GP+DC RSY+ CIANG+RFH K+H         D   RK RIN+
Subjt:  DDVEPYLEEHMSLLQSNGHGDLHRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRINV

Query:  SDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI---
         DG+TSINTR RWYKDE FIL SQA QVFY+DDYKLGQ+WKIV QIQPRHVWDIP+ E+NE+ EV ++E+I TCYP+V+ NLDSQTF+ +DVEP  I   
Subjt:  SDGYTSINTRYRWYKDESFILASQATQVFYLDDYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPI---

Query:  EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD
        EDI+   +++ +T     DEE+FD   SSD+S S +
Subjt:  EDIVYIDSDKKTTS----DEEEFDFEYSSDDSGSTD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCTGAAAAAGCACTTTCAGCGACGCTGTCAATTGACGGCGTCGCTGAAGAGATTTTCAGCAACACATAAGTTGTATGTGTCGCTGAAATCTCTGATTTTTATCCC
AAATCAGTTCGCGATTTCACAAACCCTCGCCGCCAACGCAATTTCTTCTCCCATCCGTCGCCGCGTCACCGATTTCAGCACCTCCCCTCTCCGTTCATCTCCTCCGGTTG
TCGTTTTGGTTCGTCGTGCCTTGCTCCGTGAGTTTGGTGGCTACTGTCCATCCGTTTCGACTGCTGTAAGCATCACCATCCCGAGGAGTCTGCGATATTTGAAGCAATAT
GTTAGAAACAAAGCTCGACCTGAAGGTCCAATTGCAGAAGCTTACATCATAAATGAATCTTTAACTTTTTGTTCTATGTATTTACATGGGATTGAAACAAGATTCAATAG
GAGTGAGCGTAATTATGATAATGTCGAGAATGTGGAGAACAATAGTGAGCTATCGATTTTTTCTGGACGTGTACGACTATTGGGTGGTAGCCAATTGAAATGCTTGACTT
CTGATGAGTTAGAAAAAACACATTGGTATATACTAAGCAACTGTGATGATGTGGAACCTTACCTCGAGGAGCATATGAGTCTTCTCCAATCAAACGGTCATGGTGATTTG
CACAGGAGACAACAATTAGAATTTCCTACTTGGTTTAAGAAAAAAGCTCAAGTATTGTATCGTGAAAAAGGGATTTCTGATGCATTATATGTCCTAGCAATTGGCCCGAA
CGACTGTGTTCGATCTTACACTGGGTGTATAGCCAATGGAATTCGATTCCATTCAAAAGAGCATGTGATTCTTTTGAAATGTGAATGGTACGATACAAGTTCAAGAAAGG
GTAGAATTAATGTATCTGATGGTTATACATCGATTAATACTCGTTATAGATGGTACAAAGACGAGTCATTCATACTTGCAAGTCAAGCTACCCAAGTATTCTACTTAGAT
GATTACAAACTTGGACAAGAATGGAAGATTGTTCAACAAATTCAACCAAGACATGTGTGGGATATTCCAGAGGCTGAAAATAATGAAATAGAAGAAGTAGTGGCGGATGA
AAGCATTCGCACATGTTACCCACAAGTTGATGATAACTTAGATTCACAAACCTTCCATAGAGAAGATGTTGAGCCGATGCCTATTGAAGATATAGTATATATAGATAGTG
ACAAGAAAACAACTAGTGATGAAGAAGAGTTTGATTTTGAGTATTCTAGTGATGATTCTGGAAGCACAGATTCCAATATGATGTCGTCTTCAAATCTTGGACCAGGGAAC
CGACGGAGTTACACAAGTAACGTCGATAGAATAGAAAGCACTCAACCGGGAGAGGCTAATCAAACAGCACAGAGTTCTGGCACGACTAAAAGAAAAGGTAGACGCAACAC
TAGAGGGCTGAATGTTGATAAACATGTCCAAAATCATGGTCTGATAGAAATTAATATCGAAGAAGAAGATGGCAAGCTAGTTTGTAGCCATAGTTCGAAACTGGTCTCTC
AAATTGGGGTATGGGTGCGGTCAATCGTTCCTCTGAATTGTGAACATTGGTCAGACGTCAACCAAGACGATAAAAATAGTATCATAGATAGACTTAGCAACCAATTCATT
CTTGATCTCAATGATCCAATTGTTTATCGTTATTTGGAGCATGAGATGAGTAGTCGGCATAGAGATTTTCGATACAGGTTGCATAAGCATTACATGCAATATTCTCCAGA
AGAAGCACGACATCACAAACACAAAGATGTTGCACGAGATGAAGATTGGCATCGCTTATGCGATCGTTGGGAAACAAAGAAGTTTAAGGAACAAATGGTTGAACTGAAAA
ATGCACCAGTGGAAGAAGGTGCAGAACCTCCTTCAGCTCGACAAATCAATGTTCAAGTACTAGGCAGGCGAGGTGCACTTGGGTGGGGACCACCGACGGCGACGACTCAG
AATGAACGAAGTTCTCAAAGAAATAGAGAGTCAAGAAATCAAATCGCTCAATTACAATCCGTTGTACAGACGCAACAAAGCCTGCTTGAATCAGCAATGACGCAATTGAG
TCAGTTACAAGAAACTGTAAACAGGCTAAGCAATGCTGGAGAAGGATCGTCACAACAACCTAATTCTCAACCGTCGAATGATGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGCTGAAAAAGCACTTTCAGCGACGCTGTCAATTGACGGCGTCGCTGAAGAGATTTTCAGCAACACATAAGTTGTATGTGTCGCTGAAATCTCTGATTTTTATCCC
AAATCAGTTCGCGATTTCACAAACCCTCGCCGCCAACGCAATTTCTTCTCCCATCCGTCGCCGCGTCACCGATTTCAGCACCTCCCCTCTCCGTTCATCTCCTCCGGTTG
TCGTTTTGGTTCGTCGTGCCTTGCTCCGTGAGTTTGGTGGCTACTGTCCATCCGTTTCGACTGCTGTAAGCATCACCATCCCGAGGAGTCTGCGATATTTGAAGCAATAT
GTTAGAAACAAAGCTCGACCTGAAGGTCCAATTGCAGAAGCTTACATCATAAATGAATCTTTAACTTTTTGTTCTATGTATTTACATGGGATTGAAACAAGATTCAATAG
GAGTGAGCGTAATTATGATAATGTCGAGAATGTGGAGAACAATAGTGAGCTATCGATTTTTTCTGGACGTGTACGACTATTGGGTGGTAGCCAATTGAAATGCTTGACTT
CTGATGAGTTAGAAAAAACACATTGGTATATACTAAGCAACTGTGATGATGTGGAACCTTACCTCGAGGAGCATATGAGTCTTCTCCAATCAAACGGTCATGGTGATTTG
CACAGGAGACAACAATTAGAATTTCCTACTTGGTTTAAGAAAAAAGCTCAAGTATTGTATCGTGAAAAAGGGATTTCTGATGCATTATATGTCCTAGCAATTGGCCCGAA
CGACTGTGTTCGATCTTACACTGGGTGTATAGCCAATGGAATTCGATTCCATTCAAAAGAGCATGTGATTCTTTTGAAATGTGAATGGTACGATACAAGTTCAAGAAAGG
GTAGAATTAATGTATCTGATGGTTATACATCGATTAATACTCGTTATAGATGGTACAAAGACGAGTCATTCATACTTGCAAGTCAAGCTACCCAAGTATTCTACTTAGAT
GATTACAAACTTGGACAAGAATGGAAGATTGTTCAACAAATTCAACCAAGACATGTGTGGGATATTCCAGAGGCTGAAAATAATGAAATAGAAGAAGTAGTGGCGGATGA
AAGCATTCGCACATGTTACCCACAAGTTGATGATAACTTAGATTCACAAACCTTCCATAGAGAAGATGTTGAGCCGATGCCTATTGAAGATATAGTATATATAGATAGTG
ACAAGAAAACAACTAGTGATGAAGAAGAGTTTGATTTTGAGTATTCTAGTGATGATTCTGGAAGCACAGATTCCAATATGATGTCGTCTTCAAATCTTGGACCAGGGAAC
CGACGGAGTTACACAAGTAACGTCGATAGAATAGAAAGCACTCAACCGGGAGAGGCTAATCAAACAGCACAGAGTTCTGGCACGACTAAAAGAAAAGGTAGACGCAACAC
TAGAGGGCTGAATGTTGATAAACATGTCCAAAATCATGGTCTGATAGAAATTAATATCGAAGAAGAAGATGGCAAGCTAGTTTGTAGCCATAGTTCGAAACTGGTCTCTC
AAATTGGGGTATGGGTGCGGTCAATCGTTCCTCTGAATTGTGAACATTGGTCAGACGTCAACCAAGACGATAAAAATAGTATCATAGATAGACTTAGCAACCAATTCATT
CTTGATCTCAATGATCCAATTGTTTATCGTTATTTGGAGCATGAGATGAGTAGTCGGCATAGAGATTTTCGATACAGGTTGCATAAGCATTACATGCAATATTCTCCAGA
AGAAGCACGACATCACAAACACAAAGATGTTGCACGAGATGAAGATTGGCATCGCTTATGCGATCGTTGGGAAACAAAGAAGTTTAAGGAACAAATGGTTGAACTGAAAA
ATGCACCAGTGGAAGAAGGTGCAGAACCTCCTTCAGCTCGACAAATCAATGTTCAAGTACTAGGCAGGCGAGGTGCACTTGGGTGGGGACCACCGACGGCGACGACTCAG
AATGAACGAAGTTCTCAAAGAAATAGAGAGTCAAGAAATCAAATCGCTCAATTACAATCCGTTGTACAGACGCAACAAAGCCTGCTTGAATCAGCAATGACGCAATTGAG
TCAGTTACAAGAAACTGTAAACAGGCTAAGCAATGCTGGAGAAGGATCGTCACAACAACCTAATTCTCAACCGTCGAATGATGCTTAA
Protein sequenceShow/hide protein sequence
MSLKKHFQRRCQLTASLKRFSATHKLYVSLKSLIFIPNQFAISQTLAANAISSPIRRRVTDFSTSPLRSSPPVVVLVRRALLREFGGYCPSVSTAVSITIPRSLRYLKQY
VRNKARPEGPIAEAYIINESLTFCSMYLHGIETRFNRSERNYDNVENVENNSELSIFSGRVRLLGGSQLKCLTSDELEKTHWYILSNCDDVEPYLEEHMSLLQSNGHGDL
HRRQQLEFPTWFKKKAQVLYREKGISDALYVLAIGPNDCVRSYTGCIANGIRFHSKEHVILLKCEWYDTSSRKGRINVSDGYTSINTRYRWYKDESFILASQATQVFYLD
DYKLGQEWKIVQQIQPRHVWDIPEAENNEIEEVVADESIRTCYPQVDDNLDSQTFHREDVEPMPIEDIVYIDSDKKTTSDEEEFDFEYSSDDSGSTDSNMMSSSNLGPGN
RRSYTSNVDRIESTQPGEANQTAQSSGTTKRKGRRNTRGLNVDKHVQNHGLIEINIEEEDGKLVCSHSSKLVSQIGVWVRSIVPLNCEHWSDVNQDDKNSIIDRLSNQFI
LDLNDPIVYRYLEHEMSSRHRDFRYRLHKHYMQYSPEEARHHKHKDVARDEDWHRLCDRWETKKFKEQMVELKNAPVEEGAEPPSARQINVQVLGRRGALGWGPPTATTQ
NERSSQRNRESRNQIAQLQSVVQTQQSLLESAMTQLSQLQETVNRLSNAGEGSSQQPNSQPSNDA