; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G04690 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G04690
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotransposon protein
Genome locationClcChr04:16955396..16956361
RNA-Seq ExpressionClc04G04690
SyntenyClc04G04690
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038877407.1 uncharacterized protein LOC120069696 [Benincasa hispida]6.1e-8470.71Show/hide
Query:  AKLVEALLYLVEIGWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQVEREVFDLWVRSHLSAKGM
        AKL+E LLYLV+IGWRS    F       L       +   ALNQN IECKVRSLKKQYN +SEML QS F WNEEFKCVQVERE+F+LWVRSH +AKGM
Subjt:  AKLVEALLYLVEIGWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQVEREVFDLWVRSHLSAKGM

Query:  WNKPFPHYDDFSTVFGKDRA--VESPLNQDGMDEEPTKQSTGRATL-AESSQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKE
        WNKPFPHYDD ST         +ES LNQD +DEEPT+QSTGR ++  ESS+GSKRKR SFQ EMIDIMRSTVEM +THMG+LASWQK+KYELEFGRQKE
Subjt:  WNKPFPHYDDFSTVFGKDRA--VESPLNQDGMDEEPTKQSTGRATL-AESSQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKE

Query:  VVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEH
        VVN IYNI+GLDE  QVTLIDL+VTDIQKT+CFL V EH
Subjt:  VVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEH

XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]4.3e-10676.81Show/hide
Query:  MAGNS-RTKHVWSKVEDAKLVEALLYLVEIGWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQVE
        M GNS R+KHVWSKVEDA+LVEALLYLVE GWRSDN TFRPGYLQ+LE+ILHEKV G ALN+N IECKVRSLKKQYN VSEML QS F WNEEFKCVQVE
Subjt:  MAGNS-RTKHVWSKVEDAKLVEALLYLVEIGWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQVE

Query:  REVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRA---------VESPLNQDGMDEEPTKQSTGRATL-AESSQGSKRKRPSFQYEMIDIMRSTVEMQ
        RE+FDLWVRSH +AKGMW KPFPHYDD S VFGKDRA          ESPLNQD +DEEP +QSTGRA++  ESS+GSKRKR SFQ EMIDI++STVEMQ
Subjt:  REVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRA---------VESPLNQDGMDEEPTKQSTGRATL-AESSQGSKRKRPSFQYEMIDIMRSTVEMQ

Query:  NTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEH
        +THMG+LASWQ EKYELE    KEVVN IYNI+ L+E DQVTLIDL+VTDIQKT+CFL V EH
Subjt:  NTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEH

XP_038892629.1 uncharacterized protein At2g29880-like [Benincasa hispida]5.2e-7572.25Show/hide
Query:  IMAGN-SRTKHVWSKVEDAKLVEALLYLVEIGWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQV
        IMAGN  ++KHVWSKVEDAKLVEALLYLVE GWR DN TFRPGYLQ+LE+ILHEKV G ALN N IECKVRSLKKQYN VSEML QS  GWNEEFKCV V
Subjt:  IMAGN-SRTKHVWSKVEDAKLVEALLYLVEIGWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQV

Query:  EREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAV--------------------ESPLNQDGMDEEPTKQSTGRATL-AESSQGSKRKRPSFQYE
        ERE+FDLWV SH +AK MWNKPFPHYDD ST+FGKDRAV                    ESPLNQD +DEEP +QSTGRA++ AESS+ +KR R SFQ E
Subjt:  EREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAV--------------------ESPLNQDGMDEEPTKQSTGRATL-AESSQGSKRKRPSFQYE

Query:  MIDIMRSTV
        MIDIMRSTV
Subjt:  MIDIMRSTV

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]1.8e-8368.02Show/hide
Query:  RTKHVWSKVEDAKLVEALLYLVEIGWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQVEREVFDL
        R+KHVWSKVEDAK VEALLYLV+ GWRSDN TFR  YLQ+LERI HEKV G ALNQN IECKVRSLKKQ N VSEML QS F WNEEFKCVQVERE+FD 
Subjt:  RTKHVWSKVEDAKLVEALLYLVEIGWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQVEREVFDL

Query:  WVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAVESPLNQDGMDEEPTKQSTGRATLAESSQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYE
        WVRSH +AKGMWNKPFPHYDD STVFGK +AV          E+P   +T                  F+ E+    +     ++THMG+LASWQKEKYE
Subjt:  WVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAVESPLNQDGMDEEPTKQSTGRATLAESSQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYE

Query:  LEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEH
        LEFGR+KEVVN IYNI+GLDE DQVTLIDLLVTDIQKTNCFL V EH
Subjt:  LEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEH

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]2.8e-11381.37Show/hide
Query:  MAGN-SRTKHVWSKVEDAKLVEALLYLVEIGWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQVE
        MAG+  R+KHVWSKVED KLVEALLYLVE GWRSDN TFR GYLQYLERILHEKV G ALNQN IECKVRSLKKQYN VSEML QS FGWNEEFKCVQVE
Subjt:  MAGN-SRTKHVWSKVEDAKLVEALLYLVEIGWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQVE

Query:  REVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRA---------VESPLNQDGMDEEPTKQSTGRAT-LAESSQGSKRKRPSFQYEMIDIMRSTVEMQ
        +E+FDLWVRSHL+AKGMWNK F HYDD STVFGKDRA          ESPLNQD +DEEP +QSTGRA+ LAESS+GSKRKRPSFQ EMIDIMRSTVEMQ
Subjt:  REVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRA---------VESPLNQDGMDEEPTKQSTGRAT-LAESSQGSKRKRPSFQYEMIDIMRSTVEMQ

Query:  NTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEH
        +THMG+LASWQKEKYELEFGR+KEVVN IY+I+GLDE DQVT IDLLVTDIQKT+CFL V EH
Subjt:  NTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEH

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859538.7e-4439.1Show/hide
Query:  MAGNSRT-KHVWSKVEDAKLVEALLYLVEI-GWRSDNETFRPGYLQYLERILHEKVSGSALNQNN-IECKVRSLKKQYNEVSEMLGQ--SEFGWNEEFKC
        MA  SR  KH W+K E+ K VE L+ LV   GWRSDN TF+PGYL  L+R++ EK+ G+ + +++ I+C V+SLKK Y+ ++EM G   S FGWNEEF+C
Subjt:  MAGNSRT-KHVWSKVEDAKLVEALLYLVEI-GWRSDNETFRPGYLQYLERILHEKVSGSALNQNN-IECKVRSLKKQYNEVSEMLGQ--SEFGWNEEFKC

Query:  VQVEREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAVES--------------------PLNQDGMDEEPTKQSTG-----------RATLAES-
        +  ER++FD W++SH +AKG+ +K FP+YDD S VFGKDRA  +                    PL     ++ PT  S G           RA  A   
Subjt:  VQVEREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAVES--------------------PLNQDGMDEEPTKQSTG-----------RATLAES-

Query:  ---SQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTV
           S  SKRKR S +YE ++++RS +E  N  +  +A W KEK  +E   + +VV  + +I  L   D+  L+ +L   ++    FL++
Subjt:  ---SQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTV

A0A5A7U0H7 Retrotransposon protein8.7e-4439.1Show/hide
Query:  MAGNSRT-KHVWSKVEDAKLVEALLYLVEI-GWRSDNETFRPGYLQYLERILHEKVSGSALNQNN-IECKVRSLKKQYNEVSEMLGQ--SEFGWNEEFKC
        MA  SR  KH W+K E+ K VE L+ LV   GWRSDN TF+PGYL  L+R++ EK+ G+ + +++ I+C V+SLKK Y+ ++EM G   S FGWNEEF+C
Subjt:  MAGNSRT-KHVWSKVEDAKLVEALLYLVEI-GWRSDNETFRPGYLQYLERILHEKVSGSALNQNN-IECKVRSLKKQYNEVSEMLGQ--SEFGWNEEFKC

Query:  VQVEREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAVES--------------------PLNQDGMDEEPTKQSTG-----------RATLAES-
        +  ER++FD W++SH +AKG+ +K FP+YDD S VFGKDRA  +                    PL     ++ PT  S G           RA  A   
Subjt:  VQVEREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAVES--------------------PLNQDGMDEEPTKQSTG-----------RATLAES-

Query:  ---SQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTV
           S  SKRKR S +YE ++++RS +E  N  +  +A W KEK  +E   + +VV  + +I  L   D+  L+ +L   ++    FL++
Subjt:  ---SQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTV

A0A5A7U9V6 Retrotransposon protein1.3e-3636.21Show/hide
Query:  MAGNSR-TKHVWSKVEDAKLVEALLYLVEI-GWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQ--SEFGWNEEFKCV
        M  +SR  KH W+K E+A LVE L+ LV   GWRSDN TFRPGYL  L R++  K+ GS ++ + I+ +++ +K+ ++ ++EM G   S FGWN+E KC+
Subjt:  MAGNSR-TKHVWSKVEDAKLVEALLYLVEI-GWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQ--SEFGWNEEFKCV

Query:  QVEREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAV----ES--------------------------PLNQDGMDEEPTK-QSTGRATLAES--
          E+EVFD W  SH +AKG+ NK F HYD+ S VFGKDRA+    ES                          P+   G++  P     T  A ++E   
Subjt:  QVEREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAV----ES--------------------------PLNQDGMDEEPTK-QSTGRATLAES--

Query:  -SQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEH
         S GSKRKRP    +  DI+R+ +E  N  + ++A W   + +     ++E+V  +  I  L  +D+  L+ +L+ ++     FL V +H
Subjt:  -SQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEH

A0A5A7VFF1 Retrotransposon protein9.3e-3837.31Show/hide
Query:  MAGNSR-TKHVWSKVEDAKLVEALLYLVEI-GWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQ--SEFGWNEEFKCV
        MA +SR  KH W+K E+A LVE L+ LV   GWRSDNETFRPGYL  L R++  K+ GS ++ + I+ +++ LK+ ++ +  M G   S FGWN+E KC+
Subjt:  MAGNSR-TKHVWSKVEDAKLVEALLYLVEI-GWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQ--SEFGWNEEFKCV

Query:  QVEREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAV------------ESPLNQDGMDEEPTKQSTGRATLAESSQGSKRKRPSFQYEMIDIMRS
          E+EV D WV+SH +AKG+ NK F HYD+ S VFGKDRA              +P   +    + T  +  +      S GSKRKR     +  DI+R+
Subjt:  QVEREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAV------------ESPLNQDGMDEEPTKQSTGRATLAESSQGSKRKRPSFQYEMIDIMRS

Query:  TVEMQNTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEH
         +E  N  + ++A W   + +     ++EVV  +  I  L  +D+  L+ +L+ ++     FL V ++
Subjt:  TVEMQNTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEH

E5GCB5 Retrotransposon protein7.9e-3736.43Show/hide
Query:  IMAGNSR-TKHVWSKVEDAKLVEALLYLVEI-GWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQ--SEFGWNEEFKC
        IM  +SR  KH W+K E+A LVE L+ LV   GWRSDN TFRPGYL  L R++  K+ GS ++ + I+ +++ +K+ ++ ++EM G   S FGWN+E KC
Subjt:  IMAGNSR-TKHVWSKVEDAKLVEALLYLVEI-GWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQ--SEFGWNEEFKC

Query:  VQVEREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRA----VES--------------------------PLNQDGMDEEPTK-QSTGRATLAES-
        +  E+EVFD W  SH +AKG+ NK F HYD+ S VFGKDRA     ES                          P+   G++  P     T  A ++E  
Subjt:  VQVEREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRA----VES--------------------------PLNQDGMDEEPTK-QSTGRATLAES-

Query:  --SQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEH
          S GSKRKRP    +  DI+R+ +E  N  + ++A W   + +     ++E+V  +  I  L  +D+  L+ +L+ ++     FL V +H
Subjt:  --SQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02210.1 unknown protein1.3e-0421.05Show/hide
Query:  KVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQVEREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAVESP--LNQDGMDEEPTKQ---STGRATLA
        + +SL++Q+N +  +L    F W+ E + V  +  V+  ++++H  A+    +P P+Y D   + G     E+   +  D  D E   Q   S+G   L+
Subjt:  KVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQVEREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAVESP--LNQDGMDEEPTKQ---STGRATLA

Query:  ESSQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDL--LVTDIQKTNCFLTV
         S++        F     D      ++ NT    +     +K  ++  +   + + +  I  L ++D   ++D   L+ D  K   FL +
Subjt:  ESSQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDL--LVTDIQKTNCFLTV

AT4G02210.2 unknown protein1.3e-0421.05Show/hide
Query:  KVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQVEREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAVESP--LNQDGMDEEPTKQ---STGRATLA
        + +SL++Q+N +  +L    F W+ E + V  +  V+  ++++H  A+    +P P+Y D   + G     E+   +  D  D E   Q   S+G   L+
Subjt:  KVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQVEREVFDLWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAVESP--LNQDGMDEEPTKQ---STGRATLA

Query:  ESSQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDL--LVTDIQKTNCFLTV
         S++        F     D      ++ NT    +     +K  ++  +   + + +  I  L ++D   ++D   L+ D  K   FL +
Subjt:  ESSQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKEVVNVIYNINGLDEVDQVTLIDL--LVTDIQKTNCFLTV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGCATGCAGTATAATGGCAGGTAATAGTAGGACTAAACACGTATGGTCGAAGGTGGAGGACGCTAAGTTGGTGGAAGCCCTACTGTATTTGGTGGAGATCGGTTG
GAGGTCTGACAATGAGACGTTTCGACCAGGATATCTACAGTACTTGGAGCGAATTCTGCATGAAAAGGTGTCTGGGAGCGCACTGAATCAGAACAACATCGAGTGCAAGG
TGAGGAGTCTGAAGAAACAATACAATGAAGTATCAGAGATGTTAGGTCAGTCAGAGTTCGGCTGGAACGAGGAGTTCAAATGTGTCCAGGTAGAGAGGGAGGTTTTTGAT
CTTTGGGTTCGGAGTCATCTCAGTGCGAAGGGGATGTGGAACAAGCCATTCCCCCATTATGATGACTTCTCCACCGTATTTGGGAAAGATAGAGCAGTAGAATCACCATT
AAATCAAGATGGAATGGATGAAGAGCCAACAAAGCAATCTACAGGTAGAGCGACACTTGCGGAGTCATCTCAAGGCAGCAAGAGGAAGAGACCATCATTCCAATATGAAA
TGATCGACATCATGAGATCGACTGTTGAAATGCAGAACACACACATGGGTAAACTTGCATCGTGGCAGAAGGAGAAGTATGAGCTGGAGTTCGGGCGTCAGAAGGAAGTA
GTAAACGTCATATACAACATTAACGGCTTGGATGAGGTTGACCAGGTCACTCTTATTGACCTCCTTGTCACGGACATTCAGAAGACGAATTGCTTCCTTACAGTATCAGA
ACACACATAG
mRNA sequenceShow/hide mRNA sequence
ATGTATGCATGCAGTATAATGGCAGGTAATAGTAGGACTAAACACGTATGGTCGAAGGTGGAGGACGCTAAGTTGGTGGAAGCCCTACTGTATTTGGTGGAGATCGGTTG
GAGGTCTGACAATGAGACGTTTCGACCAGGATATCTACAGTACTTGGAGCGAATTCTGCATGAAAAGGTGTCTGGGAGCGCACTGAATCAGAACAACATCGAGTGCAAGG
TGAGGAGTCTGAAGAAACAATACAATGAAGTATCAGAGATGTTAGGTCAGTCAGAGTTCGGCTGGAACGAGGAGTTCAAATGTGTCCAGGTAGAGAGGGAGGTTTTTGAT
CTTTGGGTTCGGAGTCATCTCAGTGCGAAGGGGATGTGGAACAAGCCATTCCCCCATTATGATGACTTCTCCACCGTATTTGGGAAAGATAGAGCAGTAGAATCACCATT
AAATCAAGATGGAATGGATGAAGAGCCAACAAAGCAATCTACAGGTAGAGCGACACTTGCGGAGTCATCTCAAGGCAGCAAGAGGAAGAGACCATCATTCCAATATGAAA
TGATCGACATCATGAGATCGACTGTTGAAATGCAGAACACACACATGGGTAAACTTGCATCGTGGCAGAAGGAGAAGTATGAGCTGGAGTTCGGGCGTCAGAAGGAAGTA
GTAAACGTCATATACAACATTAACGGCTTGGATGAGGTTGACCAGGTCACTCTTATTGACCTCCTTGTCACGGACATTCAGAAGACGAATTGCTTCCTTACAGTATCAGA
ACACACATAG
Protein sequenceShow/hide protein sequence
MYACSIMAGNSRTKHVWSKVEDAKLVEALLYLVEIGWRSDNETFRPGYLQYLERILHEKVSGSALNQNNIECKVRSLKKQYNEVSEMLGQSEFGWNEEFKCVQVEREVFD
LWVRSHLSAKGMWNKPFPHYDDFSTVFGKDRAVESPLNQDGMDEEPTKQSTGRATLAESSQGSKRKRPSFQYEMIDIMRSTVEMQNTHMGKLASWQKEKYELEFGRQKEV
VNVIYNINGLDEVDQVTLIDLLVTDIQKTNCFLTVSEHT