; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G190010 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G190010
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionmyosin-4-like isoform X1
Genome locationCiama_Chr10:24025363..24028773
RNA-Seq ExpressionCaUC10G190010
SyntenyCaUC10G190010
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443448.1 PREDICTED: uncharacterized protein LOC103487040 isoform X1 [Cucumis melo]9.7e-8157.48Show/hide
Query:  SMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELD
        S+ + EN  LF+SLINEY AEKM+DE+RIV+LK+R+E+LR++LEATN E+EN KRA+ETT+QELKGCEVELS+N+ S+QTLE RISVLQ EIA+  SEL+
Subjt:  SMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELD

Query:  SLKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ
        SLK E     DQ INHLFALN KIR               KFQ++LYMKNV  ++NATEESH+PEEDN K SS SVEERLI+++T+I+HG +D  +T+EQ
Subjt:  SLKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ

Query:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNI
         + ++R+  IYLE+R+A M+MM KG+ ELE   K                               NSG ELTY HISEELLKSCICP CFKDNT+ALDNI
Subjt:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNI

Query:  P
        P
Subjt:  P

XP_008443449.1 PREDICTED: uncharacterized protein LOC103487040 isoform X2 [Cucumis melo]9.7e-8157.48Show/hide
Query:  SMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELD
        S+ + EN  LF+SLINEY AEKM+DE+RIV+LK+R+E+LR++LEATN E+EN KRA+ETT+QELKGCEVELS+N+ S+QTLE RISVLQ EIA+  SEL+
Subjt:  SMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELD

Query:  SLKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ
        SLK E     DQ INHLFALN KIR               KFQ++LYMKNV  ++NATEESH+PEEDN K SS SVEERLI+++T+I+HG +D  +T+EQ
Subjt:  SLKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ

Query:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNI
         + ++R+  IYLE+R+A M+MM KG+ ELE   K                               NSG ELTY HISEELLKSCICP CFKDNT+ALDNI
Subjt:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNI

Query:  P
        P
Subjt:  P

XP_023528774.1 myosin-4-like isoform X2 [Cucurbita pepo subsp. pepo]4.1e-7961.13Show/hide
Query:  MVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELDS
        M ++ENQKLFLSLINEYAAEK + E+ +VVLKKR EELR+ELE  N ELENVKR KETTEQELKGCEVELSLNET+IQTLEARISVLQGEIAS+GSELDS
Subjt:  MVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELDS

Query:  LKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKN-VASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ
        LK E    +DQFINHL  LN KIR               KFQDQL  KN + S+ NATE SH+ E DN  TSSQS+EERLIK++T++++  ++  L+ EQ
Subjt:  LKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKN-VASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ

Query:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNI
          SQNR+ LI LE+RKA MVMM KG KELE LT                              K+ SGLE++YG +SEELLKSCICP CF+DNTEALDNI
Subjt:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNI

Query:  P
        P
Subjt:  P

XP_038905386.1 uncharacterized protein LOC120091435 isoform X1 [Benincasa hispida]7.7e-9465.57Show/hide
Query:  MSARSMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIG
        MS RSMV+SENQKLFLSLI EY  EK ++E+RIVVLKKR+EEL++ELEATN ELENVK AKETTEQELKGCEVELSLN+TSIQTLEARISVLQGEIAS+G
Subjt:  MSARSMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIG

Query:  SELDSLKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHL
        SELDSLK E  + +DQ IN LF LN KIR               KFQ+QLY KNV S+ NATEESH+PEEDN KTSSQS+EERLIKL+T+ISHG+DD  L
Subjt:  SELDSLKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHL

Query:  TDEQFISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEA
        T+E+F+ +NR+ +IYLE+R+A M MM KG +ELE L                               K+NSG E+TYGHISEELLKSCICPYCFKDN EA
Subjt:  TDEQFISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEA

Query:  LDNIP
        LDNIP
Subjt:  LDNIP

XP_038905387.1 uncharacterized protein LOC120091435 isoform X2 [Benincasa hispida]7.7e-9465.57Show/hide
Query:  MSARSMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIG
        MS RSMV+SENQKLFLSLI EY  EK ++E+RIVVLKKR+EEL++ELEATN ELENVK AKETTEQELKGCEVELSLN+TSIQTLEARISVLQGEIAS+G
Subjt:  MSARSMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIG

Query:  SELDSLKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHL
        SELDSLK E  + +DQ IN LF LN KIR               KFQ+QLY KNV S+ NATEESH+PEEDN KTSSQS+EERLIKL+T+ISHG+DD  L
Subjt:  SELDSLKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHL

Query:  TDEQFISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEA
        T+E+F+ +NR+ +IYLE+R+A M MM KG +ELE L                               K+NSG E+TYGHISEELLKSCICPYCFKDN EA
Subjt:  TDEQFISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEA

Query:  LDNIP
        LDNIP
Subjt:  LDNIP

TrEMBL top hitse value%identityAlignment
A0A1S3B8T2 uncharacterized protein LOC103487040 isoform X14.7e-8157.48Show/hide
Query:  SMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELD
        S+ + EN  LF+SLINEY AEKM+DE+RIV+LK+R+E+LR++LEATN E+EN KRA+ETT+QELKGCEVELS+N+ S+QTLE RISVLQ EIA+  SEL+
Subjt:  SMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELD

Query:  SLKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ
        SLK E     DQ INHLFALN KIR               KFQ++LYMKNV  ++NATEESH+PEEDN K SS SVEERLI+++T+I+HG +D  +T+EQ
Subjt:  SLKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ

Query:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNI
         + ++R+  IYLE+R+A M+MM KG+ ELE   K                               NSG ELTY HISEELLKSCICP CFKDNT+ALDNI
Subjt:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNI

Query:  P
        P
Subjt:  P

A0A1S3B8U9 uncharacterized protein LOC103487040 isoform X24.7e-8157.48Show/hide
Query:  SMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELD
        S+ + EN  LF+SLINEY AEKM+DE+RIV+LK+R+E+LR++LEATN E+EN KRA+ETT+QELKGCEVELS+N+ S+QTLE RISVLQ EIA+  SEL+
Subjt:  SMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELD

Query:  SLKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ
        SLK E     DQ INHLFALN KIR               KFQ++LYMKNV  ++NATEESH+PEEDN K SS SVEERLI+++T+I+HG +D  +T+EQ
Subjt:  SLKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ

Query:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNI
         + ++R+  IYLE+R+A M+MM KG+ ELE   K                               NSG ELTY HISEELLKSCICP CFKDNT+ALDNI
Subjt:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNI

Query:  P
        P
Subjt:  P

A0A5A7UJR5 Cingulin-like1.5e-6660.59Show/hide
Query:  SMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELD
        S+ + EN  LF+SLINEY AEKM+DE+RIV+LK+R+E+LR++LEATN E+EN KRA+ETT+QELKGCEVELS+N+ S+QTLE RISVLQ EIA+  SEL+
Subjt:  SMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELD

Query:  SLKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ
        SLK E     DQ INHLFALN KIR               KFQ++LYMKNV  ++NATEESH+PEEDN K SS SVEERLI+++T+I+HG +D  +T+EQ
Subjt:  SLKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ

Query:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYP
         + ++R+  IYLE+R+A M+MM KG+ ELE   KYP
Subjt:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYP

A0A6J1F7L8 myosin-4-like isoform X12.9e-7860.8Show/hide
Query:  MVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELDS
        M +SENQKLFLSLINEYAAEK + E+ +VVLKKR EELR+ELE  N ELENVKR KETTEQELKGCEVELSLNET+IQTLEARISVLQGEIAS+GSELDS
Subjt:  MVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELDS

Query:  LKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKN-VASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ
        LK      +DQFINHL  LN KIR               KFQDQL  KN + S+ NATE SH+ E DN  TSSQS+EERLIK++ ++++  ++  L+ EQ
Subjt:  LKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKN-VASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ

Query:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNI
          SQNR+ LI LE+RKA MVMM KG KELE LT                              K+ SGLE++YG +SEELLKSCICP CF+DNTEALDNI
Subjt:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNI

Query:  P
        P
Subjt:  P

A0A6J1J729 uncharacterized protein LOC111482344 isoform X11.4e-7760.13Show/hide
Query:  MVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELDS
        M ++ENQKLFLSLINEYAAEK + E+ +VVLKKR+EELR+ELE  N ELENVKR KETTEQELKGCEVELSLNET+IQTLEARISVLQGEIAS+GSELDS
Subjt:  MVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELDS

Query:  LKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKN-VASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ
        LK E    +DQFINHL  LN KIR               KFQDQL  K  + S+ NA E SH+ E DN   SSQS+EERLIK++T++++  ++  L+ EQ
Subjt:  LKNEILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKN-VASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQ

Query:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNI
          SQNR+ LI LE+RKA MVMM KG KELE LT                              K+ SGLE++YG +SEELLKSCICP CF+DNTEALDNI
Subjt:  FISQNRKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNI

Query:  P
        P
Subjt:  P

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G28370.1 unknown protein1.2e-2835.32Show/hide
Query:  ENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELDSLKNE
        + QK  LSLI ++ +E+ + E+R+V LKKR+E L++E+EA N+E+E  KR KE  E+EL G EVELSLN+ +IQ+LEARIS+LQ E+ +IGSE+D+LKN+
Subjt:  ENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELDSLKNE

Query:  ILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQFISQN
          L +DQFI+ +  LN +IR               +FQ  +     +        + K  ED +    ++++  L ++ ++++   ++ +L +++   Q 
Subjt:  ILLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQFISQN

Query:  RKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLII
        +K L   E++ + M  +      ++VLT+YPL+++
Subjt:  RKVLIYLEERKAEMVMMAKGRKELEVLTKYPLLII


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGCAAGATCAATGGTGATATCCGAAAATCAAAAGCTCTTTCTCTCCCTAATCAACGAATACGCGGCTGAAAAAATGAAAGATGAGGAAAGAATTGTTGTTTTGAA
GAAGAGAGTTGAAGAACTGCGAAATGAGCTTGAAGCCACCAATGCGGAACTCGAAAATGTCAAGCGCGCCAAAGAAACCACTGAACAGGAACTTAAAGGCTGTGAAGTCG
AATTGTCTTTGAATGAAACTTCAATTCAGACTCTCGAGGCAAGGATTTCTGTGCTACAAGGTGAAATTGCATCTATTGGATCTGAGTTGGACTCTCTTAAGAACGAAATT
CTTCTATTCCAGGACCAGTTTATCAATCACTTGTTTGCCCTTAACACAAAAATCAGGTTCGTTTTCTATATTAATGCAAGCAACTTGCACTGTGAATGCTTGAAATTTCA
AGATCAACTGTACATGAAAAATGTTGCGTCTATTGAAAATGCTACAGAAGAATCCCATAAACCTGAGGAAGATAATGCCAAAACTTCTTCCCAATCTGTTGAGGAAAGGC
TTATCAAGTTAGTAACTAAAATATCCCATGGAAATGATGATCACCACCTAACTGACGAACAATTTATAAGTCAGAATCGGAAAGTGTTGATTTATCTAGAGGAAAGGAAG
GCTGAAATGGTGATGATGGCGAAAGGAAGAAAAGAATTGGAAGTTTTGACTAAATATCCTTTACTGATTATTTTAATTTATGATATATGGCTACTAACATTCAATAATCT
TTCACCAACTTTGGGCTCTTTGACCGTCAATAAGCGGAATTCTGGACTGGAACTGACATATGGTCATATCAGTGAAGAGTTGCTGAAAAGTTGCATTTGTCCTTACTGTT
TCAAAGACAATACGGAGGCCTTGGACAACATTCCT
mRNA sequenceShow/hide mRNA sequence
AGATATTCTCAATTTCCATATACCACTACAGCAAAATTAGTAGACAGCAAGTCATGAGCGCAAGATCAATGGTGATATCCGAAAATCAAAAGCTCTTTCTCTCCCTAATC
AACGAATACGCGGCTGAAAAAATGAAAGATGAGGAAAGAATTGTTGTTTTGAAGAAGAGAGTTGAAGAACTGCGAAATGAGCTTGAAGCCACCAATGCGGAACTCGAAAA
TGTCAAGCGCGCCAAAGAAACCACTGAACAGGAACTTAAAGGCTGTGAAGTCGAATTGTCTTTGAATGAAACTTCAATTCAGACTCTCGAGGCAAGGATTTCTGTGCTAC
AAGGTGAAATTGCATCTATTGGATCTGAGTTGGACTCTCTTAAGAACGAAATTCTTCTATTCCAGGACCAGTTTATCAATCACTTGTTTGCCCTTAACACAAAAATCAGG
TTCGTTTTCTATATTAATGCAAGCAACTTGCACTGTGAATGCTTGAAATTTCAAGATCAACTGTACATGAAAAATGTTGCGTCTATTGAAAATGCTACAGAAGAATCCCA
TAAACCTGAGGAAGATAATGCCAAAACTTCTTCCCAATCTGTTGAGGAAAGGCTTATCAAGTTAGTAACTAAAATATCCCATGGAAATGATGATCACCACCTAACTGACG
AACAATTTATAAGTCAGAATCGGAAAGTGTTGATTTATCTAGAGGAAAGGAAGGCTGAAATGGTGATGATGGCGAAAGGAAGAAAAGAATTGGAAGTTTTGACTAAATAT
CCTTTACTGATTATTTTAATTTATGATATATGGCTACTAACATTCAATAATCTTTCACCAACTTTGGGCTCTTTGACCGTCAATAAGCGGAATTCTGGACTGGAACTGAC
ATATGGTCATATCAGTGAAGAGTTGCTGAAAAGTTGCATTTGTCCTTACTGTTTCAAAGACAATACGGAGGCCTTGGACAACATTCCT
Protein sequenceShow/hide protein sequence
MSARSMVISENQKLFLSLINEYAAEKMKDEERIVVLKKRVEELRNELEATNAELENVKRAKETTEQELKGCEVELSLNETSIQTLEARISVLQGEIASIGSELDSLKNEI
LLFQDQFINHLFALNTKIRFVFYINASNLHCECLKFQDQLYMKNVASIENATEESHKPEEDNAKTSSQSVEERLIKLVTKISHGNDDHHLTDEQFISQNRKVLIYLEERK
AEMVMMAKGRKELEVLTKYPLLIILIYDIWLLTFNNLSPTLGSLTVNKRNSGLELTYGHISEELLKSCICPYCFKDNTEALDNIP