; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025001 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025001
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon protein
Genome locationscaffold12:7618847..7621777
RNA-Seq ExpressionSpg025001
SyntenySpg025001
Gene Ontology termsNA
InterPro domainsIPR009027 - Ribosomal protein L9/RNase H1, N-terminal
IPR011320 - Ribonuclease H1, N-terminal
IPR024752 - Myb/SANT-like domain
IPR037056 - Ribonuclease H1, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050106.1 retrotransposon protein [Cucumis melo var. makuwa]1.1e-6645.03Show/hide
Query:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKC
        +++ SK  KH WT   DEVLV+CLL LV+ GGWRADNGTF+ GY  Q+ K+MKE++ G NI V+PN++SRVK LKKQY+ IAEMMGPACSGFGWN+ERKC
Subjt:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKC

Query:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA
        IEAEK +FD WV+                          +AR +                  ++DM+++ ED  +P+P  ++P  GE++  TPT     A
Subjt:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA

Query:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL
        G SR   KRR   G++ +      + T+++I KIA W  ++ E+E S  K LY +LQ+IPG+ ++D L+VA +LL D  ML  F+D+P EWK+  CM IL
Subjt:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL

Query:  GR
        GR
Subjt:  GR

TYJ96933.1 retrotransposon protein [Cucumis melo var. makuwa]1.8e-5948.52Show/hide
Query:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKC
        +++ SK  KH WT  +D+ LV+CLL LV+ GGWRA+N TF+  Y  Q+ K+MKE++P  NI V+ N+ESRVK LKKQY  IA+MMGPACS FGWN+ERKC
Subjt:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKC

Query:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA
        IEAEK +FD WV+GHP A+GL N+PF +F +L +VFG+D A G R +  +EM  +     + ++DM+++ ED  +P+P  ++P  GE++  T    T  A
Subjt:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA

Query:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW
        G SR   KRR   G++ +      Q T+++I KIA W
Subjt:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]3.9e-6745.83Show/hide
Query:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKC
        +++ SK  KH WT  EDEVLV+CLL LV+ GGWRADNGTF+ GY                               KQY  IAEMMGPACSGFGWN+ +KC
Subjt:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKC

Query:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA
        IE EK +FD WV+GHP A+GL N+PFP+F +L +VFG+D A G R +TP+EM+ +     + ++DM+++ ED  +P+P  ++P  GE++  TPT  T  A
Subjt:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA

Query:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFP
        G SR   KRR   G++ +      + T+++I KIA W  ++ E+E S  K LYAELQ+IPG+ ++D L+VA +LL D  ML  F+D+P
Subjt:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFP

TYK26842.1 uncharacterized protein E5676_scaffold260G00340 [Cucumis melo var. makuwa]3.4e-5552.43Show/hide
Query:  SKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKCIEAE
        SK  KH WT  EDE LV+CLL LV+ G WR DNGTF+ GY  Q+ K+MKE++   NI V+PN+ES VK LKKQY  IAEMMGP CSGF WN ERKCIEAE
Subjt:  SKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKCIEAE

Query:  KEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGAGPSR
        K + + WV+GH  A+ L N+PFP+F +L +VFG+D A G + +TP+EM  +     + ++DM ++ ED  +P+P  ++P  GE++  TPT     AG SR
Subjt:  KEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGAGPSR

Query:  AVNKRR
           KRR
Subjt:  AVNKRR

XP_008455678.1 PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo]2.7e-6047.95Show/hide
Query:  FIATMAASASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFG
        F+ T   +++ SK  KH WT  +D+ LV+CLL LV+ GGWRA+N TF+  Y  Q+ K+MKE++P  NI V+ N+ESRVK LKKQY  IA+MMGPACS FG
Subjt:  FIATMAASASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFG

Query:  WNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTP
        WN+ERKCIEAEK +FD WV+GHP A+GL N+PF +F +L +VFG+D A G R +  +EM  +     + ++DM+++ ED  +P+P  ++P  GE++  T 
Subjt:  WNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTP

Query:  TGRTAGAGPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW
           T  AG SR   KRR   G++ +      Q T+++I KIA W
Subjt:  TGRTAGAGPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW

TrEMBL top hitse value%identityAlignment
A0A1S3C252 uncharacterized protein At2g29880-like1.3e-6047.95Show/hide
Query:  FIATMAASASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFG
        F+ T   +++ SK  KH WT  +D+ LV+CLL LV+ GGWRA+N TF+  Y  Q+ K+MKE++P  NI V+ N+ESRVK LKKQY  IA+MMGPACS FG
Subjt:  FIATMAASASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFG

Query:  WNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTP
        WN+ERKCIEAEK +FD WV+GHP A+GL N+PF +F +L +VFG+D A G R +  +EM  +     + ++DM+++ ED  +P+P  ++P  GE++  T 
Subjt:  WNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTP

Query:  TGRTAGAGPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW
           T  AG SR   KRR   G++ +      Q T+++I KIA W
Subjt:  TGRTAGAGPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW

A0A5A7U7F7 Retrotransposon protein5.5e-6745.03Show/hide
Query:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKC
        +++ SK  KH WT   DEVLV+CLL LV+ GGWRADNGTF+ GY  Q+ K+MKE++ G NI V+PN++SRVK LKKQY+ IAEMMGPACSGFGWN+ERKC
Subjt:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKC

Query:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA
        IEAEK +FD WV+                          +AR +                  ++DM+++ ED  +P+P  ++P  GE++  TPT     A
Subjt:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA

Query:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL
        G SR   KRR   G++ +      + T+++I KIA W  ++ E+E S  K LY +LQ+IPG+ ++D L+VA +LL D  ML  F+D+P EWK+  CM IL
Subjt:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL

Query:  GR
        GR
Subjt:  GR

A0A5D3BC95 Retrotransposon protein8.5e-6048.52Show/hide
Query:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKC
        +++ SK  KH WT  +D+ LV+CLL LV+ GGWRA+N TF+  Y  Q+ K+MKE++P  NI V+ N+ESRVK LKKQY  IA+MMGPACS FGWN+ERKC
Subjt:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKC

Query:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA
        IEAEK +FD WV+GHP A+GL N+PF +F +L +VFG+D A G R +  +EM  +     + ++DM+++ ED  +P+P  ++P  GE++  T    T  A
Subjt:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA

Query:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW
        G SR   KRR   G++ +      Q T+++I KIA W
Subjt:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW

A0A5D3C7T4 Uncharacterized protein1.9e-6745.83Show/hide
Query:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKC
        +++ SK  KH WT  EDEVLV+CLL LV+ GGWRADNGTF+ GY                               KQY  IAEMMGPACSGFGWN+ +KC
Subjt:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKC

Query:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA
        IE EK +FD WV+GHP A+GL N+PFP+F +L +VFG+D A G R +TP+EM+ +     + ++DM+++ ED  +P+P  ++P  GE++  TPT  T  A
Subjt:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA

Query:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFP
        G SR   KRR   G++ +      + T+++I KIA W  ++ E+E S  K LYAELQ+IPG+ ++D L+VA +LL D  ML  F+D+P
Subjt:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFP

A0A5D3DTL0 Myb_DNA-bind_3 domain-containing protein1.7e-5552.43Show/hide
Query:  SKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKCIEAE
        SK  KH WT  EDE LV+CLL LV+ G WR DNGTF+ GY  Q+ K+MKE++   NI V+PN+ES VK LKKQY  IAEMMGP CSGF WN ERKCIEAE
Subjt:  SKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKCIEAE

Query:  KEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGAGPSR
        K + + WV+GH  A+ L N+PFP+F +L +VFG+D A G + +TP+EM  +     + ++DM ++ ED  +P+P  ++P  GE++  TPT     AG SR
Subjt:  KEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGAGPSR

Query:  AVNKRR
           KRR
Subjt:  AVNKRR

SwissProt top hitse value%identityAlignment
Q07762 Ribonuclease H3.8e-0444Show/hide
Query:  GVYMTWYECARQVHGFRGAVYQSYETLDDAEATYIAYTMEVDNHSSRHGD
        G+Y TW +C+ QV GF GAVY+S+ TL +A A   A+        S  GD
Subjt:  GVYMTWYECARQVHGFRGAVYQSYETLDDAEATYIAYTMEVDNHSSRHGD

Arabidopsis top hitse value%identityAlignment
AT2G24960.1 unknown protein5.4e-0618.82Show/hide
Query:  MAASASASKKEKHIWTPEEDEVLVQCLL----------HLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGP
        M+   + + + +  WTP  +   +  +L          H      W      F + + +Q  K +              ++SR   L KQY  +  ++  
Subjt:  MAASASASKKEKHIWTPEEDEVLVQCLL----------HLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGP

Query:  ACSGFGWNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPE
           GF W+   + +  +  ++ L+++ HP+A+  + +P   F++L L++G   A G  + +  ++  E E
Subjt:  ACSGFGWNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPE

AT2G24960.2 unknown protein5.5e-1123.53Show/hide
Query:  TMAASASASKKEKHIWTPEEDEVLVQCLLHLVQVG----------GWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMG
        T A+    S + +  WTP  D  L+  L+  V  G           W      F A + +Q  K +              +++R K L++ Y  I  ++ 
Subjt:  TMAASASASKKEKHIWTPEEDEVLVQCLLHLVQVG----------GWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMG

Query:  PACSGFGWNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVD-FED
           +GF W+  R  + A+ +I++ +++ HP+A+  R +  P +  L  +FGK+++ G   R      P P     ++E  + D F+D
Subjt:  PACSGFGWNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVD-FED

AT5G27260.1 unknown protein4.5e-1331.51Show/hide
Query:  KKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTF-----RAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKC
        K + + W+PEE ++LVQ L+  +    WR  NGT         +  +I K        C      +  SR+K LK QY    ++     SGFGW+   K 
Subjt:  KKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTF-----RAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGWNDERKC

Query:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRA
          A  E++  +++ HP  K LR   F +F+EL ++FG+  A G  A
Subjt:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCGGTGTGTACATGACGTGGTACGAATGTGCCCGCCAAGTACATGGTTTTCGTGGTGCCGTTTACCAGTCGTATGAGACGTTGGACGACGCAGAAGCAACTTATAT
CGCTTACACCATGGAAGTGGACAATCATTCCAGTCGTCATGGAGACCTTCACCTCCACAATAGCTATCCTAACCGTGAACTAGATAGAAATGTAGTCAATAGGACTGTTA
GCACATGTAGTGGAACATACCATGTCCTCCTCTTCATAGTAGGGATTGTCCGTAGAAGCTATCGAATGGATGCCCAAGATAATGAAATTCAAGAATTGATCGCAATTCTT
ACTATCATTTGTGCGACACAGTATCAGTTCATCGCGACAATGGCCGCATCTGCCTCAGCTTCAAAGAAAGAAAAACACATATGGACGCCAGAGGAAGACGAAGTGCTTGT
GCAGTGCCTACTGCACCTAGTCCAAGTGGGGGGATGGCGTGCAGACAATGGGACGTTTCGGGCTGGTTACCAGAATCAGATTGGAAAGATGATGAAAGAAAGACTACCAG
GATGCAACATAGTCGTGAGCCCGAACATTGAATCTAGAGTGAAGACCCTAAAAAAACAGTACATGGTGATAGCTGAGATGATGGGACCGGCCTGCAGTGGATTTGGTTGG
AATGATGAGAGAAAATGCATAGAGGCGGAGAAGGAGATATTCGATTTGTGGGTCGAGGGACATCCACAGGCCAAGGGCCTCCGTAACCGGCCATTCCCATGGTTCAATGA
GTTGGCGCTCGTTTTCGGGAAGGACAGCGCTAGAGGAGTGAGAGCCCGAACCCCAATTGAGATGACACCTGAACCAGAGCCGGTAGCCGATTTGGATGAAGACATGAACG
TGGATTTTGAGGATTGCTACGTCCCAAGTCCCCCAGTTATTGATCCCACACTTGGAGAAGAATTATGTGGGACACCGACTGGTAGGACAGCTGGTGCAGGACCTTCTAGG
GCAGTAAATAAGAGACGATTATCCATTGGGAACGTGGCGGAGGTACTGGAGAATGGATTCCAGATGACCGCACAACAAATCGAGAAGATTGCGCTCTGGCCGACCAAGAG
GGACGAGCTCGAGAGGAGTCGGCGGAAGGAGTTATATGCGGAGCTACAATCTATTCCTGGGGTGTCTATGGAGGACGGCTTGGTGGTCGCACGGGCACTACTATCAGATG
AGAGGATGTTGACTCACTTTATGGACTTCCCTCCAGAATGGAAGTTCGACTACTGTATGGAGATCCTCGGTAGGGCATCGAGACAGCCCCCGCAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCGGTGTGTACATGACGTGGTACGAATGTGCCCGCCAAGTACATGGTTTTCGTGGTGCCGTTTACCAGTCGTATGAGACGTTGGACGACGCAGAAGCAACTTATAT
CGCTTACACCATGGAAGTGGACAATCATTCCAGTCGTCATGGAGACCTTCACCTCCACAATAGCTATCCTAACCGTGAACTAGATAGAAATGTAGTCAATAGGACTGTTA
GCACATGTAGTGGAACATACCATGTCCTCCTCTTCATAGTAGGGATTGTCCGTAGAAGCTATCGAATGGATGCCCAAGATAATGAAATTCAAGAATTGATCGCAATTCTT
ACTATCATTTGTGCGACACAGTATCAGTTCATCGCGACAATGGCCGCATCTGCCTCAGCTTCAAAGAAAGAAAAACACATATGGACGCCAGAGGAAGACGAAGTGCTTGT
GCAGTGCCTACTGCACCTAGTCCAAGTGGGGGGATGGCGTGCAGACAATGGGACGTTTCGGGCTGGTTACCAGAATCAGATTGGAAAGATGATGAAAGAAAGACTACCAG
GATGCAACATAGTCGTGAGCCCGAACATTGAATCTAGAGTGAAGACCCTAAAAAAACAGTACATGGTGATAGCTGAGATGATGGGACCGGCCTGCAGTGGATTTGGTTGG
AATGATGAGAGAAAATGCATAGAGGCGGAGAAGGAGATATTCGATTTGTGGGTCGAGGGACATCCACAGGCCAAGGGCCTCCGTAACCGGCCATTCCCATGGTTCAATGA
GTTGGCGCTCGTTTTCGGGAAGGACAGCGCTAGAGGAGTGAGAGCCCGAACCCCAATTGAGATGACACCTGAACCAGAGCCGGTAGCCGATTTGGATGAAGACATGAACG
TGGATTTTGAGGATTGCTACGTCCCAAGTCCCCCAGTTATTGATCCCACACTTGGAGAAGAATTATGTGGGACACCGACTGGTAGGACAGCTGGTGCAGGACCTTCTAGG
GCAGTAAATAAGAGACGATTATCCATTGGGAACGTGGCGGAGGTACTGGAGAATGGATTCCAGATGACCGCACAACAAATCGAGAAGATTGCGCTCTGGCCGACCAAGAG
GGACGAGCTCGAGAGGAGTCGGCGGAAGGAGTTATATGCGGAGCTACAATCTATTCCTGGGGTGTCTATGGAGGACGGCTTGGTGGTCGCACGGGCACTACTATCAGATG
AGAGGATGTTGACTCACTTTATGGACTTCCCTCCAGAATGGAAGTTCGACTACTGTATGGAGATCCTCGGTAGGGCATCGAGACAGCCCCCGCAGCCATGA
Protein sequenceShow/hide protein sequence
MPGVYMTWYECARQVHGFRGAVYQSYETLDDAEATYIAYTMEVDNHSSRHGDLHLHNSYPNRELDRNVVNRTVSTCSGTYHVLLFIVGIVRRSYRMDAQDNEIQELIAIL
TIICATQYQFIATMAASASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMVIAEMMGPACSGFGW
NDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGAGPSR
AVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEILGRASRQPPQP