; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008665 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008665
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon protein
Genome locationscaffold10:33526824..33529750
RNA-Seq ExpressionSpg008665
SyntenySpg008665
Gene Ontology termsNA
InterPro domainsIPR009027 - Ribosomal protein L9/RNase H1, N-terminal
IPR011320 - Ribonuclease H1, N-terminal
IPR024752 - Myb/SANT-like domain
IPR037056 - Ribonuclease H1, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038975.1 retrotransposon protein [Cucumis melo var. makuwa]1.5e-5539.8Show/hide
Query:  ASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKCI
        +++++  +H+WT EE+  LV+CL+ LV +GGW++DNGTFR GY  Q+ +MM E+LPGC +  +  I+ R+KTLK+ + AIAEM GPACSGFGWNDE KCI
Subjt:  ASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKCI

Query:  EAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDE----DMNVDFEDCYVPSPPVI-DPTLGEELCGTPTGR
         AEKE+FD WV  HP AKGL N+PFP+++EL  VFG+D A G  A T  ++    EP    D     D N DF   Y     +  D            GR
Subjt:  EAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDE----DMNVDFEDCYVPSPPVI-DPTLGEELCGTPTGR

Query:  TAGAGPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYC
        T  +G  R   KR        E +      T +Q+ +IA WP +    +   R E +  L+ +P ++  D  ++ R LLS    L  F+  P + +  +C
Subjt:  TAGAGPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYC

Query:  MEIL
          +L
Subjt:  MEIL

KAA0050106.1 retrotransposon protein [Cucumis melo var. makuwa]3.8e-6745.36Show/hide
Query:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKC
        +++ SK  KH WT   DEVLV+CLL LV+ GGWRADNGTF+ GY  Q+ K+MKE++ G NI V+PN++SRVK LKKQY+AIAEMMGPACSGFGWN+ERKC
Subjt:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKC

Query:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA
        IEAEK +FD WV+                          +AR +                  ++DM+++ ED  +P+P  ++P  GE++  TPT     A
Subjt:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA

Query:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL
        G SR   KRR   G++ +      + T+++I KIA W  ++ E+E S  K LY +LQ+IPG+ ++D L+VA +LL D  ML  F+D+P EWK+  CM IL
Subjt:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL

Query:  GR
        GR
Subjt:  GR

TYJ96933.1 retrotransposon protein [Cucumis melo var. makuwa]6.0e-6048.95Show/hide
Query:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKC
        +++ SK  KH WT  +D+ LV+CLL LV+ GGWRA+N TF+  Y  Q+ K+MKE++P  NI V+ N+ESRVK LKKQY AIA+MMGPACS FGWN+ERKC
Subjt:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKC

Query:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA
        IEAEK +FD WV+GHP A+GL N+PF +F +L +VFG+D A G R +  +EM  +     + ++DM+++ ED  +P+P  ++P  GE++  T    T  A
Subjt:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA

Query:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW
        G SR   KRR   G++ +      Q T+++I KIA W
Subjt:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]1.3e-6746.18Show/hide
Query:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKC
        +++ SK  KH WT  EDEVLV+CLL LV+ GGWRADNGTF+ GY                               KQY AIAEMMGPACSGFGWN+ +KC
Subjt:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKC

Query:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA
        IE EK +FD WV+GHP A+GL N+PFP+F +L +VFG+D A G R +TP+EM+ +     + ++DM+++ ED  +P+P  ++P  GE++  TPT  T  A
Subjt:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA

Query:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFP
        G SR   KRR   G++ +      + T+++I KIA W  ++ E+E S  K LYAELQ+IPG+ ++D L+VA +LL D  ML  F+D+P
Subjt:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFP

XP_008455678.1 PREDICTED: uncharacterized protein At2g29880-like [Cucumis melo]9.2e-6148.36Show/hide
Query:  FIATMAASASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFG
        F+ T   +++ SK  KH WT  +D+ LV+CLL LV+ GGWRA+N TF+  Y  Q+ K+MKE++P  NI V+ N+ESRVK LKKQY AIA+MMGPACS FG
Subjt:  FIATMAASASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFG

Query:  WNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTP
        WN+ERKCIEAEK +FD WV+GHP A+GL N+PF +F +L +VFG+D A G R +  +EM  +     + ++DM+++ ED  +P+P  ++P  GE++  T 
Subjt:  WNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTP

Query:  TGRTAGAGPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW
           T  AG SR   KRR   G++ +      Q T+++I KIA W
Subjt:  TGRTAGAGPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW

TrEMBL top hitse value%identityAlignment
A0A1S3C252 uncharacterized protein At2g29880-like4.4e-6148.36Show/hide
Query:  FIATMAASASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFG
        F+ T   +++ SK  KH WT  +D+ LV+CLL LV+ GGWRA+N TF+  Y  Q+ K+MKE++P  NI V+ N+ESRVK LKKQY AIA+MMGPACS FG
Subjt:  FIATMAASASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFG

Query:  WNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTP
        WN+ERKCIEAEK +FD WV+GHP A+GL N+PF +F +L +VFG+D A G R +  +EM  +     + ++DM+++ ED  +P+P  ++P  GE++  T 
Subjt:  WNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTP

Query:  TGRTAGAGPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW
           T  AG SR   KRR   G++ +      Q T+++I KIA W
Subjt:  TGRTAGAGPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW

A0A5A7U7F7 Retrotransposon protein1.9e-6745.36Show/hide
Query:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKC
        +++ SK  KH WT   DEVLV+CLL LV+ GGWRADNGTF+ GY  Q+ K+MKE++ G NI V+PN++SRVK LKKQY+AIAEMMGPACSGFGWN+ERKC
Subjt:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKC

Query:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA
        IEAEK +FD WV+                          +AR +                  ++DM+++ ED  +P+P  ++P  GE++  TPT     A
Subjt:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA

Query:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL
        G SR   KRR   G++ +      + T+++I KIA W  ++ E+E S  K LY +LQ+IPG+ ++D L+VA +LL D  ML  F+D+P EWK+  CM IL
Subjt:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL

Query:  GR
        GR
Subjt:  GR

A0A5D3BC95 Retrotransposon protein2.9e-6048.95Show/hide
Query:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKC
        +++ SK  KH WT  +D+ LV+CLL LV+ GGWRA+N TF+  Y  Q+ K+MKE++P  NI V+ N+ESRVK LKKQY AIA+MMGPACS FGWN+ERKC
Subjt:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKC

Query:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA
        IEAEK +FD WV+GHP A+GL N+PF +F +L +VFG+D A G R +  +EM  +     + ++DM+++ ED  +P+P  ++P  GE++  T    T  A
Subjt:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA

Query:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW
        G SR   KRR   G++ +      Q T+++I KIA W
Subjt:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALW

A0A5D3C7T4 Uncharacterized protein6.4e-6846.18Show/hide
Query:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKC
        +++ SK  KH WT  EDEVLV+CLL LV+ GGWRADNGTF+ GY                               KQY AIAEMMGPACSGFGWN+ +KC
Subjt:  SASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKC

Query:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA
        IE EK +FD WV+GHP A+GL N+PFP+F +L +VFG+D A G R +TP+EM+ +     + ++DM+++ ED  +P+P  ++P  GE++  TPT  T  A
Subjt:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGA

Query:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFP
        G SR   KRR   G++ +      + T+++I KIA W  ++ E+E S  K LYAELQ+IPG+ ++D L+VA +LL D  ML  F+D+P
Subjt:  GPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFP

A0A5D3CWL2 Retrotransposon protein7.3e-5639.8Show/hide
Query:  ASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKCI
        +++++  +H+WT EE+  LV+CL+ LV +GGW++DNGTFR GY  Q+ +MM E+LPGC +  +  I+ R+KTLK+ + AIAEM GPACSGFGWNDE KCI
Subjt:  ASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKCI

Query:  EAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDE----DMNVDFEDCYVPSPPVI-DPTLGEELCGTPTGR
         AEKE+FD WV  HP AKGL N+PFP+++EL  VFG+D A G  A T  ++    EP    D     D N DF   Y     +  D            GR
Subjt:  EAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDE----DMNVDFEDCYVPSPPVI-DPTLGEELCGTPTGR

Query:  TAGAGPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYC
        T  +G  R   KR        E +      T +Q+ +IA WP +    +   R E +  L+ +P ++  D  ++ R LLS    L  F+  P + +  +C
Subjt:  TAGAGPSRAVNKRRLSIGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYC

Query:  MEIL
          +L
Subjt:  MEIL

SwissProt top hitse value%identityAlignment
Q07762 Ribonuclease H2.8e-0444Show/hide
Query:  GVYMTWYECARQVHGFRGAVYQSYETLDDAEATYIAYTMEVDNHSSRHGD
        G+Y TW +C+ QV GF GAVY+S+ TL +A A   A+        S  GD
Subjt:  GVYMTWYECARQVHGFRGAVYQSYETLDDAEATYIAYTMEVDNHSSRHGD

Arabidopsis top hitse value%identityAlignment
AT2G24960.1 unknown protein4.0e-0618.82Show/hide
Query:  MAASASASKKEKHIWTPEEDEVLVQCLL----------HLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGP
        M+   + + + +  WTP  +   +  +L          H      W      F + + +Q  K +              ++SR   L KQY  +  ++  
Subjt:  MAASASASKKEKHIWTPEEDEVLVQCLL----------HLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGP

Query:  ACSGFGWNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPE
           GF W+   + +  +  ++ L+++ HP+A+  + +P   F++L L++G   A G  + +  ++  E E
Subjt:  ACSGFGWNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPE

AT2G24960.2 unknown protein4.2e-1123.53Show/hide
Query:  TMAASASASKKEKHIWTPEEDEVLVQCLLHLVQVG----------GWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMG
        T A+    S + +  WTP  D  L+  L+  V  G           W      F A + +Q  K +              +++R K L++ Y  I  ++ 
Subjt:  TMAASASASKKEKHIWTPEEDEVLVQCLLHLVQVG----------GWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMG

Query:  PACSGFGWNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVD-FED
           +GF W+  R  + A+ +I++ +++ HP+A+  R +  P +  L  +FGK+++ G   R      P P     ++E  + D F+D
Subjt:  PACSGFGWNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVD-FED

AT5G27260.1 unknown protein2.0e-1331.51Show/hide
Query:  KKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTF-----RAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKC
        K + + W+PEE ++LVQ L+  +    WR  NGT         +  +I K        C      +  SR+K LK QY +  ++     SGFGW+   K 
Subjt:  KKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTF-----RAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKC

Query:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRA
          A  E++  +++ HP  K LR   F +F+EL ++FG+  A G  A
Subjt:  IEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCGGTGTGTACATGACGTGGTACGAATGTGCCCGCCAAGTACATGGTTTTCGTGGTGCCGTTTACCAGTCGTATGAGACGTTGGACGACGCAGAAGCAACTTATAT
CGCTTACACCATGGAAGTGGACAATCATTCCAGTCGTCATGGAGACCTTCACCTTCACAATAGCTATCCTAACCGTGAACTAGATAGAAATGTAGTCAGATTGTTAGCAC
ATGTAGTGGAACATACCATGATTGTCCGTAGAAGCTATCGAATGGATGCCCAAGATAATGAAATTCAAGAATTGATCGCAATTCTTACTATCATTTGTGCGACACAGTAT
CAGTTCATCGCGACAATGGCCGCATCTGCCTCAGCTTCAAAGAAAGAAAAACACATATGGACGCCGGAGGAAGACGAAGTGCTTGTGCAGTGCCTACTGCACCTAGTCCA
AGTGGGGGGATGGCGTGCAGACAATGGGACGTTTCGGGCTGGTTACCAGAATCAGATTGGAAAGATGATGAAAGAAAGACTACCAGGATGCAACATAGTCGTGAGCCCGA
ACATTGAATCTAGAGTGAAGACCCTAAAAAAACAGTACATGGCGATAGCTGAGATGATGGGACCGGCCTGCAGTGGATTTGGTTGGAATGATGAGAGAAAATGCATAGAG
GCGGAGAAGGAGATATTCGATTTGTGGGTCGAGGGACATCCACAGGCCAAGGGCCTCCGTAACCGGCCATTCCCATGGTTCAATGAGTTGGCGCTCGTTTTCGGGAAGGA
CAGCGCTAGAGGAGTGAGAGCCCGAACCCCAATTGAGATGACACCTGAACCAGAGCCGGTAGCCGATTTGGATGAAGACATGAACGTGGATTTTGAGGATTGCTACGTCC
CAAGTCCCCCAGTTATTGATCCCACACTTGGAGAAGAATTATGTGGGACACCGACTGGTAGGACAGCTGGTGCAGGACCTTCTAGGGCAGTAAATAAGAGACGATTATCC
ATTGGGAACGTGGCGGAGGTACTGGAGAATGGATTCCAGATGACCGCACAACAAATCGAGAAGATTGCGCTCTGGCCGACCAAGAGGGACGAGCTCGAGAGGAGTCGGCG
GAAGGAGTTATATGCGGAGCTACAATCTATTCCTGGGGTGTCTATGGAGGACGGCTTGGTGGTCGCACGGGCACTACTATCAGATGAGAGGATGTTGACTCACTTTATGG
ACTTCCCTCCAGAATGGAAGTTCGACTACTGTATGGAGATCCTCGGTAGGGCATCGAGACAGCCCCCGCAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCGGTGTGTACATGACGTGGTACGAATGTGCCCGCCAAGTACATGGTTTTCGTGGTGCCGTTTACCAGTCGTATGAGACGTTGGACGACGCAGAAGCAACTTATAT
CGCTTACACCATGGAAGTGGACAATCATTCCAGTCGTCATGGAGACCTTCACCTTCACAATAGCTATCCTAACCGTGAACTAGATAGAAATGTAGTCAGATTGTTAGCAC
ATGTAGTGGAACATACCATGATTGTCCGTAGAAGCTATCGAATGGATGCCCAAGATAATGAAATTCAAGAATTGATCGCAATTCTTACTATCATTTGTGCGACACAGTAT
CAGTTCATCGCGACAATGGCCGCATCTGCCTCAGCTTCAAAGAAAGAAAAACACATATGGACGCCGGAGGAAGACGAAGTGCTTGTGCAGTGCCTACTGCACCTAGTCCA
AGTGGGGGGATGGCGTGCAGACAATGGGACGTTTCGGGCTGGTTACCAGAATCAGATTGGAAAGATGATGAAAGAAAGACTACCAGGATGCAACATAGTCGTGAGCCCGA
ACATTGAATCTAGAGTGAAGACCCTAAAAAAACAGTACATGGCGATAGCTGAGATGATGGGACCGGCCTGCAGTGGATTTGGTTGGAATGATGAGAGAAAATGCATAGAG
GCGGAGAAGGAGATATTCGATTTGTGGGTCGAGGGACATCCACAGGCCAAGGGCCTCCGTAACCGGCCATTCCCATGGTTCAATGAGTTGGCGCTCGTTTTCGGGAAGGA
CAGCGCTAGAGGAGTGAGAGCCCGAACCCCAATTGAGATGACACCTGAACCAGAGCCGGTAGCCGATTTGGATGAAGACATGAACGTGGATTTTGAGGATTGCTACGTCC
CAAGTCCCCCAGTTATTGATCCCACACTTGGAGAAGAATTATGTGGGACACCGACTGGTAGGACAGCTGGTGCAGGACCTTCTAGGGCAGTAAATAAGAGACGATTATCC
ATTGGGAACGTGGCGGAGGTACTGGAGAATGGATTCCAGATGACCGCACAACAAATCGAGAAGATTGCGCTCTGGCCGACCAAGAGGGACGAGCTCGAGAGGAGTCGGCG
GAAGGAGTTATATGCGGAGCTACAATCTATTCCTGGGGTGTCTATGGAGGACGGCTTGGTGGTCGCACGGGCACTACTATCAGATGAGAGGATGTTGACTCACTTTATGG
ACTTCCCTCCAGAATGGAAGTTCGACTACTGTATGGAGATCCTCGGTAGGGCATCGAGACAGCCCCCGCAGCCATGA
Protein sequenceShow/hide protein sequence
MPGVYMTWYECARQVHGFRGAVYQSYETLDDAEATYIAYTMEVDNHSSRHGDLHLHNSYPNRELDRNVVRLLAHVVEHTMIVRRSYRMDAQDNEIQELIAILTIICATQY
QFIATMAASASASKKEKHIWTPEEDEVLVQCLLHLVQVGGWRADNGTFRAGYQNQIGKMMKERLPGCNIVVSPNIESRVKTLKKQYMAIAEMMGPACSGFGWNDERKCIE
AEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPIEMTPEPEPVADLDEDMNVDFEDCYVPSPPVIDPTLGEELCGTPTGRTAGAGPSRAVNKRRLS
IGNVAEVLENGFQMTAQQIEKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEILGRASRQPPQP