; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028143 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028143
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold2:45230643..45237345
RNA-Seq ExpressionSpg028143
SyntenySpg028143
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CCA66040.1 hypothetical protein [Beta vulgaris subsp. vulgaris]1.0e-4335.57Show/hide
Query:  FRGDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDK
        F GDFNEI   +EK+GG PR    M  FRE +D C ++D+GYVGN FT +RG+  S  I+ERLDR + N E+   F + +V HL  + SDH P+L+    
Subjt:  FRGDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDK

Query:  STQKKKRNKKRPARFEESWIAFPECKDIVMDRWNRDQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLK-AEKD
        +   ++ NK    +FE  W++  EC  IV + WN   G+D+     +++     L+ W  +   G + K   +    +  +++ DPD  +    +    D
Subjt:  STQKKKRNKKRPARFEESWIAFPECKDIVMDRWNRDQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLK-AEKD

Query:  LEALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDL
        L+ +   EE +W  R+R + +R GD+NTK+FH+KASQR++RN I  +L E+G W++ +E++  +   YF+ LF++ +P    +++ALEG+S  ++ D+
Subjt:  LEALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDL

XP_010670096.1 PREDICTED: uncharacterized protein LOC104887198 [Beta vulgaris subsp. vulgaris]3.5e-4436.28Show/hide
Query:  WKRRARMYRPESPKERQTRMEKRK--FRGDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQ
        W  R   Y+      R   M +      GDFNEIL   EK+GG PR   +M  FR +VD C L D+GY G  FT +RG+  S  ++ERLDRF+ + ++  
Subjt:  WKRRARMYRPESPKERQTRMEKRK--FRGDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQ

Query:  VFHNLKVHHLHFHASDHIPILVSIDKSTQKKKRNKKRPARFEESWIAFPECKDIVMDRWNRDQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKK
        +F  + V H+  + SDH PIL+S   S   + RNKK   RFE  W++ PEC ++V   W    G++V     ++  C E L+ W     G    K  D +
Subjt:  VFHNLKVHHLHFHASDHIPILVSIDKSTQKKKRNKKRPARFEESWIAFPECKDIVMDRWNRDQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKK

Query:  SQEIKNMERNDPDYPSQALL----KAEKDLEALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKN
            + +  N   YP  A+L    +  K+L+ L ++EE +W  R+R + LR GD+NT +FH KASQRR  N I G+  E+  W + +E + ++ S YF N
Subjt:  SQEIKNMERNDPDYPSQALL----KAEKDLEALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKN

Query:  LFSSSAPQKEILDIALEGISLTITIDLKGRPVSEILNEQ
        LFS+  P    ++ ALEG+   IT D+     +E  +E+
Subjt:  LFSSSAPQKEILDIALEGISLTITIDLKGRPVSEILNEQ

XP_010673168.1 PREDICTED: uncharacterized protein LOC104889608 [Beta vulgaris subsp. vulgaris]3.1e-4535.29Show/hide
Query:  FRGDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDK
        F GDFNE+L   E +GG+  +   M  FRE VD+  LRD+G+ G  +T  RG   +  I+ERLDRF+ + ++   F  + V H+  + SDH PI+V +  
Subjt:  FRGDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDK

Query:  STQKKKRNKKRPARFEESWIAFPECKDIVMDRWNRDQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKDL
          +++K+ KK+  RF  +W+    C+ +V   W+   G   EA   +I A  + L  W+ + L   +G+ +    +EIK ++ +      + L++    L
Subjt:  STQKKKRNKKRPARFEESWIAFPECKDIVMDRWNRDQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKDL

Query:  EALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGI
        + LLE++E +W +RSR   ++ GD+NTK+FH+KASQR++RN I G+  E   W +D E + ++   Y+KNLF+SS P  E L   L+ +
Subjt:  EALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGI

XP_010688579.1 PREDICTED: uncharacterized protein LOC104902491 [Beta vulgaris subsp. vulgaris]2.0e-4436.22Show/hide
Query:  FRGDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDK
        F GDFNEI+   EK+GG PR    M  FRE++D C+++D+GY G  FT +RG+  +  I+ERLDR + N E+  +F + ++ HL  + SDH P+L+    
Subjt:  FRGDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDK

Query:  STQKKKRNKKRPARFEESWIAFPECKDIVMDRWNRDQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLK----A
        +    +  K    +FE  W++  EC  IV D W   +G+D+   G ++      L+ W       T G    +K + +  + R     P    L+     
Subjt:  STQKKKRNKKRPARFEESWIAFPECKDIVMDRWNRDQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLK----A

Query:  EKDLEALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTIT--
          DL+ + + EE +W  R+R + LR GD+NTK+FH+KASQR+ RN IKG+L E+G W++ K+++G+I S+YF+ LFSS  P    ++ ALEG+   +T  
Subjt:  EKDLEALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTIT--

Query:  --IDLKGRPVSE
          ++L   P  E
Subjt:  --IDLKGRPVSE

XP_021845434.1 uncharacterized protein LOC110785307 [Spinacia oleracea]2.0e-4437.12Show/hide
Query:  GDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDKST
        GDFNEIL   EK+GG  R   QM  FRE++D+C++RD+G+ G+ FT +RG+     ++ERLDRF+ N  +   F   +V HL  + SDH+P+L + D   
Subjt:  GDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDKST

Query:  QKKKRNKKRPARFEESWIAFPECKDIVMDRWNRDQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKDLEA
         + +R ++R  +FE  W++  EC ++V   WN   G  + A   +I  C   L+ W      G+I + +    +E++ ++   PD     +L+   +L +
Subjt:  QKKKRNKKRPARFEESWIAFPECKDIVMDRWNRDQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKDLEA

Query:  LLEE----EEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDL
         L+E    EE +W  R+R + LR GD+NTK+FH+KASQR+KRN IKG+  ++G W    EKV  I  DYF+ LF++  P     + AL G+S  +T ++
Subjt:  LLEE----EEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDL

TrEMBL top hitse value%identityAlignment
A0A2N9ERX7 CCHC-type domain-containing protein1.0e-4635.16Show/hide
Query:  GDFNEILLDKEKKGGKPREASQMIR-FRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDKS
        GDFNE LLD+ ++ G  R     IR FRE++D C+L D+G+VGN FT  +G + S+ I ERLDR + +  +   F   KV HL   +SDH P+L+ I ++
Subjt:  GDFNEILLDKEKKGGKPREASQMIR-FRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDKS

Query:  TQKKKRNKKRPARFEESWIAFPECKDIVMDRWNR--DQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKD
        T K+K  KKR   F+  WI   +CK ++   W      G  +     K+ +C + L  W+ +R  G++   + +K ++++ +    P      ++  + +
Subjt:  TQKKKRNKKRPARFEESWIAFPECKDIVMDRWNR--DQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKD

Query:  LEALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDLKG
        L  LLE+EE FW  RSR  W+R GDRN K+FH + +QRRK N IK +L  +G W+ ++ ++  +A DY +N+F+S  P  E++D  L G+   +T D+  
Subjt:  LEALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDLKG

Query:  RPVSEILNEQ
          + E   ++
Subjt:  RPVSEILNEQ

A0A2N9GPZ7 Reverse transcriptase domain-containing protein6.8e-4634.95Show/hide
Query:  GDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDKST
        GDFNEIL + E+ G   R   Q+  FRE+V  C L D+GYVGNS+T RR    +  +  RLDR M +  +   +    V HL    SDH PIL+ I    
Subjt:  GDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDKST

Query:  QKKKRNKKRPARFEESWIAFPECKDIVMDRWNR--DQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKDL
          K+  KK+  RFE  WI   +C++++   W     +G  +     K+  C   L  W+ ER  G++  ++ +K ++++++    P   S  +L+ + DL
Subjt:  QKKKRNKKRPARFEESWIAFPECKDIVMDRWNR--DQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKDL

Query:  EALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDLKGR
          LLE+EE FW+ RSR  W+  GD+NTK+FH + ++RR+ N I G+   DG W+ +K K+ +IA DYF+ +F+SS P  E +   L+G+   +T  +  +
Subjt:  EALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDLKGR

Query:  PVSEILNEQ
          +E   ++
Subjt:  PVSEILNEQ

A0A2N9HE04 Reverse transcriptase domain-containing protein2.9e-4434.67Show/hide
Query:  GDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDKST
        GDFNEIL   EK+G + R   +M  FRE V++C+  D+GY G  FT          +KERLDR +    +  +F+ + V HL    SDHIPILV    + 
Subjt:  GDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDKST

Query:  QKKKRNKKRPARFEESWIAFPECKDIVMDRWNRDQGKDVEAF--GRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKDL
        + + RNK+R  RFEE W   P+C+ ++   W  + G+    F    KI  C  GLA W+ +  GG+    +  + + ++ +  +D       +   ++++
Subjt:  QKKKRNKKRPARFEESWIAFPECKDIVMDRWNRDQGKDVEAF--GRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKDL

Query:  EALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDLKGR
         +LL  +E  WK RSR  WL+ GD NTK+FHN A+QR++ N+I+G+L+E G W  +  ++  I+  YFK++F+SS P +  ++ A+E +   +  ++  R
Subjt:  EALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDLKGR

A0A2N9IJF6 Uncharacterized protein2.9e-4434.67Show/hide
Query:  GDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDKST
        GDFNEIL   EK+G + R   +M  FRE V++C+  D+GY G  FT          +KERLDR +    +  +F+ + V HL    SDHIPILV    + 
Subjt:  GDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDKST

Query:  QKKKRNKKRPARFEESWIAFPECKDIVMDRWNRDQGKDVEAF--GRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKDL
        + + RNK+R  RFEE W   P+C+ ++   W  + G+    F    KI  C  GLA W+ +  GG+    +  + + ++ +  +D       +   ++++
Subjt:  QKKKRNKKRPARFEESWIAFPECKDIVMDRWNRDQGKDVEAF--GRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKDL

Query:  EALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDLKGR
         +LL  +E  WK RSR  WL+ GD NTK+FHN A+QR++ N+I+G+L+E G W  +  ++  I+  YFK++F+SS P +  ++ A+E +   +  ++  R
Subjt:  EALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDLKGR

A0A2N9IPS8 Reverse transcriptase domain-containing protein6.8e-4634.95Show/hide
Query:  GDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDKST
        GDFNEIL + E+ G   R   Q+  FRE+V  C L D+GYVGNS+T RR    +  +  RLDR M +  +   +    V HL    SDH PIL+ I    
Subjt:  GDFNEILLDKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDKST

Query:  QKKKRNKKRPARFEESWIAFPECKDIVMDRWNR--DQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKDL
          K+  KK+  RFE  WI   +C++++   W     +G  +     K+  C   L  W+ ER  G++  ++ +K ++++++    P   S  +L+ + DL
Subjt:  QKKKRNKKRPARFEESWIAFPECKDIVMDRWNR--DQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKDL

Query:  EALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDLKGR
          LLE+EE FW+ RSR  W+  GD+NTK+FH + ++RR+ N I G+   DG W+ +K K+ +IA DYF+ +F+SS P  E +   L+G+   +T  +  +
Subjt:  EALLEEEEKFWKIRSREDWLRWGDRNTKWFHNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDLKGR

Query:  PVSEILNEQ
          +E   ++
Subjt:  PVSEILNEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.6e-0723.75Show/hide
Query:  ILWSIWQHKNPVLQNEAIPDANAIIRTIERVSISGNESEDFYQEKPIKKPSVRPK-SLTSHEEWIPPKDRCWKINVDASKDSKRDCGGIGWILHDSSGST
        +LW +W+ +N ++      DA  ++R       +  + E++   + ++  +  P+       +W  P  +  K N DA+   +    GIGWIL + SG  
Subjt:  ILWSIWQHKNPVLQNEAIPDANAIIRTIERVSISGNESEDFYQEKPIKKPSVRPK-SLTSHEEWIPPKDRCWKINVDASKDSKRDCGGIGWILHDSSGST

Query:  ISLGFKKINKDWPVKTLELKAIQEGLLNLPSLKAMHPDLVFPAIVIESDAAGVVELLNKE
        + +G + + +   V   EL+A++  +L +           +  I+ ESDA  +V LLN +
Subjt:  ISLGFKKINKDWPVKTLELKAIQEGLLNLPSLKAMHPDLVFPAIVIESDAAGVVELLNKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTGTACTCCCAAGACAATCATTGAGCCACCGAAAGGAAAGTCTGAGATTAAAATTAAAGAGAGTTCAGGAAAAGACAAAAATGGACCATCATGGTCGTATGGGCC
TGAAAGAGAAATGGCAAAAGCAATGGAAATTGACCATGAAGTCGAATATGGAGAAAGAGATATAGAGGAACAAAAAAGCACTCAGAAAAAAGAAGACGAGGGGAGCACCA
TGAGAAAATGGAAAAGGAGAGCTCGAATGTACAGACCTGAATCACCAAAAGAAAGGCAAACTAGAATGGAAAAAAGAAAATTCAGGGGAGACTTTAATGAAATTCTGCTA
GATAAAGAAAAGAAAGGAGGCAAGCCCAGAGAAGCATCCCAAATGATCAGGTTCAGGGAATCCGTGGACAAATGCAAGCTGAGGGATATCGGGTATGTGGGAAATAGTTT
CACTAGGAGAAGAGGAAGTCAAGATAGCAAATGCATTAAAGAGAGACTCGACAGGTTCATGGTCAATAGTGAGTTTGACCAAGTCTTCCACAACCTGAAGGTCCATCACC
TTCACTTCCATGCTTCTGACCATATACCGATATTGGTTTCTATTGACAAGAGCACTCAGAAAAAGAAAAGAAATAAAAAGAGACCAGCTAGGTTTGAGGAATCTTGGATA
GCCTTCCCGGAATGCAAAGATATAGTTATGGATAGGTGGAACAGAGACCAAGGTAAAGATGTGGAAGCTTTTGGTAGAAAAATCAATGCTTGTCTTGAAGGGTTGGCCCA
CTGGAATTATGAAAGACTTGGTGGAACTATTGGAAAGGCTGTTGATAAGAAAAGTCAGGAAATTAAGAACATGGAAAGAAATGACCCTGATTACCCCTCTCAAGCTTTAT
TAAAAGCGGAAAAAGATCTGGAGGCTTTGCTAGAAGAAGAAGAAAAGTTTTGGAAAATTAGATCTAGAGAGGATTGGCTTAGATGGGGTGATAGAAACACCAAATGGTTC
CACAATAAAGCAAGTCAAAGAAGAAAGAGGAATGAAATCAAAGGGGTCCTTAGTGAAGATGGCAGTTGGGAAGAAGATAAAGAGAAAGTTGGCAAAATAGCCTCGGACTA
TTTCAAAAACCTTTTTTCCTCATCAGCTCCCCAAAAAGAAATCCTTGACATTGCTTTGGAAGGGATCAGTCTGACCATCACAATCGATCTTAAAGGAAGACCTGTTAGCG
AGATCTTAAACGAGCAAGGGAATTGGAAAGAGGATCTAGTCATGGAGAATTTCTCAGAGAGGGATGCATCCACTATCTTAAACACTCGTTCTAGCGGTTTCAAGGCTCAG
GAAGAATCAGCCATTCATGTGTTCTGGAATTGCAAATTAGCTAAAAAGTTCAAGGACAACATGGGAAACGAAGAACTGGAAAAAGCAATTATAATTCTATGGAGTATATG
GCAGCACAAAAACCCAGTTTTACAAAATGAAGCCATTCCAGATGCGAATGCAATTATCAGAACCATAGAAAGAGTTAGCATCTCAGGAAATGAGAGCGAGGATTTTTACC
AGGAGAAGCCGATCAAGAAGCCCTCCGTTCGACCGAAGAGCCTCACGAGTCACGAAGAGTGGATTCCGCCGAAGGATCGCTGTTGGAAGATCAATGTGGACGCCTCAAAA
GACTCCAAGCGTGACTGTGGAGGTATTGGTTGGATCCTCCATGACTCTTCAGGATCTACGATCAGCTTGGGGTTCAAGAAAATTAACAAAGATTGGCCTGTGAAGACGTT
AGAGCTAAAAGCGATTCAAGAAGGTCTGCTGAACTTACCTTCCTTGAAAGCAATGCATCCAGACTTGGTCTTTCCCGCCATTGTTATCGAGTCGGATGCGGCTGGTGTAG
TCGAACTCCTCAACAAAGAAAGTGTGGATCTTATAGAAAACTCCTTGCAGAGGAAATCGTCCAACTGTGTCGTAATTTGGGAGGAATTTCCATTGATTTTTGCCCAAGAT
CGAAGAATGGCGCTGCTCATGTTTTGGCGCACGCAGCGATTTCCCCTCCCCCTGATTTTTATTCTGCTAATTTTTTTGGTGCTTCTTCTACTTCAGAAGAAGTCGACTTC
AGTTGGGTCCCCCCCCCCATGCTGGGTTATTGGGCTGCTAAGTGATGTTTGTAAGTCTCGTTTCGTGAGCCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTGTACTCCCAAGACAATCATTGAGCCACCGAAAGGAAAGTCTGAGATTAAAATTAAAGAGAGTTCAGGAAAAGACAAAAATGGACCATCATGGTCGTATGGGCC
TGAAAGAGAAATGGCAAAAGCAATGGAAATTGACCATGAAGTCGAATATGGAGAAAGAGATATAGAGGAACAAAAAAGCACTCAGAAAAAAGAAGACGAGGGGAGCACCA
TGAGAAAATGGAAAAGGAGAGCTCGAATGTACAGACCTGAATCACCAAAAGAAAGGCAAACTAGAATGGAAAAAAGAAAATTCAGGGGAGACTTTAATGAAATTCTGCTA
GATAAAGAAAAGAAAGGAGGCAAGCCCAGAGAAGCATCCCAAATGATCAGGTTCAGGGAATCCGTGGACAAATGCAAGCTGAGGGATATCGGGTATGTGGGAAATAGTTT
CACTAGGAGAAGAGGAAGTCAAGATAGCAAATGCATTAAAGAGAGACTCGACAGGTTCATGGTCAATAGTGAGTTTGACCAAGTCTTCCACAACCTGAAGGTCCATCACC
TTCACTTCCATGCTTCTGACCATATACCGATATTGGTTTCTATTGACAAGAGCACTCAGAAAAAGAAAAGAAATAAAAAGAGACCAGCTAGGTTTGAGGAATCTTGGATA
GCCTTCCCGGAATGCAAAGATATAGTTATGGATAGGTGGAACAGAGACCAAGGTAAAGATGTGGAAGCTTTTGGTAGAAAAATCAATGCTTGTCTTGAAGGGTTGGCCCA
CTGGAATTATGAAAGACTTGGTGGAACTATTGGAAAGGCTGTTGATAAGAAAAGTCAGGAAATTAAGAACATGGAAAGAAATGACCCTGATTACCCCTCTCAAGCTTTAT
TAAAAGCGGAAAAAGATCTGGAGGCTTTGCTAGAAGAAGAAGAAAAGTTTTGGAAAATTAGATCTAGAGAGGATTGGCTTAGATGGGGTGATAGAAACACCAAATGGTTC
CACAATAAAGCAAGTCAAAGAAGAAAGAGGAATGAAATCAAAGGGGTCCTTAGTGAAGATGGCAGTTGGGAAGAAGATAAAGAGAAAGTTGGCAAAATAGCCTCGGACTA
TTTCAAAAACCTTTTTTCCTCATCAGCTCCCCAAAAAGAAATCCTTGACATTGCTTTGGAAGGGATCAGTCTGACCATCACAATCGATCTTAAAGGAAGACCTGTTAGCG
AGATCTTAAACGAGCAAGGGAATTGGAAAGAGGATCTAGTCATGGAGAATTTCTCAGAGAGGGATGCATCCACTATCTTAAACACTCGTTCTAGCGGTTTCAAGGCTCAG
GAAGAATCAGCCATTCATGTGTTCTGGAATTGCAAATTAGCTAAAAAGTTCAAGGACAACATGGGAAACGAAGAACTGGAAAAAGCAATTATAATTCTATGGAGTATATG
GCAGCACAAAAACCCAGTTTTACAAAATGAAGCCATTCCAGATGCGAATGCAATTATCAGAACCATAGAAAGAGTTAGCATCTCAGGAAATGAGAGCGAGGATTTTTACC
AGGAGAAGCCGATCAAGAAGCCCTCCGTTCGACCGAAGAGCCTCACGAGTCACGAAGAGTGGATTCCGCCGAAGGATCGCTGTTGGAAGATCAATGTGGACGCCTCAAAA
GACTCCAAGCGTGACTGTGGAGGTATTGGTTGGATCCTCCATGACTCTTCAGGATCTACGATCAGCTTGGGGTTCAAGAAAATTAACAAAGATTGGCCTGTGAAGACGTT
AGAGCTAAAAGCGATTCAAGAAGGTCTGCTGAACTTACCTTCCTTGAAAGCAATGCATCCAGACTTGGTCTTTCCCGCCATTGTTATCGAGTCGGATGCGGCTGGTGTAG
TCGAACTCCTCAACAAAGAAAGTGTGGATCTTATAGAAAACTCCTTGCAGAGGAAATCGTCCAACTGTGTCGTAATTTGGGAGGAATTTCCATTGATTTTTGCCCAAGAT
CGAAGAATGGCGCTGCTCATGTTTTGGCGCACGCAGCGATTTCCCCTCCCCCTGATTTTTATTCTGCTAATTTTTTTGGTGCTTCTTCTACTTCAGAAGAAGTCGACTTC
AGTTGGGTCCCCCCCCCCATGCTGGGTTATTGGGCTGCTAAGTGATGTTTGTAAGTCTCGTTTCGTGAGCCTTTAA
Protein sequenceShow/hide protein sequence
MVCTPKTIIEPPKGKSEIKIKESSGKDKNGPSWSYGPEREMAKAMEIDHEVEYGERDIEEQKSTQKKEDEGSTMRKWKRRARMYRPESPKERQTRMEKRKFRGDFNEILL
DKEKKGGKPREASQMIRFRESVDKCKLRDIGYVGNSFTRRRGSQDSKCIKERLDRFMVNSEFDQVFHNLKVHHLHFHASDHIPILVSIDKSTQKKKRNKKRPARFEESWI
AFPECKDIVMDRWNRDQGKDVEAFGRKINACLEGLAHWNYERLGGTIGKAVDKKSQEIKNMERNDPDYPSQALLKAEKDLEALLEEEEKFWKIRSREDWLRWGDRNTKWF
HNKASQRRKRNEIKGVLSEDGSWEEDKEKVGKIASDYFKNLFSSSAPQKEILDIALEGISLTITIDLKGRPVSEILNEQGNWKEDLVMENFSERDASTILNTRSSGFKAQ
EESAIHVFWNCKLAKKFKDNMGNEELEKAIIILWSIWQHKNPVLQNEAIPDANAIIRTIERVSISGNESEDFYQEKPIKKPSVRPKSLTSHEEWIPPKDRCWKINVDASK
DSKRDCGGIGWILHDSSGSTISLGFKKINKDWPVKTLELKAIQEGLLNLPSLKAMHPDLVFPAIVIESDAAGVVELLNKESVDLIENSLQRKSSNCVVIWEEFPLIFAQD
RRMALLMFWRTQRFPLPLIFILLIFLVLLLLQKKSTSVGSPPPCWVIGLLSDVCKSRFVSL