; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035826 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035826
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold5:40440545..40441832
RNA-Seq ExpressionSpg035826
SyntenySpg035826
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]1.8e-1625.25Show/hide
Query:  LQPSAAAAWKPKVSGKGIFTVKSAYNQAVNIGAKEKLLLRIVNDPK-PWEAWSQPSTFFGLDNWSAMDYWSFM---WKKMSKEELSKAAILMWTLWNSRN
        L P+A+  W+ K+ G  I             G K + ++  + D K   + W        ++     D  + +    +K  K+EL    +L WT+W S+N
Subjt:  LQPSAAAAWKPKVSGKGIFTVKSAYNQAVNIGAKEKLLLRIVNDPK-PWEAWSQPSTFFGLDNWSAMDYWSFM---WKKMSKEELSKAAILMWTLWNSRN

Query:  RCRINNQQPDSN-QIARSLANFFEDFDKVNRPYLEPSSLKSQSSHPLWRPPDRGWWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKA
             N++ DS   IAR+ AN  + F ++ +P+ +    + ++   +W PP  GW+K+N DAA +   +  G+G ++ +S G +V A   R         
Subjt:  RCRINNQQPDSN-QIARSLANFFEDFDKVNRPYLEPSSLKSQSSHPLWRPPDRGWWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKA

Query:  LEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLSESGLIFTEILE-MSRGERIDFEFCSRSTNVLGHQLAHVATEWGDFSVFFCSSS
        +E EA+ +G+ +  R         P+ +E+D+  V  +   ++  + E+  I  +I E +    +   ++  R  NV  H LA  A E+ + SVF+  S 
Subjt:  LEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLSESGLIFTEILE-MSRGERIDFEFCSRSTNVLGHQLAHVATEWGDFSVFFCSSS

Query:  P
        P
Subjt:  P

XP_015388122.1 uncharacterized protein LOC107178012 [Citrus sinensis]1.4e-1627.05Show/hide
Query:  KKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSN-QIARSLANFFEDFDKVNRPYLEPSSLKSQSSHPLWRPPDRGWWKLNSDAAWSNSARRGGVGWVL
        +K  K+EL    +L WT+W S+N     N++ DS   IAR+ AN  + F ++ +P+ +    + ++   +W PP  GW+K+N DAA +   +  G+G ++
Subjt:  KKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSN-QIARSLANFFEDFDKVNRPYLEPSSLKSQSSHPLWRPPDRGWWKLNSDAAWSNSARRGGVGWVL

Query:  CDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLSESGLIFTEILE-MSRGERIDFEFCSRSTNV
         +S G +V A   +         +E EA+ +G+ +  R  +E  L   + +E+D+  V  +   ++  ++E+  I  +I E +    +   ++  R  NV
Subjt:  CDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLSESGLIFTEILE-MSRGERIDFEFCSRSTNV

Query:  LGHQLAHVATEWGDFSVFFCSSSPVLEDREVGWDESIPHWFVSL
          H LA  A E+               +  V WD S P   VSL
Subjt:  LGHQLAHVATEWGDFSVFFCSSSPVLEDREVGWDESIPHWFVSL

XP_021838584.1 uncharacterized protein LOC110778326 [Spinacia oleracea]2.3e-1625.7Show/hide
Query:  KEKLLLRIVNDPKPWEAWSQPSTFFGLDNWSAMDYWSFMWK---KMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLE
        +E     +V  P   E W +      ++  + +D+  ++ +   K S++ + + A+L W +W  RN+ R+  +   S ++  S     + + ++   ++ 
Subjt:  KEKLLLRIVNDPKPWEAWSQPSTFFGLDNWSAMDYWSFMWK---KMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLE

Query:  PSSLKSQSSHPLWRPPDRGWWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSV
         SS   +     W PPD GW K+NSDAA        G+G V+ D  G +  A C ++   WP+  +E +AI +GL   L+   +  +     VESD L +
Subjt:  PSSLKSQSSHPLWRPPDRGWWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSV

Query:  GKVLAGEEEDLSESGLIFTEILEM-SRGERIDFEFCSRSTNVLGHQLAH
           L     + S  G+   EIL + S  + + F    R  N   H +AH
Subjt:  GKVLAGEEEDLSESGLIFTEILEM-SRGERIDFEFCSRSTNVLGHQLAH

XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]4.7e-1732.12Show/hide
Query:  NWSAMDYWSFMWKKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFF-EDFDK-----------VNRPYLEPSSLKSQSSHPLWRPPDRG
        +W+  D W+++   +S EE++ + ++ W +W SRNR     +  D  Q+ RS+  F   + DK            N  YL             W  P   
Subjt:  NWSAMDYWSFMWKKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFF-EDFDK-----------VNRPYLEPSSLKSQSSHPLWRPPDRG

Query:  WWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLS
         WKLN+DA+WS     GG+GW+LCD  G +V AG  +I     I ALE   I  GL  F+   S    R P+ +ESD++ V +++  E+ DL+
Subjt:  WWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLS

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.2e-1730.61Show/hide
Query:  MWKKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLEPSSL----KSQSSHPLWRPPDRGWWKLNSDAAWSNSARRGG
        M  K S E+L    I  W +WN RN      +    + + + L  F  +       Y   +SL    K+ ++   W PP    W LN+DA+WS+S  RGG
Subjt:  MWKKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLEPSSL----KSQSSHPLWRPPDRGWWKLNSDAAWSNSARRGG

Query:  VGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLSESGLIFTEILEMSRG-ERIDFEFCS
        +GW++    G +V AG   +     +K LE  AI  GL      T+ G LR PL +E+D+  V  +L  + EDL+++G +  EIL +    E + F    
Subjt:  VGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLSESGLIFTEILEMSRG-ERIDFEFCS

Query:  RSTNVLGHQLAHVATEWGDFSVFFCSSSPVLEDREVGWDESIPHW
        R TN   H LA  A+              VL +  + W +  P+W
Subjt:  RSTNVLGHQLAHVATEWGDFSVFFCSSSPVLEDREVGWDESIPHW

TrEMBL top hitse value%identityAlignment
A0A1R3GIN3 Reverse transcriptase4.0e-1427.94Show/hide
Query:  PKPWEAWSQPSTFFG--LDNWSAM-DYWSFMWKKMSK-EELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLEPSSLKSQSSH
        P  +  W Q   +    L  WS+  D+W  + +K S+   L   A L+W +WN+RN+   +      + +     N   DF+  NR       L+     
Subjt:  PKPWEAWSQPSTFFG--LDNWSAM-DYWSFMWKKMSK-EELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLEPSSLKSQSSH

Query:  PLWRPPDRGWWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGT-SEGHLRPPLKVESDALSVGKVLAGEEE
          WRPP  G +KLNSDAA+    +  G+G V+ DSTG ++    +R+  +W + +L  E   +     +  T   GH    +  ESD+L     L     
Subjt:  PLWRPPDRGWWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGT-SEGHLRPPLKVESDALSVGKVLAGEEE

Query:  DLSESGLIFTEILEM-SRGERIDFEFCSRSTNVLGHQLAHVATEWGD
         L E G +  +I ++ S  +   F    RS N   H LAH+  + G+
Subjt:  DLSESGLIFTEILEM-SRGERIDFEFCSRSTNVLGHQLAHVATEWGD

A0A6J1CP26 uncharacterized protein LOC1110134126.2e-1529.03Show/hide
Query:  KMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLEPSSLK--------SQSSHPLWRPPDRGWWKLNSDAAWSNSARRG
        K  +EE  ++ I+ W +W  RN+       P++  I  ++  +  +    N      S+ K          ++   W+PP    WKLN++AAW      G
Subjt:  KMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLEPSSLK--------SQSSHPLWRPPDRGWWKLNSDAAWSNSARRG

Query:  GVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLSESGLIFTEILEMSRG-ERIDFEFC
        G+GW+L D  G ++ A C  I     I  LE  AI  G    LR   + H R P+ +ESD+L    +L  + +D +E   +  EI +M +  E +     
Subjt:  GVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLSESGLIFTEILEMSRG-ERIDFEFC

Query:  SRSTNVLGHQLAHVATE
        SR  N + H LA  A E
Subjt:  SRSTNVLGHQLAHVATE

A0A6J1CQG0 uncharacterized protein LOC1110132162.3e-1732.12Show/hide
Query:  NWSAMDYWSFMWKKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFF-EDFDK-----------VNRPYLEPSSLKSQSSHPLWRPPDRG
        +W+  D W+++   +S EE++ + ++ W +W SRNR     +  D  Q+ RS+  F   + DK            N  YL             W  P   
Subjt:  NWSAMDYWSFMWKKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFF-EDFDK-----------VNRPYLEPSSLKSQSSHPLWRPPDRG

Query:  WWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLS
         WKLN+DA+WS     GG+GW+LCD  G +V AG  +I     I ALE   I  GL  F+   S    R P+ +ESD++ V +++  E+ DL+
Subjt:  WWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLS

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X11.2e-1329.41Show/hide
Query:  STFFGLD--NWSAMDYWSFMWKKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLEPS-SLKSQSSHPL---------
        + FF +D  NW+  +YW ++  K  +EE  ++ I+   +W  RN+        ++  I  ++  +      +N    + +   KS+  HP+         
Subjt:  STFFGLD--NWSAMDYWSFMWKKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLEPS-SLKSQSSHPL---------

Query:  -WRPPDRGWWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDAL
         W+PP    WKLN+DAAW       G+GW+L D  G ++  GC  I     I  LE  AI  G    LR   + H R P+ +ESD+L
Subjt:  -WRPPDRGWWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDAL

A0A6J1DNV9 uncharacterized protein LOC1110224036.0e-1830.61Show/hide
Query:  MWKKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLEPSSL----KSQSSHPLWRPPDRGWWKLNSDAAWSNSARRGG
        M  K S E+L    I  W +WN RN      +    + + + L  F  +       Y   +SL    K+ ++   W PP    W LN+DA+WS+S  RGG
Subjt:  MWKKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLEPSSL----KSQSSHPLWRPPDRGWWKLNSDAAWSNSARRGG

Query:  VGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLSESGLIFTEILEMSRG-ERIDFEFCS
        +GW++    G +V AG   +     +K LE  AI  GL      T+ G LR PL +E+D+  V  +L  + EDL+++G +  EIL +    E + F    
Subjt:  VGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLSESGLIFTEILEMSRG-ERIDFEFCS

Query:  RSTNVLGHQLAHVATEWGDFSVFFCSSSPVLEDREVGWDESIPHW
        R TN   H LA  A+              VL +  + W +  P+W
Subjt:  RSTNVLGHQLAHVATEWGDFSVFFCSSSPVLEDREVGWDESIPHW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27870.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.5e-0530.67Show/hide
Query:  QIARSLANFFEDFDKVNRPYLEPSSLKSQSSHPLWRPPDRGWWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAG
        ++A++ A+ + + + V +       ++ + +H  WR P+RGW K N D ++ N   +   GWV+ DS GS + AG
Subjt:  QIARSLANFFEDFDKVNRPYLEPSSLKSQSSHPLWRPPDRGWWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAG

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.4e-1124.06Show/hide
Query:  LMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLEPSSLKSQSSHPL---WRPPDRGWWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAG
        L+W LW SRN      ++ D+ ++ R     FE++    R  LE  +   Q    L   W+ P   W K N+DA W     R G+GW+L + +G ++  G
Subjt:  LMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLEPSSLKSQSSHPL---WRPPDRGWWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAG

Query:  CSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEE---------EDLSESGLIFTEILEMSRGERIDFEFCSRSTNVLGH
           + R+  +   E EA+R  +    R   +      +  ESDA ++  +L  ++         ED+ +          +   E + FEF  R  N +  
Subjt:  CSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEE---------EDLSESGLIFTEILEMSRGERIDFEFCSRSTNVLGH

Query:  QLAHVATEWGDF
        ++A  +  + ++
Subjt:  QLAHVATEWGDF

AT4G29090.1 Ribonuclease H-like superfamily protein3.5e-1025.91Show/hide
Query:  YWSF-------MWKKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLEPSSLKSQ---SSHPLWRPPDRGWWKLNSDA
        YW F        W+K S+        L+W LW +RN      ++ ++ ++ R   +  E++    R   E    K Q   SS   WRPP   W K N+DA
Subjt:  YWSF-------MWKKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSNQIARSLANFFEDFDKVNRPYLEPSSLKSQ---SSHPLWRPPDRGWWKLNSDA

Query:  AWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLSESGLIFTEILEMSRG
         W+    R G+GWVL +  G +   G   + +   +   E EA+R  + +  R      +      ESD+  + ++L  +E   S    I      +S+ 
Subjt:  AWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLRPPLKVESDALSVGKVLAGEEEDLSESGLIFTEILEMSRG

Query:  ERIDFEFCSRSTNVLGHQLA
          + F F  R  N L  ++A
Subjt:  ERIDFEFCSRSTNVLGHQLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACTACAACCGTCTGCCGCTGCCGCTTGGAAACCAAAAGTTAGTGGGAAAGGCATTTTTACGGTGAAAAGTGCTTATAACCAGGCCGTTAACATTGGTGCTAAGGA
AAAGCTACTTCTTCGAATTGTGAATGATCCGAAGCCATGGGAGGCGTGGAGTCAACCGAGCACCTTTTTTGGGTTGGACAATTGGAGTGCTATGGATTATTGGAGTTTCA
TGTGGAAAAAGATGAGTAAAGAGGAGCTCAGTAAAGCTGCCATTCTTATGTGGACTCTTTGGAATTCAAGAAACAGATGTAGAATTAACAACCAACAGCCAGATTCCAAT
CAGATTGCAAGATCGCTAGCAAATTTCTTCGAGGATTTCGATAAAGTGAATAGACCGTACCTGGAGCCATCCAGTTTGAAGAGCCAGTCGAGTCACCCTTTATGGAGGCC
GCCGGACCGTGGATGGTGGAAGCTCAACTCCGATGCCGCCTGGAGCAATTCCGCAAGGAGAGGAGGCGTGGGATGGGTGCTTTGCGACTCCACCGGATCTTTAGTTGGAG
CAGGTTGCAGCCGAATTACCCGGAGCTGGCCAATCAAGGCCCTTGAAGGCGAAGCTATTCGCGTTGGCTTGAGCGCGTTTCTCCGTGGGACGAGTGAAGGCCACCTCAGA
CCTCCGTTAAAGGTCGAATCAGATGCCCTCAGTGTCGGGAAAGTGTTAGCGGGAGAGGAAGAAGATCTTTCGGAATCAGGCCTGATCTTCACTGAGATTTTAGAGATGTC
TAGGGGAGAAAGAATCGATTTTGAGTTCTGTAGTAGAAGTACCAATGTTTTGGGCCATCAGCTTGCGCATGTAGCAACTGAATGGGGGGATTTTTCTGTGTTTTTTTGCT
CTTCTTCCCCTGTTCTGGAAGATAGAGAGGTTGGTTGGGATGAGAGTATCCCTCATTGGTTTGTTTCTTTATTATCTGTGGACGTTGGTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTACTACAACCGTCTGCCGCTGCCGCTTGGAAACCAAAAGTTAGTGGGAAAGGCATTTTTACGGTGAAAAGTGCTTATAACCAGGCCGTTAACATTGGTGCTAAGGA
AAAGCTACTTCTTCGAATTGTGAATGATCCGAAGCCATGGGAGGCGTGGAGTCAACCGAGCACCTTTTTTGGGTTGGACAATTGGAGTGCTATGGATTATTGGAGTTTCA
TGTGGAAAAAGATGAGTAAAGAGGAGCTCAGTAAAGCTGCCATTCTTATGTGGACTCTTTGGAATTCAAGAAACAGATGTAGAATTAACAACCAACAGCCAGATTCCAAT
CAGATTGCAAGATCGCTAGCAAATTTCTTCGAGGATTTCGATAAAGTGAATAGACCGTACCTGGAGCCATCCAGTTTGAAGAGCCAGTCGAGTCACCCTTTATGGAGGCC
GCCGGACCGTGGATGGTGGAAGCTCAACTCCGATGCCGCCTGGAGCAATTCCGCAAGGAGAGGAGGCGTGGGATGGGTGCTTTGCGACTCCACCGGATCTTTAGTTGGAG
CAGGTTGCAGCCGAATTACCCGGAGCTGGCCAATCAAGGCCCTTGAAGGCGAAGCTATTCGCGTTGGCTTGAGCGCGTTTCTCCGTGGGACGAGTGAAGGCCACCTCAGA
CCTCCGTTAAAGGTCGAATCAGATGCCCTCAGTGTCGGGAAAGTGTTAGCGGGAGAGGAAGAAGATCTTTCGGAATCAGGCCTGATCTTCACTGAGATTTTAGAGATGTC
TAGGGGAGAAAGAATCGATTTTGAGTTCTGTAGTAGAAGTACCAATGTTTTGGGCCATCAGCTTGCGCATGTAGCAACTGAATGGGGGGATTTTTCTGTGTTTTTTTGCT
CTTCTTCCCCTGTTCTGGAAGATAGAGAGGTTGGTTGGGATGAGAGTATCCCTCATTGGTTTGTTTCTTTATTATCTGTGGACGTTGGTAGTTGA
Protein sequenceShow/hide protein sequence
MLLQPSAAAAWKPKVSGKGIFTVKSAYNQAVNIGAKEKLLLRIVNDPKPWEAWSQPSTFFGLDNWSAMDYWSFMWKKMSKEELSKAAILMWTLWNSRNRCRINNQQPDSN
QIARSLANFFEDFDKVNRPYLEPSSLKSQSSHPLWRPPDRGWWKLNSDAAWSNSARRGGVGWVLCDSTGSLVGAGCSRITRSWPIKALEGEAIRVGLSAFLRGTSEGHLR
PPLKVESDALSVGKVLAGEEEDLSESGLIFTEILEMSRGERIDFEFCSRSTNVLGHQLAHVATEWGDFSVFFCSSSPVLEDREVGWDESIPHWFVSLLSVDVGS