; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018048 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018048
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold9:27841995..27844544
RNA-Seq ExpressionSpg018048
SyntenySpg018048
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015384077.1 uncharacterized protein LOC107176301 [Citrus sinensis]9.5e-1725.94Show/hide
Query:  HH---EDSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGI
        HH   ED+  +L I L  +  +D ++W  D+KG++TVKS YQ+ L       A+ S  S     W +LW +N + + KI +W+   ++LP A NL ++  
Subjt:  HH---EDSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGI

Query:  DINPTCR-----IPPGTPDLLECVST----------------------------PMKCDR-------SSTRQLAIPSSVT------AVHQYTMIVVWSNW
          +P C+     +      L+EC +T                            P  C         S+ +  A+  +        A H +T   V  + 
Subjt:  DINPTCR-----IPPGTPDLLECVST----------------------------PMKCDR-------SSTRQLAIPSSVT------AVHQYTMIVVWSNW

Query:  STSWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELIRNLN
           W PPP++ +K+N DA+ N+     GLG ++RDS  + + T +     +  ++  EA+A+  GL+     +     +++ESDCLE++  +N
Subjt:  STSWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELIRNLN

XP_023907861.1 uncharacterized protein LOC112019574 [Quercus suber]1.9e-1726.16Show/hide
Query:  ILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVL-INVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDINPTCRIP
        ILN+PL  +   D+++W  ++KG FTVKSAY + L I   S     S  +     W+ +W +N + + +I  W+   + LP   NL+ +G++ +  C   
Subjt:  ILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVL-INVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDINPTCRIP

Query:  PGTPDLLECVSTPM-KCD-----------------------RSSTRQLAIPSSVTAVHQYTMIVVWSNW--------------STSWNPPPQSWWKINSD
        P     LEC S  + +CD                        S   Q+    S   +  + +I  W  W              ++SW PPP  ++KIN D
Subjt:  PGTPDLLECVSTPM-KCD-----------------------RSSTRQLAIPSSVTAVHQYTMIVVWSNW--------------STSWNPPPQSWWKINSD

Query:  ASWNAAEGR-EGLGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELIRNLNKE
         + ++ +GR   +G I+ DS+G  +    K ++  +PI  +EA A+  G+  +L    +   I++E D L +++++N +
Subjt:  ASWNAAEGR-EGLGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELIRNLNKE

XP_024044510.1 uncharacterized protein LOC112100177 [Citrus clementina]1.6e-1624.92Show/hide
Query:  RKTCQRALHHEDSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNL
        R+   ++     S +IL +PL      D ++W  D+ GK++ KS YQ+ +     +  + SD SK  + W  +W      + KI +W+ + ++LP   NL
Subjt:  RKTCQRALHHEDSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNL

Query:  QRKGIDINPT-----CRIPPGTPDLLEC-------------------------------VSTPMKCDRSSTRQ---LAIPSSVTAVHQYTMIVVWSNWST
         ++ I + P      CR       LLEC                               V    K  R   +Q   L+  +   A+ +    + +SN   
Subjt:  QRKGIDINPT-----CRIPPGTPDLLEC-------------------------------VSTPMKCDRSSTRQ---LAIPSSVTAVHQYTMIVVWSNWST

Query:  ----------SWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELI
                  +W PP + W+K+N DA+ N +  + GLG ++R+S G  I  ++K V  R  +  +EA+A+  G+Q    ++ RP  +++ESD  E +
Subjt:  ----------SWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELI

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]2.7e-1924.35Show/hide
Query:  DSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDINPTC
        D+ +IL IPL    ++DE++W  D++G ++VKS YQL L +      + ++ S     W +LW +    + KI +W+  N++LP+A NL ++ +   PTC
Subjt:  DSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDINPTC

Query:  R-----IPPGTPDLLEC-------VSTPMKCDRSSTRQLAIPSSVTAVH--------QYTMIVVWSNW--------------------------------
        +     +   +  LLEC       + +P    R       I S++  +         +  + + WS W                                
Subjt:  R-----IPPGTPDLLEC-------VSTPMKCDRSSTRQLAIPSSVTAVH--------QYTMIVVWSNW--------------------------------

Query:  ---------------STSWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDC
                          W PPPQ+ +K+N DA++N+     G+G ++RDS+G  +   +     +   ++ EA+A+  GLQ   N      ++++ESDC
Subjt:  ---------------STSWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDC

Query:  LELIRNLN
        LE+++ +N
Subjt:  LELIRNLN

XP_030930729.1 uncharacterized protein LOC115956517 [Quercus lobata]1.6e-1625.49Show/hide
Query:  ILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTA-WKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDINPTCRIP
        ILNIP+     +D I+W  ++KG F VKSAY +    +  +D   S    +  + WK +W +N   + +I  W++  + +P   NL +KGI ++ TC I 
Subjt:  ILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTA-WKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDINPTCRIP

Query:  PGTPDLLE---------CVSTPMKCDRSSTRQLAIPSSVTAVHQY-----TMIVVWSNWSTSWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPI
           P+ +E          V+  +  +     +L    ++  +  +      +I        +W PPP+  + IN D +  A EG  G+G I+RD +   +
Subjt:  PGTPDLLE---------CVSTPMKCDRSSTRQLAIPSSVTAVHQY-----TMIVVWSNWSTSWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPI

Query:  CTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELIRNLNKED
            K +  R+ +   EA AM +G+     L      I++E D ++ ++ +  +D
Subjt:  CTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELIRNLNKED

TrEMBL top hitse value%identityAlignment
A0A1R3GC81 Reverse transcriptase1.3e-1628.87Show/hide
Query:  EDSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDINPT
        ED + I  IPL   + +  ++W+ D  G+++V+S Y +    +G +D  +   S +   WK +W  N  P+ K  IW+++  ILP    LQ++GI+I   
Subjt:  EDSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDINPT

Query:  CRIPPGTPDLLECVSTPMKCDRSSTRQLAIPSSVTAVHQYTMIVVWSNWST--------SWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICT
        C       ++L         D      +A+ SSVT  H   M+   +  ST        +W+PPP    K+N+DA+++ + G+ GLG ++RD   + +C 
Subjt:  CRIPPGTPDLLECVSTPMKCDRSSTRQLAIPSSVTAVHQYTMIVVWSNWST--------SWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICT

Query:  S---MKSVKDRWPINILEAKAM-WEGLQFVLNLSERPQN
        +   M+ V D     I + + M WEG   +L + E  +N
Subjt:  S---MKSVKDRWPINILEAKAM-WEGLQFVLNLSERPQN

A0A484LY53 CCHC-type domain-containing protein1.5e-1529.63Show/hide
Query:  IPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGI---DINPTCRIPP
        IPL  +SS D ++W+ D+ G +TVKSAY+ +     + DATV         WK LW +  +P+ +  IW+  N+ILP   NL  K +   D+ P C    
Subjt:  IPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGI---DINPTCRIPP

Query:  GTPDLLECVSTPMKCDRSSTRQLA-IPSSVTAVHQYTMIV---------------VWSNWST---------------SWNPPPQSWWKINSDASWNAAEG
         T  L   V  P          L     +V + H + + V               +W  WS                 W  PP ++ K+N DAS    + 
Subjt:  GTPDLLECVSTPMKCDRSSTRQLA-IPSSVTAVHQYTMIV---------------VWSNWST---------------SWNPPPQSWWKINSDASWNAAEG

Query:  REGLGWIVRDSSGSPICTSMKSVKDRWPINILEAK--AMWEGLQFVLNLSERPQNIVVESDCLELIRNLN
          GLG+IVRD  G  +  + KS+K R      EA+  A+ E L ++   S+  + ++VE+DC E++ +LN
Subjt:  REGLGWIVRDSSGSPICTSMKSVKDRWPINILEAK--AMWEGLQFVLNLSERPQNIVVESDCLELIRNLN

B8BN96 Reverse transcriptase domain-containing protein1.1e-1527.8Show/hide
Query:  HHEDSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPN------------
        H+ D+  ILNI +   S +D I W  D+ G F+V+SAY+L    V  ++++ S  + I  AW+ +WK     + KI  W++ ++ L              
Subjt:  HHEDSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPN------------

Query:  -APNLQRKGIDINPT---CRIPPGTPDLLECVSTPMKCDRSSTRQLAIPSSVTAVHQYTMIVVWSNWSTSWNPPPQSWWKINSDASWNAAEGREGLGWIV
           N    G    PT    R      DLL  +    + D    + +     +    +Y ++   +N    W  P   W K+N D S++A+ G+ GLG I+
Subjt:  -APNLQRKGIDINPT---CRIPPGTPDLLECVSTPMKCDRSSTRQLAIPSSVTAVHQYTMIVVWSNWSTSWNPPPQSWWKINSDASWNAAEGREGLGWIV

Query:  RDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELIRNL
        R+S+G  I TS K ++        E +A  EGL+  ++ +  P  I VE+DC  +++ L
Subjt:  RDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELIRNL

M5VU98 Reverse transcriptase domain-containing protein5.1e-1626.67Show/hide
Query:  DSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVL-INVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDINPT
        D +DI+ IPL   +  D IVW+ D+ G FTVKSAY++ L +  G +D + S  S     W+ +W      + KI  W++ +DILP   NL +KG+D+   
Subjt:  DSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVL-INVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDINPT

Query:  CRIPPGTPD-----LLECVSTPMKCDRSSTRQLAIPSSVTAVH-------QYTMIVVWSNWSTS-----------WNPPPQSWWKINSDASWNAAEGREG
        C       +     L  C       + S   + A      + H       QY    + +N + S           W  PP    K N D +++   GR  
Subjt:  CRIPPGTPD-----LLECVSTPMKCDRSSTRQLAIPSSVTAVH-------QYTMIVVWSNWSTS-----------WNPPPQSWWKINSDASWNAAEGREG

Query:  LGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELIRNLNKEDDD
        +G + RD+ G  +    KSV +       E  A  EG+   L+L     + + E D   ++  + +   D
Subjt:  LGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELIRNLNKEDDD

M5XHI9 Reverse transcriptase domain-containing protein2.3e-1626.67Show/hide
Query:  DSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVL-INVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDINPT
        D +DI+ IPL   +  D IVW+ D+ G FTVKSAY++ L +  G +D + S  S     W+ +W      + KI  W++ +DILP   NL +KG+D+   
Subjt:  DSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVL-INVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDINPT

Query:  CRIPPGTPD-----LLECVSTPMKCDRSSTRQLAIPSSVTAVH-------QYTMIVVWSNWSTS-----------WNPPPQSWWKINSDASWNAAEGREG
        C       +     L  C       + S   + A      + H       QY    + +N + S           W  PP    K N D +++   GRE 
Subjt:  CRIPPGTPD-----LLECVSTPMKCDRSSTRQLAIPSSVTAVH-------QYTMIVVWSNWSTS-----------WNPPPQSWWKINSDASWNAAEGREG

Query:  LGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELIRNLNKEDDD
        +G + RD+ G  +    KSV +       E     EG+   L+L     + + E D   ++  + +   D
Subjt:  LGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELIRNLNKEDDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.7e-0936Show/hide
Query:  NWSTSWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQF-VLNLSE-RPQNIVVESDCLELIRNLNKED
        N S  W  PP  W K N+DA+W     R G+GWI+R+ SG  +    +++      N+LEA+   E L++ VL +S    + I+ ESD   L+  LN +D
Subjt:  NWSTSWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQF-VLNLSE-RPQNIVVESDCLELIRNLNKED

AT3G09510.1 Ribonuclease H-like superfamily protein3.8e-0818.79Show/hide
Query:  ILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDINPTC----
        I  I L ++   D+I+W+ +  G++TV+S Y L+  +  +    ++ P   +     +W +  +P+ K  +W+ L+  L     L  +G+ I+P+C    
Subjt:  ILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDINPTC----

Query:  --------------------RIPPGTPDLLECVSTPMKCDRSSTRQLAIPSSVTAVHQYTMI-VVWSNWST-----------------------------
                            R+   +    + +S   + + S+       ++++  H+   + ++W  W                               
Subjt:  --------------------RIPPGTPDLLECVSTPMKCDRSSTRQLAIPSSVTAVHQYTMI-VVWSNWST-----------------------------

Query:  --------------------SWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPIC-TSMKSVKDRWPINILEAKAMWEGLQ--FVLNLSERPQNI
                             W  PP ++ K N DA ++  +     GWI+R+  G+PI   SMK      P+   E KA+   LQ  ++   ++    +
Subjt:  --------------------SWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPIC-TSMKSVKDRWPINILEAKAMWEGLQ--FVLNLSERPQNI

Query:  VVESDCLELIRNLN
         +E DC  LI  +N
Subjt:  VVESDCLELIRNLN

AT4G29090.1 Ribonuclease H-like superfamily protein7.7e-0936.63Show/hide
Query:  WNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICTSMKSVKDRWPINILEAKAM-WEGLQFVLNLSERPQNIVV-ESDCLELIRNLNKEDDDCWV
        W PPP  W K N+DA+WN    R G+GW++R+  G       +++     +   E +AM W     VL+LS    N V+ ESD   LI  LN  +D+ W 
Subjt:  WNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICTSMKSVKDRWPINILEAKAM-WEGLQFVLNLSERPQNIVV-ESDCLELIRNLNKEDDDCWV

Query:  S
        S
Subjt:  S

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.4e-0426.53Show/hide
Query:  STSWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELIRNLNKEDDD
        +T W+PP +   K N DAS +      GLGWI+R+S G+ I   M   + R      E   +   +Q       +   ++ E D   + R +N +  +
Subjt:  STSWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICTSMKSVKDRWPINILEAKAMWEGLQFVLNLSERPQNIVVESDCLELIRNLNKEDDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACGAAAGACTTGTCAAAGAGCACTTCACCATGAAGATTCAATCGATATCCTTAACATTCCCCTCGGAGAAGCTAGCTCTAAGGATGAAATAGTTTGGAGCCTAGA
TAGAAAAGGGAAATTCACGGTGAAGAGTGCCTATCAGCTGGTGTTGATCAATGTTGGATCCAAGGATGCTACTGTCTCGGATCCTAGTAAAATTATGACAGCTTGGAAGA
GTTTATGGAAAATCAATTCTATCCCCAGGGCGAAAATTTGCATATGGAAAATTCTCAATGACATCCTCCCTAATGCTCCAAATTTACAAAGGAAAGGCATTGATATTAAC
CCTACTTGTCGGATTCCTCCCGGAACTCCGGATCTTTTGGAATGTGTTTCAACCCCAATGAAGTGCGACCGATCAAGCACTAGGCAACTGGCTATCCCTAGCTCAGTCAC
TGCCGTTCACCAATACACAATGATCGTAGTGTGGAGCAATTGGTCGACTTCGTGGAACCCCCCTCCTCAATCGTGGTGGAAGATCAACTCAGATGCCTCCTGGAATGCAG
CGGAAGGAAGAGAAGGTCTGGGATGGATTGTGCGTGACTCCAGCGGATCTCCCATCTGCACCAGCATGAAATCAGTTAAAGATAGATGGCCGATCAACATTCTTGAAGCA
AAAGCTATGTGGGAAGGCCTGCAATTCGTCCTCAATTTATCAGAAAGACCGCAAAACATTGTGGTTGAATCTGATTGCTTGGAACTCATCAGGAATTTGAATAAGGAAGA
TGATGACTGTTGGGTTTCCAGAGTGGTCTTATTAGCAGGATCCTCCCGGAACTCTGGATTTTTTGGAATGTGTTTCAACTCCAATGATGCTCTACCGATCAAGCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAACGAAAGACTTGTCAAAGAGCACTTCACCATGAAGATTCAATCGATATCCTTAACATTCCCCTCGGAGAAGCTAGCTCTAAGGATGAAATAGTTTGGAGCCTAGA
TAGAAAAGGGAAATTCACGGTGAAGAGTGCCTATCAGCTGGTGTTGATCAATGTTGGATCCAAGGATGCTACTGTCTCGGATCCTAGTAAAATTATGACAGCTTGGAAGA
GTTTATGGAAAATCAATTCTATCCCCAGGGCGAAAATTTGCATATGGAAAATTCTCAATGACATCCTCCCTAATGCTCCAAATTTACAAAGGAAAGGCATTGATATTAAC
CCTACTTGTCGGATTCCTCCCGGAACTCCGGATCTTTTGGAATGTGTTTCAACCCCAATGAAGTGCGACCGATCAAGCACTAGGCAACTGGCTATCCCTAGCTCAGTCAC
TGCCGTTCACCAATACACAATGATCGTAGTGTGGAGCAATTGGTCGACTTCGTGGAACCCCCCTCCTCAATCGTGGTGGAAGATCAACTCAGATGCCTCCTGGAATGCAG
CGGAAGGAAGAGAAGGTCTGGGATGGATTGTGCGTGACTCCAGCGGATCTCCCATCTGCACCAGCATGAAATCAGTTAAAGATAGATGGCCGATCAACATTCTTGAAGCA
AAAGCTATGTGGGAAGGCCTGCAATTCGTCCTCAATTTATCAGAAAGACCGCAAAACATTGTGGTTGAATCTGATTGCTTGGAACTCATCAGGAATTTGAATAAGGAAGA
TGATGACTGTTGGGTTTCCAGAGTGGTCTTATTAGCAGGATCCTCCCGGAACTCTGGATTTTTTGGAATGTGTTTCAACTCCAATGATGCTCTACCGATCAAGCACTAG
Protein sequenceShow/hide protein sequence
MERKTCQRALHHEDSIDILNIPLGEASSKDEIVWSLDRKGKFTVKSAYQLVLINVGSKDATVSDPSKIMTAWKSLWKINSIPRAKICIWKILNDILPNAPNLQRKGIDIN
PTCRIPPGTPDLLECVSTPMKCDRSSTRQLAIPSSVTAVHQYTMIVVWSNWSTSWNPPPQSWWKINSDASWNAAEGREGLGWIVRDSSGSPICTSMKSVKDRWPINILEA
KAMWEGLQFVLNLSERPQNIVVESDCLELIRNLNKEDDDCWVSRVVLLAGSSRNSGFFGMCFNSNDALPIKH