; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008630 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008630
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold10:37490994..37497365
RNA-Seq ExpressionSpg008630
SyntenySpg008630
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG72826.1 hypothetical protein EZV62_001405 [Acer yangbiense]2.4e-2526.37Show/hide
Query:  GRWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWRAIQNILPT
        G W ES+IR+SF+  +A+ II++P  S  SKD ++W     G +++RS Y+ + +       + L  S+S  WWK FW L   PK K+  W+A  N +P 
Subjt:  GRWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWRAIQNILPT

Query:  QINIINRGIDTNPNCFLCRSKGEFVDHVIWECK---------------ISRDSGDPNERWNELMDLLIEEDRSKAINIIWGIWNHRN--------TPPPR
        + N++  G+D    C +C    E   H +W C                 S D     E        +   D  +   I W +W+  N         P   
Subjt:  QINIINRGIDTNPNCFLCRSKGEFVDHVIWECK---------------ISRDSGDPNERWNELMDLLIEEDRSKAINIIWGIWNHRN--------TPPPR

Query:  HCWKINTDAAWFEDRGVGGIGWMARDSDRSLICTRFRRVERRWTIKCLEMKAIKESI-ISLLVLMGGPFPRITIKSDSSEEDIAILNDEYSM
         C  ++   + F+D  V G+G +ARDS   ++ +  R V+  +  +  E  AI E + +++        PR++ K       + ++ND+ S+
Subjt:  HCWKINTDAAWFEDRGVGGIGWMARDSDRSLICTRFRRVERRWTIKCLEMKAIKESI-ISLLVLMGGPFPRITIKSDSSEEDIAILNDEYSM

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]9.1e-2532.8Show/hide
Query:  LID-DEGRWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLA--QNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWR
        L+D +EG W+  ++R+ F P +A+GI++IP+G    +D +IW  +  G ++VRS Y +A   NP   +  S   S + + WW GFWK+    K K+  WR
Subjt:  LID-DEGRWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLA--QNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWR

Query:  AIQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECKISRDSGDPNERWNELMDLLIEEDRSKAIN---------IIWGIWNHRN
           + LPT  N+  RG++    C+ C   GE   H+ W CK + ++   N ++ +L   LI  +  ++++         +IWG+WN RN
Subjt:  AIQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECKISRDSGDPNERWNELMDLLIEEDRSKAIN---------IIWGIWNHRN

XP_023905045.1 uncharacterized protein LOC112016795 [Quercus suber]1.8e-2529.15Show/hide
Query:  KLIDD-EGRWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAE-KASPLDSSKSKAWWKGFWKLKSTPKEKICAWR
        +LID+  G W   +++  F+P DA  I+ IP  S+ ++D +IW   PKGTF V SAY +A + + ++ K    D+S    +W+  W L+   K K  AWR
Subjt:  KLIDD-EGRWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAE-KASPLDSSKSKAWWKGFWKLKSTPKEKICAWR

Query:  AIQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECKISRD----SGDPNE----RWNELMDLLI---------EEDRSKAINIIWGIWNHRN---
        A +NILPT+ N+ +RG+  +P C  C    E   H+ W+C+ + +    +G P +     + + +DLL          ++     I I W +W +RN   
Subjt:  AIQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECKISRD----SGDPNE----RWNELMDLLI---------EEDRSKAINIIWGIWNHRN---

Query:  ----------------------------------------TPPPRHCWKINTDAAWFEDRGVGGIGWMARD
                                                 PP    +K+NTDAA F++ G  GIG + RD
Subjt:  ----------------------------------------TPPPRHCWKINTDAAWFEDRGVGGIGWMARD

XP_023906648.1 uncharacterized protein LOC112018352 [Quercus suber]9.1e-2532.19Show/hide
Query:  LID-DEGRWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPT-NAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWRA
        LID D  RW+  +I+  F+P +A+ I+NIP+     +D IIW  + KG F+V+SAYY+A N   N+E+         +  WK  W L  + K KI  WRA
Subjt:  LID-DEGRWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPT-NAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWRA

Query:  IQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECKISRDSGDPNE--------RWNELMDLLIE-------EDRSKAINIIWGIW----------
          + LPT +N+  RGI  N  C  C  + E + H I +C++++   D  E        R  ++ D+ ++       +D      + W IW          
Subjt:  IQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECKISRDSGDPNE--------RWNELMDLLIE-------EDRSKAINIIWGIW----------

Query:  ---NHRNTPPPRHCWKINTDAAWFEDRGVGGIG
           N R TPPP   +K+N D A  ED     +G
Subjt:  ---NHRNTPPPRHCWKINTDAAWFEDRGVGGIG

XP_030936391.1 uncharacterized protein LOC115961572 [Quercus lobata]2.8e-2628.2Show/hide
Query:  DKLIDDEGRWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTN-AEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWR
        D L  ++G W+  +I   F+P +A+ I +IP+ +R   D +IW   P G F VRSAY LA N  +   K +P D+SK +++W+  W +    K +   WR
Subjt:  DKLIDDEGRWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTN-AEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWR

Query:  AIQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECKISR-----------DSGDPNERWNELMDLLI------EEDRSKAINIIWGIWNHRN---
        A +N LPT+ N++ R I  +  C  C+   E V HV+WEC+ +R           D G  +  + ++M  LI      EE  ++A    W IW++RN   
Subjt:  AIQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECKISR-----------DSGDPNERWNELMDLLI------EEDRSKAINIIWGIWNHRN---

Query:  ------------------------------------------TPPPRHCWKINTDAAWFEDRGVGGIGWMARDSDRSLICTRFRRVERRWTIKCLEMKAI
                                                  TPP    +KIN D A F  +   G+G + RDS+  L     R+++       +E KA+
Subjt:  ------------------------------------------TPPPRHCWKINTDAAWFEDRGVGGIGWMARDSDRSLICTRFRRVERRWTIKCLEMKAI

Query:  KESII
        +  ++
Subjt:  KESII

TrEMBL top hitse value%identityAlignment
A0A2N9EYC3 Reverse transcriptase domain-containing protein2.0e-2524.86Show/hide
Query:  IDENDKLID-DEGRWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKI
        I   + LID D   WK  +++E F+P +A  I+ IPL  R   D ++WG   +G + VRS Y+L  N  N ++  P D++K    WK  W L+   K + 
Subjt:  IDENDKLID-DEGRWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKI

Query:  CAWRAIQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECK--------ISRDSGDPNERWNELMDLLIEEDRSKAIN-------IIWGIWNHRN-
          WRA  + LPT+ N+ +R I  +P C  C  + E   H +W+CK        I          + + +DL+ +   + + N       I W IW  RN 
Subjt:  CAWRAIQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECK--------ISRDSGDPNERWNELMDLLIEEDRSKAIN-------IIWGIWNHRN-

Query:  ----------------------------------------------TPPPRHCWKINTDAAWFEDRGVGGIGWMARDSDRSLICTRFRRVERRWTIKCLE
                                                       PP    +K+N D A F DR   GIG + R+    ++    +R+    +++ +E
Subjt:  ----------------------------------------------TPPPRHCWKINTDAAWFEDRGVGGIGWMARDSDRSLICTRFRRVERRWTIKCLE

Query:  MKAIKESIISLLVLMGGPFPRITIKSDSSEEDIAIL-----NDEYSMDLTEVRTLAKEAKSL
          A + +I          F  I ++ DS+    AIL     +  Y   + ++R +A+  +S+
Subjt:  MKAIKESIISLLVLMGGPFPRITIKSDSSEEDIAIL-----NDEYSMDLTEVRTLAKEAKSL

A0A2N9F2A9 RNase H domain-containing protein3.0e-2627.11Show/hide
Query:  LIDDEGR-WKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWRAI
        LID + R WK  ++ E F+P +A  I+ IPL  R   D ++WG    G ++VRS Y    N ++ E     D ++    WK  W L   PK +   WRA 
Subjt:  LIDDEGR-WKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWRAI

Query:  QNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECK--------ISRDSGDPNERWNELMDLL------IEEDRSKAINII-WGIWNHRN-------
         N LPT+ N+ +R I  +P+C  C  + E   H +W+CK        IS  S      +   +DLL      +     +  ++I W IW  RN       
Subjt:  QNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECK--------ISRDSGDPNERWNELMDLL------IEEDRSKAINII-WGIWNHRN-------

Query:  ----------------TPPPRHCWKINTDAAWFEDRGVGGIGWMARDSDRSLICTRFRRVERRWTIKCLEMKAIKESIISLLVLMGGPFPRITIKSDSSE
                         PP +  +K+N D A F D    GIG + R+    ++ +   R+    +++ LE  A + S I     +G  F +  ++ DS  
Subjt:  ----------------TPPPRHCWKINTDAAWFEDRGVGGIGWMARDSDRSLICTRFRRVERRWTIKCLEMKAIKESIISLLVLMGGPFPRITIKSDSSE

Query:  EDIAILNDE-----YSMDLTEVRTLAKEAKSL
           A+L  E     Y   + +++ +A+  +S+
Subjt:  EDIAILNDE-----YSMDLTEVRTLAKEAKSL

A0A2N9IWN7 Uncharacterized protein1.5e-2530.34Show/hide
Query:  RWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWRAIQNILPTQ
        +WKE +I   F+P +A  I+ IPL  + ++D IIW   P G F++RSAY+        +  S  ++S     W   W L   PK +   WRA ++ LPT+
Subjt:  RWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWRAIQNILPTQ

Query:  INIINRGIDTNPNCFLCRSKGEFVDHVIWECKISRDSGDPNERWNE---------------LMDLLIEEDRSKAINIIWGIWNHRN-------------T
         N+  R +  +P C  C S  E + HVIW C ++    + +  +++               +MD   EE +       W +W HRN             +
Subjt:  INIINRGIDTNPNCFLCRSKGEFVDHVIWECKISRDSGDPNERWNE---------------LMDLLIEEDRSKAINIIWGIWNHRN-------------T

Query:  PPPRHCWKINTDAAWFEDRGVGGIGWMARDSDRS
        PP R  WKIN + A+ E   + GIG M  D   S
Subjt:  PPPRHCWKINTDAAWFEDRGVGGIGWMARDSDRS

A0A2N9IYL5 RNase H domain-containing protein6.8e-2627.55Show/hide
Query:  LIDDEGR-WKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWRAI
        LID + R WK  ++ E F+P +A  I+ IPL  R   D ++WG    G ++VRS Y    N ++ E     D ++    WK  W L   PK +   WRA 
Subjt:  LIDDEGR-WKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWRAI

Query:  QNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECKISRDSGDPNERWNELMDLLIEEDRSKAINIIWGIWNHRNTPPPRHCWKINTDAAWFEDRGV
         N LPT+ N+ +R I  +P+C  C  + E   H +W+CK+ +        W  +      ++   ++ ++   WN    PP +  +K+N D A F D   
Subjt:  QNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECKISRDSGDPNERWNELMDLLIEEDRSKAINIIWGIWNHRNTPPPRHCWKINTDAAWFEDRGV

Query:  GGIGWMARDSDRSLICTRFRRVERRWTIKCLEMKAIKESIISLLVLMGGPFPRITIKSDSSEEDIAILNDE-----YSMDLTEVRTLAKEAKSL
         GIG + R+    ++ +   R+    +++ LE  A + S I     +G  F +  ++ DS     A+L  E     Y   + +++ +A+  +S+
Subjt:  GGIGWMARDSDRSLICTRFRRVERRWTIKCLEMKAIKESIISLLVLMGGPFPRITIKSDSSEEDIAILNDE-----YSMDLTEVRTLAKEAKSL

A0A5C7IWD5 Uncharacterized protein1.2e-2526.37Show/hide
Query:  GRWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWRAIQNILPT
        G W ES+IR+SF+  +A+ II++P  S  SKD ++W     G +++RS Y+ + +       + L  S+S  WWK FW L   PK K+  W+A  N +P 
Subjt:  GRWKESMIRESFIPADAEGIINIPLGSRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWRAIQNILPT

Query:  QINIINRGIDTNPNCFLCRSKGEFVDHVIWECK---------------ISRDSGDPNERWNELMDLLIEEDRSKAINIIWGIWNHRN--------TPPPR
        + N++  G+D    C +C    E   H +W C                 S D     E        +   D  +   I W +W+  N         P   
Subjt:  QINIINRGIDTNPNCFLCRSKGEFVDHVIWECK---------------ISRDSGDPNERWNELMDLLIEEDRSKAINIIWGIWNHRN--------TPPPR

Query:  HCWKINTDAAWFEDRGVGGIGWMARDSDRSLICTRFRRVERRWTIKCLEMKAIKESI-ISLLVLMGGPFPRITIKSDSSEEDIAILNDEYSM
         C  ++   + F+D  V G+G +ARDS   ++ +  R V+  +  +  E  AI E + +++        PR++ K       + ++ND+ S+
Subjt:  HCWKINTDAAWFEDRGVGGIGWMARDSDRSLICTRFRRVERRWTIKCLEMKAIKESI-ISLLVLMGGPFPRITIKSDSSEEDIAILNDEYSM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25270.1 Ribonuclease H-like superfamily protein2.4e-0728.95Show/hide
Query:  WKLKSTPKEKICAWRAIQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWEC----KISRDSGDPNER-------WNELMDLLIEEDRSK-------
        WKLK+ PK K   W+ +   L T  N+  R I  +P C  C  + E   H+ ++C    ++ R SG P++            M+LL+    +        
Subjt:  WKLKSTPKEKICAWRAIQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWEC----KISRDSGDPNER-------WNELMDLLIEEDRSK-------

Query:  -AINIIWGIWNHRN
         AI I+W +W  RN
Subjt:  -AINIIWGIWNHRN

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-0632.81Show/hide
Query:  WWKGFWKLKSTPKEKICAWRAIQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECKISR
        W    W LK +PK K+  W+A+ N LP    +++R I   P C  CR   E + H+++ C  ++
Subjt:  WWKGFWKLKSTPKEKICAWRAIQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECKISR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAACAGAGAAATATGGGCTAGATGTGATCACAAAAAGAAATCATTTTATTCAAGGGCAACCATAGAGTTTTCCATTAGGGATCCTTTTGTGATCAAGGGGGACAT
CGGCAAGATGGAGAGACAGGACAAGGCTGAAGAAGTGCAAAAGAAGATGGAGAAGCTCGGACTAGAAGAAGAAGAGAGGGGAAGAGTCGTGGACATTGAAGATTACGATA
TAGACGAAAATGACAAGCTCATTGATGATGAGGGGAGGTGGAAAGAATCGATGATTAGGGAGTCTTTTATCCCTGCGGATGCGGAAGGCATCATCAATATTCCTCTGGGA
AGTAGGGAATCAAAAGATGATATCATATGGGGGCCTGATCCTAAAGGGACTTTTAATGTTAGAAGTGCCTACTATTTAGCTCAAAACCCTACTAACGCTGAAAAGGCCTC
CCCATTGGATTCTTCAAAGTCTAAAGCTTGGTGGAAAGGCTTCTGGAAGCTAAAATCAACCCCTAAAGAAAAGATTTGTGCTTGGAGAGCTATCCAAAACATCCTCCCTA
CCCAAATTAACATCATCAACAGAGGAATTGATACTAATCCTAATTGTTTTCTATGCAGGAGCAAAGGGGAGTTCGTGGATCATGTTATTTGGGAGTGCAAGATTTCGAGG
GATAGCGGGGATCCAAATGAAAGGTGGAACGAGCTCATGGATTTGCTCATCGAGGAGGACAGAAGCAAAGCTATCAACATAATTTGGGGAATTTGGAATCACAGAAATAC
CCCTCCCCCTCGGCATTGTTGGAAAATCAACACCGATGCCGCTTGGTTTGAAGACAGAGGAGTTGGAGGCATTGGGTGGATGGCTCGTGACTCAGACAGATCTTTAATTT
GCACCAGATTTCGAAGAGTTGAGAGGAGATGGACGATTAAATGTCTGGAAATGAAAGCCATAAAGGAAAGTATCATAAGCTTACTCGTCTTGATGGGTGGCCCCTTTCCA
CGGATAACAATCAAATCTGATTCTTCTGAAGAAGACATTGCTATCCTCAACGACGAGTACTCAATGGATCTCACCGAAGTCAGGACGCTAGCAAAGGAGGCTAAATCCTT
GGCAAATTGCTTCGGGGACATTTCTTTCTCAGGACATGCATCTGATATATTCTTTATGCTAAGAGACGCGAGGGATGAGTTGGAGTGGAGACATTTTGAGGACCTAGTGG
TTTTGTTGTGGTCCATATGCAATTGCTTAAACCAGCGTAGAGCAGCAGGATTAGGTGTAGTGGTGGGGAATCATGTAAGGGAGGTAATGGTTGCTGCTATAGTGTTTCAC
GGTTATGTTGGGTGCTCGGACATGGCTGAAGGTTGGGTTGTGGCTGACGGTTTAAGACTAACTATGGAGATGGGTTTTTTGTCTAATTGTCTTAGAAACAGACTTGAAGC
GAGTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAACAGAGAAATATGGGCTAGATGTGATCACAAAAAGAAATCATTTTATTCAAGGGCAACCATAGAGTTTTCCATTAGGGATCCTTTTGTGATCAAGGGGGACAT
CGGCAAGATGGAGAGACAGGACAAGGCTGAAGAAGTGCAAAAGAAGATGGAGAAGCTCGGACTAGAAGAAGAAGAGAGGGGAAGAGTCGTGGACATTGAAGATTACGATA
TAGACGAAAATGACAAGCTCATTGATGATGAGGGGAGGTGGAAAGAATCGATGATTAGGGAGTCTTTTATCCCTGCGGATGCGGAAGGCATCATCAATATTCCTCTGGGA
AGTAGGGAATCAAAAGATGATATCATATGGGGGCCTGATCCTAAAGGGACTTTTAATGTTAGAAGTGCCTACTATTTAGCTCAAAACCCTACTAACGCTGAAAAGGCCTC
CCCATTGGATTCTTCAAAGTCTAAAGCTTGGTGGAAAGGCTTCTGGAAGCTAAAATCAACCCCTAAAGAAAAGATTTGTGCTTGGAGAGCTATCCAAAACATCCTCCCTA
CCCAAATTAACATCATCAACAGAGGAATTGATACTAATCCTAATTGTTTTCTATGCAGGAGCAAAGGGGAGTTCGTGGATCATGTTATTTGGGAGTGCAAGATTTCGAGG
GATAGCGGGGATCCAAATGAAAGGTGGAACGAGCTCATGGATTTGCTCATCGAGGAGGACAGAAGCAAAGCTATCAACATAATTTGGGGAATTTGGAATCACAGAAATAC
CCCTCCCCCTCGGCATTGTTGGAAAATCAACACCGATGCCGCTTGGTTTGAAGACAGAGGAGTTGGAGGCATTGGGTGGATGGCTCGTGACTCAGACAGATCTTTAATTT
GCACCAGATTTCGAAGAGTTGAGAGGAGATGGACGATTAAATGTCTGGAAATGAAAGCCATAAAGGAAAGTATCATAAGCTTACTCGTCTTGATGGGTGGCCCCTTTCCA
CGGATAACAATCAAATCTGATTCTTCTGAAGAAGACATTGCTATCCTCAACGACGAGTACTCAATGGATCTCACCGAAGTCAGGACGCTAGCAAAGGAGGCTAAATCCTT
GGCAAATTGCTTCGGGGACATTTCTTTCTCAGGACATGCATCTGATATATTCTTTATGCTAAGAGACGCGAGGGATGAGTTGGAGTGGAGACATTTTGAGGACCTAGTGG
TTTTGTTGTGGTCCATATGCAATTGCTTAAACCAGCGTAGAGCAGCAGGATTAGGTGTAGTGGTGGGGAATCATGTAAGGGAGGTAATGGTTGCTGCTATAGTGTTTCAC
GGTTATGTTGGGTGCTCGGACATGGCTGAAGGTTGGGTTGTGGCTGACGGTTTAAGACTAACTATGGAGATGGGTTTTTTGTCTAATTGTCTTAGAAACAGACTTGAAGC
GAGTCTTTGA
Protein sequenceShow/hide protein sequence
MNNREIWARCDHKKKSFYSRATIEFSIRDPFVIKGDIGKMERQDKAEEVQKKMEKLGLEEEERGRVVDIEDYDIDENDKLIDDEGRWKESMIRESFIPADAEGIINIPLG
SRESKDDIIWGPDPKGTFNVRSAYYLAQNPTNAEKASPLDSSKSKAWWKGFWKLKSTPKEKICAWRAIQNILPTQINIINRGIDTNPNCFLCRSKGEFVDHVIWECKISR
DSGDPNERWNELMDLLIEEDRSKAINIIWGIWNHRNTPPPRHCWKINTDAAWFEDRGVGGIGWMARDSDRSLICTRFRRVERRWTIKCLEMKAIKESIISLLVLMGGPFP
RITIKSDSSEEDIAILNDEYSMDLTEVRTLAKEAKSLANCFGDISFSGHASDIFFMLRDARDELEWRHFEDLVVLLWSICNCLNQRRAAGLGVVVGNHVREVMVAAIVFH
GYVGCSDMAEGWVVADGLRLTMEMGFLSNCLRNRLEASL