; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001904 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001904
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationscaffold8:32619829..32624392
RNA-Seq ExpressionSpg001904
SyntenySpg001904
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF8408042.1 hypothetical protein HHK36_007182 [Tetracentron sinense]5.6e-2330.74Show/hide
Query:  TDSRQWNVGLIQQHFSPHKWLSCRPDAYIGAKLVFILLGFLTWLPTIDNLIKRGVDVLNICSLCGCQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSD
        T S  WN  L     S  KW      A      +FI    L  LP   NL KR + V N+C +CG +GE+ LHV   C + + V   +  G       +D
Subjt:  TDSRQWNVGLIQQHFSPHKWLSCRPDAYIGAKLVFILLGFLTWLPTIDNLIKRGVDVLNICSLCGCQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSD

Query:  CLFLLLRD-AKDYLEWGRFEYLVVLLWAIWGCRNRVKMNGDGLS-WELPTWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGE
         L   + +  K + E G   + ++  W+IW  RN    +G  ++ +     A  +L  F  AN  R+    I   R    W+APP   +K+N+D +   E
Subjt:  CLFLLLRD-AKDYLEWGRFEYLVVLLWAIWGCRNRVKMNGDGLS-WELPTWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGE

Query:  RMLAGAGLVVRNHGGDVMAATIRYHESVRNSDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISD
           AG G+VVR+H GD++AA  +   + +++ + EA A  +GL FA E+G+  +++E+DS+    AL  ++++ S F  L+ D
Subjt:  RMLAGAGLVVRNHGGDVMAATIRYHESVRNSDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISD

XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]5.1e-2426.01Show/hide
Query:  SPPSLGPDARVADLRTDSRQWNVGLIQQHF-----------------SPHKWL---------SCRPDAYIGAKL--------------------------
        SPP+L     VADL  +++ W   +I QHF                  P ++L         S +    I  KL                          
Subjt:  SPPSLGPDARVADLRTDSRQWNVGLIQQHF-----------------SPHKWL---------SCRPDAYIGAKL--------------------------

Query:  ---VFILLGFLTWLPTIDNLIKRGVDVLNICSLCGCQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWG
           +F+       LPT  NL +R +    IC  CG + E  +H    CK  K V  +AG    + +     L  +L + +        E +VVL W IW 
Subjt:  ---VFILLGFLTWLPTIDNLIKRGVDVLNICSLCGCQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWG

Query:  CRNRVKMNGDGLSWELP-TWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRNS
         +N           +L    A   +DSF+R     S   E +    +  W  PP GW+K+NVDA+ + +  +AG G+++RN  G+V+AA I+      +S
Subjt:  CRNRVKMNGDGLSWELP-TWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRNS

Query:  DMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN
           EA AV+ G+  A   G  P+++ETDS    +    +K  + +   +I+D  ES   +    + +  RE N
Subjt:  DMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN

XP_021847414.1 uncharacterized protein LOC110787151 [Spinacia oleracea]5.5e-2631.18Show/hide
Query:  LPTIDNLIKRGVDVLNICSLCGCQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWGCRNRVKMNGDGLS
        LPT   L KR      +C +C C+ ES +H    CK  + + C++ F  ++       +    R  K+ L+    E  + L WA+WG RN+  M  +GL+
Subjt:  LPTIDNLIKRGVDVLNICSLCGCQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWGCRNRVKMNGDGLS

Query:  WELPT----WAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRNSDMAEAWAVVD
         + PT    +A  V +  R+  V  S G          +W  P  GWYK+NVDA   GE + +G G V+R+  G V++  +R       +++AEA AV+ 
Subjt:  WELPT----WAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRNSDMAEAWAVVD

Query:  GLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN
        G+   AE+G+  ++VE+D +   +AL+RE    S+F++++ D M        +   FV R GN
Subjt:  GLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]2.4e-3434.67Show/hide
Query:  VFILLGFLTWLPTIDNLIKRGVDVLNICSLCGCQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWGCRN
        VF+    L  LPT  NL KRGV++ N C  CG  GE ++H+FW+CKF + +  ++ FG L         FL+LR++ + L    FE L V++W +W  RN
Subjt:  VFILLGFLTWLPTIDNLIKRGVDVLNICSLCGCQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWGCRN

Query:  RVKMNGD-----GLSWELPTWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRN
            N        +  EL  WA      FR A     + G +     E  W  P  G YK+N DASF      AG G+++ N  G VMAA  +Y E++++
Subjt:  RVKMNGD-----GLSWELPTWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRN

Query:  SDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN
         DMAEA A V+GL  A+E+G+ P +                +DLS+   ++  A   W  +      FV REGN
Subjt:  SDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN

XP_024046732.1 uncharacterized protein LOC112101057 [Citrus clementina]3.7e-2225.24Show/hide
Query:  PSLGPDARVADLRTDSRQWNVGLIQQHFS-------------------PHKWLSCRPDAYI---GAKLVFILLGFLTWLPTIDNLIKRGVDVLNICSLCG
        PSL  +++VADL     QWN  LI+QHF+                      W   +   Y    G ++   L        + ++L +R +    IC +C 
Subjt:  PSLGPDARVADLRTDSRQWNVGLIQQHFS-------------------PHKWLSCRPDAYI---GAKLVFILLGFLTWLPTIDNLIKRGVDVLNICSLCG

Query:  CQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWGCRNRVKMNGDGLS-WELPTWAAGVLDSFRRANVLR
           E A H F  CK  + V   +     M +   + +  +++     +     E++ V+ W IW  RN++   G  ++   L   A   +++++R +   
Subjt:  CQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWGCRNRVKMNGDGLS-WELPTWAAGVLDSFRRANVLR

Query:  SVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRNSDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEA
                  ++ +W  PPVG+YK+NVDA+   E+ L G G+V+RN  G V+   ++  +   N   AEA AV  GL  A E  L  +++ETD +     
Subjt:  SVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRNSDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEA

Query:  LQREKDDLSDFFMLISD
           +     +    ISD
Subjt:  LQREKDDLSDFFMLISD

TrEMBL top hitse value%identityAlignment
A0A6J1CDQ4 uncharacterized protein LOC1110105338.8e-2231.75Show/hide
Query:  LLRDAKDYLEWGRFEYLVVLLWAIWGCRNRVKMNGDGL-SWELPTWAAGVLDSFRRANVLRSV-----------GGEIRPCREEAKWVAPPVGWYKLNVD
        +LRD +D L W  FE LVV LW++W  RN    N   +   +L  W +  + +F+  N   +              +I   +    W     G +KL  D
Subjt:  LLRDAKDYLEWGRFEYLVVLLWAIWGCRNRVKMNGDGL-SWELPTWAAGVLDSFRRANVLRSV-----------GGEIRPCREEAKWVAPPVGWYKLNVD

Query:  ASFDGERMLAGAG-LVVRNHGGDVMAATIRYHESVRNSDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWP
        ASF      AG G +++R+H G V+A+  +Y E V + D AEA A V+GL  A E G+ P+++ETDS+R +    R+K+ LS    +I            
Subjt:  ASFDGERMLAGAG-LVVRNHGGDVMAATIRYHESVRNSDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWP

Query:  LKVGFVYREGN
        +   F  R GN
Subjt:  LKVGFVYREGN

A0A6J1DAR4 uncharacterized protein LOC1110189541.2e-3434.67Show/hide
Query:  VFILLGFLTWLPTIDNLIKRGVDVLNICSLCGCQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWGCRN
        VF+    L  LPT  NL KRGV++ N C  CG  GE ++H+FW+CKF + +  ++ FG L         FL+LR++ + L    FE L V++W +W  RN
Subjt:  VFILLGFLTWLPTIDNLIKRGVDVLNICSLCGCQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWGCRN

Query:  RVKMNGD-----GLSWELPTWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRN
            N        +  EL  WA      FR A     + G +     E  W  P  G YK+N DASF      AG G+++ N  G VMAA  +Y E++++
Subjt:  RVKMNGD-----GLSWELPTWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRN

Query:  SDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN
         DMAEA A V+GL  A+E+G+ P +                +DLS+   ++  A   W  +      FV REGN
Subjt:  SDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN

A0A803PR93 Uncharacterized protein8.8e-2225.66Show/hide
Query:  WLPTIDNLIKRGVDVLNICSLCGCQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWGCRNRV-----KM
        WLP   NL  RG+DV   C LCG Q E+  H  W C   K +     +       ++  +F ++   KD+L    FE  + ++WAIW  RN+       M
Subjt:  WLPTIDNLIKRGVDVLNICSLCGCQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWGCRNRV-----KM

Query:  NGDGLSWELPTWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRNSDMAEAWAV
        NG     +L  W +      R  N  + +  +++  ++  KW+ PP G   +N DA+ +      G G + R++ G+++ A + YH+S  + +MAEAWA+
Subjt:  NGDGLSWELPTWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRNSDMAEAWAV

Query:  VDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN
        ++ L +          +++D  +  + +Q   + LS    ++    +    +    +  V+R  N
Subjt:  VDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN

A0A803PV25 Uncharacterized protein6.1e-2330.68Show/hide
Query:  TWLPTIDNLIKRGVDVLNICSLCGCQG-ESALHVFWLCKFTKDVLCDAGF-GPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWGCRNRVKMNG
        +W+PT   L  R V +   C+ C     E+  H  W C+   DV   +GF   +  Q + D L  L+R +   L    FEY +VL W +W  RN V   G
Subjt:  TWLPTIDNLIKRGVDVLNICSLCGCQG-ESALHVFWLCKFTKDVLCDAGF-GPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWGCRNRVKMNG

Query:  -DGLSWELPTWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRNSDMAEAWAVV
           ++  +  W +  L  FR +NV +  G      R  A+WVAP  G Y +NVDA       LA    V+R+H G V  A +R  E   +   AE  A+ 
Subjt:  -DGLSWELPTWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRNSDMAEAWAVV

Query:  DGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN
        DG+    +  L    VETD ++A   + ++     D   L++             + FVYRE N
Subjt:  DGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN

A0A803PWX1 Uncharacterized protein6.9e-2732.58Show/hide
Query:  TWLPTIDNLIKRGVDVLNICSLCGCQG-ESALHVFWLCKFTKDVLCDAGF-GPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWGCRNRVKMNG
        +W+PT   L  R + V   C  C     E+  H  W C+  ++V   AGF G L  Q R D L  L+R +  + +   FE+ ++L W +W  RN V   G
Subjt:  TWLPTIDNLIKRGVDVLNICSLCGCQG-ESALHVFWLCKFTKDVLCDAGF-GPLMLQCRSDCLFLLLRDAKDYLEWGRFEYLVVLLWAIWGCRNRVKMNG

Query:  -DGLSWELPTWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRNSDMAEAWAVV
            +  +  W +  L  FR  NV   VG      REEAKW  P  G +K+NVDA       LAG   VVR+H G VM A  R+ E   +    E  A++
Subjt:  -DGLSWELPTWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRNSDMAEAWAVV

Query:  DGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN
         G+    +  L    VE+D ++A   + +E++   D   LI+   E         V FV+RE N
Subjt:  DGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G04420.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.1e-0824.68Show/hide
Query:  LWAIWGCRNRVKMNGDGLSW---------ELPTW--AAGVLDSFRRANVLRSVGGEIRPCREEA--KWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNH
        +W +W  RN++      LSW         ++  W  A   + S     V           RE    KW  PP+GW K N D SF+       +G ++R+ 
Subjt:  LWAIWGCRNRVKMNGDGLSW---------ELPTW--AAGVLDSFRRANVLRSVGGEIRPCREEA--KWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNH

Query:  GGDVMAATIRYHESVRNSDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREK
         G    A      ++ N+  +E  A+V  +      G   ++ E DS +  E L R++
Subjt:  GGDVMAATIRYHESVRNSDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREK

AT2G46460.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.7e-0428.7Show/hide
Query:  WVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRNSDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFML
        W  PP GW K N D S++ E M + AG ++R+  G  ++A         N+  +E  A++  +      G   +  E D+    E L R K    D F  
Subjt:  WVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMAATIRYHESVRNSDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFML

Query:  ISDAMESW
        I D +++W
Subjt:  ISDAMESW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCCCACAATTGGAGATAAAGGTTGACGGAGCAAAGAGTAGAAGTTGCAGATTTGCATCCACATTTGGTCCTGAGAATGTGAAATTTGCGTTTGAATTGCGCCCGCA
AACTGATGAGGAAAAAACCTTTTGCGTGAGCATTCCTTCTAACCTAGTCGTTGTTGCGGCAAGAAGTTCTGAGGATGCTCAGGTGAAGGTTGAAGGTAGTGTTGGATTAT
TTGTCTTAATTGAAAAAATTAATGTTGATCAGATTGAGAGAGCAAAGTCTGTTGATCCAGCAAAGCTGGTAGTATCTCCTCCCTCTTTGGGACCGGATGCTCGTGTAGCT
GATTTACGCACTGATTCAAGGCAATGGAATGTGGGGCTGATTCAACAACACTTTAGCCCCCATAAGTGGCTATCGTGTAGGCCGGATGCATATATTGGCGCAAAGCTCGT
CTTCATCCTCCTCGGTTTCCTTACATGGCTCCCAACTATAGATAATTTAATTAAGCGAGGGGTTGATGTGTTGAATATTTGCTCCCTGTGTGGCTGCCAAGGGGAGTCTG
CACTGCACGTTTTTTGGCTTTGTAAGTTTACTAAGGATGTGTTGTGTGATGCTGGTTTCGGCCCCCTTATGCTCCAATGTCGGTCAGATTGTTTATTCCTATTGTTGAGA
GATGCGAAGGATTACTTGGAATGGGGCCGTTTTGAATATTTAGTCGTGTTGCTTTGGGCTATTTGGGGCTGTAGGAACCGTGTGAAAATGAATGGGGATGGGTTGTCTTG
GGAGTTACCCACCTGGGCTGCTGGTGTGTTGGATTCCTTCCGGCGTGCGAATGTGTTACGGTCGGTTGGGGGAGAAATAAGGCCTTGCCGTGAGGAAGCGAAGTGGGTTG
CGCCGCCGGTGGGTTGGTACAAGTTGAATGTTGATGCTTCTTTTGATGGTGAGAGGATGCTTGCTGGAGCGGGCTTGGTTGTGCGGAACCATGGGGGAGATGTAATGGCG
GCTACGATAAGATACCATGAGTCTGTCAGAAATTCTGATATGGCGGAAGCTTGGGCGGTGGTTGATGGCCTAAGTTTTGCCGCTGAGATGGGGCTTTTCCCACTGATGGT
TGAGACCGATTCCATGAGGGCCTTCGAGGCGTTGCAGAGAGAGAAAGATGATTTGTCGGATTTTTTTATGTTGATTTCAGATGCAATGGAGTCCTGGCCATTGGCGTGGC
CACTGAAGGTTGGATTTGTCTACCGTGAAGGCAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGCCCACAATTGGAGATAAAGGTTGACGGAGCAAAGAGTAGAAGTTGCAGATTTGCATCCACATTTGGTCCTGAGAATGTGAAATTTGCGTTTGAATTGCGCCCGCA
AACTGATGAGGAAAAAACCTTTTGCGTGAGCATTCCTTCTAACCTAGTCGTTGTTGCGGCAAGAAGTTCTGAGGATGCTCAGGTGAAGGTTGAAGGTAGTGTTGGATTAT
TTGTCTTAATTGAAAAAATTAATGTTGATCAGATTGAGAGAGCAAAGTCTGTTGATCCAGCAAAGCTGGTAGTATCTCCTCCCTCTTTGGGACCGGATGCTCGTGTAGCT
GATTTACGCACTGATTCAAGGCAATGGAATGTGGGGCTGATTCAACAACACTTTAGCCCCCATAAGTGGCTATCGTGTAGGCCGGATGCATATATTGGCGCAAAGCTCGT
CTTCATCCTCCTCGGTTTCCTTACATGGCTCCCAACTATAGATAATTTAATTAAGCGAGGGGTTGATGTGTTGAATATTTGCTCCCTGTGTGGCTGCCAAGGGGAGTCTG
CACTGCACGTTTTTTGGCTTTGTAAGTTTACTAAGGATGTGTTGTGTGATGCTGGTTTCGGCCCCCTTATGCTCCAATGTCGGTCAGATTGTTTATTCCTATTGTTGAGA
GATGCGAAGGATTACTTGGAATGGGGCCGTTTTGAATATTTAGTCGTGTTGCTTTGGGCTATTTGGGGCTGTAGGAACCGTGTGAAAATGAATGGGGATGGGTTGTCTTG
GGAGTTACCCACCTGGGCTGCTGGTGTGTTGGATTCCTTCCGGCGTGCGAATGTGTTACGGTCGGTTGGGGGAGAAATAAGGCCTTGCCGTGAGGAAGCGAAGTGGGTTG
CGCCGCCGGTGGGTTGGTACAAGTTGAATGTTGATGCTTCTTTTGATGGTGAGAGGATGCTTGCTGGAGCGGGCTTGGTTGTGCGGAACCATGGGGGAGATGTAATGGCG
GCTACGATAAGATACCATGAGTCTGTCAGAAATTCTGATATGGCGGAAGCTTGGGCGGTGGTTGATGGCCTAAGTTTTGCCGCTGAGATGGGGCTTTTCCCACTGATGGT
TGAGACCGATTCCATGAGGGCCTTCGAGGCGTTGCAGAGAGAGAAAGATGATTTGTCGGATTTTTTTATGTTGATTTCAGATGCAATGGAGTCCTGGCCATTGGCGTGGC
CACTGAAGGTTGGATTTGTCTACCGTGAAGGCAACTGA
Protein sequenceShow/hide protein sequence
MCPQLEIKVDGAKSRSCRFASTFGPENVKFAFELRPQTDEEKTFCVSIPSNLVVVAARSSEDAQVKVEGSVGLFVLIEKINVDQIERAKSVDPAKLVVSPPSLGPDARVA
DLRTDSRQWNVGLIQQHFSPHKWLSCRPDAYIGAKLVFILLGFLTWLPTIDNLIKRGVDVLNICSLCGCQGESALHVFWLCKFTKDVLCDAGFGPLMLQCRSDCLFLLLR
DAKDYLEWGRFEYLVVLLWAIWGCRNRVKMNGDGLSWELPTWAAGVLDSFRRANVLRSVGGEIRPCREEAKWVAPPVGWYKLNVDASFDGERMLAGAGLVVRNHGGDVMA
ATIRYHESVRNSDMAEAWAVVDGLSFAAEMGLFPLMVETDSMRAFEALQREKDDLSDFFMLISDAMESWPLAWPLKVGFVYREGN