; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031630 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031630
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold11:44819333..44826593
RNA-Seq ExpressionSpg031630
SyntenySpg031630
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN74795.1 hypothetical protein VITISV_041690 [Vitis vinifera]1.8e-4134.15Show/hide
Query:  MGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCR
        +GNGE   FWED W G++ LCA +  LY + S++N +V+ VL  +   LS++F F R+L+D +   +  L+S I  +    S  D R WS + S  FS +
Subjt:  MGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCR

Query:  SFFHCL-LDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPC-LIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARH
        SFF+ L  D +P        LW  K+P KV+   W V HG+ NT D+L  + P   + P  CI C+   E +DHL   C     +W   F   G+ +   
Subjt:  SFFHCL-LDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPC-LIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARH

Query:  RGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRGFERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILDW
        R + +M+           +G  LWQ     +IW +W ERNN+ F    R    VW  +R+  SLWAS T  F    + ++ ++W
Subjt:  RGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRGFERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILDW

RVW16200.1 putative ribonuclease H protein [Vitis vinifera]2.4e-4133.8Show/hide
Query:  MGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCR
        +GNGE   FWED W G++ LCA +  LY + S++N +V+ VL  +   LS++F F R+L+D +   +  L+S +  +    S  D R WS + S  FS +
Subjt:  MGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCR

Query:  SFFHCL-LDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPC-LIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARH
        SFF+ L  D +P        LW  K+P KV+   W V HG+ NT D+L  + P   + P  CI C+   E +DHL   C     +W   F   G+ +   
Subjt:  SFFHCL-LDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPC-LIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARH

Query:  RGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRGFERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILDW
        R + +M+           +G  LWQ     +IW +W ERNN+ F    R    VW  +R+  SLWAS T  F    + ++ ++W
Subjt:  RGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRGFERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILDW

RVW19678.1 Structural maintenance of chromosomes protein 3, partial [Vitis vinifera]1.1e-4133.8Show/hide
Query:  MGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCR
        +GNGE   FWED W G++ LCA +  LY + S++N +V+ VL  +   LS++F F R+L+D +   +  L+S +  +    S  D R WS + S  FS +
Subjt:  MGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCR

Query:  SFFHCL-LDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPC-LIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARH
        SFF+ L  D +P        LW  K+P KV+   W V HG+ NT D+L  + P   + P  CI C+   E +DHL   C     +W   F   G+ +   
Subjt:  SFFHCL-LDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPC-LIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARH

Query:  RGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRGFERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILDWSPF
        R + +M+           +G  LWQ     +IW +W ERNN+ F    R    VW  +R+  SLWAS T  F    + ++ ++W  F
Subjt:  RGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRGFERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILDWSPF

RVX17758.1 putative ribonuclease H protein [Vitis vinifera]8.2e-4234.86Show/hide
Query:  MGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCR
        +GNGE   FWED W G++ LC+ +  LY + SMKN +V+ VL  +   LS++F F R+L+D +   +  L+S +  +    S  D R+WS + S  FS +
Subjt:  MGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCR

Query:  SFFHCLLDPS-PTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPC-LIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARH
        SFF+ L   S P        LW  K+P KV+   W V HG+ NT D+L  + P   + P  CI C+   E +DHL   C     +W   F   G+ +   
Subjt:  SFFHCLLDPS-PTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPC-LIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARH

Query:  RGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRGFERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILDW
        R L +M+           +G  LWQ     +IW +W ERNN+ F    R    VW  +R+  SLWAS T+ F    + ++ L+W
Subjt:  RGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRGFERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILDW

TYK09969.1 calpain-type cysteine protease DEK1 [Cucumis melo var. makuwa]1.9e-4643.64Show/hide
Query:  GDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCRSFFHCLLDPSPTDLS
        G+RPLC  +P LYHLSS+KN  +A+ L  +G+S S+SFGF R+LSDR+T+++++L+SL+E  +F   RRD  +WSP P  GF C+SFF CL++ +PT  S
Subjt:  GDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCRSFFHCLLDPSPTDLS

Query:  IFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCLIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARHRGLREMIEKFLPHPPFR
        + S++W++K+P+K   F WQ                              VEEDLDHLLW C     VWD F  +FGL +ARHR +R  +E+FL + P  
Subjt:  IFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCLIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARHRGLREMIEKFLPHPPFR

Query:  EQGNFLWQAGICAIIWGLWG
        E+G FLW A + A + G  G
Subjt:  EQGNFLWQAGICAIIWGLWG

TrEMBL top hitse value%identityAlignment
A0A5D3CI74 Calpain-type cysteine protease DEK19.1e-4743.64Show/hide
Query:  GDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCRSFFHCLLDPSPTDLS
        G+RPLC  +P LYHLSS+KN  +A+ L  +G+S S+SFGF R+LSDR+T+++++L+SL+E  +F   RRD  +WSP P  GF C+SFF CL++ +PT  S
Subjt:  GDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCRSFFHCLLDPSPTDLS

Query:  IFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCLIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARHRGLREMIEKFLPHPPFR
        + S++W++K+P+K   F WQ                              VEEDLDHLLW C     VWD F  +FGL +ARHR +R  +E+FL + P  
Subjt:  IFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCLIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARHRGLREMIEKFLPHPPFR

Query:  EQGNFLWQAGICAIIWGLWG
        E+G FLW A + A + G  G
Subjt:  EQGNFLWQAGICAIIWGLWG

A0A5H2XQW2 TatD related DNase2.8e-4032.17Show/hide
Query:  KYSMGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGF
        ++S+GNGE   FWED W+ +  L   +PRLY LS  KN+ +A   N     L++ F F R+LS+ +  +++ LL ++  +    SR D R W       F
Subjt:  KYSMGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGF

Query:  SCRSFFHCLLDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCL-IGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFA
        SC+SF   L+  +      +  +WK K P K+QFF+W   +GR NT D + R+ P + + P  C+ C+   E++DHL   C ++  +W     + G ++ 
Subjt:  SCRSFFHCLLDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCL-IGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFA

Query:  RHRGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRGFERDSA-EVWSFVRYNVSLWASVTRLFCNYSIGLIILD
          +G  E++   L      ++   L    I AI W +W ERN + F+G       E+W  +++  SLWASV+  F +Y    I+ D
Subjt:  RHRGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRGFERDSA-EVWSFVRYNVSLWASVTRLFCNYSIGLIILD

M5VH03 zf-RVT domain-containing protein (Fragment)5.2e-4233.57Show/hide
Query:  KYSMGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGF
        ++S+GNGE   FWED W+ +  L   +PRL  LS  KN+S+A   N     L++ F F R+LS+ +  +++ LL ++  +    SR D R W       F
Subjt:  KYSMGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGF

Query:  SCRSFFHCLLDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCL-IGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFA
        SC+SF   LL  +      FS +WK K P K+QFF+W   +GR NT D + R+ P + + P  C+ C+   E++DHL   C ++  +W     + G+++ 
Subjt:  SCRSFFHCLLDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCL-IGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFA

Query:  RHRGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRG-FERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILD
          +G  E++   L      ++   L    + AI W +W ERN K F+G  E    E+W  +++  SLWASV+  F +Y    I+ D
Subjt:  RHRGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRG-FERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILD

M5WJ76 Reverse transcriptase domain-containing protein (Fragment)9.7e-4132.87Show/hide
Query:  KYSMGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGF
        ++S+GNGE   FWED W+ +  L   +PRL  LS  KN+S+A   N     L++ F F R+LS+ +  +++ LL ++  +    SR D R W       F
Subjt:  KYSMGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGF

Query:  SCRSFFHCLLDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCL-IGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFA
        SC+SF   LL  +      FS +WK K P K+QFF+W   +GR NT D + R+ P + + P  C+ C+   E++DHL   C ++  +W     + G+++ 
Subjt:  SCRSFFHCLLDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCL-IGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFA

Query:  RHRGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRG-FERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILD
          +G  E++   L      ++   L    + AI W +W ERN + F+G       E+W  +++  SLWASV+  F +Y    I+ D
Subjt:  RHRGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRG-FERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILD

M5XV38 zf-RVT domain-containing protein9.7e-4132.87Show/hide
Query:  KYSMGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGF
        ++S+GNGE   FWED W+ +  L   +PRL  LS  KN+S+A   N     L++ F F R+LS+ +  +++ LL ++  +    SR D R W       F
Subjt:  KYSMGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGF

Query:  SCRSFFHCLLDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCL-IGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFA
        SC+SF   LL  +      FS +WK K P K+QFF+W   +GR NT D + R+ P + + P  C+ C+   E++DHL   C ++  +W     + G+++ 
Subjt:  SCRSFFHCLLDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCL-IGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFA

Query:  RHRGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRG-FERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILD
          +G  E++   L      ++   L    + AI W +W ERN + F+G       E+W  +++  SLWASV+  F +Y    I+ D
Subjt:  RHRGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNNKTFRG-FERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILD

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.1e-0824.68Show/hide
Query:  GFWQPGWAALASIGIREVPVEQTPMNHLVEDKEAKTPPKKKGDPKKYSMGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYS
        G W   W ++A IG+R+V      ++H V                 +  G+G+   FW DRWV  +PL      L +     +       +L      + 
Subjt:  GFWQPGWAALASIGIREVPVEQTPMNHLVEDKEAKTPPKKKGDPKKYSMGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYS

Query:  FGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCRSFFHCLL---DPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSR
        F  +   +  +T   L L +++  +   T  RD   W  +    FS RS +  L     P P   S F+ LWKV++P++V+ F+W V  G    +    R
Subjt:  FGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCRSFFHCLL---DPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSR

Query:  KIPCLIGPFCCITCRMVEEDLDHLLWSCGFARAVW
            L     C  C+   E + H+L  C     +W
Subjt:  KIPCLIGPFCCITCRMVEEDLDHLLWSCGFARAVW

Arabidopsis top hitse value%identityAlignment
AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.2e-0826.4Show/hide
Query:  KYSMGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMK---NRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPS
        K ++GNG   +FW D W    PL       Y   S++   N  V E L ++G  L  S    RS   +   D +S ++     T      D   W     
Subjt:  KYSMGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMK---NRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPS

Query:  IGFSCRSFFHC----LLDPSPTDLSIFSMLW-KVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCLIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFA
         G  C+ F        + P   +L     +W K  +PK   F +W     R  T  RL+      I  F C  C +  E  DHLL+SC FA  VW + F+
Subjt:  IGFSCRSFFHC----LLDPSPTDLSIFSMLW-KVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCLIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFA

Query:  SFGLQFARHRGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNN
            +        E++             + L +    AII+ +W +RNN
Subjt:  SFGLQFARHRGLREMIEKFLPHPPFREQGNFLWQAGICAIIWGLWGERNN

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.2e-0426.13Show/hide
Query:  CITCRMVEEDLDHLLWSCGFARAVWDM----FFASFGLQFARHRGLREMIEKFLPHPPFREQGNFL-WQAGICAIIWGLWGERNNKTFRGFERDSAEVWS
        C+ C    E ++HLL+ C FAR VW +     +       + +  L  ++   +  P   + GN + W      ++W LW  RN   F+G E D+ EV  
Subjt:  CITCRMVEEDLDHLLWSCGFARAVWDM----FFASFGLQFARHRGLREMIEKFLPHPPFREQGNFL-WQAGICAIIWGLWGERNNKTFRGFERDSAEVWS

Query:  FVRYNVSLWAS
            +   W++
Subjt:  FVRYNVSLWAS

AT4G10613.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.8e-0532.08Show/hide
Query:  NTLDRLSRKIPCL-----IGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARHR---GLREMIEKFLPHPPFREQGNFLWQAGICAIIWGL
        + LDRL  +   +     I P CC+ C    E  DHL+ +CGF+ ++W+M  A   LQ  + R    L   I+      P   +   + QA +CAI    
Subjt:  NTLDRLSRKIPCL-----IGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARHR---GLREMIEKFLPHPPFREQGNFLWQAGICAIIWGL

Query:  WGERNN
        W +RNN
Subjt:  WGERNN

AT4G29090.1 Ribonuclease H-like superfamily protein3.2e-1223.24Show/hide
Query:  MGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLL-----SLIEGITFCTSRR--DFRLWSPNP
        +GNGED   W  +W+  +P  A   R+  +   +  SV+ +L +S          +         D++ +L       + G      RR  D   W    
Subjt:  MGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLL-----SLIEGITFCTSRR--DFRLWSPNP

Query:  SIGFSCRSFFHCLLD-----PSPTDLS------IFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCLIGPFCCITCRMVEEDLDHLLWSCGFARA
        S  ++ +S +  L        SP ++S      I+  +WK +   K+Q F+W+ +         L+ +   L     CI C   +E ++HLL+ C FAR 
Subjt:  SIGFSCRSFFHCLLD-----PSPTDLS------IFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCLIGPFCCITCRMVEEDLDHLLWSCGFARA

Query:  VWDMFFASFGLQFARHRGLREMIEKFLPHPPFREQGNFLWQAG---ICAIIWGLWGERNNKTFRGFERDSAEVWSFVRYNVSLW
         W    A   +         + I   L        GN  W+     +  ++W LW  RN   FRG E ++ EV      ++  W
Subjt:  VWDMFFASFGLQFARHRGLREMIEKFLPHPPFREQGNFLWQAG---ICAIIWGLWGERNNKTFRGFERDSAEVWSFVRYNVSLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACCAAGAAAGTTTCGGCCAGCATTGTGGGCGATCCTTCTTTGGTTGGAAAAGAGACGGAAACTACCGCCGTCCTCTCTCCACGATCAACAACCGTACGCTTGCT
GTCGGTTGAACAAGACACAAAGGAACTAAGAAGTGATGTGGGTGAGATAAAGAAAATTTTTGAAATGATTTGTGAAAAACTGGGTTGCAAATCTGACCAGCATAATGTCG
ATCCAAGAAGTCATATAAATCCCGAAAGGAACCAGCAAGAACAACCACGAGAGGATTTCAGAACGAAGCAATGGCAAGGAAGACAAATCCCAGAGCAAAAGACAGTCCAA
GGAATGAAACCAGCTCCAAGATACTATCAAGAACACCACACGGGACAACAAGAAGTCAAGAAGGTGAACATCGATTGGAAGGTGTTTCATCTCCGAAAGAATGAGACGGG
GAGGAAAGTGGTGCTGGAGGAACGGAGGGGAGCAAGAACGAGGCAAATGGATTTAGACCTAGGAACGTTCGCTTGGATCCTGAGGAATTCAAAAGGAAGATATGGCCTGG
TATCCTTGGAACCCTTTCGTGGACGCAGAAATACGATCTTCATTCCAAAAGGCTCTAACTTGAAGGGATGGGGGGCTTTGGAAGCAACTATGACGAGTCACCATCTCAAA
TCGTTTGCTGAGGCGGTGAAAGAGGGCCCAAGTCGGAAACCTGAAGAGGAAGTGAAGCTCTCGGAGGACGTCGACTTTGCTTTTGTCTGTTCGGAGGCGGTGATCGTAGA
AAGGACATCTATTAAGTCGAGCTGGGCAGAAGTCAGGGAAGTTTGCTTAAGGTTAGTCAATATTGGTTTTTCTCTAAAACCTTTCTGTGCTAATATGGCAGTGGGTCTTT
GTGGAAGCAACAAGGAAGCTGAAGACTTGATTCGTGCGGGTTCTTCTTTGAATTCCGAGCACTTTTCAGTTCACGTGTGGGACTTCAAGGTGATAGTTGAAAGAAAATGT
CAGAAATTTGAAGGTGGATGGATCACGGCCCTGGACGTTCCGCCATTTTTGAGGACCAAGCAAATGATATCCAAGCTTGCTGATTTCTGTGGGGGATTAAAGGATGATGA
TGAGATTTTTGGCGAGACATGCTGGACTGATGAGATATGCTTCAAAGTCAAAGGGAAGGACAGTGGTCTCATACCCACATCGTGCTATGTTAATCACGGTCAGACATACT
TTCCGGTGAGGTTTGTGATCGGAAAGGGGATGGGTCTTGTGAAGTCTGGCGAGGAGACGGTAAGCCCAAGACATCCTGAAAAAATCGTACTCCAGACGTCTGTTGCAGCA
GATTTGGCAGAGCAGTCGGCGGATCAGAGGGCTCCTACTCCCAAAAGTCACGTGAAGGCTGGAGAAAAGGAAGATGGGCCTGGGTCTATTAATGGGTCGTGGGCTGATGG
GCCAAGCAATGATATTTTAAAAGCAAAAGCCCAGAACATTATTATCAAAGATTTGCTTCAGCCGAATATGGTAGAAGATGGGCCGTCGAAGATGGGAAGTGAGCCGGACC
AGTTTCAGAATTTAGATTCATCCATAGAGTATCTAGGCAGCAACGAGCATGATTTGGAGTGGGATGACAGATTTAATCTCACGGAGGATTTTATTATTGAGGAAGAGGCT
GCGTCCGACCCCTCTGTCAAGGAAGACAAGCATATATCGCCTTCCGAAGTTTCGACGAGGGATTCTTTTGCTTTGACGACAGGGGTGTATCAAGATGTCTCGACTTCAGG
GGATTCGAATCAGCCGCTGATCAAGGTAAATTCTGATGGGATCCCTATCCAGGTACGCTCTCCTTCTTTGGGTATTTCACCTTCTAATTCGATTGGGAGGGGAGGGTGGT
TGTATTCAGGGGGGTTAGCCTATCCGTCTTCTTATTCTGTAGCCTTACCGTTGGGAGAGGGAAGGGGAGAGTTCTTGGGGTTCAACCATAACTTTGGGCATCTTGTTTTC
TTGAATTTTCTCCAACATTCTGGAAGTGCCTTGTCCTACCCGTCGACTTATTCTGTAGGAGAGAGAGTGCCGCCATTTGCTTCCGTATCTAGTGTCCCGGGTAATGGAAT
TGGGGGAGGCGTTCAACCTTTTCCCAATGGGCTTGTGAGCAACATTCCCTTACCCATCTATAGCCCCACGGTCATTGAAGGCCTCTCAATTAAAGAAGGAAACCATGGCT
TTTGGCAACCGGGGTGGGCTGCGTTGGCGTCGATTGGGATAAGAGAAGTCCCAGTTGAGCAAACCCCTATGAATCACTTAGTTGAGGATAAAGAAGCTAAAACTCCTCCC
AAAAAGAAAGGGGACCCGAAAAAATACTCTATGGGGAATGGGGAGGATACATATTTTTGGGAAGACAGGTGGGTGGGGGATAGACCCCTTTGTGCTACTTATCCTCGGCT
TTATCATTTATCTTCTATGAAAAATCGCTCTGTGGCCGAGGTTTTGAACCTTTCAGGGAGCTCTCTATCCTATTCCTTTGGCTTCGTGCGTTCTCTGTCTGATAGAGATA
CTACAGACATCCTGTCTCTCCTGTCTTTGATTGAGGGGATCACCTTTTGTACGTCGAGGAGGGATTTTCGTTTGTGGAGTCCCAACCCCTCCATTGGCTTCTCTTGTCGG
TCCTTCTTTCATTGTTTACTGGACCCTTCTCCTACCGATTTGTCCATTTTTTCCATGTTATGGAAGGTGAAAATCCCAAAAAAGGTGCAGTTCTTTATTTGGCAGGTTAT
CCACGGAAGAGCTAATACTCTTGATCGGCTCTCAAGAAAGATCCCTTGCTTGATCGGGCCTTTTTGTTGCATTACTTGCCGGATGGTAGAGGAAGACCTTGACCATTTGT
TGTGGAGTTGCGGTTTCGCTAGGGCCGTGTGGGACATGTTTTTTGCTTCGTTTGGTTTGCAGTTTGCCAGACATAGGGGCCTTAGAGAGATGATCGAGAAGTTCCTTCCC
CATCCCCCTTTTCGAGAACAAGGGAATTTCTTATGGCAAGCGGGGATTTGCGCTATTATTTGGGGGCTTTGGGGTGAGAGAAACAATAAGACATTTAGAGGGTTTGAGAG
GGATTCGGCTGAGGTTTGGTCCTTTGTTAGATATAACGTTTCTCTTTGGGCGTCTGTGACGCGTTTATTTTGTAATTATTCTATAGGTCTTATTATTTTGGATTGGAGCC
CCTTTATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAACCAAGAAAGTTTCGGCCAGCATTGTGGGCGATCCTTCTTTGGTTGGAAAAGAGACGGAAACTACCGCCGTCCTCTCTCCACGATCAACAACCGTACGCTTGCT
GTCGGTTGAACAAGACACAAAGGAACTAAGAAGTGATGTGGGTGAGATAAAGAAAATTTTTGAAATGATTTGTGAAAAACTGGGTTGCAAATCTGACCAGCATAATGTCG
ATCCAAGAAGTCATATAAATCCCGAAAGGAACCAGCAAGAACAACCACGAGAGGATTTCAGAACGAAGCAATGGCAAGGAAGACAAATCCCAGAGCAAAAGACAGTCCAA
GGAATGAAACCAGCTCCAAGATACTATCAAGAACACCACACGGGACAACAAGAAGTCAAGAAGGTGAACATCGATTGGAAGGTGTTTCATCTCCGAAAGAATGAGACGGG
GAGGAAAGTGGTGCTGGAGGAACGGAGGGGAGCAAGAACGAGGCAAATGGATTTAGACCTAGGAACGTTCGCTTGGATCCTGAGGAATTCAAAAGGAAGATATGGCCTGG
TATCCTTGGAACCCTTTCGTGGACGCAGAAATACGATCTTCATTCCAAAAGGCTCTAACTTGAAGGGATGGGGGGCTTTGGAAGCAACTATGACGAGTCACCATCTCAAA
TCGTTTGCTGAGGCGGTGAAAGAGGGCCCAAGTCGGAAACCTGAAGAGGAAGTGAAGCTCTCGGAGGACGTCGACTTTGCTTTTGTCTGTTCGGAGGCGGTGATCGTAGA
AAGGACATCTATTAAGTCGAGCTGGGCAGAAGTCAGGGAAGTTTGCTTAAGGTTAGTCAATATTGGTTTTTCTCTAAAACCTTTCTGTGCTAATATGGCAGTGGGTCTTT
GTGGAAGCAACAAGGAAGCTGAAGACTTGATTCGTGCGGGTTCTTCTTTGAATTCCGAGCACTTTTCAGTTCACGTGTGGGACTTCAAGGTGATAGTTGAAAGAAAATGT
CAGAAATTTGAAGGTGGATGGATCACGGCCCTGGACGTTCCGCCATTTTTGAGGACCAAGCAAATGATATCCAAGCTTGCTGATTTCTGTGGGGGATTAAAGGATGATGA
TGAGATTTTTGGCGAGACATGCTGGACTGATGAGATATGCTTCAAAGTCAAAGGGAAGGACAGTGGTCTCATACCCACATCGTGCTATGTTAATCACGGTCAGACATACT
TTCCGGTGAGGTTTGTGATCGGAAAGGGGATGGGTCTTGTGAAGTCTGGCGAGGAGACGGTAAGCCCAAGACATCCTGAAAAAATCGTACTCCAGACGTCTGTTGCAGCA
GATTTGGCAGAGCAGTCGGCGGATCAGAGGGCTCCTACTCCCAAAAGTCACGTGAAGGCTGGAGAAAAGGAAGATGGGCCTGGGTCTATTAATGGGTCGTGGGCTGATGG
GCCAAGCAATGATATTTTAAAAGCAAAAGCCCAGAACATTATTATCAAAGATTTGCTTCAGCCGAATATGGTAGAAGATGGGCCGTCGAAGATGGGAAGTGAGCCGGACC
AGTTTCAGAATTTAGATTCATCCATAGAGTATCTAGGCAGCAACGAGCATGATTTGGAGTGGGATGACAGATTTAATCTCACGGAGGATTTTATTATTGAGGAAGAGGCT
GCGTCCGACCCCTCTGTCAAGGAAGACAAGCATATATCGCCTTCCGAAGTTTCGACGAGGGATTCTTTTGCTTTGACGACAGGGGTGTATCAAGATGTCTCGACTTCAGG
GGATTCGAATCAGCCGCTGATCAAGGTAAATTCTGATGGGATCCCTATCCAGGTACGCTCTCCTTCTTTGGGTATTTCACCTTCTAATTCGATTGGGAGGGGAGGGTGGT
TGTATTCAGGGGGGTTAGCCTATCCGTCTTCTTATTCTGTAGCCTTACCGTTGGGAGAGGGAAGGGGAGAGTTCTTGGGGTTCAACCATAACTTTGGGCATCTTGTTTTC
TTGAATTTTCTCCAACATTCTGGAAGTGCCTTGTCCTACCCGTCGACTTATTCTGTAGGAGAGAGAGTGCCGCCATTTGCTTCCGTATCTAGTGTCCCGGGTAATGGAAT
TGGGGGAGGCGTTCAACCTTTTCCCAATGGGCTTGTGAGCAACATTCCCTTACCCATCTATAGCCCCACGGTCATTGAAGGCCTCTCAATTAAAGAAGGAAACCATGGCT
TTTGGCAACCGGGGTGGGCTGCGTTGGCGTCGATTGGGATAAGAGAAGTCCCAGTTGAGCAAACCCCTATGAATCACTTAGTTGAGGATAAAGAAGCTAAAACTCCTCCC
AAAAAGAAAGGGGACCCGAAAAAATACTCTATGGGGAATGGGGAGGATACATATTTTTGGGAAGACAGGTGGGTGGGGGATAGACCCCTTTGTGCTACTTATCCTCGGCT
TTATCATTTATCTTCTATGAAAAATCGCTCTGTGGCCGAGGTTTTGAACCTTTCAGGGAGCTCTCTATCCTATTCCTTTGGCTTCGTGCGTTCTCTGTCTGATAGAGATA
CTACAGACATCCTGTCTCTCCTGTCTTTGATTGAGGGGATCACCTTTTGTACGTCGAGGAGGGATTTTCGTTTGTGGAGTCCCAACCCCTCCATTGGCTTCTCTTGTCGG
TCCTTCTTTCATTGTTTACTGGACCCTTCTCCTACCGATTTGTCCATTTTTTCCATGTTATGGAAGGTGAAAATCCCAAAAAAGGTGCAGTTCTTTATTTGGCAGGTTAT
CCACGGAAGAGCTAATACTCTTGATCGGCTCTCAAGAAAGATCCCTTGCTTGATCGGGCCTTTTTGTTGCATTACTTGCCGGATGGTAGAGGAAGACCTTGACCATTTGT
TGTGGAGTTGCGGTTTCGCTAGGGCCGTGTGGGACATGTTTTTTGCTTCGTTTGGTTTGCAGTTTGCCAGACATAGGGGCCTTAGAGAGATGATCGAGAAGTTCCTTCCC
CATCCCCCTTTTCGAGAACAAGGGAATTTCTTATGGCAAGCGGGGATTTGCGCTATTATTTGGGGGCTTTGGGGTGAGAGAAACAATAAGACATTTAGAGGGTTTGAGAG
GGATTCGGCTGAGGTTTGGTCCTTTGTTAGATATAACGTTTCTCTTTGGGCGTCTGTGACGCGTTTATTTTGTAATTATTCTATAGGTCTTATTATTTTGGATTGGAGCC
CCTTTATCTAA
Protein sequenceShow/hide protein sequence
MATKKVSASIVGDPSLVGKETETTAVLSPRSTTVRLLSVEQDTKELRSDVGEIKKIFEMICEKLGCKSDQHNVDPRSHINPERNQQEQPREDFRTKQWQGRQIPEQKTVQ
GMKPAPRYYQEHHTGQQEVKKVNIDWKVFHLRKNETGRKVVLEERRGARTRQMDLDLGTFAWILRNSKGRYGLVSLEPFRGRRNTIFIPKGSNLKGWGALEATMTSHHLK
SFAEAVKEGPSRKPEEEVKLSEDVDFAFVCSEAVIVERTSIKSSWAEVREVCLRLVNIGFSLKPFCANMAVGLCGSNKEAEDLIRAGSSLNSEHFSVHVWDFKVIVERKC
QKFEGGWITALDVPPFLRTKQMISKLADFCGGLKDDDEIFGETCWTDEICFKVKGKDSGLIPTSCYVNHGQTYFPVRFVIGKGMGLVKSGEETVSPRHPEKIVLQTSVAA
DLAEQSADQRAPTPKSHVKAGEKEDGPGSINGSWADGPSNDILKAKAQNIIIKDLLQPNMVEDGPSKMGSEPDQFQNLDSSIEYLGSNEHDLEWDDRFNLTEDFIIEEEA
ASDPSVKEDKHISPSEVSTRDSFALTTGVYQDVSTSGDSNQPLIKVNSDGIPIQVRSPSLGISPSNSIGRGGWLYSGGLAYPSSYSVALPLGEGRGEFLGFNHNFGHLVF
LNFLQHSGSALSYPSTYSVGERVPPFASVSSVPGNGIGGGVQPFPNGLVSNIPLPIYSPTVIEGLSIKEGNHGFWQPGWAALASIGIREVPVEQTPMNHLVEDKEAKTPP
KKKGDPKKYSMGNGEDTYFWEDRWVGDRPLCATYPRLYHLSSMKNRSVAEVLNLSGSSLSYSFGFVRSLSDRDTTDILSLLSLIEGITFCTSRRDFRLWSPNPSIGFSCR
SFFHCLLDPSPTDLSIFSMLWKVKIPKKVQFFIWQVIHGRANTLDRLSRKIPCLIGPFCCITCRMVEEDLDHLLWSCGFARAVWDMFFASFGLQFARHRGLREMIEKFLP
HPPFREQGNFLWQAGICAIIWGLWGERNNKTFRGFERDSAEVWSFVRYNVSLWASVTRLFCNYSIGLIILDWSPFI