; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018272 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018272
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold3:19256811..19262449
RNA-Seq ExpressionSpg018272
SyntenySpg018272
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4383622.1 hypothetical protein F8388_014122 [Cannabis sativa]7.9e-6439.43Show/hide
Query:  SGGLVLMWKDELNLTITSFSIGHIDTVIK-SSSDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDGAERNQRLMEAFRLV
        SGGL+L+W D+  +++ SF+ GHID ++K    + WRFTGFYGNP  + R ESW L+ RL  + +LPW+ GGDFNEIL   EK  G++R+   M  F+  
Subjt:  SGGLVLMWKDELNLTITSFSIGHIDTVIK-SSSDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDGAERNQRLMEAFRLV

Query:  LDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSWLAHEECKDIIK
        LDRC+L DLG     FTW+ K +G + + ERLDR+  N     LF  + + + +   S+H PI+A++      +   ++    RFE  WL   EC++II 
Subjt:  LDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSWLAHEECKDIIK

Query:  NHWNSL--PRSVPCSFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKENEI-KMIQASNDNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGDRNT
          W SL  P +   S  +   LC  +L  WNK +  GS+   ++  + ++  ++  S   +   ++R  E KL+ LL  EE YWK RSR +WL  GDRNT
Subjt:  NHWNSL--PRSVPCSFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKENEI-KMIQASNDNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGDRNT

Query:  KWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNP
        K+FH+KA+ RK+ N I      +G  ++ EE I  EI  Y   +FSS +P
Subjt:  KWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNP

KAF7825238.1 ribonuclease H [Senna tora]1.2e-6727.43Show/hide
Query:  ISSRGRGRGSVGKSGGLVLMWKDELNLTITSFSIGHIDTVIKSSSD--WWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDG
        +  RGRG+    ++GGL L+WK E+ + + SFS+ HID  IK  S    WRFTGFYG P    +K SW+L++ L+  SNLPW+  GDFNEI+   EK  G
Subjt:  ISSRGRGRGSVGKSGGLVLMWKDELNLTITSFSIGHIDTVIKSSSD--WWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDG

Query:  AERNQRLMEAFRLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPI-RF
        A +N R M  FR  LD C L D+G     +TW     G   I ERLD+  A+ E  SLF  + + H  +  S H  ++ +  ++  + + +R+A  + RF
Subjt:  AERNQRLMEAFRLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPI-RF

Query:  EGSWLAHEECKDIIKNHWNSLPRSVPCSFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKE------NEIKMIQASNDNLNDIKLRLAEKKLDMLLEEEEI
        E SW   + C +++   W +   S       K+A C  + +  N     GSI+  IK  E       +I++  A   N+ D     A+ +LD LL+ EEI
Subjt:  EGSWLAHEECKDIIKNHWNSLPRSVPCSFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKE------NEIKMIQASNDNLNDIKLRLAEKKLDMLLEEEEI

Query:  YWKFRSRGDWLNWGDRNTKWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSN---------------------------PNREAK
         W+ RSR  WL  GD NTK+FH KA+QR+  N I+   + +    +D   I   IT Y  NLF +SN                            N E K
Subjt:  YWKFRSRGDWLNWGDRNTKWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSN---------------------------PNREAK

Query:  A---------------------------------------------------------NTIEGMSMKISEAQKD---------LHNEQGNWDDNTIKEVF
        A                                                         +++ G S  IS++  D         +H++   W+   I E+F
Subjt:  A---------------------------------------------------------NTIEGMSMKISEAQKD---------LHNEQGNWDDNTIKEVF

Query:  LPEDAKAIREIPRARLADGDEIIWNHDPKGIFLVKSAYHLAKNLSSRDNPSGSNNSKSKAIWKSIWKAKCSSRAKIVVWKIINNIIPTKINIINKGIDAN
        LP +AK I+ IP +     D ++W  +  G + V+SAYH   N    D+   S++S   + W  +W      + K+ +W++ +  I + +N+  +G+  +
Subjt:  LPEDAKAIREIPRARLADGDEIIWNHDPKGIFLVKSAYHLAKNLSSRDNPSGSNNSKSKAIWKSIWKAKCSSRAKIVVWKIINNIIPTKINIINKGIDAN

Query:  NLCCLCRANQGTITHVFGHSR-------GDPIK--------------VWGQLQTSLSEEEMNGAILTLWNLWNARNLSKINNK----QPDLNQIIRAIQN
          C  C     + TH+F             P+                W +L    S E +    L  W++W  RN      K    +  L++  R I++
Subjt:  NLCCLCRANQGTITHVFGHSR-------GDPIK--------------VWGQLQTSLSEEEMNGAILTLWNLWNARNLSKINNK----QPDLNQIIRAIQN

Query:  NIEESERFLKKAQPPPPHFENHSSHLWKPSDPNFWKLNSDATWFDKDGVGGIAWSIRDSNGSLIGAGCKKVHKKWPIKCLEARAMIEGL-LAYEKFENFE
          + + R  +   P        ++ +W P  PN  K+N+DA     D   G+    RDS G+LI     K          EA A+   + +AY       
Subjt:  NIEESERFLKKAQPPPPHFENHSSHLWKPSDPNFWKLNSDATWFDKDGVGGIAWSIRDSNGSLIGAGCKKVHKKWPIKCLEARAMIEGL-LAYEKFENFE

Query:  GRRCKLPLVVESDSVEVVGAVNREIEDQSELCFFVGEIQALS
             L +V ESD + V+   ++  +D S L   + +   LS
Subjt:  GRRCKLPLVVESDSVEVVGAVNREIEDQSELCFFVGEIQALS

PWA36168.1 hypothetical protein CTI12_AA602590 [Artemisia annua]5.1e-6326.91Show/hide
Query:  DTVIKSSSDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDGAERNQRLMEAFRLVLDRCNLVDLGAPRNMFTWIKKVKGS
        D V+K   ++WR TG YG P   ++  +W L+  L       W+  GDFNEI++A EK      N   M AFR     CNL D  A     TW    +G+
Subjt:  DTVIKSSSDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDGAERNQRLMEAFRLVLDRCNLVDLGAPRNMFTWIKKVKGS

Query:  SIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSWLAHEECKDIIKNHW----NSLPRSVPCSFGNKMALC
          + +RLDRFL N+    L+ +    +L +  S+H PI+       +S  V ++    RFE  WL  +    ++++ W     +  +  PC     ++ C
Subjt:  SIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSWLAHEECKDIIKNHW----NSLPRSVPCSFGNKMALC

Query:  MIKLSKWNKRRLNGSIKSAIKRKENEIKMIQASNDNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGDRNTKWFHSKASQRKKHNKIKGFYNSEG
          +LS WNKR   G ++ +IK K+  ++ +Q+  D     + +   +++  LL  EE+ WK RSR +WL  GD+NT++FH++AS R++ N I      +G
Subjt:  MIKLSKWNKRRLNGSIKSAIKRKENEIKMIQASNDNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGDRNTKWFHSKASQRKKHNKIKGFYNSEG

Query:  AWVNDEEQIGVEITRYISNLFSSSNP----------NREAKANTIEGMSMKISEAQ-KDLHNEQGN-WDDNTIKEVFLPEDAKAIR--EIPRARLADGDE
         WV +  ++   ++ Y S+LFSSS+P          +R    N  + +   ++ ++ +DL N +G+ W+   +  +F    A  I    I ++R    D 
Subjt:  AWVNDEEQIGVEITRYISNLFSSSNP----------NREAKANTIEGMSMKISEAQ-KDLHNEQGN-WDDNTIKEVFLPEDAKAIR--EIPRARLADGDE

Query:  IIWNHDPKGIFLVKSAYHLAKNLSSRDNPSGSNNSKSKAIWKSIWKAKCSSRAKIVVWKIINNIIPTKINIINKGIDANNLCCLCRANQGTITHV-----
        + W+++P G F  KSAY LA         + + ++     W+ +WKA+  S+ K+ +W+  NN +PT  N+ ++G++  + C  C      + HV     
Subjt:  IIWNHDPKGIFLVKSAYHLAKNLSSRDNPSGSNNSKSKAIWKSIWKAKCSSRAKIVVWKIINNIIPTKINIINKGIDANNLCCLCRANQGTITHV-----

Query:  ----------FG--HSRGDPIKVWGQLQTSLSE--EEMNGAILTLWNLWNARNL---SKINNKQPDLNQIIRAIQNNIEESERFLKKAQPPPPHFENHSS
                  FG  +     I      Q  L +   E    ++ LW LW  RN     ++N ++ ++  I +++ ++  ++ +    +     H  N  +
Subjt:  ----------FG--HSRGDPIKVWGQLQTSLSE--EEMNGAILTLWNLWNARNL---SKINNKQPDLNQIIRAIQNNIEESERFLKKAQPPPPHFENHSS

Query:  HLWKPSDPNFWKLNSDATWFDKDGVGGIAWSIRDSNGSLIGAG
         +W   D    K+N DA W  + G  G+ +  R+  G ++ +G
Subjt:  HLWKPSDPNFWKLNSDATWFDKDGVGGIAWSIRDSNGSLIGAG

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]4.8e-6134.31Show/hide
Query:  GSVGKSGGLVLMWKDELNLTITSFSIGHIDTVIKS-SSDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDGAERNQRLME
        G  G  GGL L+W +E+++ I S+S  HID VI       WR +G YG+P   +++ +W L+ RL+G+   PW+  GDFNEIL   EK+ G +RN  ++ 
Subjt:  GSVGKSGGLVLMWKDELNLTITSFSIGHIDTVIKS-SSDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDGAERNQRLME

Query:  AFRLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATP-IRFEGSWLAHEE
        AFR  ++ CNL+DLG     FTW  +  G  +I ERLDRFL + +      N+ + +L+   S+H P++  + E+       + ++P + +E  W  +E 
Subjt:  AFRLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATP-IRFEGSWLAHEE

Query:  CKDIIKNHW---NSLPRSVPCSFGNKMA-LCMIKLSKWNKRRLNGSIK--SAIKRKENEIKMIQASNDNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGD
        CK+I+K  W    S  +  P +   K +  C+ +L  W++   +G  +    +K+K +E+K      + +++IK    E++++ +L +EE+YWK RSR D
Subjt:  CKDIIKNHW---NSLPRSVPCSFGNKMA-LCMIKLSKWNKRRLNGSIK--SAIKRKENEIKMIQASNDNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGD

Query:  WLNWGDRNTKWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNPNREAKANTIEGMSMKIS
        WL  GD+NTK+FHSKAS RK+ N+I G  +    WV+D E +  +   Y ++LF++S+P+ +     + G+  +++
Subjt:  WLNWGDRNTKWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNPNREAKANTIEGMSMKIS

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]3.1e-1531.51Show/hide
Query:  IEGMSMKISEAQKDLHNEQGNWDDNTIKEVFLPEDAKAIREIPRARLADGDEIIWNHDPKGIFLVKSAYHLAKNLSSRDNPSGSNNSKSKAIWKSIWKAK
        I   SM       +L +E+  W ++ I + F PEDA+AI +IP  +    D++IW++D KG + VKS Y +A  +   ++PS SN+ ++  +W+ IWK  
Subjt:  IEGMSMKISEAQKDLHNEQGNWDDNTIKEVFLPEDAKAIREIPRARLADGDEIIWNHDPKGIFLVKSAYHLAKNLSSRDNPSGSNNSKSKAIWKSIWKAK

Query:  CSSRAKIVVWKIINNIIPTKINIINKGIDANNLCCLCRANQGTITH
           + KI +W+  ++++PT  N+  K +    +C  C  +  T++H
Subjt:  CSSRAKIVVWKIINNIIPTKINIINKGIDANNLCCLCRANQGTITH

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]2.0e-5937.46Show/hide
Query:  VGKSGGLVLMWKDELNLTITSFSIGHIDTVIKSS-SDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDGAERNQRLMEAF
        V   GGL L+WK   ++ + SFS  HID ++     D WRFTGFYG+P    R+ SW L+  L+   +LPW+  GDFNEI   EEK+   +R +R M+ F
Subjt:  VGKSGGLVLMWKDELNLTITSFSIGHIDTVIKSS-SDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDGAERNQRLMEAF

Query:  RLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSWLAHEECKD
        R  LD C L DLG     FTW  +  G+  +  RLDR +A  E    F    I HL+   S+H PIL     +   K   R   P RFE  W+  + C+ 
Subjt:  RLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSWLAHEECKD

Query:  IIKNHWNS--LPRSVPCSFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKENEIKMIQASNDNLND-IKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGD
        +I++ W +  L  S+   F NK+      L  WN++   G +++++ +K  E+K+++ S   ++D  ++ L   +++ L  +EE  WK RSR  WL  GD
Subjt:  IIKNHWNS--LPRSVPCSFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKENEIKMIQASNDNLND-IKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGD

Query:  RNTKWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNPNR
        RNT +FH +A+QR K N I G  +  G WV+ EE +G  +  Y  N+F+SSNP++
Subjt:  RNTKWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNPNR

TrEMBL top hitse value%identityAlignment
A0A2U1KHJ0 CCHC-type domain-containing protein2.5e-6326.91Show/hide
Query:  DTVIKSSSDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDGAERNQRLMEAFRLVLDRCNLVDLGAPRNMFTWIKKVKGS
        D V+K   ++WR TG YG P   ++  +W L+  L       W+  GDFNEI++A EK      N   M AFR     CNL D  A     TW    +G+
Subjt:  DTVIKSSSDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDGAERNQRLMEAFRLVLDRCNLVDLGAPRNMFTWIKKVKGS

Query:  SIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSWLAHEECKDIIKNHW----NSLPRSVPCSFGNKMALC
          + +RLDRFL N+    L+ +    +L +  S+H PI+       +S  V ++    RFE  WL  +    ++++ W     +  +  PC     ++ C
Subjt:  SIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSWLAHEECKDIIKNHW----NSLPRSVPCSFGNKMALC

Query:  MIKLSKWNKRRLNGSIKSAIKRKENEIKMIQASNDNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGDRNTKWFHSKASQRKKHNKIKGFYNSEG
          +LS WNKR   G ++ +IK K+  ++ +Q+  D     + +   +++  LL  EE+ WK RSR +WL  GD+NT++FH++AS R++ N I      +G
Subjt:  MIKLSKWNKRRLNGSIKSAIKRKENEIKMIQASNDNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGDRNTKWFHSKASQRKKHNKIKGFYNSEG

Query:  AWVNDEEQIGVEITRYISNLFSSSNP----------NREAKANTIEGMSMKISEAQ-KDLHNEQGN-WDDNTIKEVFLPEDAKAIR--EIPRARLADGDE
         WV +  ++   ++ Y S+LFSSS+P          +R    N  + +   ++ ++ +DL N +G+ W+   +  +F    A  I    I ++R    D 
Subjt:  AWVNDEEQIGVEITRYISNLFSSSNP----------NREAKANTIEGMSMKISEAQ-KDLHNEQGN-WDDNTIKEVFLPEDAKAIR--EIPRARLADGDE

Query:  IIWNHDPKGIFLVKSAYHLAKNLSSRDNPSGSNNSKSKAIWKSIWKAKCSSRAKIVVWKIINNIIPTKINIINKGIDANNLCCLCRANQGTITHV-----
        + W+++P G F  KSAY LA         + + ++     W+ +WKA+  S+ K+ +W+  NN +PT  N+ ++G++  + C  C      + HV     
Subjt:  IIWNHDPKGIFLVKSAYHLAKNLSSRDNPSGSNNSKSKAIWKSIWKAKCSSRAKIVVWKIINNIIPTKINIINKGIDANNLCCLCRANQGTITHV-----

Query:  ----------FG--HSRGDPIKVWGQLQTSLSE--EEMNGAILTLWNLWNARNL---SKINNKQPDLNQIIRAIQNNIEESERFLKKAQPPPPHFENHSS
                  FG  +     I      Q  L +   E    ++ LW LW  RN     ++N ++ ++  I +++ ++  ++ +    +     H  N  +
Subjt:  ----------FG--HSRGDPIKVWGQLQTSLSE--EEMNGAILTLWNLWNARNL---SKINNKQPDLNQIIRAIQNNIEESERFLKKAQPPPPHFENHSS

Query:  HLWKPSDPNFWKLNSDATWFDKDGVGGIAWSIRDSNGSLIGAG
         +W   D    K+N DA W  + G  G+ +  R+  G ++ +G
Subjt:  HLWKPSDPNFWKLNSDATWFDKDGVGGIAWSIRDSNGSLIGAG

A0A7J6GL46 CCHC-type domain-containing protein3.8e-6439.43Show/hide
Query:  SGGLVLMWKDELNLTITSFSIGHIDTVIK-SSSDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDGAERNQRLMEAFRLV
        SGGL+L+W D+  +++ SF+ GHID ++K    + WRFTGFYGNP  + R ESW L+ RL  + +LPW+ GGDFNEIL   EK  G++R+   M  F+  
Subjt:  SGGLVLMWKDELNLTITSFSIGHIDTVIK-SSSDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDGAERNQRLMEAFRLV

Query:  LDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSWLAHEECKDIIK
        LDRC+L DLG     FTW+ K +G + + ERLDR+  N     LF  + + + +   S+H PI+A++      +   ++    RFE  WL   EC++II 
Subjt:  LDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSWLAHEECKDIIK

Query:  NHWNSL--PRSVPCSFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKENEI-KMIQASNDNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGDRNT
          W SL  P +   S  +   LC  +L  WNK +  GS+   ++  + ++  ++  S   +   ++R  E KL+ LL  EE YWK RSR +WL  GDRNT
Subjt:  NHWNSL--PRSVPCSFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKENEI-KMIQASNDNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGDRNT

Query:  KWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNP
        K+FH+KA+ RK+ N I      +G  ++ EE I  EI  Y   +FSS +P
Subjt:  KWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNP

A0A803NMB3 Uncharacterized protein1.2e-6035.08Show/hide
Query:  VGKSGGLVLMWKDELNLTITSFSIGHIDTVIKSSSD-WWRFTGFYGNPIPNKRKESWELMERLSGIS-NLPWMIGGDFNEILFAEEKVDGAERNQRLMEA
        VG SGGL+L+W+D +++T+  + + + D  +K+ SD    FT FYG+P    ++ SW L++RL  ++  LPW+  GDFNEI     K  G+ RN++ ME 
Subjt:  VGKSGGLVLMWKDELNLTITSFSIGHIDTVIKSSSD-WWRFTGFYGNPIPNKRKESWELMERLSGIS-NLPWMIGGDFNEILFAEEKVDGAERNQRLMEA

Query:  FRLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSWLAHEECK
        FR VLD C+L +       +TWIK     + + ERLD    NN     FN     HL+ + S+H  I  +V   G + +  +R +  RFE  WL+  ECK
Subjt:  FRLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSWLAHEECK

Query:  DIIKNHWNSLPRSVPC-SFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKENEIKMIQASN----DNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLN
        ++I   WN    + P     + +  C   L +W++ +  GS+K  I   ++++ ++   N    +N+N++K   AE  LD LLE+EEIYW+ RSR DWL+
Subjt:  DIIKNHWNSLPRSVPC-SFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKENEIKMIQASN----DNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLN

Query:  WGDRNTKWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNPNREAKANTIEGMSMKISEAQKDLHNE
         GDRNTK+FH+KAS RK +NKIK   N  G  V+ +  +   +  Y +++F++ + + ++ + T   +   ++    D++NE
Subjt:  WGDRNTKWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNPNREAKANTIEGMSMKISEAQKDLHNE

A0A803NQ77 Uncharacterized protein9.5e-6323.65Show/hide
Query:  VGKSGGLVLMWKDELNLTITSFSIGHIDTVIKSSS-DWWRFTGFYGNPIPNKRKESWELMERL-SGISNLPWMIGGDFNEILFAEEKVDGAERNQRLMEA
        +G  GGL+L+W   +++T+ ++S+ H D ++ S +   + FTGFYG P  + R  SW  +  L S   N+PW++ GDFNE+L  ++K  G  RN  LM  
Subjt:  VGKSGGLVLMWKDELNLTITSFSIGHIDTVIKSSS-DWWRFTGFYGNPIPNKRKESWELMERL-SGISNLPWMIGGDFNEILFAEEKVDGAERNQRLMEA

Query:  FRLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSWLAHEECK
        FR  +D C+L  +    + FTW  K    +II ERLDR   N+    +F   +I+HL+ + S+H  I  +V+         +R +  RFE  WL   +C 
Subjt:  FRLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSWLAHEECK

Query:  DIIKNHWNSLPRSVPC-SFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKENEIKMIQASNDNLNDI-KLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGD
         II  HWNS     P  S    +A C   L  W++ +     +      ++  K+    +   + I ++  A++ LD LLE+EE+YW  R+R DWL  GD
Subjt:  DIIKNHWNSLPRSVPC-SFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKENEIKMIQASNDNLNDI-KLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGD

Query:  RNTKWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNPNREAKANTIEGMSMKISEAQK--------------------------
         NTK+FHS+A  R   NKIK   +S G +V+ EE+I  EI +Y S LFSS+  + EA  + I  +   I+  Q                           
Subjt:  RNTKWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNPNREAKANTIEGMSMKISEAQK--------------------------

Query:  ------------------------------------------------------------DLHNEQGN--WDDNT-------------------------
                                                                     L +   N  W  NT                         
Subjt:  ------------------------------------------------------------DLHNEQGN--WDDNT-------------------------

Query:  -----------IKEVFLPE----------------------------------------------------------------------------DAKAI
                    K+  LP                                                                             D + I
Subjt:  -----------IKEVFLPE----------------------------------------------------------------------------DAKAI

Query:  REIPRARLADGDEIIWNHDPKGIFLVKSAYHLAKNLSSRDNPSGSNNSKSKAIWKSIWKAKCSSRAKIVVWKIINNIIPTKINIINKGIDANNLCCLCRA
          IP +     D++IW H   GI+ VKS Y LA +L ++   S S  S ++  W  +W  +   + KI +W+ IN  +PT +N+ ++ I ++N C LC+ 
Subjt:  REIPRARLADGDEIIWNHDPKGIFLVKSAYHLAKNLSSRDNPSGSNNSKSKAIWKSIWKAKCSSRAKIVVWKIINNIIPTKINIINKGIDANNLCCLCRA

Query:  NQGTITH-VFGHSRGDPIKVWGQLQTSL---------------------SEEEMNGAILTLWNLWNARNLSKINNKQPDLNQIIRAIQNNIEESERFLKK
        +  T  H +F   R     VW Q Q  +                     ++ E+   +  +WN+W+ RN     +K   ++ +     + +++  +    
Subjt:  NQGTITH-VFGHSRGDPIKVWGQLQTSL---------------------SEEEMNGAILTLWNLWNARNLSKINNKQPDLNQIIRAIQNNIEESERFLKK

Query:  AQ-------------PPPPHFENHSSHLWKPSDPNFWKLNSDATWFDKDGVGGIAWSIRDSNGSLIGAGCKKVHKKWPIKCLEARAMIEGLL
         Q             PP     + +   W+      +KLN DA      G  G    IRD  G ++    K     +  K +EA A+   L+
Subjt:  AQ-------------PPPPHFENHSSHLWKPSDPNFWKLNSDATWFDKDGVGGIAWSIRDSNGSLIGAGCKKVHKKWPIKCLEARAMIEGLL

A0A803PRV5 Uncharacterized protein5.2e-6136.07Show/hide
Query:  GKSGGLVLMWKDELNLTITSFSIGHIDTVIK-SSSDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDGAERNQRLMEAFR
        GKSGGL+L+W  ++   + S+S  HID+ I+  +  WWRFTGFYG+P P++R  SW+L++RL+ +   PW++GGDFNEIL  +EK+ G  +   L+  FR
Subjt:  GKSGGLVLMWKDELNLTITSFSIGHIDTVIK-SSSDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILFAEEKVDGAERNQRLMEAFR

Query:  LVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGIS-KEVSRRATPIRFEGSWLAHEECKD
          LDRC L D+G   N +TW    K + +I ERLDR   N++   +F ++ +RHL++  S+H P+L    +  +  +  +R  T   FE +W   EEC  
Subjt:  LVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGIS-KEVSRRATPIRFEGSWLAHEECKD

Query:  IIKNHWNSLPRSVPCSFGNKMALCMI--KLSKWNKRRLNGSIKSAIKRKENEIKMIQASNDNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGDR
        I+++ W    ++     G K  L  I   L +WNK +    +   +K  E++I ++  S +  +   L   E+K ++ L++EE +WK RSR  WL  GD+
Subjt:  IIKNHWNSLPRSVPCSFGNKMALCMI--KLSKWNKRRLNGSIKSAIKRKENEIKMIQASNDNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGDR

Query:  NTKWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNPNREAKANTIEGMSMKISEAQKDLHNEQGNWDDNTIKEVFLPEDAKAIR
        NTK+FH KAS RK  N IKG  +    W+ + + +G     Y   LF+S  PN+E        +  ++S+A  D   E       T +EVF     KA+R
Subjt:  NTKWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNPNREAKANTIEGMSMKISEAQKDLHNEQGNWDDNTIKEVFLPEDAKAIR

Query:  EI
        +I
Subjt:  EI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein1.2e-0430.53Show/hide
Query:  KRKESWELMERLSGIS---NLPWMIGGDFNEILFAEEKVDGAERNQRL--MEAFRLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLAN
        +R+  W+ + RLS  S   N PW++ GDFN+I    E       N  L  +E  +  +   +LVDL     ++TW    + + I+  +LDR + N
Subjt:  KRKESWELMERLSGIS---NLPWMIGGDFNEILFAEEKVDGAERNQRL--MEAFRLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLAN

AT1G43760.1 DNAse I-like superfamily protein6.8e-0523.34Show/hide
Query:  GDFNEILFAEEKVDGAERN--QRLMEAFRLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASV--
        GDF++I    +     + +   R +E F+  L   +LVD+ +    +TW      + II  +LDR +AN +  S F +          S+H P +  +  
Subjt:  GDFNEILFAEEKVDGAERN--QRLMEAFRLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASV--

Query:  ---AEKGISKEVSRRATPIRFEGSWLAHEECKDIIKNHWNSLPRSVPCSFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKENEIKMIQAS-----NDNLN
             K   +  S  +T   F  S     E +  + +H  SL   +  +          K  K   R+  G+I+   K   + ++ IQ+      +D+L 
Subjt:  ---AEKGISKEVSRRATPIRFEGSWLAHEECKDIIKNHWNSLPRSVPCSFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKENEIKMIQAS-----NDNLN

Query:  DIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGDRNTKWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSN
         ++  +A KK +      E +++ +SR  WL  GD NT++FH      +  N IK     +   V +  Q+   I  Y ++L  S +
Subjt:  DIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGDRNTKWFHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSN

AT3G09510.1 Ribonuclease H-like superfamily protein7.0e-1021.94Show/hide
Query:  WDDNTIKEVFLPEDAKAIREIPRARLADGDEIIWNHDPKGIFLVKSAYHL-----AKNLSSRDNPSGSNNSKSKAIWKSIWKAKCSSRAKIVVWKIINNI
        WDD+ I +     D   I  I  A+    D+IIWN++  G + V+S Y L     + N+ + + P GS + K++     IW      + K  +W+ ++  
Subjt:  WDDNTIKEVFLPEDAKAIREIPRARLADGDEIIWNHDPKGIFLVKSAYHL-----AKNLSSRDNPSGSNNSKSKAIWKSIWKAKCSSRAKIVVWKIINNI

Query:  IPTKINIINKGIDANNLCCLCRANQGTITH--------VFGHSRGDPIKVWGQLQTSLSEEEMNG-----------------AILTLWNLWNARNLSKIN
        + T   +  +G+  +  C  C     +I H               D   +  QL ++  EE ++                   +  +W +W ARN    N
Subjt:  IPTKINIINKGIDANNLCCLCRANQGTITH--------VFGHSRGDPIKVWGQLQTSLSEEEMNG-----------------AILTLWNLWNARNLSKIN

Query:  NKQPDLNQIIRAIQNNIEE--SERFLKKAQPPPPHFENHSSHLWKPSDPNFWKLNSDATWFD---KDGVGGIAWSIRDSNGSLIGAGCKKV-HKKWPIKC
          +   ++ + + +    +  +     K  P P      +   W+     + K N DA  FD    +  GG  W IR+  G+ I  G  K+ H   P++ 
Subjt:  NKQPDLNQIIRAIQNNIEE--SERFLKKAQPPPPHFENHSSHLWKPSDPNFWKLNSDATWFD---KDGVGGIAWSIRDSNGSLIGAGCKKV-HKKWPIKC

Query:  LEARAMIEGL
         E +A++  L
Subjt:  LEARAMIEGL

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.5e-0434.62Show/hide
Query:  IWKAKCSSRAKIVVWKIINNIIPTKINIINKGIDANNLCCLCRANQGTITHV
        IW  K S + K+++WK +NN +P    ++++ I     C  CR  + TITH+
Subjt:  IWKAKCSSRAKIVVWKIINNIIPTKINIINKGIDANNLCCLCRANQGTITHV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGTGAAGGGCAACCACCTGGTTTGCCATTGACGGATCCTCGGGTGATCTTGCGGGTCGATTTCCGACGTGAAGCTGAGAAGATGGATAACTCAATCGAAGTGGA
AGAAGTTCAAAGACAGCTAGAGAACCTTGGGCTGGAGGAGGAAGAAAAGGGAAGAGTGCGCTTGGAAATTCTATCGGAAAGTTCGAAGAAGCGGAAACTGATGAATATGG
AAAAATGGAGGGAGATACACTCAGAGTTAAAAAGGCTGGGGCATGTGATCCATGAATGCGACGAAGAAGAAGAAGAGGCGAGTGAGAAACTGAAATATGGAGTCGAATTA
AGAAATACCCAGGGAAGTAAAGGCTTTTATAGGAACAAGAAGACCGAGGTGAATGAAATCAGCTCTAGAGGGAGGGGAAGAGGCAGCGTTGGTAAGAGTGGTGGCTTAGT
TCTGATGTGGAAAGATGAGTTAAATCTCACTATTACCTCTTTCTCTATTGGTCATATAGACACTGTGATTAAAAGTTCTAGTGATTGGTGGCGGTTTACCGGTTTCTATG
GAAATCCTATCCCGAACAAGAGAAAAGAATCTTGGGAGCTCATGGAGAGATTGAGTGGTATCTCTAATCTCCCCTGGATGATAGGAGGGGATTTCAATGAGATTCTCTTC
GCTGAGGAGAAGGTAGATGGAGCGGAGAGAAACCAAAGGCTCATGGAGGCATTTCGGTTGGTGTTGGATAGGTGCAATCTCGTTGACTTGGGAGCCCCTAGGAATATGTT
CACTTGGATTAAGAAGGTGAAAGGCAGTTCGATCATTGCTGAAAGACTCGACCGCTTCCTAGCTAACAATGAGATGAGATCCTTGTTTAACAATATTGATATCCGTCACC
TGAACAAGCATGGTTCGAATCACCACCCTATTTTGGCCTCGGTGGCAGAGAAGGGGATCTCTAAAGAGGTGTCTAGAAGAGCAACCCCTATTAGATTTGAAGGTAGTTGG
CTGGCTCATGAAGAGTGCAAAGATATTATTAAGAATCATTGGAACTCCCTTCCTAGATCTGTCCCCTGTAGTTTCGGTAACAAAATGGCCTTGTGTATGATCAAGCTTAG
CAAATGGAACAAGCGTAGGCTGAATGGCTCCATCAAATCGGCCATCAAAAGAAAAGAGAATGAGATTAAAATGATTCAGGCCAGCAATGATAATCTTAATGACATCAAGC
TTAGACTTGCAGAGAAAAAGCTTGATATGCTCCTGGAGGAGGAAGAAATCTACTGGAAATTCAGATCCCGAGGGGACTGGCTGAATTGGGGGGACAGAAACACTAAATGG
TTCCACTCTAAGGCTAGCCAGAGAAAGAAGCACAATAAAATCAAAGGCTTCTATAATAGTGAAGGTGCGTGGGTGAACGATGAGGAACAGATAGGGGTAGAAATAACTCG
ATACATTTCAAACCTCTTCTCTTCTTCAAATCCTAACAGGGAAGCCAAAGCAAATACCATAGAGGGGATGTCTATGAAAATTTCGGAGGCCCAAAAGGACCTTCATAATG
AGCAAGGCAACTGGGATGATAACACGATAAAGGAGGTTTTCCTCCCTGAAGATGCTAAAGCGATCAGAGAGATTCCTAGAGCTCGGCTTGCCGATGGGGATGAGATCATT
TGGAACCATGATCCAAAAGGGATTTTCTTGGTTAAAAGCGCTTATCATTTAGCCAAGAATCTCAGTTCTAGAGACAACCCCTCGGGATCGAATAATTCCAAGTCCAAAGC
CATTTGGAAGTCGATTTGGAAGGCTAAATGCTCTTCAAGAGCTAAAATAGTGGTCTGGAAGATCATTAACAATATTATCCCTACTAAAATCAATATCATTAACAAAGGGA
TCGATGCTAACAATTTGTGTTGTTTATGCAGGGCCAATCAGGGGACAATCACTCACGTTTTTGGACATAGTAGGGGAGATCCTATAAAGGTTTGGGGCCAGTTGCAAACC
TCTCTATCAGAGGAAGAAATGAATGGAGCAATCCTAACACTATGGAACCTGTGGAATGCGAGAAATTTGAGCAAAATCAACAACAAACAGCCAGATCTCAACCAAATTAT
CAGAGCAATCCAAAACAACATAGAAGAATCAGAAAGATTTTTAAAGAAAGCTCAGCCCCCCCCCCCCCATTTCGAGAACCATTCGAGTCACCTGTGGAAGCCGTCGGATC
CCAACTTTTGGAAGCTGAATTCAGACGCCACCTGGTTCGACAAAGATGGAGTGGGTGGAATTGCATGGTCCATTCGTGACTCCAACGGGTCTCTTATTGGAGCGGGTTGT
AAGAAAGTTCATAAGAAATGGCCGATCAAATGTCTGGAAGCGAGAGCTATGATCGAAGGGCTGTTAGCGTACGAGAAGTTTGAAAATTTCGAGGGAAGAAGATGTAAGCT
ACCTCTGGTTGTTGAATCAGATTCCGTCGAGGTCGTGGGTGCGGTTAATCGCGAGATTGAAGACCAATCGGAGCTGTGCTTCTTCGTCGGTGAGATTCAGGCGTTGTCGT
CGTCGATGGGTATCTCTTCCTTCTCCAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGTGAAGGGCAACCACCTGGTTTGCCATTGACGGATCCTCGGGTGATCTTGCGGGTCGATTTCCGACGTGAAGCTGAGAAGATGGATAACTCAATCGAAGTGGA
AGAAGTTCAAAGACAGCTAGAGAACCTTGGGCTGGAGGAGGAAGAAAAGGGAAGAGTGCGCTTGGAAATTCTATCGGAAAGTTCGAAGAAGCGGAAACTGATGAATATGG
AAAAATGGAGGGAGATACACTCAGAGTTAAAAAGGCTGGGGCATGTGATCCATGAATGCGACGAAGAAGAAGAAGAGGCGAGTGAGAAACTGAAATATGGAGTCGAATTA
AGAAATACCCAGGGAAGTAAAGGCTTTTATAGGAACAAGAAGACCGAGGTGAATGAAATCAGCTCTAGAGGGAGGGGAAGAGGCAGCGTTGGTAAGAGTGGTGGCTTAGT
TCTGATGTGGAAAGATGAGTTAAATCTCACTATTACCTCTTTCTCTATTGGTCATATAGACACTGTGATTAAAAGTTCTAGTGATTGGTGGCGGTTTACCGGTTTCTATG
GAAATCCTATCCCGAACAAGAGAAAAGAATCTTGGGAGCTCATGGAGAGATTGAGTGGTATCTCTAATCTCCCCTGGATGATAGGAGGGGATTTCAATGAGATTCTCTTC
GCTGAGGAGAAGGTAGATGGAGCGGAGAGAAACCAAAGGCTCATGGAGGCATTTCGGTTGGTGTTGGATAGGTGCAATCTCGTTGACTTGGGAGCCCCTAGGAATATGTT
CACTTGGATTAAGAAGGTGAAAGGCAGTTCGATCATTGCTGAAAGACTCGACCGCTTCCTAGCTAACAATGAGATGAGATCCTTGTTTAACAATATTGATATCCGTCACC
TGAACAAGCATGGTTCGAATCACCACCCTATTTTGGCCTCGGTGGCAGAGAAGGGGATCTCTAAAGAGGTGTCTAGAAGAGCAACCCCTATTAGATTTGAAGGTAGTTGG
CTGGCTCATGAAGAGTGCAAAGATATTATTAAGAATCATTGGAACTCCCTTCCTAGATCTGTCCCCTGTAGTTTCGGTAACAAAATGGCCTTGTGTATGATCAAGCTTAG
CAAATGGAACAAGCGTAGGCTGAATGGCTCCATCAAATCGGCCATCAAAAGAAAAGAGAATGAGATTAAAATGATTCAGGCCAGCAATGATAATCTTAATGACATCAAGC
TTAGACTTGCAGAGAAAAAGCTTGATATGCTCCTGGAGGAGGAAGAAATCTACTGGAAATTCAGATCCCGAGGGGACTGGCTGAATTGGGGGGACAGAAACACTAAATGG
TTCCACTCTAAGGCTAGCCAGAGAAAGAAGCACAATAAAATCAAAGGCTTCTATAATAGTGAAGGTGCGTGGGTGAACGATGAGGAACAGATAGGGGTAGAAATAACTCG
ATACATTTCAAACCTCTTCTCTTCTTCAAATCCTAACAGGGAAGCCAAAGCAAATACCATAGAGGGGATGTCTATGAAAATTTCGGAGGCCCAAAAGGACCTTCATAATG
AGCAAGGCAACTGGGATGATAACACGATAAAGGAGGTTTTCCTCCCTGAAGATGCTAAAGCGATCAGAGAGATTCCTAGAGCTCGGCTTGCCGATGGGGATGAGATCATT
TGGAACCATGATCCAAAAGGGATTTTCTTGGTTAAAAGCGCTTATCATTTAGCCAAGAATCTCAGTTCTAGAGACAACCCCTCGGGATCGAATAATTCCAAGTCCAAAGC
CATTTGGAAGTCGATTTGGAAGGCTAAATGCTCTTCAAGAGCTAAAATAGTGGTCTGGAAGATCATTAACAATATTATCCCTACTAAAATCAATATCATTAACAAAGGGA
TCGATGCTAACAATTTGTGTTGTTTATGCAGGGCCAATCAGGGGACAATCACTCACGTTTTTGGACATAGTAGGGGAGATCCTATAAAGGTTTGGGGCCAGTTGCAAACC
TCTCTATCAGAGGAAGAAATGAATGGAGCAATCCTAACACTATGGAACCTGTGGAATGCGAGAAATTTGAGCAAAATCAACAACAAACAGCCAGATCTCAACCAAATTAT
CAGAGCAATCCAAAACAACATAGAAGAATCAGAAAGATTTTTAAAGAAAGCTCAGCCCCCCCCCCCCCATTTCGAGAACCATTCGAGTCACCTGTGGAAGCCGTCGGATC
CCAACTTTTGGAAGCTGAATTCAGACGCCACCTGGTTCGACAAAGATGGAGTGGGTGGAATTGCATGGTCCATTCGTGACTCCAACGGGTCTCTTATTGGAGCGGGTTGT
AAGAAAGTTCATAAGAAATGGCCGATCAAATGTCTGGAAGCGAGAGCTATGATCGAAGGGCTGTTAGCGTACGAGAAGTTTGAAAATTTCGAGGGAAGAAGATGTAAGCT
ACCTCTGGTTGTTGAATCAGATTCCGTCGAGGTCGTGGGTGCGGTTAATCGCGAGATTGAAGACCAATCGGAGCTGTGCTTCTTCGTCGGTGAGATTCAGGCGTTGTCGT
CGTCGATGGGTATCTCTTCCTTCTCCAAATGA
Protein sequenceShow/hide protein sequence
MNSEGQPPGLPLTDPRVILRVDFRREAEKMDNSIEVEEVQRQLENLGLEEEEKGRVRLEILSESSKKRKLMNMEKWREIHSELKRLGHVIHECDEEEEEASEKLKYGVEL
RNTQGSKGFYRNKKTEVNEISSRGRGRGSVGKSGGLVLMWKDELNLTITSFSIGHIDTVIKSSSDWWRFTGFYGNPIPNKRKESWELMERLSGISNLPWMIGGDFNEILF
AEEKVDGAERNQRLMEAFRLVLDRCNLVDLGAPRNMFTWIKKVKGSSIIAERLDRFLANNEMRSLFNNIDIRHLNKHGSNHHPILASVAEKGISKEVSRRATPIRFEGSW
LAHEECKDIIKNHWNSLPRSVPCSFGNKMALCMIKLSKWNKRRLNGSIKSAIKRKENEIKMIQASNDNLNDIKLRLAEKKLDMLLEEEEIYWKFRSRGDWLNWGDRNTKW
FHSKASQRKKHNKIKGFYNSEGAWVNDEEQIGVEITRYISNLFSSSNPNREAKANTIEGMSMKISEAQKDLHNEQGNWDDNTIKEVFLPEDAKAIREIPRARLADGDEII
WNHDPKGIFLVKSAYHLAKNLSSRDNPSGSNNSKSKAIWKSIWKAKCSSRAKIVVWKIINNIIPTKINIINKGIDANNLCCLCRANQGTITHVFGHSRGDPIKVWGQLQT
SLSEEEMNGAILTLWNLWNARNLSKINNKQPDLNQIIRAIQNNIEESERFLKKAQPPPPHFENHSSHLWKPSDPNFWKLNSDATWFDKDGVGGIAWSIRDSNGSLIGAGC
KKVHKKWPIKCLEARAMIEGLLAYEKFENFEGRRCKLPLVVESDSVEVVGAVNREIEDQSELCFFVGEIQALSSSMGISSFSK