; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033861 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033861
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold13:32740688..32753036
RNA-Seq ExpressionSpg033861
SyntenySpg033861
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF2317147.1 hypothetical protein GH714_012179 [Hevea brasiliensis]4.6e-4624.97Show/hide
Query:  LGRWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNEKKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERL
        LG WR TGFYG P  ++R  SWAL+  L + +  PW+  GDF  I++ +EK+GG       +  F +A+ +  L DV  +G K+T    R   + ++ +L
Subjt:  LGRWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNEKKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERL

Query:  DRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSVSKRQRGNRIMRFGPGWIKQKDTKKIIEECWNQEAGDNAQNLSTKIIRCIHNLHKWNKTR
        DRF  N          +   L++ SSDH PI+     +  +   +   R+ RF   W  +   ++I+EE W+     +  ++  K+  C   L +W    
Subjt:  DRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSVSKRQRGNRIMRFGPGWIKQKDTKKIIEECWNQEAGDNAQNLSTKIIRCIHNLHKWNKTR

Query:  LKGSLKHAISAKEQEVQNLYIEDQDQNWEEILQAEIELDDLLDEEEEYWRIR--EDRAGKPTQRRNRPKGRANLARPKWSASALGRGRALWSASAQGRGR
        L+   K  I   ++ +  L  +    + E    A+ +  DLL  +E YWR R  E    +  Q       +A + + K     L      W   + G   
Subjt:  LKGSLKHAISAKEQEVQNLYIEDQDQNWEEILQAEIELDDLLDEEEEYWRIR--EDRAGKPTQRRNRPKGRANLARPKWSASALGRGRALWSASAQGRGR

Query:  PFGPLARAELGHLLSIPNVASRPGFAWFVPKRLRIPKNPRSMS----SIGGGVASTTPVCRFSLLQATS--SPSSTNLPLV-----AREGQNILAIPLGN
                +    +   N  S       VP  +   +N   M      +   + +TT    F +   +   +  + N+ LV      R+ + IL+IPL  
Subjt:  PFGPLARAELGHLLSIPNVASRPGFAWFVPKRLRIPKNPRSMS----SIGGGVASTTPVCRFSLLQATS--SPSSTNLPLV-----AREGQNILAIPLGN

Query:  SRDKDEIIWNNDSKGRFSVKSVYHLATNIHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIVKNLIPSKVNLINKGLDISPLCVLCRGAVESSGH
        S   D   W +D +G +SV S Y L       T+ S +   ++ K W+ +W +    + +   WR V   +P++  L ++ +  + +C LC    ES  H
Subjt:  SRDKDEIIWNNDSKGRFSVKSVYHLATNIHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIVKNLIPSKVNLINKGLDISPLCVLCRGAVESSGH

Query:  IFWRCKKVSHVWPKFFPKLMDLSNFYREGRDMESWWEEIFTHLSKEEAEKIGIMIWTIWSFRNQVSVNKIKADEQKLSQWIQRNFEDQRKRTHCHLAEIR
        +   C    HVW         L  F       + W        +  ++  +  + W+IW  RN V              W       Q     C L  +R
Subjt:  IFWRCKKVSHVWPKFFPKLMDLSNFYREGRDMESWWEEIFTHLSKEEAEKIGIMIWTIWSFRNQVSVNKIKADEQKLSQWIQRNFEDQRKRTHCHLAEIR

Query:  LESLWNH------------------ECWSPPSQNLLKLNSDASWNEVVNSG--GIGWVIRDSPGSLVQAGFKSINRKWKIKSLEIAAIKEGLTAYLTLRQ
        L+  W+                   + W  P  N +KLN D + N  VNSG  GIG V+R+  GS V      +   +     EI  ++E L+    ++ 
Subjt:  LESLWNH------------------ECWSPPSQNLLKLNSDASWNEVVNSG--GIGWVIRDSPGSLVQAGFKSINRKWKIKSLEIAAIKEGLTAYLTLRQ

Query:  NRSLNLIVESDASEVIKLLNHEETD-LSEDKALLIDIESL--AVKARVLAFVKCPRLGNRVTHSLARAAAGFP
        N   N+IVESD  +V+ +LN    +  S    ++ D  SL   +   +L F    R  N V H+LA+A   FP
Subjt:  NRSLNLIVESDASEVIKLLNHEETD-LSEDKALLIDIESL--AVKARVLAFVKCPRLGNRVTHSLARAAAGFP

KAF4351405.1 hypothetical protein F8388_001025, partial [Cannabis sativa]4.9e-4841.26Show/hide
Query:  IKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVN-DSLGRWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNE
        I+  +   N F V   GKS GL+L WN D+ +++ S+SVGHID  V    L  WRFTGFYGNP+   R  SW L+ RLK  F+ PWI GGDF EI+S NE
Subjt:  IKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVN-DSLGRWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNE

Query:  KKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSVSKRQRG-NR
        KKGG +R+++ I  F +A+++C+L+D+G+ G  FT    R G   ++ERLDR+F N     L P+++V + +F  SDHRPI AI E  + V  RQ    +
Subjt:  KKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSVSKRQRG-NR

Query:  IMRFGPGWIKQKDTKKIIEECW--NQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEV
          RF   W+K  + + I+ + W       DN  ++     RC   L  WNKT+  GS+   ++ +E  V
Subjt:  IMRFGPGWIKQKDTKKIIEECW--NQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEV

KAF4372682.1 hypothetical protein F8388_000849, partial [Cannabis sativa]2.4e-4741.27Show/hide
Query:  IKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVN-DSLGRWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNE
        I+  +   N F V   GKS GL+L WN D+ +++ S+SVGHID  V    L  WRFTGFYGNP+   R  SW L+ RLK  F+ PWI GGDF E++S NE
Subjt:  IKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVN-DSLGRWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNE

Query:  KKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSVSKRQRGNRI
        KKGG +R+++ I  F +A+++C+L+D+G+ G  FT    R G   ++ERLDR+F N     L P+++V + +F  SDHRPI AI E     S++    + 
Subjt:  KKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSVSKRQRGNRI

Query:  MRFGPGWIKQKDTKKIIEECW--NQEAGDNAQNLSTKIIRCIHNLHKWNKTR
         RF   W+K  + + I+ + W       DN  ++     RC   L  WNKT+
Subjt:  MRFGPGWIKQKDTKKIIEECW--NQEAGDNAQNLSTKIIRCIHNLHKWNKTR

KAF7825238.1 ribonuclease H [Senna tora]1.7e-6126.45Show/hide
Query:  KIKIDLQCDNVFCVPSKGKSK----GLMLFWNSDFGININSYSVGHIDNFVNDSLG--RWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIE
        +IK  L  D +  VP +G+ K    GL L W  +  I++ S+S+ HID  + D      WRFTGFYG P+   +  SW L++ L      PW+  GDF E
Subjt:  KIKIDLQCDNVFCVPSKGKSK----GLMLFWNSDFGININSYSVGHIDNFVNDSLG--RWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIE

Query:  IMSPNEKKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSVSKR
        I+  +EKKGGA +N   +  F EA++ C L D+G++G  +T   GR G + I+ERLD+   +     L P + V H     S H  +V  ++  +  + R
Subjt:  IMSPNEKKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSVSKR

Query:  QRGNRIMRFGPGWIKQKDTKKIIEECWNQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEVQNL-YIEDQDQNWEEILQAEIELDDLLD
         +  R+ RF   W  +    ++++  W  EAG +  +   K+  C    +  N     GS++  I   E+ +  L  I+  D   + I  A+ ELD LL 
Subjt:  QRGNRIMRFGPGWIKQKDTKKIIEECWNQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEVQNL-YIEDQDQNWEEILQAEIELDDLLD

Query:  EEEEYWRIREDRA-------------GKPTQRRNRPK-GRANLARPKWSASALGRGRALWS--------ASAQG-------------------RGRPFG-
         EE  WR R                  K  QRRNR    R    + +  +   G    + S        ++ +G                     RP+  
Subjt:  EEEEYWRIREDRA-------------GKPTQRRNRPK-GRANLARPKWSASALGRGRALWS--------ASAQG-------------------RGRPFG-

Query:  --PLARAELGHLLSIP----NVASRPGFAWFVPKRLRIPKNPRSMSSIGGGVASTT------PVCRFSLLQATSSPSSTNLPL-----------------
            A  +  H    P     +   P F W          +  +   +G G              R S + + SS  ++NL +                 
Subjt:  --PLARAELGHLLSIP----NVASRPGFAWFVPKRLRIPKNPRSMSSIGGGVASTT------PVCRFSLLQATSSPSSTNLPL-----------------

Query:  ---VAREGQNILAIPLGNSRDKDEIIWNNDSKGRFSVKSVYHLATNIHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIVKNLIPSKVNLINKGL
           +  E + I +IPL     +D ++W  +  G +SV+S YH  +N  +LT++S S  S    +W  +W L++  + K+  WR+    I S +N+  +G+
Subjt:  ---VAREGQNILAIPLGNSRDKDEIIWNNDSKGRFSVKSVYHLATNIHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIVKNLIPSKVNLINKGL

Query:  DISPLCVLCRGAVESSGHIFWRCKKVSHVW----PKFFPKLMDLSNFYREGRDMESWWEEIFTHLSKEEAEKIGIMIWTIWSFRNQ-VSVNKIKADEQKL
             C  C   VES  H+F RC K   +W      FFP L D ++F         W+ +     S E    I ++ W+IW  RN  V   K+   E  L
Subjt:  DISPLCVLCRGAVESSGHIFWRCKKVSHVW----PKFFPKLMDLSNFYREGRDMESWWEEIFTHLSKEEAEKIGIMIWTIWSFRNQ-VSVNKIKADEQKL

Query:  SQWIQ--RNFEDQRKRTHCHLAEIRLESLWNHECWSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGSLVQAGFKSINRKWKIKSLEIAAIKEGLTAY
        S+  +  ++F D     +C  +E  + +      W PP  N +K+N+DA+    +   G+G V RDS G+L+          +   S  +A     L   
Subjt:  SQWIQ--RNFEDQRKRTHCHLAEIRLESLWNHECWSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGSLVQAGFKSINRKWKIKSLEIAAIKEGLTAY

Query:  LTLR-------QNRSLNLIVESDASEVIKLLNHEETDLSEDKALLIDIESLAVKARVLAFVKCPRLGNRVTHSLARAAAGF
        L LR        N  L+++ ESD   VI   +    D+S  + ++ D ++L+            R GNRV H +AR    F
Subjt:  LTLR-------QNRSLNLIVESDASEVIKLLNHEETDLSEDKALLIDIESLAVKARVLAFVKCPRLGNRVTHSLARAAAGF

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]1.7e-4539.6Show/hide
Query:  KTWKRLAKNEPMQQDSMASQSGKRHGSVKIKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVNDSLGRWRFTGFYGNPETDKRHFSW
        +T + L +    Q   ++   G      + K +L  D    V S GKS GLML WNSD  + I S S GHID+ + D  G WRFTGFYGNP T KR  SW
Subjt:  KTWKRLAKNEPMQQDSMASQSGKRHGSVKIKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVNDSLGRWRFTGFYGNPETDKRHFSW

Query:  ALIERLKACFEGPWIIGGDFIEIMSPNEKKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLN
         L+ERL    + PWIIGGDF EI+S  EK GG  RN +Q+                         RG      I ERLDRF IN  M     NL+V HL 
Subjt:  ALIERLKACFEGPWIIGGDFIEIMSPNEKKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLN

Query:  FFSSDHRPIVAIWEQRSSVSKRQRGNRIMRFGPGWIKQKDTKKIIEECWNQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEVQNLYIE
          SSDHRPI+A W+     +      R +RF   W++    + II   W    G   +    KI  C+  L++WNK RL  SLK AI+ KE+E++ L   
Subjt:  FFSSDHRPIVAIWEQRSSVSKRQRGNRIMRFGPGWIKQKDTKKIIEECWNQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEVQNLYIE

Query:  DQD
        D D
Subjt:  DQD

TrEMBL top hitse value%identityAlignment
A0A1R3J0D7 Reverse transcriptase8.4e-4623.88Show/hide
Query:  IKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVNDS-LGRWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNE
        I+  L  D  F V   G+S GL   W +D  +++ SYS  HID  +  S + +WRFTGFYG PET KRH SWAL+  L   +  PW+  GDF E++S  E
Subjt:  IKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVNDS-LGRWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNE

Query:  KKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSV-SKRQRGNR
        K+GGA R+  Q+ LF   I+ C   ++   G   T  R   G   + ERLDR F          +    HL   +SDH P++     R+ V SKR+R   
Subjt:  KKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSV-SKRQRGNR

Query:  IMRFGPGWIKQKDTKKIIEECWNQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEVQNLYIEDQDQNWEEILQAEIELDDLLDEEEEYW
          +F   W   ++ +KI+++ W      ++Q +++K+  C   L +WNKT   G+L+  I+ K++E + LY +    +  +  + ++ELD L   EE  W
Subjt:  IMRFGPGWIKQKDTKKIIEECWNQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEVQNLYIEDQDQNWEEILQAEIELDDLLDEEEEYW

Query:  R-------IREDRAGK-------------------------------------------------------PTQ----------------------RRNR
        R       ++ED   +                                                        TQ                       RN+
Subjt:  R-------IREDRAGK-------------------------------------------------------PTQ----------------------RRNR

Query:  PKGR-ANLA----------RPKWS------------------------------------ASALGRGRALWSA--------------------SAQGRGR
          GR A++A          R +W+                                           R LW                      SA G   
Subjt:  PKGR-ANLA----------RPKWS------------------------------------ASALGRGRALWSA--------------------SAQGRGR

Query:  PFGPLARAELGHLLSIPNVASRPGFAWFVPKRLRIPKNPRSMSSIGGGVASTTPVCRFSLLQA--------TSSPS------------STNLPLV-----
         F        GH +    + S P F W     +R      S   +G G       C    L +        T  PS            S N+ +V     
Subjt:  PFGPLARAELGHLLSIPNVASRPGFAWFVPKRLRIPKNPRSMSSIGGGVASTTPVCRFSLLQA--------TSSPS------------STNLPLV-----

Query:  AREGQNILAIPLGNSRDKDEIIWNNDSKGRFSVKSVYHLATNIHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIVKNLIPSKVNLINKGLDISP
          +   IL++ +      D +IWN++  G FSV+S Y++A    RL         S++  W+ IW  ++  + K   WR+V N++P+K  L ++GLD+S 
Subjt:  AREGQNILAIPLGNSRDKDEIIWNNDSKGRFSVKSVYHLATNIHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIVKNLIPSKVNLINKGLDISP

Query:  LCVLCRGAVESSGHIFWRCKKVSHVWPKFFPKLMDLSNFYREGRDMESWWEEI-FTHLSKEEAEKIGIM---IWTIWSFRNQVSVNKIKADEQKLSQWIQ
         C +C     S  H+F+ C+    VW      +            +E W  E  F +   ++A  +G +   ++ +W+ R   + +   +     +  ++
Subjt:  LCVLCRGAVESSGHIFWRCKKVSHVWPKFFPKLMDLSNFYREGRDMESWWEEI-FTHLSKEEAEKIGIM---IWTIWSFRNQVSVNKIKADEQKLSQWIQ

Query:  RNFEDQRKRTHCHLAEIRLESLWNHEC--WSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGSLVQAGFKSINRKWKIKSLEIAAIKEGLTAYLTLRQ
             + +  +     I +++L  H+   W+PP   +LK+NSDAS        G+G VIR+S G +V +G + +         E+ AI  G    L    
Subjt:  RNFEDQRKRTHCHLAEIRLESLWNHEC--WSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGSLVQAGFKSINRKWKIKSLEIAAIKEGLTAYLTLRQ

Query:  NRSLNLIVESDASEVIKLLNHEETDLSEDKALLIDIESLAVKARVLAFVKCPRLGNRVTHSLA
        +   + + ESD+   I  +        E   L+ +I  LA       F    R  N + H +A
Subjt:  NRSLNLIVESDASEVIKLLNHEETDLSEDKALLIDIESLAVKARVLAFVKCPRLGNRVTHSLA

A0A6J1DUG8 uncharacterized protein LOC1110241358.4e-4639.6Show/hide
Query:  KTWKRLAKNEPMQQDSMASQSGKRHGSVKIKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVNDSLGRWRFTGFYGNPETDKRHFSW
        +T + L +    Q   ++   G      + K +L  D    V S GKS GLML WNSD  + I S S GHID+ + D  G WRFTGFYGNP T KR  SW
Subjt:  KTWKRLAKNEPMQQDSMASQSGKRHGSVKIKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVNDSLGRWRFTGFYGNPETDKRHFSW

Query:  ALIERLKACFEGPWIIGGDFIEIMSPNEKKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLN
         L+ERL    + PWIIGGDF EI+S  EK GG  RN +Q+                         RG      I ERLDRF IN  M     NL+V HL 
Subjt:  ALIERLKACFEGPWIIGGDFIEIMSPNEKKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLN

Query:  FFSSDHRPIVAIWEQRSSVSKRQRGNRIMRFGPGWIKQKDTKKIIEECWNQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEVQNLYIE
          SSDHRPI+A W+     +      R +RF   W++    + II   W    G   +    KI  C+  L++WNK RL  SLK AI+ KE+E++ L   
Subjt:  FFSSDHRPIVAIWEQRSSVSKRQRGNRIMRFGPGWIKQKDTKKIIEECWNQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEVQNLYIE

Query:  DQD
        D D
Subjt:  DQD

A0A7J6DZ24 CCHC-type domain-containing protein2.4e-4841.26Show/hide
Query:  IKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVN-DSLGRWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNE
        I+  +   N F V   GKS GL+L WN D+ +++ S+SVGHID  V    L  WRFTGFYGNP+   R  SW L+ RLK  F+ PWI GGDF EI+S NE
Subjt:  IKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVN-DSLGRWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNE

Query:  KKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSVSKRQRG-NR
        KKGG +R+++ I  F +A+++C+L+D+G+ G  FT    R G   ++ERLDR+F N     L P+++V + +F  SDHRPI AI E  + V  RQ    +
Subjt:  KKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSVSKRQRG-NR

Query:  IMRFGPGWIKQKDTKKIIEECW--NQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEV
          RF   W+K  + + I+ + W       DN  ++     RC   L  WNKT+  GS+   ++ +E  V
Subjt:  IMRFGPGWIKQKDTKKIIEECW--NQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEV

A0A7J6FPV7 CCHC-type domain-containing protein1.2e-4741.27Show/hide
Query:  IKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVN-DSLGRWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNE
        I+  +   N F V   GKS GL+L WN D+ +++ S+SVGHID  V    L  WRFTGFYGNP+   R  SW L+ RLK  F+ PWI GGDF E++S NE
Subjt:  IKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVN-DSLGRWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNE

Query:  KKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSVSKRQRGNRI
        KKGG +R+++ I  F +A+++C+L+D+G+ G  FT    R G   ++ERLDR+F N     L P+++V + +F  SDHRPI AI E     S++    + 
Subjt:  KKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSVSKRQRGNRI

Query:  MRFGPGWIKQKDTKKIIEECW--NQEAGDNAQNLSTKIIRCIHNLHKWNKTR
         RF   W+K  + + I+ + W       DN  ++     RC   L  WNKT+
Subjt:  MRFGPGWIKQKDTKKIIEECW--NQEAGDNAQNLSTKIIRCIHNLHKWNKTR

A0A803QI56 Uncharacterized protein4.4e-6325.7Show/hide
Query:  LQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVNDS-LGRWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNEKKGG
        L+ + VF V ++G S GL L W +    ++  YS  HID  V  S  G W+ TGFYG PE + RH SW L+  L      PW + GD   I+   +K+GG
Subjt:  LQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVNDS-LGRWRFTGFYGNPETDKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNEKKGG

Query:  ANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSVSKRQRGNRIMRFG
               I  F +A+N C L+D+   G+ FT  +GR  RN I+ RLDR  IN   S +     + +L   SSDH PI         ++     NR  +F 
Subjt:  ANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFSSDHRPIVAIWEQRSSVSKRQRGNRIMRFG

Query:  PGWIKQKDTKKIIEECWNQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEVQNLYIEDQDQNWEEILQAEIELDDLLDEEEEYWRIRED
          W+K+    +I+ +CW  E  DN  N   K+ RC   L  W K  + G+ K  I+  + E++ L  +    + +   + + EL  +LD+ E +W+ R  
Subjt:  PGWIKQKDTKKIIEECWNQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEVQNLYIEDQDQNWEEILQAEIELDDLLDEEEEYWRIRED

Query:  RAGKPTQRRNRPKGRANLARPKWSASALGRGRALWSASAQGRGRPFGPLARAELGHLLSIPNVASRPGFAWFVPKRLRIPKNPRSMSSIGGGVASTTPVC
        +        N        +  +W          L          P+       LG    + N+    G +W V                           
Subjt:  RAGKPTQRRNRPKGRANLARPKWSASALGRGRALWSASAQGRGRPFGPLARAELGHLLSIPNVASRPGFAWFVPKRLRIPKNPRSMSSIGGGVASTTPVC

Query:  RFSLLQATSSPSSTNLPLVAREGQNILAIPLGNSRDKDEIIWNNDSKGRFSVKSVYHLATNIHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIV
                      N   + R+ + IL IPL  S   D++ W+ +S G +SVKS Y+L   IH   + +   +    K W   W   I  + K   WR  
Subjt:  RFSLLQATSSPSSTNLPLVAREGQNILAIPLGNSRDKDEIIWNNDSKGRFSVKSVYHLATNIHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIV

Query:  KNLIPSKVNLINKGLDISPLCVLCRGAVESSGHIFWRCKKVSHVWPKFFPKLMDLSNFYREGRDMESWWEEIFTHLSKEEAEKIGIMIWTIWSFRNQVSV
        +  +P+   L  K +D+   C +C    ES  H    C KV  VW +     + ++    E  +   W+   FT    E+   + ++ W IWS RN V  
Subjt:  KNLIPSKVNLINKGLDISPLCVLCRGAVESSGHIFWRCKKVSHVWPKFFPKLMDLSNFYREGRDMESWWEEIFTHLSKEEAEKIGIMIWTIWSFRNQVSV

Query:  NKIKADEQKLSQWIQRNFEDQRKRTHCHLAEIRLESLWN-------HECWSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGSLVQAGFKSINRKWKI
         K       +   +   + +Q K         R+E+ W+        E W+ PS + +K+N DA+  +  N  G G V RD  G L++   K        
Subjt:  NKIKADEQKLSQWIQRNFEDQRKRTHCHLAEIRLESLWN-------HECWSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGSLVQAGFKSINRKWKI

Query:  KSLEIAAIKEGLTAYLTLRQNRSLNLIVESDASEVIKLLNHEETDLSEDKALLIDIESLAVKARVLAFVKCPRLGNRVTHSLARAAAGFP
        +  E   I+E L+    ++++   ++ +E+D   V++ +  E   +S    ++ + ++L ++ + ++ +   R  N V H+ ARA+   P
Subjt:  KSLEIAAIKEGLTAYLTLRQNRSLNLIVESDASEVIKLLNHEETDLSEDKALLIDIESLAVKARVLAFVKCPRLGNRVTHSLARAAAGFP

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657507.9e-0921.65Show/hide
Query:  KDEIIWNNDSKGRFSVKSVYHLAT--NIHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIVKNLIPSKVNLINKGLDISPLCVLCRGAVESSGHI
        +D + W     G+FSV+S Y + T   + R   AS          +  +W + + +R K   W +    + ++     + L  S +C +C+G VES  H+
Subjt:  KDEIIWNNDSKGRFSVKSVYHLAT--NIHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIVKNLIPSKVNLINKGLDISPLCVLCRGAVESSGHI

Query:  FWRCKKVSHVWPKFFPKLMDLSNFYREGRDMESWWEEIFTHL-SKEEAEKI------GIMIWTIWSFR--NQVSVNKIKADEQK-LSQWIQRNFEDQRKR
           C     +W +  P+        ++G   +S +E ++ +L  +   E I       ++IW  W +R  N    N    D  K + +W    +      
Subjt:  FWRCKKVSHVWPKFFPKLMDLSNFYREGRDMESWWEEIFTHL-SKEEAEKI------GIMIWTIWSFR--NQVSVNKIKADEQK-LSQWIQRNFEDQRKR

Query:  THCHLAEIRLESLWNHECWSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGS
            + + R+E +     W  P    +K+N+D +          G V+RD  G+
Subjt:  THCHLAEIRLESLWNHECWSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGS

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.1e-0525Show/hide
Query:  GQNILAIPLGNSRDKDEIIWNNDSKGRFSVKSVYHLATNIHRLTEASGSGDSSQIKKW-RSIWDLNIIQRAKIGFWRIVKNLIPSKVNLINKGLDISPLC
        G  I A+ L + +  D  IW  D     ++ S    +  +H           + I  W +++W  N + +     W +  N + ++  L + GL I  +C
Subjt:  GQNILAIPLGNSRDKDEIIWNNDSKGRFSVKSVYHLATNIHRLTEASGSGDSSQIKKW-RSIWDLNIIQRAKIGFWRIVKNLIPSKVNLINKGLDISPLC

Query:  VLCRGAVESSGHIFWRCKKVSHVWPKFF
        +LC    ES  H+F+ C     VW +FF
Subjt:  VLCRGAVESSGHIFWRCKKVSHVWPKFF

AT3G09510.1 Ribonuclease H-like superfamily protein1.1e-1826.47Show/hide
Query:  IPLGNSRDKDEIIWNNDSKGRFSVKSVYHL-----ATNIHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIVKNLIPSKVNLINKGLDISPLCVL
        I L  S+  D+IIWN ++ G ++V+S Y L     +TNI  +    GS D         IW+L I+ + K   WR +   + +   L  +G+ I P C  
Subjt:  IPLGNSRDKDEIIWNNDSKGRFSVKSVYHL-----ATNIHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIVKNLIPSKVNLINKGLDISPLCVL

Query:  CRGAVESSGHIFWRCKKVSHVWPKFFPKLMDLSNFYRE--GRDMESWWEEIF-----THLSKEEAEKIGIMIWTIWSFRNQVSVNKIKADEQKLSQWIQR
        C    ES  H  + C   +  W     +L D S    +    D E     I      T +S         +IW IW  RN V  NK +    K     + 
Subjt:  CRGAVESSGHIFWRCKKVSHVWPKFFPKLMDLSNFYRE--GRDMESWWEEIF-----THLSKEEAEKIGIMIWTIWSFRNQVSVNKIKADEQKLSQWIQR

Query:  NFEDQRKRTHCH---------LAEIRLESLWNHECWSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGSLVQAGFKSINRKWKIKSLEIAAIKEGLTA
           D    T  H         +AE ++E       W  P    +K N DA ++        GW+IR+  G+ +  G    + K    S  + A  + L A
Subjt:  NFEDQRKRTHCH---------LAEIRLESLWNHECWSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGSLVQAGFKSINRKWKIKSLEIAAIKEGLTA

Query:  YLTLRQNRS-LNLIVESDASEVIKLLNHEETDLSEDKAL---LIDIESLAVKARVLAFVKCPRLGNRVTHSLAR
         L     R    + +E D   +I L+N     +S   +L   L DI   A K   + F    R GN++ H LA+
Subjt:  YLTLRQNRS-LNLIVESDASEVIKLLNHEETDLSEDKAL---LIDIESLAVKARVLAFVKCPRLGNRVTHSLAR

AT4G10613.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.3e-0529Show/hide
Query:  IPSKVNLINKGLDISPLCVLCRGAVESSGHIFWRCKKVSHVWPKFFPKLMDLSNFYREGRDMESWWEEIFTHLSKEEAEKI--GIMIWTIWSFRNQVSVN
        +P++  L++ GL ISPLC LC  +VE+  H+   C   S +W     +L      +R    + S W ++ T  S     K+     +  IW  RN +  N
Subjt:  IPSKVNLINKGLDISPLCVLCRGAVESSGHIFWRCKKVSHVWPKFFPKLMDLSNFYREGRDMESWWEEIFTHLSKEEAEKI--GIMIWTIWSFRNQVSVN

AT4G29090.1 Ribonuclease H-like superfamily protein9.9e-2323.46Show/hide
Query:  GNSRDKDEIIWNNDSKGRFSVKSVYHLATN-IHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIVKNLIPSKVNLINKGLDISPLCVLCRGAVES
        G  R  D   W+  S G ++VKS Y + T  I++ +      + S    ++ IW      + +   W+ + N +P    L  + L     C+ C    E+
Subjt:  GNSRDKDEIIWNNDSKGRFSVKSVYHLATN-IHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIVKNLIPSKVNLINKGLDISPLCVLCRGAVES

Query:  SGHIFWRCKKVSHVWP-KFFPKLMDLSNFYREGRDMESWWEEIFTHLS---KEEAEKIGIMIWTIWSFRNQVSVNKIKADEQKLSQWIQRNFEDQRKRTH
          H+ ++C      W     P  + L   + +   +  +W     + +   ++ ++ +  ++W +W  RN++     + + Q++ +  + + E+ R RT 
Subjt:  SGHIFWRCKKVSHVWP-KFFPKLMDLSNFYREGRDMESWWEEIFTHLS---KEEAEKIGIMIWTIWSFRNQVSVNKIKADEQKLSQWIQRNFEDQRKRTH

Query:  CHLAEIRLESLWNHEC--WSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGSLVQAGFKSINRKWKIKSLEIAAIKEGLTAYLTLRQNRSLNLIVESD
              +   +    C  W PP    +K N+DA+WN      GIGWV+R+  G +   G +++    K+KS+  A ++    A L+L + +   +I ESD
Subjt:  CHLAEIRLESLWNHEC--WSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGSLVQAGFKSINRKWKIKSLEIAAIKEGLTAYLTLRQNRSLNLIVESD

Query:  ASEVIKLLNHEETDLSEDKALLIDIESLAVKARVLAFVKCPRLGNRVTHSLARAAAGF
        +  +I++LN++E   S  K  + D++ L  +   + FV  PR GN +   +AR +  F
Subjt:  ASEVIKLLNHEETDLSEDKALLIDIESLAVKARVLAFVKCPRLGNRVTHSLARAAAGF

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.9e-0620Show/hide
Query:  MIWTIWSFRNQVSVNKIKADEQ--------KLSQWIQRNFEDQRKRTHCHLAEIRLESLWNHECWSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGS
        ++W IW   N +  N  +   Q           +W+     ++++  + +    R      +  WSPP ++ LK N DAS +E     G+GW++R+S G+
Subjt:  MIWTIWSFRNQVSVNKIKADEQ--------KLSQWIQRNFEDQRKRTHCHLAEIRLESLWNHECWSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGS

Query:  LVQAGFKSINRKWKIKSLEIAAIKEGLTAYLTLRQNRSLNLIVESDASEVIKLLNHEETD
        +++ G      +   +  E + +   + A       +   +I E D   + +++N + ++
Subjt:  LVQAGFKSINRKWKIKSLEIAAIKEGLTAYLTLRQNRSLNLIVESDASEVIKLLNHEETD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGCAGCGTCGAGACGCTGTGGAGGTAGCGTCTCGACGCTGCCTCGATCTCGCACCTCTCCTGATTATTGGCTAGCTTTGTTGTTGTCTCAGGGTGGCAAGATTCG
TAGGCTACAAGATATTGAAACTCAGAGAACTACCTGCAGAATAGATACAATGGATGAAGGTGATGGACCCTCTGGCACACAAGCGAAGCCTACTACAGCAAGTGGTGATA
GTCAAAACAAGGAAGGCAGTTCGAGCGGGTCAAAGCAGAAGGGACAAGTTGGTAGGCCCCGTGGTCAGGGAAAAGCTTTTGCGAGGACGAAATTCCTAAAAGGAGAACAT
AGAGACCATGTCAGAAAAGACAAAACAGCAGCCGTACAAAGCCGGAAGGGGTCAGACAATTCTGAGAACATAGCCGTTGAATCCCAAATGCCGAGGAAAAAAACAGGAAA
TGAGAGCTACAACACCCAGAGGAAAGAAGAAACGAACTTTAAAAAAGAAACATCTATTCAAGTTAGCGGTGAGAAAAGCCTAGAAGCAAGCCAGGATCAGTGGGACCCAA
TGAACAAACAAGAAAAAGGGAAAAGTACAGAGGAGGGGGAAAAAGTCACCAGAACTCAATCAACTTCAAAGGAAATACAAATCATAACCACGAAAGATCATTCAGGGGAA
ACGGATGAACACCGAGGAAAGGGTGAAACAAACCAAAGGAAAAGGAACTTAAAAACGTGGAAGAGGCTGGCCAAGAACGAGCCCATGCAACAAGATTCCATGGCCAGCCA
AAGTGGAAAGAGACATGGGAGTGTCAAAATCAAGATAGATTTACAGTGTGACAACGTTTTTTGTGTCCCTAGCAAAGGTAAAAGCAAGGGGCTCATGCTATTCTGGAATT
CTGATTTTGGGATCAATATTAATTCCTATTCGGTTGGGCACATTGACAATTTTGTTAATGATAGTTTGGGAAGGTGGAGATTTACGGGCTTTTACGGCAACCCGGAAACT
GATAAAAGGCACTTTTCTTGGGCGCTCATTGAAAGGCTCAAGGCTTGCTTTGAGGGGCCGTGGATTATCGGGGGCGATTTTATTGAAATTATGTCCCCAAATGAGAAGAA
AGGGGGGGCGAATAGGAATATAAACCAAATAGGTCTGTTTGCGGAGGCTATTAACAGATGTGAGTTGATGGACGTGGGTTACTCGGGCAACAAATTCACGCGGAGAAGAG
GGAGAAGCGGTCGGAACCAGATCAAAGAGAGGCTGGACCGTTTCTTCATTAATTATCCCATGTCTCTCCTAGTGCCTAATCTGCAAGTAAATCACTTAAACTTCTTTAGT
TCAGACCATAGACCCATAGTGGCTATTTGGGAGCAGCGGTCTTCGGTTTCCAAGAGGCAGAGGGGAAATAGAATCATGAGATTCGGGCCTGGGTGGATCAAGCAGAAGGA
CACCAAGAAGATCATAGAAGAGTGTTGGAATCAGGAAGCTGGAGACAACGCTCAGAATCTAAGCACTAAAATTATCAGGTGTATCCATAATCTTCATAAGTGGAACAAAA
CTAGACTCAAAGGCAGCCTCAAACATGCCATTTCGGCAAAGGAGCAAGAAGTGCAAAACCTTTATATAGAGGATCAAGATCAAAACTGGGAGGAAATTCTGCAAGCTGAG
ATTGAATTAGACGACCTCCTTGACGAAGAAGAAGAGTATTGGCGGATTCGCGAAGATAGGGCTGGGAAACCGACCCAGAGGAGGAACCGACCAAAGGGCCGGGCCAACTT
GGCCCGACCCAAATGGTCGGCCTCGGCCTTAGGCCGAGGCCGAGCATTATGGTCGGCCTCGGCCCAAGGCCGAGGCCGACCATTCGGCCCGCTTGCGCGGGCCGAGCTCG
GTCACCTCCTCTCGATCCCCAATGTCGCTAGCCGCCCCGGTTTCGCCTGGTTTGTCCCGAAACGCCTCCGAATTCCTAAAAACCCTAGGAGCATGAGCAGCATCGGAGGC
GGTGTGGCTAGCACCACACCGGTGTGCAGGTTTTCTCTTTTGCAGGCCACGTCTTCCCCCTCATCTACAAATTTACCGTTGGTGGCACGTGAAGGTCAGAACATCCTTGC
CATTCCCCTTGGGAACTCGAGGGACAAGGATGAGATTATATGGAATAACGATTCTAAGGGCAGGTTCAGTGTCAAAAGTGTTTATCACCTGGCTACTAATATTCATCGTT
TAACAGAAGCTTCCGGCTCGGGCGACTCCTCTCAGATCAAGAAATGGAGATCCATCTGGGATCTCAACATTATCCAGAGAGCTAAGATAGGCTTTTGGAGAATTGTGAAA
AATTTAATTCCTTCCAAAGTTAATCTTATCAACAAAGGCTTGGACATTTCCCCTCTATGTGTTTTGTGCAGGGGCGCAGTAGAATCCTCGGGTCATATCTTCTGGAGATG
TAAAAAGGTAAGTCATGTCTGGCCTAAGTTCTTCCCTAAACTAATGGACTTATCGAATTTCTACAGGGAAGGCAGAGACATGGAATCCTGGTGGGAAGAAATTTTCACTC
ATCTCAGTAAAGAGGAAGCTGAAAAGATTGGCATTATGATCTGGACAATTTGGAGCTTCAGGAACCAGGTTTCAGTCAACAAAATCAAAGCGGACGAACAGAAACTATCT
CAGTGGATTCAACGAAATTTTGAAGATCAAAGGAAGCGTACCCATTGTCATCTGGCAGAGATCAGGCTAGAGAGCCTTTGGAATCATGAATGTTGGTCCCCCCCTTCGCA
GAATCTTCTGAAGCTTAATTCTGACGCCTCCTGGAATGAAGTTGTGAATTCAGGGGGAATTGGTTGGGTAATCCGCGATTCCCCAGGATCTCTAGTCCAAGCGGGGTTCA
AAAGCATCAATCGCAAATGGAAGATAAAATCTTTGGAGATCGCTGCGATTAAGGAAGGGTTAACGGCTTACCTTACTCTTCGCCAGAATCGATCTCTGAATTTGATTGTA
GAATCGGACGCCTCTGAAGTGATCAAGTTGCTAAACCACGAGGAGACTGATCTCTCTGAAGACAAGGCTTTGTTGATCGATATAGAGTCCCTTGCGGTGAAAGCAAGAGT
TCTCGCCTTCGTCAAGTGCCCAAGATTGGGCAACCGTGTAACGCATTCTCTCGCGCGAGCTGCGGCGGGTTTCCCGCCGGTTTTCCCTCCATCGACCGATGTTGTTGACG
GTTTTTTTGTTCCTTCGCATTCTTCCACGCTGGAAGGAAAATTCTTTTACATCTCTAGTGATGGCCGGAATGCAACTGGAGGTCCGGCCAATTCTGCCATTTCGGTGGCC
GGCGGCCGTTTCTCTTGGGCTCATGGACTTTCCTCTGGTGGAGCTTTTCTTCAGAATCTCTTTAATGGCCTCTACCTGATACAACACCGGATACGCCCTCCCGATTCTGT
TGAACTTTGTGCAAGCGCTCATGTGCTCCTTCAAAGCCTCCTCCCTTTTCCAACCCCTCTATTAATAAATTCGAATAAGGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTGCAGCGTCGAGACGCTGTGGAGGTAGCGTCTCGACGCTGCCTCGATCTCGCACCTCTCCTGATTATTGGCTAGCTTTGTTGTTGTCTCAGGGTGGCAAGATTCG
TAGGCTACAAGATATTGAAACTCAGAGAACTACCTGCAGAATAGATACAATGGATGAAGGTGATGGACCCTCTGGCACACAAGCGAAGCCTACTACAGCAAGTGGTGATA
GTCAAAACAAGGAAGGCAGTTCGAGCGGGTCAAAGCAGAAGGGACAAGTTGGTAGGCCCCGTGGTCAGGGAAAAGCTTTTGCGAGGACGAAATTCCTAAAAGGAGAACAT
AGAGACCATGTCAGAAAAGACAAAACAGCAGCCGTACAAAGCCGGAAGGGGTCAGACAATTCTGAGAACATAGCCGTTGAATCCCAAATGCCGAGGAAAAAAACAGGAAA
TGAGAGCTACAACACCCAGAGGAAAGAAGAAACGAACTTTAAAAAAGAAACATCTATTCAAGTTAGCGGTGAGAAAAGCCTAGAAGCAAGCCAGGATCAGTGGGACCCAA
TGAACAAACAAGAAAAAGGGAAAAGTACAGAGGAGGGGGAAAAAGTCACCAGAACTCAATCAACTTCAAAGGAAATACAAATCATAACCACGAAAGATCATTCAGGGGAA
ACGGATGAACACCGAGGAAAGGGTGAAACAAACCAAAGGAAAAGGAACTTAAAAACGTGGAAGAGGCTGGCCAAGAACGAGCCCATGCAACAAGATTCCATGGCCAGCCA
AAGTGGAAAGAGACATGGGAGTGTCAAAATCAAGATAGATTTACAGTGTGACAACGTTTTTTGTGTCCCTAGCAAAGGTAAAAGCAAGGGGCTCATGCTATTCTGGAATT
CTGATTTTGGGATCAATATTAATTCCTATTCGGTTGGGCACATTGACAATTTTGTTAATGATAGTTTGGGAAGGTGGAGATTTACGGGCTTTTACGGCAACCCGGAAACT
GATAAAAGGCACTTTTCTTGGGCGCTCATTGAAAGGCTCAAGGCTTGCTTTGAGGGGCCGTGGATTATCGGGGGCGATTTTATTGAAATTATGTCCCCAAATGAGAAGAA
AGGGGGGGCGAATAGGAATATAAACCAAATAGGTCTGTTTGCGGAGGCTATTAACAGATGTGAGTTGATGGACGTGGGTTACTCGGGCAACAAATTCACGCGGAGAAGAG
GGAGAAGCGGTCGGAACCAGATCAAAGAGAGGCTGGACCGTTTCTTCATTAATTATCCCATGTCTCTCCTAGTGCCTAATCTGCAAGTAAATCACTTAAACTTCTTTAGT
TCAGACCATAGACCCATAGTGGCTATTTGGGAGCAGCGGTCTTCGGTTTCCAAGAGGCAGAGGGGAAATAGAATCATGAGATTCGGGCCTGGGTGGATCAAGCAGAAGGA
CACCAAGAAGATCATAGAAGAGTGTTGGAATCAGGAAGCTGGAGACAACGCTCAGAATCTAAGCACTAAAATTATCAGGTGTATCCATAATCTTCATAAGTGGAACAAAA
CTAGACTCAAAGGCAGCCTCAAACATGCCATTTCGGCAAAGGAGCAAGAAGTGCAAAACCTTTATATAGAGGATCAAGATCAAAACTGGGAGGAAATTCTGCAAGCTGAG
ATTGAATTAGACGACCTCCTTGACGAAGAAGAAGAGTATTGGCGGATTCGCGAAGATAGGGCTGGGAAACCGACCCAGAGGAGGAACCGACCAAAGGGCCGGGCCAACTT
GGCCCGACCCAAATGGTCGGCCTCGGCCTTAGGCCGAGGCCGAGCATTATGGTCGGCCTCGGCCCAAGGCCGAGGCCGACCATTCGGCCCGCTTGCGCGGGCCGAGCTCG
GTCACCTCCTCTCGATCCCCAATGTCGCTAGCCGCCCCGGTTTCGCCTGGTTTGTCCCGAAACGCCTCCGAATTCCTAAAAACCCTAGGAGCATGAGCAGCATCGGAGGC
GGTGTGGCTAGCACCACACCGGTGTGCAGGTTTTCTCTTTTGCAGGCCACGTCTTCCCCCTCATCTACAAATTTACCGTTGGTGGCACGTGAAGGTCAGAACATCCTTGC
CATTCCCCTTGGGAACTCGAGGGACAAGGATGAGATTATATGGAATAACGATTCTAAGGGCAGGTTCAGTGTCAAAAGTGTTTATCACCTGGCTACTAATATTCATCGTT
TAACAGAAGCTTCCGGCTCGGGCGACTCCTCTCAGATCAAGAAATGGAGATCCATCTGGGATCTCAACATTATCCAGAGAGCTAAGATAGGCTTTTGGAGAATTGTGAAA
AATTTAATTCCTTCCAAAGTTAATCTTATCAACAAAGGCTTGGACATTTCCCCTCTATGTGTTTTGTGCAGGGGCGCAGTAGAATCCTCGGGTCATATCTTCTGGAGATG
TAAAAAGGTAAGTCATGTCTGGCCTAAGTTCTTCCCTAAACTAATGGACTTATCGAATTTCTACAGGGAAGGCAGAGACATGGAATCCTGGTGGGAAGAAATTTTCACTC
ATCTCAGTAAAGAGGAAGCTGAAAAGATTGGCATTATGATCTGGACAATTTGGAGCTTCAGGAACCAGGTTTCAGTCAACAAAATCAAAGCGGACGAACAGAAACTATCT
CAGTGGATTCAACGAAATTTTGAAGATCAAAGGAAGCGTACCCATTGTCATCTGGCAGAGATCAGGCTAGAGAGCCTTTGGAATCATGAATGTTGGTCCCCCCCTTCGCA
GAATCTTCTGAAGCTTAATTCTGACGCCTCCTGGAATGAAGTTGTGAATTCAGGGGGAATTGGTTGGGTAATCCGCGATTCCCCAGGATCTCTAGTCCAAGCGGGGTTCA
AAAGCATCAATCGCAAATGGAAGATAAAATCTTTGGAGATCGCTGCGATTAAGGAAGGGTTAACGGCTTACCTTACTCTTCGCCAGAATCGATCTCTGAATTTGATTGTA
GAATCGGACGCCTCTGAAGTGATCAAGTTGCTAAACCACGAGGAGACTGATCTCTCTGAAGACAAGGCTTTGTTGATCGATATAGAGTCCCTTGCGGTGAAAGCAAGAGT
TCTCGCCTTCGTCAAGTGCCCAAGATTGGGCAACCGTGTAACGCATTCTCTCGCGCGAGCTGCGGCGGGTTTCCCGCCGGTTTTCCCTCCATCGACCGATGTTGTTGACG
GTTTTTTTGTTCCTTCGCATTCTTCCACGCTGGAAGGAAAATTCTTTTACATCTCTAGTGATGGCCGGAATGCAACTGGAGGTCCGGCCAATTCTGCCATTTCGGTGGCC
GGCGGCCGTTTCTCTTGGGCTCATGGACTTTCCTCTGGTGGAGCTTTTCTTCAGAATCTCTTTAATGGCCTCTACCTGATACAACACCGGATACGCCCTCCCGATTCTGT
TGAACTTTGTGCAAGCGCTCATGTGCTCCTTCAAAGCCTCCTCCCTTTTCCAACCCCTCTATTAATAAATTCGAATAAGGCTTAG
Protein sequenceShow/hide protein sequence
MFAASRRCGGSVSTLPRSRTSPDYWLALLLSQGGKIRRLQDIETQRTTCRIDTMDEGDGPSGTQAKPTTASGDSQNKEGSSSGSKQKGQVGRPRGQGKAFARTKFLKGEH
RDHVRKDKTAAVQSRKGSDNSENIAVESQMPRKKTGNESYNTQRKEETNFKKETSIQVSGEKSLEASQDQWDPMNKQEKGKSTEEGEKVTRTQSTSKEIQIITTKDHSGE
TDEHRGKGETNQRKRNLKTWKRLAKNEPMQQDSMASQSGKRHGSVKIKIDLQCDNVFCVPSKGKSKGLMLFWNSDFGININSYSVGHIDNFVNDSLGRWRFTGFYGNPET
DKRHFSWALIERLKACFEGPWIIGGDFIEIMSPNEKKGGANRNINQIGLFAEAINRCELMDVGYSGNKFTRRRGRSGRNQIKERLDRFFINYPMSLLVPNLQVNHLNFFS
SDHRPIVAIWEQRSSVSKRQRGNRIMRFGPGWIKQKDTKKIIEECWNQEAGDNAQNLSTKIIRCIHNLHKWNKTRLKGSLKHAISAKEQEVQNLYIEDQDQNWEEILQAE
IELDDLLDEEEEYWRIREDRAGKPTQRRNRPKGRANLARPKWSASALGRGRALWSASAQGRGRPFGPLARAELGHLLSIPNVASRPGFAWFVPKRLRIPKNPRSMSSIGG
GVASTTPVCRFSLLQATSSPSSTNLPLVAREGQNILAIPLGNSRDKDEIIWNNDSKGRFSVKSVYHLATNIHRLTEASGSGDSSQIKKWRSIWDLNIIQRAKIGFWRIVK
NLIPSKVNLINKGLDISPLCVLCRGAVESSGHIFWRCKKVSHVWPKFFPKLMDLSNFYREGRDMESWWEEIFTHLSKEEAEKIGIMIWTIWSFRNQVSVNKIKADEQKLS
QWIQRNFEDQRKRTHCHLAEIRLESLWNHECWSPPSQNLLKLNSDASWNEVVNSGGIGWVIRDSPGSLVQAGFKSINRKWKIKSLEIAAIKEGLTAYLTLRQNRSLNLIV
ESDASEVIKLLNHEETDLSEDKALLIDIESLAVKARVLAFVKCPRLGNRVTHSLARAAAGFPPVFPPSTDVVDGFFVPSHSSTLEGKFFYISSDGRNATGGPANSAISVA
GGRFSWAHGLSSGGAFLQNLFNGLYLIQHRIRPPDSVELCASAHVLLQSLLPFPTPLLINSNKA