; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040939 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040939
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:10034573..10036280
RNA-Seq ExpressionLag0040939
SyntenyLag0040939
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO54692.1 reverse transcriptase [Corchorus capsularis]6.3e-4730.02Show/hide
Query:  DINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVWGRD
        ++N + A+ WWGE  G+ K HW  W  LC SK  GGLGFRD   FN ++LAK  WR+++N +SL  + +K KYF+G++F+ A  G NPS  WRS++ GR 
Subjt:  DINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVWGRD

Query:  LFTKGYKWKVGNGRHIAIDQDPWISSRGS--------EVSTLGKRVKGRNHLELGE------KRVFSVKSAFHL-ATHLASSQKASS-----------ST
        +  +G +W+VG+G  I +  D WI+S  S        E   L  RV     ++L        +R+FS +  F +    L+S  +  S           + 
Subjt:  LFTKGYKWKVGNGRHIAIDQDPWISSRGS--------EVSTLGKRVKGRNHLELGE------KRVFSVKSAFHL-ATHLASSQKASS-----------ST

Query:  HSTPTLSGSVYGILESKGESGNHIF---WECKLSKK---------IWNTFIPLTIPLYDLYRGEWNPKENW-RWMSDNLQTEDLERAIIILWSLWEHRNS
         S   ++  + G  E    S + I+   W+ ++  K         +WNT  P    + +    E    E W R  S   Q   +ER +  LW++W +RN 
Subjt:  HSTPTLSGSVYGILESKGESGNHIF---WECKLSKK---------IWNTFIPLTIPLYDLYRGEWNPKENW-RWMSDNLQTEDLERAIIILWSLWEHRNS

Query:  --NRGFKGK--------SGHLK--LRQFPCDEE---PPSHKSWLPPLPGSWKLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKA
          +    G         + HL+  ++ FP +           W+PP  GS+KLNVDA++D  R V GLG ++RD  G       K+++    +   E+ A
Subjt:  --NRGFKGK--------SGHLK--LRQFPCDEE---PPSHKSWLPPLPGSWKLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKA

Query:  IVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMA
        I  GL+     G+          +VESD    I  IN       E + L++++  +   FE + F    R  N  A  LA+ A
Subjt:  IVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMA

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.2e-5328.78Show/hide
Query:  MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVW
        +C+DI +  A  WWG  K K   HW  W+ +  +K RGGLGFRDL  FNQA++AK  WR+++ PNSL+++ +K +Y+K + F  A +GSNPS  WRSI+W
Subjt:  MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVW

Query:  GRDLFTKGYKWKVGNGRHIAIDQDPWI-------------------------SSRGSEVSTLGKRV--------------KGRNHLEL----GEKRVFSV
        G  +  KG +W++G+G+ + + +D WI                         S     V  L +                 G+   E+     +K  +SV
Subjt:  GRDLFTKGYKWKVGNGRHIAIDQDPWI-------------------------SSRGSEVSTLGKRV--------------KGRNHLEL----GEKRVFSV

Query:  KSAFHLATHLASSQKASSSTHST---------------------------PT---------LSGSVYGILESKGESGNHIFWECKLSKKIWNTFIPLTIP
        KS + LA +     +  SS  S+                           PT         L   +    + + E+ +H+  ECK ++KIW+    +  P
Subjt:  KSAFHLATHLASSQKASSSTHST---------------------------PT---------LSGSVYGILESKGESGNHIFWECKLSKKIWNTFIPLTIP

Query:  LYDLYRGEWNP-KENWRWMSDNLQTEDLERAIIILWSLWEHRN--------SNRGFKGKSGHLKLRQFPCDEEPPS----------HKSWLPPLPGSWKL
          D  +  ++  +E W   S    T + E  I+  W +W  RN        S+  F        L+ +    +P +           + W PP     KL
Subjt:  LYDLYRGEWNP-KENWRWMSDNLQTEDLERAIIILWSLWEHRN--------SNRGFKGKSGHLKLRQFPCDEEPPS----------HKSWLPPLPGSWKL

Query:  NVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLMEDL
        NVDA+  +K    GLG I+RD+EG  L +G K+   R  + L E +AI  GL+    +  I SS L    +VESD   V+ L+N      +EI +++ D+
Subjt:  NVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLMEDL

Query:  NKVKKPFEDLSFVFCPRDQNLAADSLARMAISPPSLSVLVSS
         +  K F+ + F F PR  N  A +LA+ A+   S  V V +
Subjt:  NKVKKPFEDLSFVFCPRDQNLAADSLARMAISPPSLSVLVSS

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]1.2e-5028.89Show/hide
Query:  MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVW
        +C+DI +  A  WWG  + +   HW  W ++  SK RGG+GFRDL  FNQA++AK  WRI++ P+SL+++ LK +YFK   F+ A +GS PS  WRSIVW
Subjt:  MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVW

Query:  GRDLFTKGYKWKVGNGRHIAIDQDPWI-----------SSRGSEVST---LGKRVKGRNHLEL-----------------------------GEKRVFSV
        GR +  KG +W++GNG+++ +  + WI            S G++ +    + ++ + R  L L                              +K  +SV
Subjt:  GRDLFTKGYKWKVGNGRHIAIDQDPWI-----------SSRGSEVST---LGKRVKGRNHLEL-----------------------------GEKRVFSV

Query:  KSAFHLATHLASSQKASSSTHS---------------------------TPT---------LSGSVYGILESKGESGNHIFWECKLSKKIWNTFIPLTIP
        KS + +A  +   +  S S H                             PT         L   +        E+ +H   EC  ++KIW  +  L   
Subjt:  KSAFHLATHLASSQKASSSTHS---------------------------TPT---------LSGSVYGILESKGESGNHIFWECKLSKKIWNTFIPLTIP

Query:  LYDLYRGE--WNPKENWRWMSDNLQTEDLERAIIILWSLWEHRNSNRGFKGK---------------SGHLKLRQ----FPCDEEPPSHKSWLPPLPGSW
        L  +YR +  W  +    W   + + E  E A  +LW++W+ RN    F+GK                   K+RQ    +         K W PP  G  
Subjt:  LYDLYRGE--WNPKENWRWMSDNLQTEDLERAIIILWSLWEHRNSNRGFKGK---------------SGHLKLRQ----FPCDEEPPSHKSWLPPLPGSW

Query:  KLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLME
        K+NVDA+ D +  + GLG ++RDS+G+      K +     + + E  A+  GLK      V E + +   I  ESD+  VI LIN++   L+EI +L+ 
Subjt:  KLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLME

Query:  DLNKVKKPFEDLSFVFCPRDQNLAADSLARMAI
        D+ +  + F++      PRD N AA SLA++A+
Subjt:  DLNKVKKPFEDLSFVFCPRDQNLAADSLARMAI

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]8.8e-4929.14Show/hide
Query:  CDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVWG
        CDDI R  A  WWG    K   HW  W KL  +K RGGLGFR+   FNQA++AK +WR+++ PNSL+S+ L+ +YF+ + FL A  G+N S  WRSI+WG
Subjt:  CDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVWG

Query:  RDLFTKGYKWKVGNGRHIAIDQDPWISSR-------------GSEVSTLGK------RVKGRNHL---------------ELGEKRV---------FSVK
        R +  KG +W++GNG+ IAI  D W+                 S V+ L K       +K R H                E  E  V         +SVK
Subjt:  RDLFTKGYKWKVGNGRHIAIDQDPWISSR-------------GSEVSTLGK------RVKGRNHL---------------ELGEKRV---------FSVK

Query:  SAFHLA--THLASSQKASSSTH---------------------STPTLSGSVYGILESK-------------GESGNHIFWECKLSKKIWNTFIPLTIPL
        S + LA  +    S   + ++H                     ++  L  S   + + K              E+ +H   ECK ++KIW    P + P 
Subjt:  SAFHLA--THLASSQKASSSTH---------------------STPTLSGSVYGILESK-------------GESGNHIFWECKLSKKIWNTFIPLTIPL

Query:  YDLYRGEWNPKE---NWRWMSDNLQTEDLERAIIILWSLWEHRNS--------NRGFKGKSGHLKLRQFPCDEEP-PSH---------KSWLPPLPGSWK
            R E N ++     + M+  L+  DLE  + + WS W  RN         N           L  F    +P  SH         + WLPP    +K
Subjt:  YDLYRGEWNPKE---NWRWMSDNLQTEDLERAIIILWSLWEHRNS--------NRGFKGKSGHLKLRQFPCDEEP-PSH---------KSWLPPLPGSWK

Query:  LNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLMED
        +NVDA+++SK    G+G ++RDS G  +  G  +   +    L E +A++ GL+   ++ V         +++ESD   V++L+N      SEI + +  
Subjt:  LNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLMED

Query:  LNKVKKPFEDLSFVFCPRDQNLAADSLARMAI
        +    K F+ +     PR  N  A  LA++A+
Subjt:  LNKVKKPFEDLSFVFCPRDQNLAADSLARMAI

XP_042962672.1 uncharacterized protein LOC122296942 [Carya illinoinensis]1.2e-4527.65Show/hide
Query:  MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVW
        +C ++  M A  WWG+   ++K HW  W KLC SK RGG+GFRDL LFN A+LAK  WR+++N +SL  K  K KYF  +    A +G+N S  W+ I  
Subjt:  MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVW

Query:  GRDLFTKGYKWKVGNGRHIAIDQDPWISS--------RGSEVSTLGKRVKGRNHLELG------EKRVFSVKSAFHLATH---LASSQKASSSTHST---
          D   KG +W+VGNG+ + I +DPW+             E   +   + G    E        +   FSVKSA+    +   LA+ Q ++ S+ +    
Subjt:  GRDLFTKGYKWKVGNGRHIAIDQDPWISS--------RGSEVSTLGKRVKGRNHLELG------EKRVFSVKSAFHLATH---LASSQKASSSTHST---

Query:  -----------------------PT---------LSGSVYGILESKGESGNHIFWECKLSKKIWNTF------IPLTIPLYDL-----YRGEWNPKENWR
                               PT         L  +  G+     E   H  + C   + +W  F      I   +  +DL      RG         
Subjt:  -----------------------PT---------LSGSVYGILESKGESGNHIFWECKLSKKIWNTF------IPLTIPLYDL-----YRGEWNPKENWR

Query:  WMSDNLQTEDLERAIIILWSLWEHRNS--------------NRGFKGKSGHLKLRQFP-CDEEPPSHKSWLPPLPGSWKLNVDASWDSKRSVGGLGWILR
          SD+L    L R I I W LW  RN               N     +  + +++ F   +++      W PP     KLN+D +   + SV G+G +LR
Subjt:  WMSDNLQTEDLERAIIILWSLWEHRNS--------------NRGFKGKSGHLKLRQFP-CDEEPPSHKSWLPPLPGSWKLNVDASWDSKRSVGGLGWILR

Query:  DSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQN
        D  G  +    K   +    + +E  A++ GL+     GV       P I++E+D   ++  +NE    L++I+F+++D+ ++   F+++  V   R  N
Subjt:  DSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQN

Query:  LAADSLARMA
        L A  LAR A
Subjt:  LAADSLARMA

TrEMBL top hitse value%identityAlignment
A0A1R3G9C4 Reverse transcriptase3.0e-4730.02Show/hide
Query:  DINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVWGRD
        ++N + A+ WWGE  G+ K HW  W  LC SK  GGLGFRD   FN ++LAK  WR+++N +SL  + +K KYF+G++F+ A  G NPS  WRS++ GR 
Subjt:  DINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVWGRD

Query:  LFTKGYKWKVGNGRHIAIDQDPWISSRGS--------EVSTLGKRVKGRNHLELGE------KRVFSVKSAFHL-ATHLASSQKASS-----------ST
        +  +G +W+VG+G  I +  D WI+S  S        E   L  RV     ++L        +R+FS +  F +    L+S  +  S           + 
Subjt:  LFTKGYKWKVGNGRHIAIDQDPWISSRGS--------EVSTLGKRVKGRNHLELGE------KRVFSVKSAFHL-ATHLASSQKASS-----------ST

Query:  HSTPTLSGSVYGILESKGESGNHIF---WECKLSKK---------IWNTFIPLTIPLYDLYRGEWNPKENW-RWMSDNLQTEDLERAIIILWSLWEHRNS
         S   ++  + G  E    S + I+   W+ ++  K         +WNT  P    + +    E    E W R  S   Q   +ER +  LW++W +RN 
Subjt:  HSTPTLSGSVYGILESKGESGNHIF---WECKLSKK---------IWNTFIPLTIPLYDLYRGEWNPKENW-RWMSDNLQTEDLERAIIILWSLWEHRNS

Query:  --NRGFKGK--------SGHLK--LRQFPCDEE---PPSHKSWLPPLPGSWKLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKA
          +    G         + HL+  ++ FP +           W+PP  GS+KLNVDA++D  R V GLG ++RD  G       K+++    +   E+ A
Subjt:  --NRGFKGK--------SGHLK--LRQFPCDEE---PPSHKSWLPPLPGSWKLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKA

Query:  IVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMA
        I  GL+     G+          +VESD    I  IN       E + L++++  +   FE + F    R  N  A  LA+ A
Subjt:  IVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMA

A0A2N9FM05 Reverse transcriptase domain-containing protein3.4e-4629.18Show/hide
Query:  MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVW
        +CD+IN M +N WWG+ +G+ K HW+GW KL  SK+ GG+GFRDLRLFN A+LA+  WRIIKNP+SLL + LK KYF  + FL A +  N S  WRSI+ 
Subjt:  MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVW

Query:  GRDLFTKGYKWKVGNGRHIAIDQDPWISSRG-----SEVSTLGKRVKGRNHLELGEKR---------VFSVKSAFHLATHLASSQKASSSTHSTPTLSGS
         R++ + G +W+VGNG +I I +D WI +R      S +S L +     + L   + R         +F V  A  + + +  S+++S+ T         
Subjt:  GRDLFTKGYKWKVGNGRHIAIDQDPWISSRG-----SEVSTLGKRVKGRNHLELGEKR---------VFSVKSAFHLATHLASSQKASSSTHSTPTLSGS

Query:  VYGILESKGESGNHIFWECKLSKKIWNTFIPLTIPLYDLYRGEWNPKENWRWMSDNLQTEDLERAIIILWSLWEHRN----SNRGFKGKSGHLKLRQFPC
        VY +      S  H+  + K   +  N                          S +L + D+E    I W LW  RN     N+    +    +      
Subjt:  VYGILESKGESGNHIFWECKLSKKIWNTFIPLTIPLYDLYRGEWNPKENWRWMSDNLQTEDLERAIIILWSLWEHRN----SNRGFKGKSGHLKLRQFPC

Query:  D---EEPPSHK-------SWLPPLPGSWKLNVDASWDSKRSVGGLGWILRDSEGS---SLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLP
        D   +E  SH         W PP    +K+NV   W S  S GG G ++RDS GS   ++C    ++     +    ++  +   K +            
Subjt:  D---EEPPSHK-------SWLPPLPGSWKLNVDASWDSKRSVGGLGWILRDSEGS---SLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLP

Query:  PPIVVESDAATVIRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAISPPSLSVLV
          ++VE D + ++  +      L+    L++++ ++   F    F   PR  NL + SLA+ + S  S  V +
Subjt:  PPIVVESDAATVIRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAISPPSLSVLV

A0A803P119 Uncharacterized protein3.0e-4731.95Show/hide
Query:  HWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVWGRDLFTKGYKWKVGNGRHIAIDQ
        HW  WN LC SK  GGLGFR+   FNQA+LAK +WR++ N +SLL +TL  +YF  + FL A +G NPS TWRSI+WGRDL+ KG  WKVG G  I+   
Subjt:  HWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVWGRDLFTKGYKWKVGNGRHIAIDQ

Query:  DPWISSRGSEVSTLGKRVKGRNHLELGE--------------KRVFSVKSAFHLATHLASSQKASSST--------------------------HSTPTL
        DPW+    +    L   +   N L + +                 ++VKS +HLAT++   +  SSS                           H+   L
Subjt:  DPWISSRGSEVSTLGKRVKGRNHLELGE--------------KRVFSVKSAFHLATHLASSQKASSST--------------------------HSTPTL

Query:  SGSVY----------GILESKGESGNHIFWECKLSKKIWNTFIPLTIPL---YDLYRGEWNPKENWRWMSDNLQTEDLERAIIILWSLWEHRNSNRGFKG
        + +++                 ES NH F+ CK +K++W   +P  + L        GE+    +     DN Q   +E  + I+W LW  RN++    G
Subjt:  SGSVY----------GILESKGESGNHIFWECKLSKKIWNTFIPLTIPL---YDLYRGEWNPKENWRWMSDNLQTEDLERAIIILWSLWEHRNSNRGFKG

Query:  KSGHLKL------------------RQFPCD-EEPPSHKSWLPPLPGSWKLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIV
        K  +L                    RQ P    +P  +KSW PP  G  KLNVDA+ DS   V G+G I+RDS G+ +    K I   + +K +E  AI 
Subjt:  KSGHLKL------------------RQFPCD-EEPPSHKSWLPPLPGSWKLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIV

Query:  EGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAI
          L+ L  SG +          +E+DA  V + +N      S    L+ D+      F  +S     R  N AA  LAR A+
Subjt:  EGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAI

A0A803P614 Uncharacterized protein4.0e-4731.42Show/hide
Query:  DDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVWGR
        D +  M AN WWG  +   K HW  WN LC +K  GG+GFR    FNQA+LAK +WR+++ P+SLL K LK +YF  NDFL AP G +PSLTW+ I+WGR
Subjt:  DDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVWGR

Query:  DLFTKGYKWKVGNGRHIAIDQDPWISSRGSEVST--LGKRVKGRNHLELGEKRVFSVKSAFHLATHLASSQKASSSTHSTPTLSGSVYGILESKGESGNH
         L  KG +WK+G GRHI I  DPWI    +   T  LG      +HL + E+RV++V             Q+  S     P    +++G  ++K   G +
Subjt:  DLFTKGYKWKVGNGRHIAIDQDPWISSRGSEVST--LGKRVKGRNHLELGEKRVFSVKSAFHLATHLASSQKASSSTHSTPTLSGSVYGILESKGESGNH

Query:  IFWECKLSKKIWNTFIPLTIPLYDLYRGEWNPKENWRWMSDN---LQTEDLERAIIILWSLWEHRNSNRGFKGKSGHLKLRQFPCDEEPPSHKSWLPPLP
        +F             I   I  + L      PK+ W + S+    +     + A  +      + ++ R    K   +     P          W PP  
Subjt:  IFWECKLSKKIWNTFIPLTIPLYDLYRGEWNPKENWRWMSDN---LQTEDLERAIIILWSLWEHRNSNRGFKGKSGHLKLRQFPCDEEPPSHKSWLPPLP

Query:  GSWKLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISF
         + KLNVDA+ D+ R++ G+G ++R+S+G  L    K I   +   ++E +A+   L     + VI+     P  +VE+DA  V          +S    
Subjt:  GSWKLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISF

Query:  LMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAI
        ++ D+  +   F  ++ V   R  N+ A SLA+ A+
Subjt:  LMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAI

A0A803QH07 Uncharacterized protein2.7e-4829.28Show/hide
Query:  CDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVWG
        C+ +  M AN WWG  +   K HW  W  LC SK  GG+GFR    FNQA+LAK +WRI   P+SLLS+ LK +YF    FL A IG +PS TW+SI WG
Subjt:  CDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVWG

Query:  RDLFTKGYKWKVGNGRHIAIDQDPWISSRGS------------EVSTLGKRVKGRNHLELGE------------------------------KRVFSVKS
        R+L  KG ++KVGNG HI   +DPWI S  S             VS L    +  N   L +                                 ++VKS
Subjt:  RDLFTKGYKWKVGNGRHIAIDQDPWISSRGS------------EVSTLGKRVKGRNHLELGE------------------------------KRVFSVKS

Query:  AFHLATHLASSQKASSS---------------------------THSTP---------TLSGSVYGILESKGESGNHIFWECKLSKKIW--NTFIPLTIP
         FHLATHL    ++SSS                            H  P          +  +   +  S  ES  H  + C  +K IW  + FI     
Subjt:  AFHLATHLASSQKASSS---------------------------THSTP---------TLSGSVYGILESKGESGNHIFWECKLSKKIW--NTFIPLTIP

Query:  LYDLYRGEWNPKENWRWMSDNLQTEDLERAIIILWSLWEHRNS----------------NRGFK---GKSGHLKLRQFPCDEEPPS------------HK
           ++ G++       ++S     ED E  I +LW +W  RN                   GF     ++ HL  +      E  S            H 
Subjt:  LYDLYRGEWNPKENWRWMSDNLQTEDLERAIIILWSLWEHRNS----------------NRGFK---GKSGHLKLRQFPCDEEPPS------------HK

Query:  SWLPPLPGSWKLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDE
         W PP    +KLNVDA+ +S +   G+G I+R  +G  +    K +   +    +E KA+   L N  S   +  +       +E+DA  V   +N    
Subjt:  SWLPPLPGSWKLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDE

Query:  DLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAI
        DLS  S L+ D+  +   F  +      R  N AA  LAR A+
Subjt:  DLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAI

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.1e-1736.15Show/hide
Query:  INRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKG---NDFLTAPIGSNPSLTWRSIVWG
        ++++     WG    K K H + W+K+C+ K  GGLG R  +  N+A+++KV WR+++  NSL +  L+ KY  G   +     P GS  S TWRSI  G
Subjt:  INRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKG---NDFLTAPIGSNPSLTWRSIVWG

Query:  -RDLFTKGYKWKVGNGRHIAIDQDPWISSR
         RD+ + G  W  G+G+ I    D W+S +
Subjt:  -RDLFTKGYKWKVGNGRHIAIDQDPWISSR

P93295 Uncharacterized mitochondrial protein AtMg003104.0e-2843.75Show/hide
Query:  MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASK-DRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIV
        +C  +       WW   + K K  W+ W KLC SK D GGLGFRDL  FNQA+LAK S+RII  P++LLS+ L+ +YF  +  +   +G+ PS  WRSI+
Subjt:  MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASK-DRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIV

Query:  WGRDLFTKGYKWKVGNGRHIAIDQDPWI
         GR+L ++G    +G+G H  +  D WI
Subjt:  WGRDLFTKGYKWKVGNGRHIAIDQDPWI

Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein2.7e-0828.57Show/hide
Query:  KLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLME
        K N DAS      V GLGW++R+S+G+ L  G  +   R   +  E  A++  ++   + G  +       ++ E D + V RLIN +  D   +   ++
Subjt:  KLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLME

Query:  DLNKVKKPFEDLSFVFCPRDQNLAADSLARMAI
         +      F    F+F  R+QN  AD+L + AI
Subjt:  DLNKVKKPFEDLSFVFCPRDQNLAADSLARMAI

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.6e-1629.37Show/hide
Query:  ESGNHIFWECKLSKKIWNTFIPLTIPLYDLYRGEWNPK--ENWRWMSDNLQTEDLERAII------ILWSLWEHRNSNRGFKGKSGHLK--LRQFPCDEE
        E+ NH+ ++C  ++ +W       IP Y    GEW      N  W+  NL+ E  +   I      +LW LW+ RN    FKGK       LR+   D E
Subjt:  ESGNHIFWECKLSKKIWNTFIPLTIPLYDLYRGEWNPK--ENWRWMSDNLQTEDLERAII------ILWSLWEHRNSNRGFKGKSGHLK--LRQFPCDEE

Query:  PPSHK------------------SWLPPLPGSW-KLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESS
          S +                   W  P P  W K N DA+W  +    G+GWILR+  G  L +G + + +   +   EL+A+           V+  S
Subjt:  PPSHK------------------SWLPPLPGSW-KLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESS

Query:  RLP-PPIVVESDAATVIRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAIS
        R     I+ ESDA  ++ L+N  D+    +   +ED+ ++   FE++ F F PR  N  AD +AR +IS
Subjt:  RLP-PPIVVESDAATVIRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAIS

AT4G29090.1 Ribonuclease H-like superfamily protein6.5e-4225.14Show/hide
Query:  MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVW
        +C  I  + A+ WW   +     HW  W+ L   K  GG+GF+D+  FN A+L K  WR++  P SL++K  K +YF  +D L AP+GS PS  W+SI  
Subjt:  MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVW

Query:  GRDLFTKGYKWKVGNGRHIAIDQDPWISSRGSEVSTLGKRV--------------------KGR-------------------NHLELGEKRV-------
         +++  +G +  VGNG  I I +  W+ S+ +  +   +RV                     GR                     L  G +R+       
Subjt:  GRDLFTKGYKWKVGNGRHIAIDQDPWISSRGSEVSTLGKRV--------------------KGR-------------------NHLELGEKRV-------

Query:  ------FSVKSAFHLATHL----ASSQKASSS--------------------------THSTPTLSGSVYGILESKG---------ESGNHIFWECKLSK
              ++VKS + + T +    +S Q+ S                            ++S P      Y  L  +          E+ NH+ ++C  ++
Subjt:  ------FSVKSAFHLATHL----ASSQKASSS--------------------------THSTPTLSGSVYGILESKG---------ESGNHIFWECKLSK

Query:  KIWNTFIPLTIPLYDLYRGEWNPK--ENWRWM----SDNLQTEDLERAI-IILWSLWEHRNS----NRGFKG-----------KSGHLKLRQFPCDEEPP
          W     + IPL     GEW      N  W+    + N Q E   + +  +LW LW++RN      R F             +   ++     C  +P 
Subjt:  KIWNTFIPLTIPLYDLYRGEWNPK--ENWRWM----SDNLQTEDLERAI-IILWSLWEHRNS----NRGFKG-----------KSGHLKLRQFPCDEEPP

Query:  SHKS----WLPPLPGSW-KLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVV-ESDAATV
         ++S    W PP P  W K N DA+W+      G+GW+LR+ +G    +G + + K        LK+++E         V+  SR     V+ ESD+  +
Subjt:  SHKS----WLPPLPGSW-KLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVV-ESDAATV

Query:  IRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAIS
        I ++N  DE    +   ++DL ++   F ++ FVF PR+ N  A+ +AR ++S
Subjt:  IRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAIS

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.5e-1128.19Show/hide
Query:  EPPSHKSWLPPLPGSWKLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRL
        +P  +  W PP     K N DAS   + +V GLGWILR+S+G+ +  G  +   R   +  E   ++  ++     G          ++ E D  T+ R+
Subjt:  EPPSHKSWLPPLPGSWKLNVDASWDSKRSVGGLGWILRDSEGSSLCLGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRL

Query:  INEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAI
        IN +  +   +   ++ +      FE + F F  R+QN  AD LA+ AI
Subjt:  INEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAI

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.8e-2943.75Show/hide
Query:  MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASK-DRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIV
        +C  +       WW   + K K  W+ W KLC SK D GGLGFRDL  FNQA+LAK S+RII  P++LLS+ L+ +YF  +  +   +G+ PS  WRSI+
Subjt:  MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASK-DRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIV

Query:  WGRDLFTKGYKWKVGNGRHIAIDQDPWI
         GR+L ++G    +G+G H  +  D WI
Subjt:  WGRDLFTKGYKWKVGNGRHIAIDQDPWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGACGATATTAATCGAATGTGCGCCAATTTATGGTGGGGTGAGTTTAAAGGAAAAGACAAAGCCCACTGGATGGGATGGAACAAGTTGTGTGCTAGCAAAGATAG
AGGAGGGCTCGGTTTTAGAGACCTTAGACTTTTCAACCAAGCGATGCTTGCTAAAGTGAGCTGGAGGATTATTAAAAATCCTAATAGTCTCCTTTCCAAAACTCTCAAAG
GAAAATACTTCAAAGGAAACGATTTCCTTACGGCTCCGATTGGCTCGAATCCATCTCTTACATGGAGAAGTATTGTATGGGGCCGAGATCTTTTCACAAAAGGATACAAA
TGGAAAGTAGGCAACGGTAGGCACATCGCTATAGATCAAGATCCATGGATTTCTTCTAGAGGCAGTGAAGTCTCAACCCTGGGGAAAAGAGTCAAAGGACGAAATCATTT
GGAATTGGGAGAAAAAAGAGTCTTCTCGGTGAAAAGTGCTTTCCACCTAGCTACCCATTTAGCTAGCAGCCAGAAAGCATCATCCTCCACCCATTCCACCCCGACCCTTT
CTGGAAGCGTTTATGGGATATTAGAGTCCAAAGGAGAATCTGGAAATCATATCTTCTGGGAGTGTAAGCTTTCGAAGAAAATCTGGAACACTTTCATCCCTCTTACAATT
CCTCTTTATGACTTGTACAGGGGTGAATGGAATCCTAAAGAGAACTGGAGATGGATGAGTGACAATTTGCAGACAGAGGATTTGGAGAGAGCTATTATCATCCTTTGGAG
CCTTTGGGAGCATAGAAATTCGAACCGAGGATTTAAAGGAAAATCTGGCCATCTCAAGCTCCGCCAGTTCCCCTGCGACGAAGAACCTCCGAGTCACAAGTCATGGCTCC
CCCCTCTTCCCGGATCGTGGAAACTCAATGTGGATGCTTCCTGGGACTCTAAGAGAAGCGTGGGAGGTTTAGGGTGGATACTTCGTGACTCTGAAGGATCTTCGTTATGC
CTGGGTTTTAAACGAATTACAAAACGTTGGCCCATAAAACTGCTGGAACTAAAAGCTATCGTCGAAGGTTTGAAGAATTTACCTTCCTCAGGTGTGATCGAATCCTCCCG
GCTCCCCCCTCCGATCGTGGTGGAGTCCGATGCAGCTACAGTTATTCGGCTGATAAACGAAGAGGACGAAGACTTGTCCGAAATCTCCTTTTTGATGGAGGATCTAAACA
AGGTGAAGAAGCCTTTTGAAGACTTATCTTTCGTCTTCTGCCCCAGAGATCAAAACTTGGCCGCTGATTCTTTGGCGCGCATGGCGATCTCCCCCCCTTCTCTTTCTGTT
TTGGTTTCCTCTTCCAATAGGGAAGAGGAAGAGGGCTTTTGGAGTGGGCACCCCCGTATTGTATTAAAAGCCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGACGATATTAATCGAATGTGCGCCAATTTATGGTGGGGTGAGTTTAAAGGAAAAGACAAAGCCCACTGGATGGGATGGAACAAGTTGTGTGCTAGCAAAGATAG
AGGAGGGCTCGGTTTTAGAGACCTTAGACTTTTCAACCAAGCGATGCTTGCTAAAGTGAGCTGGAGGATTATTAAAAATCCTAATAGTCTCCTTTCCAAAACTCTCAAAG
GAAAATACTTCAAAGGAAACGATTTCCTTACGGCTCCGATTGGCTCGAATCCATCTCTTACATGGAGAAGTATTGTATGGGGCCGAGATCTTTTCACAAAAGGATACAAA
TGGAAAGTAGGCAACGGTAGGCACATCGCTATAGATCAAGATCCATGGATTTCTTCTAGAGGCAGTGAAGTCTCAACCCTGGGGAAAAGAGTCAAAGGACGAAATCATTT
GGAATTGGGAGAAAAAAGAGTCTTCTCGGTGAAAAGTGCTTTCCACCTAGCTACCCATTTAGCTAGCAGCCAGAAAGCATCATCCTCCACCCATTCCACCCCGACCCTTT
CTGGAAGCGTTTATGGGATATTAGAGTCCAAAGGAGAATCTGGAAATCATATCTTCTGGGAGTGTAAGCTTTCGAAGAAAATCTGGAACACTTTCATCCCTCTTACAATT
CCTCTTTATGACTTGTACAGGGGTGAATGGAATCCTAAAGAGAACTGGAGATGGATGAGTGACAATTTGCAGACAGAGGATTTGGAGAGAGCTATTATCATCCTTTGGAG
CCTTTGGGAGCATAGAAATTCGAACCGAGGATTTAAAGGAAAATCTGGCCATCTCAAGCTCCGCCAGTTCCCCTGCGACGAAGAACCTCCGAGTCACAAGTCATGGCTCC
CCCCTCTTCCCGGATCGTGGAAACTCAATGTGGATGCTTCCTGGGACTCTAAGAGAAGCGTGGGAGGTTTAGGGTGGATACTTCGTGACTCTGAAGGATCTTCGTTATGC
CTGGGTTTTAAACGAATTACAAAACGTTGGCCCATAAAACTGCTGGAACTAAAAGCTATCGTCGAAGGTTTGAAGAATTTACCTTCCTCAGGTGTGATCGAATCCTCCCG
GCTCCCCCCTCCGATCGTGGTGGAGTCCGATGCAGCTACAGTTATTCGGCTGATAAACGAAGAGGACGAAGACTTGTCCGAAATCTCCTTTTTGATGGAGGATCTAAACA
AGGTGAAGAAGCCTTTTGAAGACTTATCTTTCGTCTTCTGCCCCAGAGATCAAAACTTGGCCGCTGATTCTTTGGCGCGCATGGCGATCTCCCCCCCTTCTCTTTCTGTT
TTGGTTTCCTCTTCCAATAGGGAAGAGGAAGAGGGCTTTTGGAGTGGGCACCCCCGTATTGTATTAAAAGCCTCTTAA
Protein sequenceShow/hide protein sequence
MCDDINRMCANLWWGEFKGKDKAHWMGWNKLCASKDRGGLGFRDLRLFNQAMLAKVSWRIIKNPNSLLSKTLKGKYFKGNDFLTAPIGSNPSLTWRSIVWGRDLFTKGYK
WKVGNGRHIAIDQDPWISSRGSEVSTLGKRVKGRNHLELGEKRVFSVKSAFHLATHLASSQKASSSTHSTPTLSGSVYGILESKGESGNHIFWECKLSKKIWNTFIPLTI
PLYDLYRGEWNPKENWRWMSDNLQTEDLERAIIILWSLWEHRNSNRGFKGKSGHLKLRQFPCDEEPPSHKSWLPPLPGSWKLNVDASWDSKRSVGGLGWILRDSEGSSLC
LGFKRITKRWPIKLLELKAIVEGLKNLPSSGVIESSRLPPPIVVESDAATVIRLINEEDEDLSEISFLMEDLNKVKKPFEDLSFVFCPRDQNLAADSLARMAISPPSLSV
LVSSSNREEEEGFWSGHPRIVLKAS