; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036080 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036080
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:38637654..38641117
RNA-Seq ExpressionLag0036080
SyntenyLag0036080
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015382226.1 uncharacterized protein LOC107175329, partial [Citrus sinensis]2.1e-9139.76Show/hide
Query:  QEVEKAVSQLFPTKAPGPDGYPALFYKKYWSLV--------------------------------GLKCSNAPEISHLLFADDSLIFCKVEEVELLALKN
        +E+ +A++Q+ PTKAPGPDG PA F++K+W  V                                GL+ +++  ISHLLFADDSL+F +  + +   LK 
Subjt:  QEVEKAVSQLFPTKAPGPDGYPALFYKKYWSLV--------------------------------GLKCSNAPEISHLLFADDSLIFCKVEEVELLALKN

Query:  LLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKIWNSVQSWKRSFFSLVGKEILIKSIGQAIP
        +   Y  ASG+  N+ KS+++FS  V      I+  +   N V    KYLG+PS+  R K   F+    +IWN + SW+   FS  G+E+LIK++ QA+P
Subjt:  LLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKIWNSVQSWKRSFFSLVGKEILIKSIGQAIP

Query:  TYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILSTELG
         YAMS+F  P  +CE+I K+ ARFWWGS+  +R IHW KWEK C  K  GG+ FRD   FNQALIAKQ WRI+ +P SL++R LK  YFN S  +  +LG
Subjt:  TYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILSTELG

Query:  RKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKLKGNVLDMDIDIISSIPV-NLNLRDKL
         +PS++W+S++WGR+++ KG+R+++G+G+   +++  WLPR   FKP         + V + +  + NW  +    + +  D  +I  I + N    D+ 
Subjt:  RKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKLKGNVLDMDIDIISSIPV-NLNLRDKL

Query:  IWHFDKTEKYSVKSG
        +WH+DK  KYSVKSG
Subjt:  IWHFDKTEKYSVKSG

XP_015388578.1 uncharacterized protein LOC107178209 [Citrus sinensis]4.3e-8941.25Show/hide
Query:  QEVEKAVSQLFPTKAPGPDGYPALFYKKYWSLVGLKCSNAPEISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRS
        +E+ +A+SQ+ PTKAPGPDG  A   +K   + GLK      +SHLLFADDSL+F K    +   LK +  SY  ASG+  N+ KS++ +S  +  +  S
Subjt:  QEVEKAVSQLFPTKAPGPDGYPALFYKKYWSLVGLKCSNAPEISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRS

Query:  ILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKIWNSVQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKK
         + ++     +  F KYLG+PS+  R K+  F+ V  K+ + + +W+  FFS  GKE+LIK++ QA+P YAMS+F  P  LCE+I K+ A+FWWG+  +K
Subjt:  ILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKIWNSVQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKK

Query:  RKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNR
          IHW   E+    K  GGL FRD+  FNQAL+AKQ WRI+ +  SL++R LK  YF +   L+ ++G  PS++W+S++WGR+++LKG R+R+G+G   +
Subjt:  RKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNR

Query:  MFQDPWLPRESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKLKGNVLDMDIDIISSIPV-NLNLRDKLIWHFDKTEKYSVKSG
        ++   WLPR +TF+PI       ++ V + +     W  D ++   +  D DII  IP+   + RD+L WH+DK  +YS+KSG
Subjt:  MFQDPWLPRESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKLKGNVLDMDIDIISSIPV-NLNLRDKLIWHFDKTEKYSVKSG

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]3.8e-9340.85Show/hide
Query:  IETNLENMGAGQEVEKAVSQLFPTKAPGPDGYPALFYKKYWSLV--------------------------------GLKCSNAPEISHLLFADDSLIFCK
        +   L++    +EV  A+SQ+ PTKAPGPDG PA F++K+W  V                                G+      ++SHLLFADDSLIF +
Subjt:  IETNLENMGAGQEVEKAVSQLFPTKAPGPDGYPALFYKKYWSLV--------------------------------GLKCSNAPEISHLLFADDSLIFCK

Query:  VEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKIWNSVQSWKRSFFSLVGKE
            +   LK L + Y +ASG+  NF KS++ FSKG   D  + +S +     V    KYLG+PS+  R     F+ V  ++ N + SW+  FF+  GKE
Subjt:  VEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKIWNSVQSWKRSFFSLVGKE

Query:  ILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYF
        +LIK++ QAIPTYAMS+F  P  LCE+I K+ ARFWWG+   ++ IHW +WE+    K  GG+ FRD+  FNQAL+AKQ WRI+  PSSLV+R LK  YF
Subjt:  ILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYF

Query:  NNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKLKGNVLDMDIDIISSI
         ++  ++  LG KPS++W+S++WGR++L KG R+R+GNG +  ++ + W+PR +TFKPI       ++TV E +     W  D +  +    D + I  I
Subjt:  NNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKLKGNVLDMDIDIISSI

Query:  PVNLNLR-DKLIWHFDKTEKYSVKSG
        P+    + D+LIWH+DK   YSVKSG
Subjt:  PVNLNLR-DKLIWHFDKTEKYSVKSG

XP_024156142.1 uncharacterized protein LOC112164137 [Rosa chinensis]2.1e-8345.38Show/hide
Query:  APEISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKI
        AP I+HL FADDS +F K E  E   +K +LK Y+ ASG+ +NF KS I FSK V +  +  L+ V G   VD   KYLG+P+  S +K + F ++M+K 
Subjt:  APEISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKI

Query:  WNSVQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWR
         N +++WK    S+ GKE++IKS+ Q++PTY MS F  PK LC+E+ +  A FWWG ++K RKIHW  W+K C+PK  GGL FR++  FNQAL+AKQ WR
Subjt:  WNSVQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWR

Query:  ILSNPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPI-CLNYNMFNSTVDEFLS-HSGNW
        IL +P SL+ + LK  YF N+D +   + +  SY W+SL+ G+ LL KG+RF+VG G+   ++ DPW+PR  +F+P   +   + + TV + +   S +W
Subjt:  ILSNPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPI-CLNYNMFNSTVDEFLS-HSGNW

Query:  DCDKLKGNVLDMDIDIISSIPVNL-NLRDKLIWHFDKTEKYSVKSG
          D L+      ++D+I  IP++L N  D+LIWHFDK   YSVKSG
Subjt:  DCDKLKGNVLDMDIDIISSIPVNL-NLRDKLIWHFDKTEKYSVKSG

XP_024172304.2 uncharacterized protein LOC112178381 [Rosa chinensis]2.5e-8445.38Show/hide
Query:  APEISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKI
        AP I+HL FADDS +F K E  E   +K +LK Y+ ASG+ +NF KS I FSK V +  +  L+ V G   VD   KYLG+P+  S +K++ F ++M+K 
Subjt:  APEISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKI

Query:  WNSVQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWR
         N +++WK    S+ GKE++IKS+ Q++PTY MS F  PK LC+E+ +  A FWWG ++K RKIHW  W+K C+PK  GGL FR++  FNQAL+AKQ WR
Subjt:  WNSVQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWR

Query:  ILSNPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPI-CLNYNMFNSTVDEFLS-HSGNW
        IL +P SL+ + LK  YF N+D +   + +  SY W+SL+ G+ LL KG+RF+VG+G+   ++ DPW+PR  +F+P   +   + + TV + +   S +W
Subjt:  ILSNPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPI-CLNYNMFNSTVDEFLS-HSGNW

Query:  DCDKLKGNVLDMDIDIISSIPVNL-NLRDKLIWHFDKTEKYSVKSG
          D L+      ++D+I  IP++L N  D+LIWHFDK   YSVKSG
Subjt:  DCDKLKGNVLDMDIDIISSIPVNL-NLRDKLIWHFDKTEKYSVKSG

TrEMBL top hitse value%identityAlignment
A0A803NTN0 Uncharacterized protein1.7e-8341.53Show/hide
Query:  SLVGLKCS-NAPEISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKS
        +L GL+ + NAP +SHLLFADDSL+FC+  +    A+K +L +Y +ASG+ +N NKS + FS       ++  ++ L     +   +YLG+PS   R+K 
Subjt:  SLVGLKCS-NAPEISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKS

Query:  KDFSYVMDKIWNSVQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFN
        + FS++ +K+W  + +W    FS  GKE+L+K++ Q+IPTYAMS F   KK C ++    A FWWG+N    KIHW +W+  C  K  GG+ FR    FN
Subjt:  KDFSYVMDKIWNSVQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFN

Query:  QALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICLNYNMFNSTVDE
        QAL+AKQ WRI   P+SL+SR LK  YF+N+  L   +G  PSY W+S+ WGR+LL+KG+RF+VGNG++     DPW+P  + FKP+       + +V  
Subjt:  QALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICLNYNMFNSTVDE

Query:  FLSHSGNWDCDKLKGNVLDMDIDIISSIPVNLNL-RDKLIWHFDKTEKYSVKSG
        F++    W+ D L      +D++ I +IP++    +D+LIWH   +  Y+VKSG
Subjt:  FLSHSGNWDCDKLKGNVLDMDIDIISSIPVNLNL-RDKLIWHFDKTEKYSVKSG

A0A803PM68 Uncharacterized protein9.4e-9044.92Show/hide
Query:  WSLVGLKCSNAPEISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKS
        +S  G       E+SHL FADDSL+F +  E E    + LL+ Y  ASG+ +NF+KS + F + V    +S L++++G   VDN+GKYLG+PS   R K 
Subjt:  WSLVGLKCSNAPEISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKS

Query:  KDFSYVMDKIWNSVQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFN
        + F ++  K+WN ++ WK SFFS  GKEILIK+I QAIPTY MS F  PKK    I    ARFWWGS++K  KIHWCKW   C  K  GGL FRD+G FN
Subjt:  KDFSYVMDKIWNSVQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFN

Query:  QALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICLNYNMFNSTVDE
        QAL+AKQ+WR +  P+SL S+ LK  YF N  +L  + G   S++W+SL+WG++++ KG R+R+GNG+S R+ +DPWLPR  TFK            V +
Subjt:  QALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICLNYNMFNSTVDE

Query:  FLSHSGNWDCDKLKGNVLDMDIDIISSIPVN-LNLRDKLIWHFDKTEKYSVKSG
         +  +G WD + ++      D ++I  +P +   + DK++WH+ K  +YSV+SG
Subjt:  FLSHSGNWDCDKLKGNVLDMDIDIISSIPVN-LNLRDKLIWHFDKTEKYSVKSG

A0A803PV25 Uncharacterized protein1.1e-8544.87Show/hide
Query:  ISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKIWNS
        +SHL FADDSL+F    E E    + LL+ Y  ASG+ +NF+KS + F + V    R+ L++ +G   VDN+GKYLG+PS   R K + F ++ +K+WN 
Subjt:  ISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKIWNS

Query:  VQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILS
        ++ WK SFFS  GKE+LIK+I QAIPTY MS F  PKK    I    ARFWWGS++K  KIHWCKW   C  K  GGL FRD+G FNQAL+AKQ+WR + 
Subjt:  VQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILS

Query:  NPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKL
         P+SL S+ LK  Y+ N  +L  + G   S++W+SL+WG++++  G R+R+GNG+S R+  DPWLPR  TFK         N  V +    +G WD + +
Subjt:  NPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKL

Query:  KGNVLDMDIDIISSIPVN-LNLRDKLIWHFDKTEKYSVKSG
        +      D ++I  +  +  ++ DK++WH+ K  +YSV+SG
Subjt:  KGNVLDMDIDIISSIPVN-LNLRDKLIWHFDKTEKYSVKSG

A0A803Q0L5 Uncharacterized protein5.3e-8545.45Show/hide
Query:  ISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKIWNS
        +SHL FADDSLIF   E       K+LL+ Y  ASG+ +N++KS + F + V  + R  L+  +G   VDN GKYLG+PS   RNK +    + +K+W  
Subjt:  ISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKIWNS

Query:  VQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILS
        ++ WK S FS+ GKE+LIK++ QAIPTYAMS F   KK    I +  ARFWWGS++K +KIHWCKW   C PK  GGL FRD+G FNQA++AKQVWR + 
Subjt:  VQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILS

Query:  NPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKL
          ++L SR LK  YF +  IL  + G   S++W+SLIWG+++++ G R+RVGNG + R+ +DPWLPR  TFK         N  V +     G+WD   +
Subjt:  NPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKL

Query:  KGNVLDMDIDIISSIP-VNLNLRDKLIWHFDKTEKYSVKSG
        +      D ++I S+P     L DK++WH+ K  +Y+VKSG
Subjt:  KGNVLDMDIDIISSIP-VNLNLRDKLIWHFDKTEKYSVKSG

A0A803Q1K6 Uncharacterized protein3.7e-8646.33Show/hide
Query:  ISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKIWNS
        ISHL FAD SLIF           + LL  Y  ASG+ +N++KS   F + V  + RS+L+ +LG   V+N GKYLG+PS   RNK +    + +K+W  
Subjt:  ISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFNKSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKIWNS

Query:  VQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILS
        ++ WK S FS+ GKE+LIK+I QAIPTY MS F  PKK    + +  +RFWWGS+DK++KIHWCKW   C PK  GGL FRD+G FNQAL+AKQ+WR L 
Subjt:  VQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILS

Query:  NPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKL
        +P  L SR LK  YF    +L    G   S++ +SL+WG++L+LKG R+RVGNG S R+ +DPWLPR  TFK         N  V +     G WD   +
Subjt:  NPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKL

Query:  KGNVLDMDIDIISSIPV-NLNLRDKLIWHFDKTEKYSVKSG
        +      D D+I  IP  + +  DK++WH+ K  +YSVKSG
Subjt:  KGNVLDMDIDIISSIPV-NLNLRDKLIWHFDKTEKYSVKSG

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.4e-2126.01Show/hide
Query:  VPSVFSRNKSKDFSYVMDKIWNSVQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGG
        +P +  R     F  +++++ + +  W+    S  G+  L K++  ++P ++MS    P+ +   + +    F WGS  +K+K H  KW K C PK  GG
Subjt:  VPSVFSRNKSKDFSYVMDKIWNSVQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGG

Query:  LNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILSTELGRKPSY--LWKSLIWG-RELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPI
        L  R     N+ALI+K  WR+L   +SL +  L+  Y       S  L  K S+   W+S+  G R+++  G+ +  G+G   R + D W+    + KP+
Subjt:  LNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILSTELGRKPSY--LWKSLIWG-RELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPI

Query:  CLNYNMFNST------VDEFLSHSGNWDCDKLKGNVLDMDIDIISSIPVNL--NLRDKLIWHFDKTEKYSVKS
            N    T        +       WD  K+     +     + ++ ++L    RD+L W F +  ++SV+S
Subjt:  CLNYNMFNST------VDEFLSHSGNWDCDKLKGNVLDMDIDIISSIPVNL--NLRDKLIWHFDKTEKYSVKS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.5e-0436.84Show/hide
Query:  QVVHRTEKETAYALNCDEILLSHEPANFDEAMQSKESDMWLIAMNEEMASLMKNETWVLSEKPPDHKLVDCKWLFK
        +V  R    T Y L  D+     EP +  E +   E +  + AM EEM SL KN T+ L E P   + + CKW+FK
Subjt:  QVVHRTEKETAYALNCDEILLSHEPANFDEAMQSKESDMWLIAMNEEMASLMKNETWVLSEKPPDHKLVDCKWLFK

P93295 Uncharacterized mitochondrial protein AtMg003103.4e-3644.74Show/hide
Query:  AIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPK-GLGGLNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILS
        A+P YAMS F   K LC+++T +   FWW S + KRKI W  W+K C  K   GGL FRD+G FNQAL+AKQ +RI+  P +L+SR L+  YF +S ++ 
Subjt:  AIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPK-GLGGLNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILS

Query:  TELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPI
          +G +PSY W+S+I GRELL +G+   +G+G   +++ D W+  E+   P+
Subjt:  TELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPI

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein4.3e-1031.43Show/hide
Query:  LKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRM----FQDPWLPR----ESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKLK
        +K  YF +  IL  ++ ++ SY W SL+ G  LL KG R  +G+G + R+      D   PR    E T+K + +N N+F      +      WD  K+ 
Subjt:  LKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRM----FQDPWLPR----ESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKLK

Query:  GNVLDMDIDIISSIPVNLNLR-DKLIWHFDKTEKYSVKSG
          V   D   I  I +  + + DK+IW+++ T +Y+V+SG
Subjt:  GNVLDMDIDIISSIPVNLNLR-DKLIWHFDKTEKYSVKSG

AT4G29090.1 Ribonuclease H-like superfamily protein2.9e-3532.6Show/hide
Query:  AIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILST
        A+PTY M+ F  PK +C++I    A FWW +  + + +HW  W+     K  GG+ F+D+  FN AL+ KQ+WR+LS P SL+++  K  YF+ SD L+ 
Subjt:  AIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILST

Query:  ELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICL------NYNMFNS--TVDEFLSHSG-NWDCDKLKGNVLDMDIDIISS
         LG +PS++WKS+   +E+L +G R  VGNG    +++  WL  +     + +       Y   +S   V + +  SG  W  D ++    +++  +I  
Subjt:  ELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPICL------NYNMFNS--TVDEFLSHSG-NWDCDKLKGNVLDMDIDIISS

Query:  I-PVNLNLRDKLIWHFDKTEKYSVKSG
        + P    + D   W +  +  Y+VKSG
Subjt:  I-PVNLNLRDKLIWHFDKTEKYSVKSG

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.4e-3744.74Show/hide
Query:  AIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPK-GLGGLNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILS
        A+P YAMS F   K LC+++T +   FWW S + KRKI W  W+K C  K   GGL FRD+G FNQAL+AKQ +RI+  P +L+SR L+  YF +S ++ 
Subjt:  AIPTYAMSIFGFPKKLCEEITKSFARFWWGSNDKKRKIHWCKWEKFCLPK-GLGGLNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILS

Query:  TELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPI
          +G +PSY W+S+I GRELL +G+   +G+G   +++ D W+  E+   P+
Subjt:  TELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQDPWLPRESTFKPI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.5e-0443.24Show/hide
Query:  WLIAMNEEMASLMKNETWVLSEKPPDHKLVDCKWLFK
        W  AM EE+ +L +N+TW+L   P +  ++ CKW+FK
Subjt:  WLIAMNEEMASLMKNETWVLSEKPPDHKLVDCKWLFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAGCAAGTTGTGCATAGAACAGAGAAAGAAACAGCTTATGCCTTAAACTGTGATGAAATCTTGTTAAGTCATGAACCTGCAAATTTTGATGAAGCAATGCAGTC
GAAAGAAAGTGATATGTGGTTAATTGCTATGAATGAAGAAATGGCCTCATTGATGAAAAATGAGACTTGGGTGCTTTCAGAAAAACCGCCAGATCATAAGCTTGTTGATT
GCAAATGGTTGTTTAAATGGCTGCAGTGCAGGCCTGCCAGGGATTATGAAAATCATTTGTTGGAACGCTCGGGGGTTGGGGAACCCGAGAACGTTCCGTGCAATTCGAGA
CCTTATATTCGTAACCTGGATTGGGTTTACTCAGACCATAGACCTATTGAAACTAATTTGGAGAACATGGGAGCTGGTCAGGAAGTTGAAAAGGCAGTGAGTCAATTGTT
TCCTACTAAAGCTCCAGGGCCGGATGGTTACCCTGCCTTATTTTATAAAAAATACTGGTCTTTGGTAGGGTTGAAATGTTCAAATGCGCCAGAAATCTCTCACCTCCTAT
TCGCAGATGACAGCCTCATCTTTTGTAAGGTAGAAGAGGTGGAATTATTGGCCTTAAAAAACCTACTAAAGTCATACGATAGGGCTTCTGGAGAATGTATAAATTTTAAC
AAATCTGCCATTATGTTTTCTAAAGGAGTAGGTCTTGACACTAGGTCTATTCTCAGTTCAGTTTTAGGAAACAATTGTGTAGATAATTTTGGCAAATATCTTGGAGTTCC
CTCCGTATTTTCAAGGAATAAATCTAAGGATTTTAGCTATGTTATGGACAAAATTTGGAATTCAGTTCAGAGTTGGAAAAGGTCTTTTTTCTCTTTGGTTGGGAAGGAAA
TACTGATAAAGAGTATAGGACAAGCTATTCCAACCTATGCTATGAGTATCTTTGGATTCCCAAAAAAGCTTTGTGAAGAGATTACCAAAAGTTTTGCTAGATTTTGGTGG
GGCTCCAATGATAAGAAAAGAAAAATTCATTGGTGTAAATGGGAGAAATTTTGCCTACCAAAAGGCTTAGGGGGTCTTAATTTTAGAGATGTGGGAGGTTTTAACCAAGC
TTTAATAGCCAAACAAGTATGGAGAATTCTTTCCAATCCATCTTCTTTAGTTTCACGGTTTCTAAAAGGGATCTATTTTAATAATTCTGATATATTATCTACAGAATTAG
GGAGGAAGCCTTCTTATCTTTGGAAGAGTCTTATTTGGGGTCGTGAGCTTCTATTAAAAGGTATTAGATTTAGGGTAGGCAATGGATCATCCAATAGAATGTTCCAAGAT
CCGTGGTTACCTAGAGAATCTACCTTCAAGCCCATCTGTTTAAACTACAATATGTTTAACTCAACAGTAGATGAGTTTCTTTCTCATTCAGGTAATTGGGATTGTGATAA
ACTAAAGGGTAATGTTCTAGACATGGATATAGATATTATTAGCAGTATTCCGGTCAATCTTAATTTGAGGGATAAGCTAATATGGCATTTTGATAAAACCGAAAAATATT
CTGTTAAGAGTGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGAAGCAAGTTGTGCATAGAACAGAGAAAGAAACAGCTTATGCCTTAAACTGTGATGAAATCTTGTTAAGTCATGAACCTGCAAATTTTGATGAAGCAATGCAGTC
GAAAGAAAGTGATATGTGGTTAATTGCTATGAATGAAGAAATGGCCTCATTGATGAAAAATGAGACTTGGGTGCTTTCAGAAAAACCGCCAGATCATAAGCTTGTTGATT
GCAAATGGTTGTTTAAATGGCTGCAGTGCAGGCCTGCCAGGGATTATGAAAATCATTTGTTGGAACGCTCGGGGGTTGGGGAACCCGAGAACGTTCCGTGCAATTCGAGA
CCTTATATTCGTAACCTGGATTGGGTTTACTCAGACCATAGACCTATTGAAACTAATTTGGAGAACATGGGAGCTGGTCAGGAAGTTGAAAAGGCAGTGAGTCAATTGTT
TCCTACTAAAGCTCCAGGGCCGGATGGTTACCCTGCCTTATTTTATAAAAAATACTGGTCTTTGGTAGGGTTGAAATGTTCAAATGCGCCAGAAATCTCTCACCTCCTAT
TCGCAGATGACAGCCTCATCTTTTGTAAGGTAGAAGAGGTGGAATTATTGGCCTTAAAAAACCTACTAAAGTCATACGATAGGGCTTCTGGAGAATGTATAAATTTTAAC
AAATCTGCCATTATGTTTTCTAAAGGAGTAGGTCTTGACACTAGGTCTATTCTCAGTTCAGTTTTAGGAAACAATTGTGTAGATAATTTTGGCAAATATCTTGGAGTTCC
CTCCGTATTTTCAAGGAATAAATCTAAGGATTTTAGCTATGTTATGGACAAAATTTGGAATTCAGTTCAGAGTTGGAAAAGGTCTTTTTTCTCTTTGGTTGGGAAGGAAA
TACTGATAAAGAGTATAGGACAAGCTATTCCAACCTATGCTATGAGTATCTTTGGATTCCCAAAAAAGCTTTGTGAAGAGATTACCAAAAGTTTTGCTAGATTTTGGTGG
GGCTCCAATGATAAGAAAAGAAAAATTCATTGGTGTAAATGGGAGAAATTTTGCCTACCAAAAGGCTTAGGGGGTCTTAATTTTAGAGATGTGGGAGGTTTTAACCAAGC
TTTAATAGCCAAACAAGTATGGAGAATTCTTTCCAATCCATCTTCTTTAGTTTCACGGTTTCTAAAAGGGATCTATTTTAATAATTCTGATATATTATCTACAGAATTAG
GGAGGAAGCCTTCTTATCTTTGGAAGAGTCTTATTTGGGGTCGTGAGCTTCTATTAAAAGGTATTAGATTTAGGGTAGGCAATGGATCATCCAATAGAATGTTCCAAGAT
CCGTGGTTACCTAGAGAATCTACCTTCAAGCCCATCTGTTTAAACTACAATATGTTTAACTCAACAGTAGATGAGTTTCTTTCTCATTCAGGTAATTGGGATTGTGATAA
ACTAAAGGGTAATGTTCTAGACATGGATATAGATATTATTAGCAGTATTCCGGTCAATCTTAATTTGAGGGATAAGCTAATATGGCATTTTGATAAAACCGAAAAATATT
CTGTTAAGAGTGGTTAA
Protein sequenceShow/hide protein sequence
MMKQVVHRTEKETAYALNCDEILLSHEPANFDEAMQSKESDMWLIAMNEEMASLMKNETWVLSEKPPDHKLVDCKWLFKWLQCRPARDYENHLLERSGVGEPENVPCNSR
PYIRNLDWVYSDHRPIETNLENMGAGQEVEKAVSQLFPTKAPGPDGYPALFYKKYWSLVGLKCSNAPEISHLLFADDSLIFCKVEEVELLALKNLLKSYDRASGECINFN
KSAIMFSKGVGLDTRSILSSVLGNNCVDNFGKYLGVPSVFSRNKSKDFSYVMDKIWNSVQSWKRSFFSLVGKEILIKSIGQAIPTYAMSIFGFPKKLCEEITKSFARFWW
GSNDKKRKIHWCKWEKFCLPKGLGGLNFRDVGGFNQALIAKQVWRILSNPSSLVSRFLKGIYFNNSDILSTELGRKPSYLWKSLIWGRELLLKGIRFRVGNGSSNRMFQD
PWLPRESTFKPICLNYNMFNSTVDEFLSHSGNWDCDKLKGNVLDMDIDIISSIPVNLNLRDKLIWHFDKTEKYSVKSG