; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032810 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032810
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr11:37816228..37817393
RNA-Seq ExpressionLag0032810
SyntenyLag0032810
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG55646.1 hypothetical protein EZV62_020902 [Acer yangbiense]4.8e-4434.45Show/hide
Query:  SSSSASAVVSASISSAAKIASSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKSLE---TNPVYGEWTTVDQA
        SSSS          S +   SS FG+ L+    +KLD  N++LW+ MV  I++G  +DG +  T+  P EF+  P+  G S     +NP Y +W   DQ 
Subjt:  SSSSASAVVSASISSAAKIASSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKSLE---TNPVYGEWTTVDQA

Query:  LLGWVFGSMSPAIAADMISFKTSREVWKALEKVYGATIKAKVNQLR---------------------------------------------GLDSEYVPI
        L+GW++ SM+  +A  ++   T+  +WKALE ++GA  K+K N +R                                             GLDSEY+PI
Subjt:  LLGWVFGSMSPAIAADMISFKTSREVWKALEKVYGATIKAKVNQLR---------------------------------------------GLDSEYVPI

Query:  VCTIEDKEINTRQELSSIMITFEGTLARY-TVPANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQGGNQ-HQQSNFSNNRGNSRGRGRGRNNFQ
        V  IE +E  T QE+   +++++  L     V A  + L   +AHLA    N+ N   N N     Q+ NQGGN+   +  F    G  RGRG GRN   
Subjt:  VCTIEDKEINTRQELSSIMITFEGTLARY-TVPANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQGGNQ-HQQSNFSNNRGNSRGRGRGRNNFQ

Query:  RNNSKPTYQLCGKYEHSAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY
         NNS+PT Q+CGK+ HSA  CY R+++++        SN+         + + ++ATPE ++D  W  +SGATNHVT D GNL LKS Y
Subjt:  RNNSKPTYQLCGKYEHSAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY

TXG69253.1 hypothetical protein EZV62_004188 [Acer yangbiense]4.1e-4333.75Show/hide
Query:  SSSSASAVVSASISSAAKIASSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKSLET-----------NPVYG
        SSSS +        S +   SS FG+ L+    +KLD  N++LW+ MV  I++G  +DG +  T+  P EF+  P+  G    T           NP Y 
Subjt:  SSSSASAVVSASISSAAKIASSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKSLET-----------NPVYG

Query:  EWTTVDQALLGWVFGSMSPAIAADMISFKTSREVWKALEKVYGATIKAKVNQLR---------------------------------------------G
        +W   DQ L+GW++ SM+  +A  ++   T+  +WKALE ++GA  K+K N +R                                             G
Subjt:  EWTTVDQALLGWVFGSMSPAIAADMISFKTSREVWKALEKVYGATIKAKVNQLR---------------------------------------------G

Query:  LDSEYVPIVCTIEDKEINTRQELSSIMITFEGTLARY-TVPANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQGGNQ-HQQSNFSNNRGNSRGR
        LDSEY+PIV  IE +E  T QE+   +++++  L     V A  + L   +AHLA    N+ N   N N     Q+ NQGGN+   +  F    G  RGR
Subjt:  LDSEYVPIVCTIEDKEINTRQELSSIMITFEGTLARY-TVPANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQGGNQ-HQQSNFSNNRGNSRGR

Query:  GRGRNNFQRNNSKPTYQLCGKYEHSAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY
        G GRN    NNS+PT Q+CGK+ HSA  CY R+++++        SN+         + + ++ATPE ++D  W  +SGATNHVT D GNL LKS Y
Subjt:  GRGRNNFQRNNSKPTYQLCGKYEHSAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]2.3e-7044.5Show/hide
Query:  SSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKS--LETNPVYGEWTTVDQALLGWVFGSMSPAIAADMISFK
        ++SFGHPL TVLTVKLD+ NY LWRGMV A+LRGQ  DG+VLGT A+P +F+  P   G S  L+ NP Y EW  VDQALLGW+FGSM+P+IA D++ F+
Subjt:  SSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKS--LETNPVYGEWTTVDQALLGWVFGSMSPAIAADMISFK

Query:  TSREVWKALEKVYGATIKAKVNQLR---------------------------------------------GLDSEYVPIVCTIEDKEINTRQELSSIMIT
        +SREVWKALE +YGAT KA++NQLR                                             GL++EY+PIVC IE K+  + QEL + ++T
Subjt:  TSREVWKALEKVYGATIKAKVNQLR---------------------------------------------GLDSEYVPIVCTIEDKEINTRQELSSIMIT

Query:  FEGTLARYTV--PANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQ---GGNQHQQSNFSNN-RGNSRGRGRGR-NNFQRNNSKPTYQLCGKYEH
        FE TL R  +   A    + D + +   + QN            GN+ F+Q   G  Q + S  SN+ + N RGRGRGR + ++ NNSKP+ QLCGKY H
Subjt:  FEGTLARYTV--PANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQ---GGNQHQQSNFSNN-RGNSRGRGRGR-NNFQRNNSKPTYQLCGKYEH

Query:  SAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY
         A  CY+RF+E+FNN  ++N +            ++AY+A PEI+ +P WL +SGAT+HVT+D  NL +KS Y
Subjt:  SAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]2.3e-7044.5Show/hide
Query:  SSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKS--LETNPVYGEWTTVDQALLGWVFGSMSPAIAADMISFK
        ++SFGHPL TVLTVKLD+ NY LWRGMV A+LRGQ  DG+VLGT A+P +F+  P   G S  L+ NP Y EW  VDQALLGW+FGSM+P+IA D++ F+
Subjt:  SSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKS--LETNPVYGEWTTVDQALLGWVFGSMSPAIAADMISFK

Query:  TSREVWKALEKVYGATIKAKVNQLR---------------------------------------------GLDSEYVPIVCTIEDKEINTRQELSSIMIT
        +SREVWKALE +YGAT KA++NQLR                                             GL++EY+PIVC IE K+  + QEL + ++T
Subjt:  TSREVWKALEKVYGATIKAKVNQLR---------------------------------------------GLDSEYVPIVCTIEDKEINTRQELSSIMIT

Query:  FEGTLARYTV--PANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQ---GGNQHQQSNFSNN-RGNSRGRGRGR-NNFQRNNSKPTYQLCGKYEH
        FE TL R  +   A    + D + +   + QN            GN+ F+Q   G  Q + S  SN+ + N RGRGRGR + ++ NNSKP+ QLCGKY H
Subjt:  FEGTLARYTV--PANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQ---GGNQHQQSNFSNN-RGNSRGRGRGR-NNFQRNNSKPTYQLCGKYEH

Query:  SAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY
         A  CY+RF+E+FNN  ++N +            ++AY+A PEI+ +P WL +SGAT+HVT+D  NL +KS Y
Subjt:  SAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY

XP_030492910.1 uncharacterized protein LOC115709020 isoform X2 [Cannabis sativa]1.6e-4735.05Show/hide
Query:  SASAVVSASISSAAKIASSSFGHPLSTV---LTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGK---SLETNPVYGEWTTVDQA
        S+ A V+++ + A     ++F  P ST+     +KLD NNY LW+ MV  I+RG  +DGFV GT+A P EF+   +  G+    ++ NP Y  W   DQ 
Subjt:  SASAVVSASISSAAKIASSSFGHPLSTV---LTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGK---SLETNPVYGEWTTVDQA

Query:  LLGWVFGSMSPAIAADMISFKTSREVWKALEKVYGATIKAKV---------------------------------------------NQLRGLDSEYVPI
        L+GW++ SM+ AIA +++   T+  +WKALE +YGA  K+K+                                             N L GLD+ Y+PI
Subjt:  LLGWVFGSMSPAIAADMISFKTSREVWKALEKVYGATIKAKV---------------------------------------------NQLRGLDSEYVPI

Query:  VCTIEDKEINTRQELSSIMITFEGTLARYTVPANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQGGNQHQQSNFSNNRGNS-RGRGRGRNNFQR
        V  IE +   T QEL  ++++F+  + R      +  L + + H A    N AN  +N +  RGN S     N  + +   NNRGN  RGRGRGR+N   
Subjt:  VCTIEDKEINTRQELSSIMITFEGTLARYTVPANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQGGNQHQQSNFSNNRGNS-RGRGRGRNNFQR

Query:  NNSKPTYQLCGKYEHSAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY
        NN+KPT Q+CGK+ HSA  CY R+ E+F       GS+    Q   +   +A+ A+PE+++   W  +SGA++HVT+DG N+  KS Y
Subjt:  NNSKPTYQLCGKYEHSAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY

TrEMBL top hitse value%identityAlignment
A0A5C7HHE9 Uncharacterized protein2.3e-4434.45Show/hide
Query:  SSSSASAVVSASISSAAKIASSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKSLE---TNPVYGEWTTVDQA
        SSSS          S +   SS FG+ L+    +KLD  N++LW+ MV  I++G  +DG +  T+  P EF+  P+  G S     +NP Y +W   DQ 
Subjt:  SSSSASAVVSASISSAAKIASSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKSLE---TNPVYGEWTTVDQA

Query:  LLGWVFGSMSPAIAADMISFKTSREVWKALEKVYGATIKAKVNQLR---------------------------------------------GLDSEYVPI
        L+GW++ SM+  +A  ++   T+  +WKALE ++GA  K+K N +R                                             GLDSEY+PI
Subjt:  LLGWVFGSMSPAIAADMISFKTSREVWKALEKVYGATIKAKVNQLR---------------------------------------------GLDSEYVPI

Query:  VCTIEDKEINTRQELSSIMITFEGTLARY-TVPANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQGGNQ-HQQSNFSNNRGNSRGRGRGRNNFQ
        V  IE +E  T QE+   +++++  L     V A  + L   +AHLA    N+ N   N N     Q+ NQGGN+   +  F    G  RGRG GRN   
Subjt:  VCTIEDKEINTRQELSSIMITFEGTLARY-TVPANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQGGNQ-HQQSNFSNNRGNSRGRGRGRNNFQ

Query:  RNNSKPTYQLCGKYEHSAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY
         NNS+PT Q+CGK+ HSA  CY R+++++        SN+         + + ++ATPE ++D  W  +SGATNHVT D GNL LKS Y
Subjt:  RNNSKPTYQLCGKYEHSAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X21.1e-7044.5Show/hide
Query:  SSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKS--LETNPVYGEWTTVDQALLGWVFGSMSPAIAADMISFK
        ++SFGHPL TVLTVKLD+ NY LWRGMV A+LRGQ  DG+VLGT A+P +F+  P   G S  L+ NP Y EW  VDQALLGW+FGSM+P+IA D++ F+
Subjt:  SSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKS--LETNPVYGEWTTVDQALLGWVFGSMSPAIAADMISFK

Query:  TSREVWKALEKVYGATIKAKVNQLR---------------------------------------------GLDSEYVPIVCTIEDKEINTRQELSSIMIT
        +SREVWKALE +YGAT KA++NQLR                                             GL++EY+PIVC IE K+  + QEL + ++T
Subjt:  TSREVWKALEKVYGATIKAKVNQLR---------------------------------------------GLDSEYVPIVCTIEDKEINTRQELSSIMIT

Query:  FEGTLARYTV--PANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQ---GGNQHQQSNFSNN-RGNSRGRGRGR-NNFQRNNSKPTYQLCGKYEH
        FE TL R  +   A    + D + +   + QN            GN+ F+Q   G  Q + S  SN+ + N RGRGRGR + ++ NNSKP+ QLCGKY H
Subjt:  FEGTLARYTV--PANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQ---GGNQHQQSNFSNN-RGNSRGRGRGR-NNFQRNNSKPTYQLCGKYEH

Query:  SAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY
         A  CY+RF+E+FNN  ++N +            ++AY+A PEI+ +P WL +SGAT+HVT+D  NL +KS Y
Subjt:  SAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X11.1e-7044.5Show/hide
Query:  SSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKS--LETNPVYGEWTTVDQALLGWVFGSMSPAIAADMISFK
        ++SFGHPL TVLTVKLD+ NY LWRGMV A+LRGQ  DG+VLGT A+P +F+  P   G S  L+ NP Y EW  VDQALLGW+FGSM+P+IA D++ F+
Subjt:  SSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKS--LETNPVYGEWTTVDQALLGWVFGSMSPAIAADMISFK

Query:  TSREVWKALEKVYGATIKAKVNQLR---------------------------------------------GLDSEYVPIVCTIEDKEINTRQELSSIMIT
        +SREVWKALE +YGAT KA++NQLR                                             GL++EY+PIVC IE K+  + QEL + ++T
Subjt:  TSREVWKALEKVYGATIKAKVNQLR---------------------------------------------GLDSEYVPIVCTIEDKEINTRQELSSIMIT

Query:  FEGTLARYTV--PANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQ---GGNQHQQSNFSNN-RGNSRGRGRGR-NNFQRNNSKPTYQLCGKYEH
        FE TL R  +   A    + D + +   + QN            GN+ F+Q   G  Q + S  SN+ + N RGRGRGR + ++ NNSKP+ QLCGKY H
Subjt:  FEGTLARYTV--PANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQ---GGNQHQQSNFSNN-RGNSRGRGRGR-NNFQRNNSKPTYQLCGKYEH

Query:  SAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY
         A  CY+RF+E+FNN  ++N +            ++AY+A PEI+ +P WL +SGAT+HVT+D  NL +KS Y
Subjt:  SAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY

A0A803PAZ1 Uncharacterized protein5.6e-4634.57Show/hide
Query:  SFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSES----GKSLETNPVYGEWTTVDQALLGWVFGSMSPAIAADMISFK
        SFG+ L+   ++KLD NNY LWR +V  I+RG  ++G+V GTK  P+EF+  P       G  L+ NP Y  W   DQ L+GW++GSM+ +IA  ++   
Subjt:  SFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSES----GKSLETNPVYGEWTTVDQALLGWVFGSMSPAIAADMISFK

Query:  TSREVWKALEKVYGATIKAKV---------------------------------------------NQLRGLDSEYVPIVCTIE--DKEINTRQELSSIM
        ++R +W ALE +YGA  +AK+                                             N L GLD+EY+ IVC +E   K   T QE+  I+
Subjt:  TSREVWKALEKVYGATIKAKV---------------------------------------------NQLRGLDSEYVPIVCTIE--DKEINTRQELSSIM

Query:  ITFEGTLARYTV-PANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQG-------GNQHQQSNFSNNRGNSRGRGRGRNNFQRNNSKPTYQLCGK
        ++F+  L R  +  A      +  A  +    N+ +   ++   RGN +F+         GN ++  N S N G  RGRGRGR      N+KPT Q+CG+
Subjt:  ITFEGTLARYTV-PANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQG-------GNQHQQSNFSNNRGNSRGRGRGRNNFQRNNSKPTYQLCGK

Query:  YEHSAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY
        Y HSA YCY R++E+F      N +N     G  + +  AYIATPEI++   W  +SGA+NH+T+D  N+  K+ Y
Subjt:  YEHSAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY

A0A803QD97 Uncharacterized protein1.0e-4734.31Show/hide
Query:  SSAAKIASSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGK---SLETNPVYGEWTTVDQALLGWVFGSMSPAI
        SS++ ++   FG  L+    +KLD NN+ LW+ MV AI RG  +DG++ G +  P E++  P   G+   + E NP +  W   DQ L+GW++GSM+  I
Subjt:  SSAAKIASSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGK---SLETNPVYGEWTTVDQALLGWVFGSMSPAI

Query:  AADMISFKTSREVWKALEKVYGATIKAKV---------------------------------------------NQLRGLDSEYVPIVCTIEDKEINTRQ
        A +++   +S E+W +LE ++GA  KAK+                                             N L GLD EY+PIV  IE +E  T Q
Subjt:  AADMISFKTSREVWKALEKVYGATIKAKV---------------------------------------------NQLRGLDSEYVPIVCTIEDKEINTRQ

Query:  ELSSIMITFEGTLARYTVPANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQGGNQHQQSNFSNNRGNSRGRGRGRNNFQRNNSKPTYQLCGKYE
         L  ++++F+  L R      +S L + + H      N A+   N   + G+ ++N G N ++    S++  NSRGR  GR    R   KPT Q+CG+Y 
Subjt:  ELSSIMITFEGTLARYTVPANVSELPDLAAHLAFNCQNQANFGKNFNPRRGNQSFNQGGNQHQQSNFSNNRGNSRGRGRGRNNFQRNNSKPTYQLCGKYE

Query:  HSAPYCYQRFEESF--NNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY
        HSA YCY RF+E+F    P    G N++  Q     N+ A++ATPE+L D  W   SGA+NHVT++  NL  K+KY
Subjt:  HSAPYCYQRFEESF--NNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-1325.36Show/hide
Query:  KLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKSLETNPVYGEWTTVDQALLGWVFGSMSPAIAADMISFKTSREVWKALEKVYGAT
        KL   NYL+W   V A+  G  + GF+ G+   P   I   +    +   NP Y  W   D+ +   V G++S ++   +    T+ ++W+ L K+Y   
Subjt:  KLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKSLETNPVYGEWTTVDQALLGWVFGSMSPAIAADMISFKTSREVWKALEKVYGAT

Query:  IKAKVNQLRGLDSEYVPIVCTIED----------------KEINTRQELSSIMITFE-------GTLARYTVPANVSELPD---------LAAHLAFNCQ
            V QLR    ++     TI+D                K ++  +++  ++             +A    P  ++E+ +         LA   A    
Subjt:  IKAKVNQLRGLDSEYVPIVCTIED----------------KEINTRQELSSIMITFE-------GTLARYTVPANVSELPD---------LAAHLAFNCQ

Query:  NQANFGKNFNPRRGNQSFNQGGNQHQQSNFSNNRGNSRGRGRGRNNFQRNN--SKP---TYQLCGKYEHSAPYCYQRFEESFNNPHAANGSNSQGFQGT-
          AN   + N    N   N  GN++ + +  NN  NS+   +   NF  NN  SKP     Q+CG   HSA  C Q         H  +  NSQ      
Subjt:  NQANFGKNFNPRRGNQSFNQGGNQHQQSNFSNNRGNSRGRGRGRNNFQRNN--SKP---TYQLCGKYEHSAPYCYQRFEESFNNPHAANGSNSQGFQGT-

Query:  TQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY
        T     A +A     +   WL++SGAT+H+T+D  NL L   Y
Subjt:  TQTNSAAYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-1324.04Show/hide
Query:  KLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKSLETNPVYGEWTTVDQALLGWVFGSMSPAIAADMISFKTSREVWKALEKVYGAT
        KL   NYL+W   V A+  G  + GF+ G+   P   I   +        NP Y  W   D+ +   + G++S ++   +    T+ ++W+ L K+Y   
Subjt:  KLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKSLETNPVYGEWTTVDQALLGWVFGSMSPAIAADMISFKTSREVWKALEKVYGAT

Query:  IKAKVNQLR--------------------------GLDSEYVPIVCTIEDKEI-NTRQELSSIMITFEGTLARYTVPANVSELPDLAAHLAFNCQNQANF
            V QLR                           L  +Y P++  I  K+   +  E+   +I  E  L    +  N +E+  + A++        N 
Subjt:  IKAKVNQLR--------------------------GLDSEYVPIVCTIEDKEI-NTRQELSSIMITFEGTLARYTVPANVSELPDLAAHLAFNCQNQANF

Query:  GKNFNPRRGNQSFNQGGNQHQQSNFSNNRGNSRGRGRGRNNFQRNNSKP---TYQLCGKYEHSAPYCYQ--RFEESFNNPHAANGSNSQGFQGTTQTNSA
         +N N R  N+++N           +NNR NS       +       KP     Q+C    HSA  C Q  +F+ + N          Q     T     
Subjt:  GKNFNPRRGNQSFNQGGNQHQQSNFSNNRGNSRGRGRGRNNFQRNNSKP---TYQLCGKYEHSAPYCYQ--RFEESFNNPHAANGSNSQGFQGTTQTNSA

Query:  AYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY
        A +A     N   WL++SGAT+H+T+D  NL     Y
Subjt:  AYIATPEILNDPKWLVESGATNHVTADGGNLILKSKY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGATGAAATTGGTTCTTCTTCTTCTGCTTCAGCAGTTGTCTCAGCCTCAATTTCTTCTGCTGCGAAAATTGCGAGTTCGTCATTTGGTCATCCACTCAGCACCGT
TCTCACGGTCAAATTAGATGAGAACAATTACTTGTTGTGGAGGGGAATGGTGTTTGCCATCCTCAGAGGTCAGAATGTAGATGGTTTTGTTCTTGGAACAAAAGCTCAAC
CTTCTGAATTCATTGAAGTCCCGTCAGAATCTGGAAAGAGTCTTGAAACAAATCCTGTCTATGGAGAGTGGACAACAGTCGATCAAGCGTTGCTAGGATGGGTGTTTGGT
TCTATGTCGCCTGCTATTGCTGCAGACATGATAAGTTTTAAAACATCTAGAGAAGTATGGAAGGCTTTAGAGAAGGTGTATGGAGCGACAATCAAGGCTAAGGTTAACCA
ACTGAGAGGGCTCGACTCGGAATATGTTCCTATTGTTTGCACCATTGAAGATAAGGAAATAAATACCCGGCAAGAGTTATCATCCATTATGATCACTTTTGAAGGGACTT
TGGCTCGATATACAGTCCCTGCAAATGTTAGTGAACTACCCGACCTTGCAGCTCATTTGGCTTTTAATTGCCAAAATCAGGCCAATTTTGGGAAGAACTTTAATCCTCGA
AGAGGGAATCAAAGTTTCAACCAAGGTGGTAATCAACACCAGCAGTCAAATTTTTCCAACAACCGTGGAAATAGTCGTGGCAGAGGACGAGGGCGAAATAACTTTCAACG
AAACAATTCCAAACCGACCTACCAGCTTTGTGGAAAATATGAGCATTCTGCTCCATATTGTTATCAACGGTTTGAAGAATCCTTCAACAATCCTCATGCGGCTAATGGCT
CAAACAGTCAAGGTTTTCAGGGAACAACTCAAACAAACTCAGCAGCCTATATTGCAACTCCGGAAATCCTGAATGACCCCAAATGGTTAGTAGAGAGTGGTGCTACTAAT
CATGTTACAGCAGATGGCGGTAATCTTATTTTAAAGTCTAAATACCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGATGAAATTGGTTCTTCTTCTTCTGCTTCAGCAGTTGTCTCAGCCTCAATTTCTTCTGCTGCGAAAATTGCGAGTTCGTCATTTGGTCATCCACTCAGCACCGT
TCTCACGGTCAAATTAGATGAGAACAATTACTTGTTGTGGAGGGGAATGGTGTTTGCCATCCTCAGAGGTCAGAATGTAGATGGTTTTGTTCTTGGAACAAAAGCTCAAC
CTTCTGAATTCATTGAAGTCCCGTCAGAATCTGGAAAGAGTCTTGAAACAAATCCTGTCTATGGAGAGTGGACAACAGTCGATCAAGCGTTGCTAGGATGGGTGTTTGGT
TCTATGTCGCCTGCTATTGCTGCAGACATGATAAGTTTTAAAACATCTAGAGAAGTATGGAAGGCTTTAGAGAAGGTGTATGGAGCGACAATCAAGGCTAAGGTTAACCA
ACTGAGAGGGCTCGACTCGGAATATGTTCCTATTGTTTGCACCATTGAAGATAAGGAAATAAATACCCGGCAAGAGTTATCATCCATTATGATCACTTTTGAAGGGACTT
TGGCTCGATATACAGTCCCTGCAAATGTTAGTGAACTACCCGACCTTGCAGCTCATTTGGCTTTTAATTGCCAAAATCAGGCCAATTTTGGGAAGAACTTTAATCCTCGA
AGAGGGAATCAAAGTTTCAACCAAGGTGGTAATCAACACCAGCAGTCAAATTTTTCCAACAACCGTGGAAATAGTCGTGGCAGAGGACGAGGGCGAAATAACTTTCAACG
AAACAATTCCAAACCGACCTACCAGCTTTGTGGAAAATATGAGCATTCTGCTCCATATTGTTATCAACGGTTTGAAGAATCCTTCAACAATCCTCATGCGGCTAATGGCT
CAAACAGTCAAGGTTTTCAGGGAACAACTCAAACAAACTCAGCAGCCTATATTGCAACTCCGGAAATCCTGAATGACCCCAAATGGTTAGTAGAGAGTGGTGCTACTAAT
CATGTTACAGCAGATGGCGGTAATCTTATTTTAAAGTCTAAATACCTTTAG
Protein sequenceShow/hide protein sequence
MGDEIGSSSSASAVVSASISSAAKIASSSFGHPLSTVLTVKLDENNYLLWRGMVFAILRGQNVDGFVLGTKAQPSEFIEVPSESGKSLETNPVYGEWTTVDQALLGWVFG
SMSPAIAADMISFKTSREVWKALEKVYGATIKAKVNQLRGLDSEYVPIVCTIEDKEINTRQELSSIMITFEGTLARYTVPANVSELPDLAAHLAFNCQNQANFGKNFNPR
RGNQSFNQGGNQHQQSNFSNNRGNSRGRGRGRNNFQRNNSKPTYQLCGKYEHSAPYCYQRFEESFNNPHAANGSNSQGFQGTTQTNSAAYIATPEILNDPKWLVESGATN
HVTADGGNLILKSKYL