; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017663 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017663
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr5:6476477..6478243
RNA-Seq ExpressionLag0017663
SyntenyLag0017663
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.8e-16956.01Show/hide
Query:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKI
        MSE+IL+QM+HC S K IW  L  IF++R LAQ M+ K KL  I+KG M LKEYF KI Q VDAL+++ KPV  +DHIL+IL+GLGSDY+SM+SVISA+ 
Subjt:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKI

Query:  GPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSES-PKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQR
           SV EVMSLLLTQE++NESKL  SETALPSVN+        +ES  + N N Y ++ +   RG G G   SNRG R   NRN+ QCQ+C K G++A R
Subjt:  GPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSES-PKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQR

Query:  CYFRYAPPGPSNNPASFSP--HFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSP
        C+FRY    P +N + +SP  H       N  PQM+AM+ A D+N D++WYPDSGATNHLTHS  NLS+G+EYGGGNQ++  NG+GLPI +YG  SF+S 
Subjt:  CYFRYAPPGPSNNPASFSP--HFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSP

Query:  TCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSV
        T   + F LNNLL VPSITKNLISVSQFA+DN VFFEFHPTLCYVKD  +G+VLLQG L++GLY+F +  PS       ++  + + +  +  S T L  
Subjt:  TCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSV

Query:  SGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWI
             L++WHRRL HP L IVK+VL          N   FC ACALGK H+LPF  S T+Y+ PLQLI  DLWGPA   S +G+RYYI+FVD +SRYTWI
Subjt:  SGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWI

Query:  YFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY
        YFL SKSDA   F KFK  VE  LG SIK+ Q+D G EFK F   L+ +GI HR TCP+TSKQN IVERKHR++++ GL LLS +++PL +
Subjt:  YFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY

KZV26181.1 hypothetical protein F511_06348 [Dorcoceras hygrometricum]1.8e-13646.54Show/hide
Query:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKI
        MSE    QMI C ++  +W+ + Q+F TR+ A++M+ K +LQT++KG +S+K+Y  K++ Y+D L+A G  +  +D IL IL G+G +YES+V  +++++
Subjt:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKI

Query:  GPQSVHEVMSLLLTQENRNES-KLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGR-TWNNRNRIQCQVCGKFGHTAQ
           S+ EV +LLL  E R E+  ++   TA PSVN+T +P    +E+   +   Y        RGRG GR  + RGGR  W+N  R  CQ+CG  GH A+
Subjt:  GPQSVHEVMSLLLTQENRNES-KLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGR-TWNNRNRIQCQVCGKFGHTAQ

Query:  RCYFRYAP---PGPSNNPASFSPHFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFS
         CY+R+     P  S    +    FN+S+    +P  A   T  +   +  WYPDSGA++H+T+  GNLSV +EY GG++V VGNGAGL I N G S+ +
Subjt:  RCYFRYAP---PGPSNNPASFSPHFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFS

Query:  SPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVL
            ++R F L NLLHVP ITKNLISVS+FA DN V+FEFHP+ C VKD A+  VLL+GTLH GLYRFNL     S P+     +Q+ +S       + L
Subjt:  SPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVL

Query:  SVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYT
         +   + L+ WH RL HPS+A VK VL     ++S N++  FC++C LGK H LPF  S T +SAP +++ SDLWGPA+IPS +G RYYI+FVD ++RYT
Subjt:  SVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYT

Query:  WIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY
        WIYFLK KS+    F+ F+ + E      IKT Q+D GGEF+S ++   S GI HRF+CP+TSKQNG+VERKHRHVVDTGL+LL+H+S+P ++
Subjt:  WIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.3e-12143.97Show/hide
Query:  GMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESP
        G+++++Y +K++ Y D L+  G  +   DHIL I+ GLG +YES+++VIS+K    S+  V S L+  E R   K+SS++    SVN T S       S 
Subjt:  GMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESP

Query:  KPNPNPYPSS-------FTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPA------------------SFSPHFN-
          N N YPSS       F G    RG    +  RG        + QCQ+C KFGHT  RC++RY P    N PA                  S S   N 
Subjt:  KPNPNPYPSS-------FTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPA------------------SFSPHFN-

Query:  -----QSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSIT
              +     + +M AM+  P+   +  W+PDSGATNH+TH  GNL+ G EY G +++H+GNG GL I + G S F S +  N+V FL N+L VP+I 
Subjt:  -----QSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSIT

Query:  KNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSV----PSSSTPIKKDTAVQTLLSQSL----SSSPTVLSVSGGSDLNVWHR
        KNL+SVSQFARDN V+FEFHP +C+VKD+++  +LLQG LH+GLY+FNLS      +S   +  D    T  + SL    +S     + S     ++WH+
Subjt:  KNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSV----PSSSTPIKKDTAVQTLLSQSL----SSSPTVLSVSGGSDLNVWHR

Query:  RLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDALN
        RL HP+  IV  VL   +   S  +    C+AC LGK+H+LPF  S TVY+ PLQL+VSDLWGPA I S  G+ YY++FVD +SRYTW+YFLK+KS    
Subjt:  RLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDALN

Query:  VFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY
         FL FK   E   G  +KTFQ+D GGEF+S  +     GI HR +CPHTSKQNGI+ERKHRH+V+ GL LL+ +S+PLKY
Subjt:  VFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]9.8e-13043.09Show/hide
Query:  LHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSV
        L Q++ CSS   +W+ + Q F +++ A++M  K+++Q ++K G+++++Y +K++ Y D L+  G  +   DHIL I+ GLG +YES+++VIS+K    S+
Subjt:  LHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSV

Query:  HEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSS-------FTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQ
          V S L+  E R   K+SS++    SVN T S       S   N N YPSS       F G    RG    +  RG        + QCQ+C KFGHT  
Subjt:  HEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSS-------FTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQ

Query:  RCYFRYAPPGPSNNPA------------------SFSPHFN------QSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGN
        RC++RY P    N PA                  S S   N       +     + +M AM+  P+   +  W+PDSGATNH+TH  GNL+ G EY G +
Subjt:  RCYFRYAPPGPSNNPA------------------SFSPHFN------QSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGN

Query:  QVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSV----PSS
        ++H+GNG GL I + G S F S +  N+V FL N+L VP+I KNL+SVSQFARDN V+FEFHP +C+VKD+++  +LLQG LH+GLY+FNLS      +S
Subjt:  QVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSV----PSS

Query:  STPIKKDTAVQTLLSQSL----SSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIV
           +  D    T  + SL    +S     + S     ++WH+RL HP+  IV  VL   +   S  +    C+AC LGK+H+LPF  S TVY+ PLQL+V
Subjt:  STPIKKDTAVQTLLSQSL----SSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIV

Query:  SDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVER
        SDLWGPA I S  G+ YY++FVD +SRYTW+YFLK+KS     FL FK   E   G  +KTFQ+D GGEF+S  +     GI HR +CPHTSKQNGI+ER
Subjt:  SDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVER

Query:  KHRHVVDTGLALLSHSSMPLKY
        KHRH+V+ GL LL+ +S+PLKY
Subjt:  KHRHVVDTGLALLSHSSMPLKY

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.8e-16956.01Show/hide
Query:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKI
        MSE+IL+QM+HC S K IW  L  IF++R LAQ M+ K KL  I+KG M LKEYF KI Q VDAL+++ KPV  +DHIL+IL+GLGSDY+SM+SVISA+ 
Subjt:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKI

Query:  GPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSES-PKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQR
           SV EVMSLLLTQE++NESKL  SETALPSVN+        +ES  + N N Y ++ +   RG G G   SNRG R   NRN+ QCQ+C K G++A R
Subjt:  GPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSES-PKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQR

Query:  CYFRYAPPGPSNNPASFSP--HFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSP
        C+FRY    P +N + +SP  H       N  PQM+AM+ A D+N D++WYPDSGATNHLTHS  NLS+G+EYGGGNQ++  NG+GLPI +YG  SF+S 
Subjt:  CYFRYAPPGPSNNPASFSP--HFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSP

Query:  TCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSV
        T   + F LNNLL VPSITKNLISVSQFA+DN VFFEFHPTLCYVKD  +G+VLLQG L++GLY+F +  PS       ++  + + +  +  S T L  
Subjt:  TCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSV

Query:  SGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWI
             L++WHRRL HP L IVK+VL          N   FC ACALGK H+LPF  S T+Y+ PLQLI  DLWGPA   S +G+RYYI+FVD +SRYTWI
Subjt:  SGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWI

Query:  YFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY
        YFL SKSDA   F KFK  VE  LG SIK+ Q+D G EFK F   L+ +GI HR TCP+TSKQN IVERKHR++++ GL LLS +++PL +
Subjt:  YFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY

TrEMBL top hitse value%identityAlignment
A0A2Z7AWA7 Integrase catalytic domain-containing protein8.9e-13746.54Show/hide
Query:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKI
        MSE    QMI C ++  +W+ + Q+F TR+ A++M+ K +LQT++KG +S+K+Y  K++ Y+D L+A G  +  +D IL IL G+G +YES+V  +++++
Subjt:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKI

Query:  GPQSVHEVMSLLLTQENRNES-KLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGR-TWNNRNRIQCQVCGKFGHTAQ
           S+ EV +LLL  E R E+  ++   TA PSVN+T +P    +E+   +   Y        RGRG GR  + RGGR  W+N  R  CQ+CG  GH A+
Subjt:  GPQSVHEVMSLLLTQENRNES-KLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGR-TWNNRNRIQCQVCGKFGHTAQ

Query:  RCYFRYAP---PGPSNNPASFSPHFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFS
         CY+R+     P  S    +    FN+S+    +P  A   T  +   +  WYPDSGA++H+T+  GNLSV +EY GG++V VGNGAGL I N G S+ +
Subjt:  RCYFRYAP---PGPSNNPASFSPHFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFS

Query:  SPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVL
            ++R F L NLLHVP ITKNLISVS+FA DN V+FEFHP+ C VKD A+  VLL+GTLH GLYRFNL     S P+     +Q+ +S       + L
Subjt:  SPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVL

Query:  SVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYT
         +   + L+ WH RL HPS+A VK VL     ++S N++  FC++C LGK H LPF  S T +SAP +++ SDLWGPA+IPS +G RYYI+FVD ++RYT
Subjt:  SVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYT

Query:  WIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY
        WIYFLK KS+    F+ F+ + E      IKT Q+D GGEF+S ++   S GI HRF+CP+TSKQNG+VERKHRHVVDTGL+LL+H+S+P ++
Subjt:  WIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY

A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-946.2e-12243.97Show/hide
Query:  GMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESP
        G+++++Y +K++ Y D L+  G  +   DHIL I+ GLG +YES+++VIS+K    S+  V S L+  E R   K+SS++    SVN T S       S 
Subjt:  GMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESP

Query:  KPNPNPYPSS-------FTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPA------------------SFSPHFN-
          N N YPSS       F G    RG    +  RG        + QCQ+C KFGHT  RC++RY P    N PA                  S S   N 
Subjt:  KPNPNPYPSS-------FTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPA------------------SFSPHFN-

Query:  -----QSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSIT
              +     + +M AM+  P+   +  W+PDSGATNH+TH  GNL+ G EY G +++H+GNG GL I + G S F S +  N+V FL N+L VP+I 
Subjt:  -----QSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSIT

Query:  KNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSV----PSSSTPIKKDTAVQTLLSQSL----SSSPTVLSVSGGSDLNVWHR
        KNL+SVSQFARDN V+FEFHP +C+VKD+++  +LLQG LH+GLY+FNLS      +S   +  D    T  + SL    +S     + S     ++WH+
Subjt:  KNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSV----PSSSTPIKKDTAVQTLLSQSL----SSSPTVLSVSGGSDLNVWHR

Query:  RLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDALN
        RL HP+  IV  VL   +   S  +    C+AC LGK+H+LPF  S TVY+ PLQL+VSDLWGPA I S  G+ YY++FVD +SRYTW+YFLK+KS    
Subjt:  RLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDALN

Query:  VFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY
         FL FK   E   G  +KTFQ+D GGEF+S  +     GI HR +CPHTSKQNGI+ERKHRH+V+ GL LL+ +S+PLKY
Subjt:  VFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-13043.09Show/hide
Query:  LHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSV
        L Q++ CSS   +W+ + Q F +++ A++M  K+++Q ++K G+++++Y +K++ Y D L+  G  +   DHIL I+ GLG +YES+++VIS+K    S+
Subjt:  LHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSV

Query:  HEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSS-------FTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQ
          V S L+  E R   K+SS++    SVN T S       S   N N YPSS       F G    RG    +  RG        + QCQ+C KFGHT  
Subjt:  HEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSS-------FTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQ

Query:  RCYFRYAPPGPSNNPA------------------SFSPHFN------QSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGN
        RC++RY P    N PA                  S S   N       +     + +M AM+  P+   +  W+PDSGATNH+TH  GNL+ G EY G +
Subjt:  RCYFRYAPPGPSNNPA------------------SFSPHFN------QSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGN

Query:  QVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSV----PSS
        ++H+GNG GL I + G S F S +  N+V FL N+L VP+I KNL+SVSQFARDN V+FEFHP +C+VKD+++  +LLQG LH+GLY+FNLS      +S
Subjt:  QVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSV----PSS

Query:  STPIKKDTAVQTLLSQSL----SSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIV
           +  D    T  + SL    +S     + S     ++WH+RL HP+  IV  VL   +   S  +    C+AC LGK+H+LPF  S TVY+ PLQL+V
Subjt:  STPIKKDTAVQTLLSQSL----SSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIV

Query:  SDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVER
        SDLWGPA I S  G+ YY++FVD +SRYTW+YFLK+KS     FL FK   E   G  +KTFQ+D GGEF+S  +     GI HR +CPHTSKQNGI+ER
Subjt:  SDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVER

Query:  KHRHVVDTGLALLSHSSMPLKY
        KHRH+V+ GL LL+ +S+PLKY
Subjt:  KHRHVVDTGLALLSHSSMPLKY

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-16956.01Show/hide
Query:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKI
        MSE+IL+QM+HC S K IW  L  IF++R LAQ M+ K KL  I+KG M LKEYF KI Q VDAL+++ KPV  +DHIL+IL+GLGSDY+SM+SVISA+ 
Subjt:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKI

Query:  GPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSES-PKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQR
           SV EVMSLLLTQE++NESKL  SETALPSVN+        +ES  + N N Y ++ +   RG G G   SNRG R   NRN+ QCQ+C K G++A R
Subjt:  GPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSES-PKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQR

Query:  CYFRYAPPGPSNNPASFSP--HFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSP
        C+FRY    P +N + +SP  H       N  PQM+AM+ A D+N D++WYPDSGATNHLTHS  NLS+G+EYGGGNQ++  NG+GLPI +YG  SF+S 
Subjt:  CYFRYAPPGPSNNPASFSP--HFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSP

Query:  TCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSV
        T   + F LNNLL VPSITKNLISVSQFA+DN VFFEFHPTLCYVKD  +G+VLLQG L++GLY+F +  PS       ++  + + +  +  S T L  
Subjt:  TCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSV

Query:  SGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWI
             L++WHRRL HP L IVK+VL          N   FC ACALGK H+LPF  S T+Y+ PLQLI  DLWGPA   S +G+RYYI+FVD +SRYTWI
Subjt:  SGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWI

Query:  YFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY
        YFL SKSDA   F KFK  VE  LG SIK+ Q+D G EFK F   L+ +GI HR TCP+TSKQN IVERKHR++++ GL LLS +++PL +
Subjt:  YFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-16956.01Show/hide
Query:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKI
        MSE+IL+QM+HC S K IW  L  IF++R LAQ M+ K KL  I+KG M LKEYF KI Q VDAL+++ KPV  +DHIL+IL+GLGSDY+SM+SVISA+ 
Subjt:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKI

Query:  GPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSES-PKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQR
           SV EVMSLLLTQE++NESKL  SETALPSVN+        +ES  + N N Y ++ +   RG G G   SNRG R   NRN+ QCQ+C K G++A R
Subjt:  GPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSES-PKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQR

Query:  CYFRYAPPGPSNNPASFSP--HFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSP
        C+FRY    P +N + +SP  H       N  PQM+AM+ A D+N D++WYPDSGATNHLTHS  NLS+G+EYGGGNQ++  NG+GLPI +YG  SF+S 
Subjt:  CYFRYAPPGPSNNPASFSP--HFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSP

Query:  TCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSV
        T   + F LNNLL VPSITKNLISVSQFA+DN VFFEFHPTLCYVKD  +G+VLLQG L++GLY+F +  PS       ++  + + +  +  S T L  
Subjt:  TCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSV

Query:  SGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWI
             L++WHRRL HP L IVK+VL          N   FC ACALGK H+LPF  S T+Y+ PLQLI  DLWGPA   S +G+RYYI+FVD +SRYTWI
Subjt:  SGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWI

Query:  YFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY
        YFL SKSDA   F KFK  VE  LG SIK+ Q+D G EFK F   L+ +GI HR TCP+TSKQN IVERKHR++++ GL LLS +++PL +
Subjt:  YFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.0e-1421.12Show/hide
Query:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQ-KGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAK
        +S+  L+      + + I   L  ++  ++LA  + ++ +L +++    MSL  +F    + +  L A G  ++  D I  +L  L S Y+ +++ I   
Subjt:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQ-KGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAK

Query:  IGPQ-SVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQ
             ++  V + LL QE +   K   ++T+   +N  V            N N Y ++           R +  +     N++ +++C  CG+ GH  +
Subjt:  IGPQ-SVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQ

Query:  RCYFRYAPPGPSNNPASFSPHFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPT
         C F Y      NN    +    Q+   +    M   +    +  +  +  DSGA++HL        +  E    + V V     + +   G   +++  
Subjt:  RCYFRYAPPGPSNNPASFSPHFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPT

Query:  CANRV-----FFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPT
           R+       L ++L       NL+SV +  ++ G+  EF                                  S   I K+  +    S  L++ P 
Subjt:  CANRV-----FFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPT

Query:  V------LSVSGGSDLNVWHRRLDHPSLAIVKSVLR--LQQPQMSINN---DFQFCTACALGKTHSLPF--FPSHTVYSAPLQLIVSDLWGPAYIPSVSG
        +      ++    ++  +WH R  H S   +  + R  +   Q  +NN     + C  C  GK   LPF      T    PL ++ SD+ GP    ++  
Subjt:  V------LSVSGGSDLNVWHRRLDHPSLAIVKSVLR--LQQPQMSINN---DFQFCTACALGKTHSLPF--FPSHTVYSAPLQLIVSDLWGPAYIPSVSG

Query:  YRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKS--FSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLAL
          Y++ FVD F+ Y   Y +K KSD  ++F  F    E    L +     D+G E+ S          GIS+  T PHT + NG+ ER  R + +    +
Subjt:  YRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKS--FSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLAL

Query:  LSHSSM
        +S + +
Subjt:  LSHSSM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-3324.14Show/hide
Query:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKG-GMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAK
        +S+D+++ +I   + + IW+ L  ++ ++ L   + +K +L  +    G +   + +     +  L+ +G  ++ ED  + +L+ L S Y+++ + I   
Subjt:  MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKG-GMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAK

Query:  IGPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSN--RGGRTWNNRNRIQ-----CQVCGK
             + +V S LL  E                       K P+++           +     RGR   RSS+N  R G    ++NR +     C  C +
Subjt:  IGPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSN--RGGRTWNNRNRIQ-----CQVCGK

Query:  FGHTAQRCYFRYAPPGPSNNPASFSPHFNQSNRPNQFPQMAAMLTAPD-----INH----------DTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVH
         GH  + C      P P       S   N  N        AAM+   D     IN           ++ W  D+ A++H T    +L      G    V 
Subjt:  FGHTAQRCYFRYAPPGPSNNPASFSPHFNQSNRPNQFPQMAAMLTAPD-----INH----------DTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVH

Query:  VGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKD
        +GN +   I   G       T       L ++ HVP +  NLIS     RD    +E +      +      V+ +G     LYR N  +          
Subjt:  VGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKD

Query:  TAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPS
                  L+++   +SV      ++WH+R+ H S   ++ + +      +     + C  C  GK H + F  S       L L+ SD+ GP  I S
Subjt:  TAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPS

Query:  VSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKS--FSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTG
        + G +Y++TF+D  SR  W+Y LK+K     VF KF   VE   G  +K  +SD+GGE+ S  F    +S+GI H  T P T + NG+ ER +R +V+  
Subjt:  VSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKS--FSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTG

Query:  LALLSHSSMPLKY
         ++L  + +P  +
Subjt:  LALLSHSSMPLKY

Q07791 Transposon Ty2-DR3 Gag-Pol polyprotein2.1e-1325.34Show/hide
Query:  LHVPSITKNLISVSQFARDNGVFFEFHPTLCYVK---DQASGRVLLQGTLHEGLYRFNLS--VPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLN
        LH P+I  +L+S+S+ A  N        T C+ +   +++ G VL     H   Y  +    +PS    I K T      S+S++  P  L         
Subjt:  LHVPSITKNLISVSQFARDNGVFFEFHPTLCYVK---DQASGRVLLQGTLHEGLYRFNLS--VPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLN

Query:  VWHRRLDHPSLAIVKSVLR------LQQPQMSINNDFQF-CTACALGK-THSLPFFPSHTVYS---APLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSR
          HR L H +   ++  L+      L++  +  +N   + C  C +GK T       S   Y     P Q + +D++GP +    S   Y+I+F D  +R
Subjt:  VWHRRLDHPSLAIVKSVLR------LQQPQMSINNDFQF-CTACALGK-THSLPFFPSHTVYS---APLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSR

Query:  YTWIYFL--KSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEF--KSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMP
        + W+Y L  + +   LNVF      ++N     +   Q D G E+  K+      + GI+  +T    S+ +G+ ER +R +++    LL  S +P
Subjt:  YTWIYFL--KSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEF--KSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.2e-7433.56Show/hide
Query:  IWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQEN
        IW  L +I+   +   + +++T+L+   KG  ++ +Y   +    D L+ +GKP+D ++ +  +L  L  +Y+ ++  I+AK  P ++ E+   LL  E+
Subjt:  IWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQEN

Query:  RNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTW----------NNRNRI---QCQVCGKFGHTAQRCYFRY
        +      SS T +P     VS +   + +   N          GNR       ++N   + W          NN+++    +CQ+CG  GH+A+RC    
Subjt:  RNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTW----------NNRNRI---QCQVCGKFGHTAQRCYFRY

Query:  APPGPSNNPASFSPHFNQSNRPNQFP--QMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPTCANR
              +    F    N    P+ F   Q  A L         +W  DSGAT+H+T  F NLS+   Y GG+ V V +G+ +PI + G +S S+    +R
Subjt:  APPGPSNNPASFSPHFNQSNRPNQFP--QMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPTCANR

Query:  VFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSD
           L+N+L+VP+I KNLISV +    NGV  EF P    VKD  +G  LLQG   + LY + +   +SS P+            SL +SP     S  + 
Subjt:  VFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSD

Query:  LNVWHRRLDHPSLAIVKSVL-RLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLK
         + WH RL HP+ +I+ SV+       ++ ++ F  C+ C + K++ +PF  S    + PL+ I SD+W  + I S   YRYY+ FVD F+RYTW+Y LK
Subjt:  LNVWHRRLDHPSLAIVKSVL-RLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLK

Query:  SKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY
         KS     F+ FK  +EN     I TF SD+GGEF +     + +GISH  + PHT + NG+ ERKHRH+V+TGL LLSH+S+P  Y
Subjt:  SKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-7035.66Show/hide
Query:  DALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGN
        D L+ +GKP+D ++ +  +L  L  DY+ ++  I+AK  P S+ E+   L+ +E++  + L+S+E    + N+       ++ + +   N   +     N
Subjt:  DALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGN

Query:  RGRGGGRSSSNRGGRTWNNRNRI---QCQVCGKFGHTAQRCYFRYAPPGPSNNPASFSPHFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTH
          R      S+ G R+ N + +    +CQ+C   GH+A+RC   +     +N   S SP      R N        L      +  +W  DSGAT+H+T 
Subjt:  RGRGGGRSSSNRGGRTWNNRNRI---QCQVCGKFGHTAQRCYFRYAPPGPSNNPASFSPHFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTH

Query:  SFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEG
         F NLS    Y GG+ V + +G+ +PI + G  S S PT ++R   LN +L+VP+I KNLISV +    N V  EF P    VKD  +G  LLQG   + 
Subjt:  SFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEG

Query:  LYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQ-PQMSINNDFQFCTACALGKTHSLPFFPSHTVY
        LY +         PI    AV      S+ +SP   +         WH RL HPSLAI+ SV+     P ++ ++    C+ C + K+H +PF  S    
Subjt:  LYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQ-PQMSINNDFQFCTACALGKTHSLPFFPSHTVY

Query:  SAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTS
        S PL+ I SD+W  + I S+  YRYY+ FVD F+RYTW+Y LK KS   + F+ FK  VEN     I T  SD+GGEF      L+ +GISH  + PHT 
Subjt:  SAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTS

Query:  KQNGIVERKHRHVVDTGLALLSHSSMPLKY
        + NG+ ERKHRH+V+ GL LLSH+S+P  Y
Subjt:  KQNGIVERKHRHVVDTGLALLSHSSMPLKY

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.8e-0425Show/hide
Query:  SSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLL
        S+++ IW  +   F     A+ +++ ++L+T   G M + +Y+ K+++  D+L  V  PV   + ++++L+GL   ++++++VI  +    S  +  ++L
Subjt:  SSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLL

Query:  LTQENRNESKLSSSET----ALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRS-SSNRGGR-------TWNNRNR
          +E+R +  +  + T    +  S  L  S  PP +   +   N        G RGRG G +    RGGR       T+N+ NR
Subjt:  LTQENRNESKLSSSET----ALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRS-SSNRGGR-------TWNNRNR

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.2e-0828.8Show/hide
Query:  MSEDILHQMIHCSST-KAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAK
        +++ +L  +I    T + +W  L  +F     A+ ++ + +L+T     +S+ EY  K++   D L+ V  P+     ++ +L+GL   Y+ +++VI  K
Subjt:  MSEDILHQMIHCSST-KAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAK

Query:  IGPQSVHEVMSLLLTQENR--NESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRS-SSNRGGRT----WNNRN
            S  E  S+LL +E+R  N+SK S S T  PS++  +   P   E        YP  +   N   G GRS   NRGG +    +NN N
Subjt:  IGPQSVHEVMSLLLTQENR--NESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRS-SSNRGGRT----WNNRN

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.1e-0625Show/hide
Query:  RVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHS
        R +L+G  H+ LY                     +L  S+ +  + L+ +   +  +WH RL H S   ++ +++      S  +  +FC  C  GKTH 
Subjt:  RVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHS

Query:  LPFFPSHTVYSAPLQLIVSDLWGPAYIP
        + F         PL  + SDLWG   +P
Subjt:  LPFFPSHTVYSAPLQLIVSDLWGPAYIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGAAGATATCCTTCATCAGATGATTCATTGCTCTTCAACGAAAGCTATATGGTCTTGTCTCGGGCAAATCTTCACTACACGTAACTTGGCCCAAATGATGAAAAT
CAAAACCAAGTTACAAACCATCCAGAAGGGAGGTATGTCACTCAAAGAATATTTTTCAAAAATCCAACAATATGTTGATGCTTTGTCTGCTGTTGGGAAACCGGTGGATG
TTGAGGACCATATCTTATTTATTTTATCTGGTTTGGGCTCTGACTATGAATCGATGGTGTCTGTCATTTCTGCTAAAATTGGTCCTCAGTCGGTTCACGAAGTTATGTCG
CTTTTATTGACTCAAGAAAATCGTAATGAAAGTAAGTTGTCAAGTTCTGAAACTGCCCTCCCCTCTGTGAACCTTACGGTGAGTCCGAAACCTCCTGATTCTGAGTCCCC
GAAACCTAATCCCAATCCATATCCTTCATCCTTCACTGGTGGGAATCGAGGGCGTGGTGGTGGTCGTTCTAGTTCCAACCGTGGAGGACGCACCTGGAACAACCGTAATC
GGATTCAATGTCAAGTGTGTGGGAAATTTGGCCACACTGCTCAACGTTGCTATTTTCGTTATGCCCCGCCTGGTCCCTCTAATAATCCTGCCTCATTTTCTCCACACTTT
AATCAGTCTAATCGTCCAAATCAATTTCCACAGATGGCTGCTATGCTCACTGCTCCTGATATTAATCATGATACCAGCTGGTACCCTGACTCCGGTGCAACGAATCATCT
TACTCATTCCTTTGGTAATCTCTCAGTAGGTACCGAGTATGGTGGCGGAAATCAAGTTCATGTGGGAAATGGAGCAGGTTTGCCAATACTTAACTATGGTTACTCTTCTT
TTTCTTCTCCTACTTGTGCTAATCGGGTCTTCTTTTTAAATAATCTTCTTCATGTCCCTTCCATAACGAAAAATCTTATTAGTGTGAGTCAGTTTGCTAGAGATAATGGT
GTATTTTTTGAGTTTCATCCAACGTTGTGTTATGTGAAGGATCAAGCATCTGGTCGGGTTCTGCTCCAAGGGACTCTCCATGAAGGTCTCTACCGCTTCAACCTCTCAGT
TCCCTCGTCTTCCACGCCGATTAAGAAGGATACAGCGGTTCAGACTCTTCTTTCTCAGTCTCTCTCTTCTTCTCCGACTGTCTTATCTGTGTCTGGTGGTTCTGACTTAA
ATGTATGGCATAGACGTCTCGACCATCCTAGTTTAGCCATTGTTAAATCTGTTTTACGGTTACAACAGCCTCAAATGTCCATAAATAATGATTTTCAGTTCTGTACTGCC
TGTGCGTTGGGGAAAACTCATAGTTTACCTTTTTTTCCCTCTCATACAGTGTACTCTGCCCCTCTTCAATTAATAGTATCAGATCTTTGGGGCCCTGCTTATATACCTTC
TGTCTCAGGCTATCGTTACTATATTACATTTGTGGATGTCTTTAGTCGGTATACATGGATCTATTTTTTGAAATCTAAGTCTGATGCTTTGAATGTGTTTCTTAAATTCA
AATTACATGTGGAAAATCTTCTAGGTTTATCTATCAAAACCTTCCAATCTGACAGTGGAGGTGAGTTCAAATCTTTTTCTTCCATGTTGAATAGCTATGGCATTTCTCAT
CGCTTTACTTGTCCTCACACTTCCAAACAAAACGGCATTGTTGAGCGTAAGCACAGACATGTAGTGGATACAGGGTTAGCTCTTCTTTCTCATTCCTCTATGCCTTTAAA
ATATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGAAGATATCCTTCATCAGATGATTCATTGCTCTTCAACGAAAGCTATATGGTCTTGTCTCGGGCAAATCTTCACTACACGTAACTTGGCCCAAATGATGAAAAT
CAAAACCAAGTTACAAACCATCCAGAAGGGAGGTATGTCACTCAAAGAATATTTTTCAAAAATCCAACAATATGTTGATGCTTTGTCTGCTGTTGGGAAACCGGTGGATG
TTGAGGACCATATCTTATTTATTTTATCTGGTTTGGGCTCTGACTATGAATCGATGGTGTCTGTCATTTCTGCTAAAATTGGTCCTCAGTCGGTTCACGAAGTTATGTCG
CTTTTATTGACTCAAGAAAATCGTAATGAAAGTAAGTTGTCAAGTTCTGAAACTGCCCTCCCCTCTGTGAACCTTACGGTGAGTCCGAAACCTCCTGATTCTGAGTCCCC
GAAACCTAATCCCAATCCATATCCTTCATCCTTCACTGGTGGGAATCGAGGGCGTGGTGGTGGTCGTTCTAGTTCCAACCGTGGAGGACGCACCTGGAACAACCGTAATC
GGATTCAATGTCAAGTGTGTGGGAAATTTGGCCACACTGCTCAACGTTGCTATTTTCGTTATGCCCCGCCTGGTCCCTCTAATAATCCTGCCTCATTTTCTCCACACTTT
AATCAGTCTAATCGTCCAAATCAATTTCCACAGATGGCTGCTATGCTCACTGCTCCTGATATTAATCATGATACCAGCTGGTACCCTGACTCCGGTGCAACGAATCATCT
TACTCATTCCTTTGGTAATCTCTCAGTAGGTACCGAGTATGGTGGCGGAAATCAAGTTCATGTGGGAAATGGAGCAGGTTTGCCAATACTTAACTATGGTTACTCTTCTT
TTTCTTCTCCTACTTGTGCTAATCGGGTCTTCTTTTTAAATAATCTTCTTCATGTCCCTTCCATAACGAAAAATCTTATTAGTGTGAGTCAGTTTGCTAGAGATAATGGT
GTATTTTTTGAGTTTCATCCAACGTTGTGTTATGTGAAGGATCAAGCATCTGGTCGGGTTCTGCTCCAAGGGACTCTCCATGAAGGTCTCTACCGCTTCAACCTCTCAGT
TCCCTCGTCTTCCACGCCGATTAAGAAGGATACAGCGGTTCAGACTCTTCTTTCTCAGTCTCTCTCTTCTTCTCCGACTGTCTTATCTGTGTCTGGTGGTTCTGACTTAA
ATGTATGGCATAGACGTCTCGACCATCCTAGTTTAGCCATTGTTAAATCTGTTTTACGGTTACAACAGCCTCAAATGTCCATAAATAATGATTTTCAGTTCTGTACTGCC
TGTGCGTTGGGGAAAACTCATAGTTTACCTTTTTTTCCCTCTCATACAGTGTACTCTGCCCCTCTTCAATTAATAGTATCAGATCTTTGGGGCCCTGCTTATATACCTTC
TGTCTCAGGCTATCGTTACTATATTACATTTGTGGATGTCTTTAGTCGGTATACATGGATCTATTTTTTGAAATCTAAGTCTGATGCTTTGAATGTGTTTCTTAAATTCA
AATTACATGTGGAAAATCTTCTAGGTTTATCTATCAAAACCTTCCAATCTGACAGTGGAGGTGAGTTCAAATCTTTTTCTTCCATGTTGAATAGCTATGGCATTTCTCAT
CGCTTTACTTGTCCTCACACTTCCAAACAAAACGGCATTGTTGAGCGTAAGCACAGACATGTAGTGGATACAGGGTTAGCTCTTCTTTCTCATTCCTCTATGCCTTTAAA
ATATTGA
Protein sequenceShow/hide protein sequence
MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMS
LLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPASFSPHF
NQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSITKNLISVSQFARDNG
VFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTA
CALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISH
RFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY