; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001680 (gene) of Snake gourd v1 genome

Gene IDTan0001680
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:58075975..58078040
RNA-Seq ExpressionTan0001680
SyntenyTan0001680
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025724 - GAG-pre-integrase domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-13152.8Show/hide
Query:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL
        EC Q+P + A R+VR+ Y+RW  ANEKA+ YI+ S+S+VLAKKHE M+TA+EIM+SLQE+FGQ S+Q++HD+LKY++NARM E +SVREHV +MM HFN+
Subjt:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL

Query:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--
        AEMN A IDE+SQVSFILE+ P+SF                      LQ F+SLM+++  K EANVA   R ++RGSTSGTK + SS    K + K+G  
Subjt:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--

Query:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------
          K +   A+  KK    A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV +TCLV++ DS WI+DSGATNHV   F+G+    + E       
Subjt:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------

Query:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR
                                          ++ NL                            ICSA LE+NLYV +  + K++LNTE+FK A T+
Subjt:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR

Query:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF
         KR K+SPKENAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPV ESCLEGKMTK PF+GKG+RAKEPLELVH DLCGPMNVKARGG+EYF++F DD+
Subjt:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-13152.8Show/hide
Query:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL
        EC Q+P + A R+VR+ Y+RW  ANEKA+ YI+ S+S+VLAKKHE M+TA+EIM+SLQE+FGQ S+Q++HD+LKY++NARM E +SVREHV +MM HFN+
Subjt:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL

Query:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--
        AEMN A IDE+SQVSFILE+ P+SF                      LQ F+SLM+++  K EANVA   R ++RGSTSGTK + SS    K + K+G  
Subjt:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--

Query:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------
          K +   A+  KK    A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV +TCLV++ DS WI+DSGATNHV   F+G+    + E       
Subjt:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------

Query:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR
                                          ++ NL                            ICSA LE+NLYV +  + K++LNTE+FK A T+
Subjt:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR

Query:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF
         KR K+SPKENAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPV ESCLEGKMTK PF+GKG+RAKEPLELVH DLCGPMNVKARGG+EYF++F DD+
Subjt:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-13152.8Show/hide
Query:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL
        EC Q+P + A R+VR+ Y+RW  ANEKA+ YI+ S+S+VLAKKHE M+TA+EIM+SLQE+FGQ S+Q++HD+LKY++NARM E +SVREHV +MM HFN+
Subjt:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL

Query:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--
        AEMN A IDE+SQVSFILE+ P+SF                      LQ F+SLM+++  K EANVA   R ++RGSTSGTK + SS    K + K+G  
Subjt:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--

Query:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------
          K +   A+  KK    A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV +TCLV++ DS WI+DSGATNHV   F+G+    + E       
Subjt:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------

Query:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR
                                          ++ NL                            ICSA LE+NLYV +  + K++LNTE+FK A T+
Subjt:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR

Query:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF
         KR K+SPKENAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPV ESCLEGKMTK PF+GKG+RAKEPLELVH DLCGPMNVKARGG+EYF++F DD+
Subjt:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-13152.8Show/hide
Query:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL
        EC Q+P + A R+VR+ Y+RW  ANEKA+ YI+ S+S+VLAKKHE M+TA+EIM+SLQE+FGQ S+Q++HD+LKY++NARM E +SVREHV +MM HFN+
Subjt:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL

Query:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--
        AEMN A IDE+SQVSFILE+ P+SF                      LQ F+SLM+++  K EANVA   R ++RGSTSGTK + SS    K + K+G  
Subjt:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--

Query:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------
          K +   A+  KK    A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV +TCLV++ DS WI+DSGATNHV   F+G+    + E       
Subjt:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------

Query:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR
                                          ++ NL                            ICSA LE+NLYV +  + K++LNTE+FK A T+
Subjt:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR

Query:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF
         KR K+SPKENAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPV ESCLEGKMTK PF+GKG+RAKEPLELVH DLCGPMNVKARGG+EYF++F DD+
Subjt:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-13152.8Show/hide
Query:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL
        EC Q+P + A R+VR+ Y+RW  ANEKA+ YI+ S+S+VLAKKHE M+TA+EIM+SLQE+FGQ S+Q++HD+LKY++NARM E +SVREHV +MM HFN+
Subjt:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL

Query:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--
        AEMN A IDE+SQVSFILE+ P+SF                      LQ F+SLM+++  K EANVA   R ++RGSTSGTK + SS    K + K+G  
Subjt:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--

Query:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------
          K +   A+  KK    A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV +TCLV++ DS WI+DSGATNHV   F+G+    + E       
Subjt:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------

Query:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR
                                          ++ NL                            ICSA LE+NLYV +  + K++LNTE+FK A T+
Subjt:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR

Query:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF
         KR K+SPKENAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPV ESCLEGKMTK PF+GKG+RAKEPLELVH DLCGPMNVKARGG+EYF++F DD+
Subjt:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.4e-13152.8Show/hide
Query:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL
        EC Q+P + A R+VR+ Y+RW  ANEKA+ YI+ S+S+VLAKKHE M+TA+EIM+SLQE+FGQ S+Q++HD+LKY++NARM E +SVREHV +MM HFN+
Subjt:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL

Query:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--
        AEMN A IDE+SQVSFILE+ P+SF                      LQ F+SLM+++  K EANVA   R ++RGSTSGTK + SS    K + K+G  
Subjt:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--

Query:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------
          K +   A+  KK    A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV +TCLV++ DS WI+DSGATNHV   F+G+    + E       
Subjt:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------

Query:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR
                                          ++ NL                            ICSA LE+NLYV +  + K++LNTE+FK A T+
Subjt:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR

Query:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF
         KR K+SPKENAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPV ESCLEGKMTK PF+GKG+RAKEPLELVH DLCGPMNVKARGG+EYF++F DD+
Subjt:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF

A0A5A7TWB9 Gag/pol protein1.4e-13152.8Show/hide
Query:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL
        EC Q+P + A R+VR+ Y+RW  ANEKA+ YI+ S+S+VLAKKHE M+TA+EIM+SLQE+FGQ S+Q++HD+LKY++NARM E +SVREHV +MM HFN+
Subjt:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL

Query:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--
        AEMN A IDE+SQVSFILE+ P+SF                      LQ F+SLM+++  K EANVA   R ++RGSTSGTK + SS    K + K+G  
Subjt:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--

Query:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------
          K +   A+  KK    A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV +TCLV++ DS WI+DSGATNHV   F+G+    + E       
Subjt:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------

Query:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR
                                          ++ NL                            ICSA LE+NLYV +  + K++LNTE+FK A T+
Subjt:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR

Query:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF
         KR K+SPKENAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPV ESCLEGKMTK PF+GKG+RAKEPLELVH DLCGPMNVKARGG+EYF++F DD+
Subjt:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF

A0A5A7V4M1 Gag/pol protein1.4e-13152.8Show/hide
Query:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL
        EC Q+P + A R+VR+ Y+RW  ANEKA+ YI+ S+S+VLAKKHE M+TA+EIM+SLQE+FGQ S+Q++HD+LKY++NARM E +SVREHV +MM HFN+
Subjt:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL

Query:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--
        AEMN A IDE+SQVSFILE+ P+SF                      LQ F+SLM+++  K EANVA   R ++RGSTSGTK + SS    K + K+G  
Subjt:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--

Query:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------
          K +   A+  KK    A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV +TCLV++ DS WI+DSGATNHV   F+G+    + E       
Subjt:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------

Query:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR
                                          ++ NL                            ICSA LE+NLYV +  + K++LNTE+FK A T+
Subjt:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR

Query:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF
         KR K+SPKENAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPV ESCLEGKMTK PF+GKG+RAKEPLELVH DLCGPMNVKARGG+EYF++F DD+
Subjt:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF

A0A5D3CPJ6 Gag/pol protein1.4e-13152.8Show/hide
Query:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL
        EC Q+P + A R+VR+ Y+RW  ANEKA+ YI+ S+S+VLAKKHE M+TA+EIM+SLQE+FGQ S+Q++HD+LKY++NARM E +SVREHV +MM HFN+
Subjt:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL

Query:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--
        AEMN A IDE+SQVSFILE+ P+SF                      LQ F+SLM+++  K EANVA   R ++RGSTSGTK + SS    K + K+G  
Subjt:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--

Query:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------
          K +   A+  KK    A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV +TCLV++ DS WI+DSGATNHV   F+G+    + E       
Subjt:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------

Query:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR
                                          ++ NL                            ICSA LE+NLYV +  + K++LNTE+FK A T+
Subjt:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR

Query:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF
         KR K+SPKENAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPV ESCLEGKMTK PF+GKG+RAKEPLELVH DLCGPMNVKARGG+EYF++F DD+
Subjt:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF

A0A5D3CSZ6 Gag/pol protein1.4e-13152.8Show/hide
Query:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL
        EC Q+P + A R+VR+ Y+RW  ANEKA+ YI+ S+S+VLAKKHE M+TA+EIM+SLQE+FGQ S+Q++HD+LKY++NARM E +SVREHV +MM HFN+
Subjt:  ECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYVFNARMKEESSVREHVRDMMTHFNL

Query:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--
        AEMN A IDE+SQVSFILE+ P+SF                      LQ F+SLM+++  K EANVA   R ++RGSTSGTK + SS    K + K+G  
Subjt:  AEMNEASIDESSQVSFILETFPKSF----------------------LQNFQSLMRVRASKSEANVA--YRSYYRGSTSGTKPVASSCPKGKNRMKRG--

Query:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------
          K +   A+  KK    A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV +TCLV++ DS WI+DSGATNHV   F+G+    + E       
Subjt:  --KTDRGVAQKGKKVNKVAEKGKCFHCNGGGHWKRNCPKFLAERK--NQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCER------

Query:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR
                                          ++ NL                            ICSA LE+NLYV +  + K++LNTE+FK A T+
Subjt:  ----------------------------------VRGNL----------------------------ICSASLEHNLYVFKPNSVKSVLNTELFKMAETR

Query:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF
         KR K+SPKENAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPV ESCLEGKMTK PF+GKG+RAKEPLELVH DLCGPMNVKARGG+EYF++F DD+
Subjt:  TKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.9e-0938Show/hide
Query:  KENAHLWHLRLGHIN------LNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRA--KEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF
        K N  LWH R GHI+      + R        LLN L E S  + E CL GK  + PF     +   K PL +VH D+CGP+         YFV F+D F
Subjt:  KENAHLWHLRLGHIN------LNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRA--KEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-1237.21Show/hide
Query:  LWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDD
        LWH R+GH++   ++ L K  L++  +  ++   + CL GK  +  F     R    L+LV+ D+CGPM +++ GG +YFV+FIDD
Subjt:  LWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDD

P25384 Transposon Ty2-C Gag-Pol polyprotein6.8e-0629.69Show/hide
Query:  SVKSVLNTELFKMAETRTKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSL-----PVYE--SCLEGKMTKCPFSGKGYRAK-----EPL
        S K ++ + + K+      ++K   K    L H  LGH N   I+K +K   +  L+E+ +       Y+   CL GK TK     KG R K     EP 
Subjt:  SVKSVLNTELFKMAETRTKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSL-----PVYE--SCLEGKMTKCPFSGKGYRAK-----EPL

Query:  ELVHFDLCGPMNVKARGGYEYFVSFIDD
        + +H D+ GP++   +    YF+SF D+
Subjt:  ELVHFDLCGPMNVKARGGYEYFVSFIDD

P93293 Uncharacterized mitochondrial protein AtMg003001.9e-0836.78Show/hide
Query:  ETRTKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNV
        ET       + K+   LWH RL H++   +E LVK G L+  + +SL   E C+ GK  +  FS   +  K PL+ VH DL G  +V
Subjt:  ETRTKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNV

Q12491 Transposon Ty2-B Gag-Pol polyprotein6.8e-0629.69Show/hide
Query:  SVKSVLNTELFKMAETRTKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSL-----PVYE--SCLEGKMTKCPFSGKGYRAK-----EPL
        S K ++ + + K+      ++K   K    L H  LGH N   I+K +K   +  L+E+ +       Y+   CL GK TK     KG R K     EP 
Subjt:  SVKSVLNTELFKMAETRTKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSL-----PVYE--SCLEGKMTKCPFSGKGYRAK-----EPL

Query:  ELVHFDLCGPMNVKARGGYEYFVSFIDD
        + +H D+ GP++   +    YF+SF D+
Subjt:  ELVHFDLCGPMNVKARGGYEYFVSFIDD

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.4e-0936.78Show/hide
Query:  ETRTKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNV
        ET       + K+   LWH RL H++   +E LVK G L+  + +SL   E C+ GK  +  FS   +  K PL+ VH DL G  +V
Subjt:  ETRTKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCGCATGCTAGTTTGGCCTGGACCAGACAATCCCTTCGGAAGGCCTGATCATGGGAGTCAGAACACTGTGAATTCCCAAAAGGGATACAGTTTCCTGGAGTGTGC
TCAGATGCCAGGCTCGACTGCATTGCGAAGTGTTCGTGATGCATACGATCGATGGATCAGTGCCAATGAAAAGGCCAAGGTCTATATCATTGTCAGCATGTCTGATGTTT
TGGCAAAGAAGCATGAGCTAATGGTCACCGCTAAGGAGATCATGGAGTCCTTGCAGGAAATATTTGGACAACAGTCCTTTCAGGTCCGGCATGACTCGCTCAAATACGTC
TTCAACGCACGGATGAAAGAAGAATCGTCTGTCCGTGAACATGTTCGAGACATGATGACCCACTTTAATCTTGCTGAGATGAACGAGGCTTCGATCGACGAGTCGAGCCA
AGTCAGTTTTATCTTGGAGACTTTTCCGAAGAGTTTCCTTCAGAATTTCCAGTCCTTGATGAGAGTTAGGGCATCGAAATCTGAGGCAAATGTTGCTTACAGGTCTTATT
ACAGGGGTTCGACCTCTGGGACGAAACCTGTTGCTTCTTCATGCCCGAAAGGGAAGAACAGGATGAAGAGGGGTAAAACTGACCGAGGTGTCGCCCAGAAGGGCAAGAAG
GTCAACAAAGTTGCAGAAAAAGGAAAGTGTTTCCACTGCAATGGAGGCGGACACTGGAAGAGAAACTGTCCCAAATTCCTAGCCGAAAGGAAGAATCAAGGTAAATGTGA
TTTACTTGTGACAAAAACCTGTTTAGTGGACAGTAGTGACTCTACTTGGATATTGGATTCGGGCGCCACTAACCATGTTGTTCTTCTTTTCAGGGGATTGATTCCTGGCA
CCCGCTGCGAGAGGGTGAGAGGCAATCTTATTTGTTCCGCTTCACTTGAGCATAATCTGTATGTTTTCAAACCTAATTCGGTCAAAAGTGTTTTGAATACTGAATTGTTT
AAAATGGCAGAAACACGAACAAAAAGAGCGAAAGTTTCTCCTAAAGAAAATGCCCATCTTTGGCATCTGCGGTTAGGCCACATTAATCTCAATAGGATTGAGAAACTAGT
GAAGAGTGGACTTCTAAACGAGTTGGAAGAAAACTCTTTACCGGTGTATGAGTCATGCCTTGAGGGCAAAATGACCAAATGTCCTTTTAGTGGAAAAGGATATAGAGCCA
AAGAGCCCCTTGAGTTAGTACATTTTGACCTTTGTGGTCCGATGAATGTTAAAGCTCGAGGTGGTTATGAATACTTCGTGTCTTTCATAGACGATTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCCGCATGCTAGTTTGGCCTGGACCAGACAATCCCTTCGGAAGGCCTGATCATGGGAGTCAGAACACTGTGAATTCCCAAAAGGGATACAGTTTCCTGGAGTGTGC
TCAGATGCCAGGCTCGACTGCATTGCGAAGTGTTCGTGATGCATACGATCGATGGATCAGTGCCAATGAAAAGGCCAAGGTCTATATCATTGTCAGCATGTCTGATGTTT
TGGCAAAGAAGCATGAGCTAATGGTCACCGCTAAGGAGATCATGGAGTCCTTGCAGGAAATATTTGGACAACAGTCCTTTCAGGTCCGGCATGACTCGCTCAAATACGTC
TTCAACGCACGGATGAAAGAAGAATCGTCTGTCCGTGAACATGTTCGAGACATGATGACCCACTTTAATCTTGCTGAGATGAACGAGGCTTCGATCGACGAGTCGAGCCA
AGTCAGTTTTATCTTGGAGACTTTTCCGAAGAGTTTCCTTCAGAATTTCCAGTCCTTGATGAGAGTTAGGGCATCGAAATCTGAGGCAAATGTTGCTTACAGGTCTTATT
ACAGGGGTTCGACCTCTGGGACGAAACCTGTTGCTTCTTCATGCCCGAAAGGGAAGAACAGGATGAAGAGGGGTAAAACTGACCGAGGTGTCGCCCAGAAGGGCAAGAAG
GTCAACAAAGTTGCAGAAAAAGGAAAGTGTTTCCACTGCAATGGAGGCGGACACTGGAAGAGAAACTGTCCCAAATTCCTAGCCGAAAGGAAGAATCAAGGTAAATGTGA
TTTACTTGTGACAAAAACCTGTTTAGTGGACAGTAGTGACTCTACTTGGATATTGGATTCGGGCGCCACTAACCATGTTGTTCTTCTTTTCAGGGGATTGATTCCTGGCA
CCCGCTGCGAGAGGGTGAGAGGCAATCTTATTTGTTCCGCTTCACTTGAGCATAATCTGTATGTTTTCAAACCTAATTCGGTCAAAAGTGTTTTGAATACTGAATTGTTT
AAAATGGCAGAAACACGAACAAAAAGAGCGAAAGTTTCTCCTAAAGAAAATGCCCATCTTTGGCATCTGCGGTTAGGCCACATTAATCTCAATAGGATTGAGAAACTAGT
GAAGAGTGGACTTCTAAACGAGTTGGAAGAAAACTCTTTACCGGTGTATGAGTCATGCCTTGAGGGCAAAATGACCAAATGTCCTTTTAGTGGAAAAGGATATAGAGCCA
AAGAGCCCCTTGAGTTAGTACATTTTGACCTTTGTGGTCCGATGAATGTTAAAGCTCGAGGTGGTTATGAATACTTCGTGTCTTTCATAGACGATTTCTAG
Protein sequenceShow/hide protein sequence
MIRMLVWPGPDNPFGRPDHGSQNTVNSQKGYSFLECAQMPGSTALRSVRDAYDRWISANEKAKVYIIVSMSDVLAKKHELMVTAKEIMESLQEIFGQQSFQVRHDSLKYV
FNARMKEESSVREHVRDMMTHFNLAEMNEASIDESSQVSFILETFPKSFLQNFQSLMRVRASKSEANVAYRSYYRGSTSGTKPVASSCPKGKNRMKRGKTDRGVAQKGKK
VNKVAEKGKCFHCNGGGHWKRNCPKFLAERKNQGKCDLLVTKTCLVDSSDSTWILDSGATNHVVLLFRGLIPGTRCERVRGNLICSASLEHNLYVFKPNSVKSVLNTELF
KMAETRTKRAKVSPKENAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVYESCLEGKMTKCPFSGKGYRAKEPLELVHFDLCGPMNVKARGGYEYFVSFIDDF