; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015889 (gene) of Snake gourd v1 genome

Gene IDTan0015889
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:49355932..49359073
RNA-Seq ExpressionTan0015889
SyntenyTan0015889
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-22860.47Show/hide
Query:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA
        CP  P ++A QSV++AYDRWTKANDKA+++ILAS+S+IL+KKHE M           EMFGQPS QI+ E+                          NVA
Subjt:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA

Query:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKK------------
           R                  +F  S SG +K QK+K G GKG  +A +  KGKAK+A K KCFHCN+D HW  NCPKYL + KEK+            
Subjt:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKK------------

Query:  ------------GAINHVCSSFQETSSFKELEEA------------------------------------------------------------------
                    GA NHVCSS QETSSFK+LE++                                                                  
Subjt:  ------------GAINHVCSSFQETSSFKELEEA------------------------------------------------------------------

Query:  -------------------------------------------KISPNNNTYLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFT
                                                   +ISPNNNTYLWHLRLGHINL+RI RL KNGLLNKL+D SLPPCESCLEGKMTKRPFT
Subjt:  -------------------------------------------KISPNNNTYLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFT

Query:  GKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKS
        GKGYRA EPLEL+HSDLCGP+NVKARGG+EYFI+FIDDYSRYGYLYLM HKSEALEKFKEYK EVEN L K IK LRSDRGGEYMDLRFQDYMIEHGI+S
Subjt:  GKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKS

Query:  QLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLC
        QLSAP TPQQNGVSERRNRTLLDMVRSMMSYAQLP+SFWGYAVET + ILN+VPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLE RSRLC
Subjt:  QLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLC

Query:  QFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYL
        QFVGYPKETRGGLF+DPQEN+V VSTNATFLEEDHMRNHKPRSKLVL+EAT E TRVV + GPSSRVD   +TS QS PSQSL MPRRSGRVVSQP+RYL
Subjt:  QFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYL

Query:  GLAETQVVIHDDGVEDPLSYKQAMNDVDKDQ
        GL ETQVVI DDGVEDPLSYKQAMNDVDKDQ
Subjt:  GLAETQVVIHDDGVEDPLSYKQAMNDVDKDQ

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-22459.64Show/hide
Query:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA
        CP  P ++A QSV++AYDRWTKANDKA+++ILAS+S+IL+KKHE M           EMFGQPS QI+ E+                          NVA
Subjt:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA

Query:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKK------------
           R                  +F  S SG +K QK+K G GKG  +A +  KGKAK+A K KCFHCN+D HW  NCPKYL +  EK+            
Subjt:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKK------------

Query:  ------------GAINHVCSSFQETSSFKELEEA------------------------------------------------------------------
                    GA NHVCSS QETSSFK+LE++                                                                  
Subjt:  ------------GAINHVCSSFQETSSFKELEEA------------------------------------------------------------------

Query:  -------------------------------------------KISPNNNTYLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFT
                                                   +ISPNNNTYLWHLRLGHINL+RI RL K+GLLNKL+D SLPPCESCLEGKMTKRPFT
Subjt:  -------------------------------------------KISPNNNTYLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFT

Query:  GKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKS
        GKGYRA EPLEL+HSDLCGP+NVKARG +EYFI+FIDDYSRYGYLYLM HKSEALEKFKEYK EVEN L K IK  RSDRGGEYMDL FQDYMIEHGI+S
Subjt:  GKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKS

Query:  QLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLC
        QLSAP TPQQNGVSERRNRTLLDMVRSMMSYAQLP+SFWGYAVET + ILN+VPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLE RSRLC
Subjt:  QLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLC

Query:  QFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYL
        QFVGYPKETRGGLF+DP+EN+V VSTNATFLEEDHMRNHKPRSKLVL+EAT E TRVV + GPSSRVD   +TS QS PSQSL MPRRSGRVVSQP+RYL
Subjt:  QFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYL

Query:  GLAETQVVIHDDGVEDPLSYKQAMNDVDKDQ
        GL ETQVVI DDGVEDPLSYKQAMNDVDKDQ
Subjt:  GLAETQVVIHDDGVEDPLSYKQAMNDVDKDQ

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-23471.07Show/hide
Query:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA
        CP  P ++A QSV++AYDRWTKANDKA+++ILAS+S+IL+KKHE M           EMFGQPS QI+ E+                          NVA
Subjt:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA

Query:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKKGAINHVCSSFQE
           R                  +F  S SG +K QK+K G GKG  +A +  KGKAK+A K KCFHCN+D HW  NCPKYL + KEK+GA NHVCSS QE
Subjt:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKKGAINHVCSSFQE

Query:  TSSFKELEEAKISPNNNT-------YLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKAR
        TSSFK+LE+++++    T        +   +LGHINL+RI RL KNGLLNKL+D SLPPCESCLEGKMTKRPFTGKGYRA EPLEL+HSDLCGP+NVKAR
Subjt:  TSSFKELEEAKISPNNNT-------YLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKAR

Query:  GGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVR
        GG+EYFI+FIDDYSRYGYLYLM HKSEALEKFKEYK EVEN L K IK LRSDRGGEYMDLRFQDYMIEHGI+SQLSAP TPQQNGVSERRNRTLLDMVR
Subjt:  GGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVR

Query:  SMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVST
        SMMSYAQLP+SFWGYAVET + ILN+VPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLE RSRLCQFVGYPKETRGGLF+DPQEN+V VST
Subjt:  SMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVST

Query:  NATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYLGLAETQVVIHDDGVEDPLSYKQAMND
        NATFLEEDHMRNHKPRSKLVL+EAT E TRVV + GPSSRVD   +TS QS PSQSL MPRRSGRVVSQP+RYLGL ETQVVI DDGVEDPLSYKQAMND
Subjt:  NATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYLGLAETQVVIHDDGVEDPLSYKQAMND

Query:  VDKDQ
        VDKDQ
Subjt:  VDKDQ

KAA0062886.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-22068.21Show/hide
Query:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA
        CP  P + A QSV++AYDRWTKANDKA+++ILAS+S+IL+KKHE M           E+FGQ S QI+ E+                          NVA
Subjt:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA

Query:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKKGAINHVCSSFQE
           R                   F  SSSG +K QK+K G GKG  +A + GKGKAK+  KGKCFHC +D HW +N PKYL + KEK+GA NHV SS QE
Subjt:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKKGAINHVCSSFQE

Query:  TSSFKELEEAKISPNNNTYLWHLRLGHIN------LNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKARG
        TSSFK+LEE++++         L++G  +      +   +RL KNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRA EPLEL+HSDLCGP+NVKARG
Subjt:  TSSFKELEEAKISPNNNTYLWHLRLGHIN------LNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKARG

Query:  GYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRS
        G+EYFI+FIDDY RYGYLYLM HK EALEKFKEYK EVEN L K IK LR DRG EYMDLRFQDYMIEHGI+SQLSAP TPQQNGVSERRNRTLLDMV S
Subjt:  GYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRS

Query:  MMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVSTN
        MMSYAQL +SFWGYAVE  + ILN+V SKSVSETPFELW+GRKPSL HFRIWGCPA+VLVTNPKKLELRSRLCQFVGYPKETRGGLF+DPQEN+V+VSTN
Subjt:  MMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVSTN

Query:  ATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYLGLAETQVVIHDDGVEDPLSYKQAMNDV
        ATFLEEDHM +HKPR+KLVLNEAT E TRVV + GPSSRVD   +TS QS PSQSL MPRRSGRVVSQP+RYLG  ETQVVI DDGVEDPLSYKQAMNDV
Subjt:  ATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYLGLAETQVVIHDDGVEDPLSYKQAMNDV

Query:  DKDQ
        DKDQ
Subjt:  DKDQ

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-23471.07Show/hide
Query:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA
        CP  P ++A QSV++AYDRWTKANDKA+++ILAS+S+IL+KKHE M           EMFGQPS QI+ E+                          NVA
Subjt:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA

Query:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKKGAINHVCSSFQE
           R                  +F  S SG +K QK+K G GKG  +A +  KGKAK+A K KCFHCN+D HW  NCPKYL + KEK+GA NHVCSS QE
Subjt:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKKGAINHVCSSFQE

Query:  TSSFKELEEAKISPNNNT-------YLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKAR
        TSSFK+LE+++++    T        +   +LGHINL+RI RL KNGLLNKL+D SLPPCESCLEGKMTKRPFTGKGYRA EPLEL+HSDLCGP+NVKAR
Subjt:  TSSFKELEEAKISPNNNT-------YLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKAR

Query:  GGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVR
        GG+EYFI+FIDDYSRYGYLYLM HKSEALEKFKEYK EVEN L K IK LRSDRGGEYMDLRFQDYMIEHGI+SQLSAP TPQQNGVSERRNRTLLDMVR
Subjt:  GGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVR

Query:  SMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVST
        SMMSYAQLP+SFWGYAVET + ILN+VPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLE RSRLCQFVGYPKETRGGLF+DPQEN+V VST
Subjt:  SMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVST

Query:  NATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYLGLAETQVVIHDDGVEDPLSYKQAMND
        NATFLEEDHMRNHKPRSKLVL+EAT E TRVV + GPSSRVD   +TS QS PSQSL MPRRSGRVVSQP+RYLGL ETQVVI DDGVEDPLSYKQAMND
Subjt:  NATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYLGLAETQVVIHDDGVEDPLSYKQAMND

Query:  VDKDQ
        VDKDQ
Subjt:  VDKDQ

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein8.9e-22559.64Show/hide
Query:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA
        CP  P ++A QSV++AYDRWTKANDKA+++ILAS+S+IL+KKHE M           EMFGQPS QI+ E+                          NVA
Subjt:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA

Query:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKK------------
           R                  +F  S SG +K QK+K G GKG  +A +  KGKAK+A K KCFHCN+D HW  NCPKYL +  EK+            
Subjt:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKK------------

Query:  ------------GAINHVCSSFQETSSFKELEEA------------------------------------------------------------------
                    GA NHVCSS QETSSFK+LE++                                                                  
Subjt:  ------------GAINHVCSSFQETSSFKELEEA------------------------------------------------------------------

Query:  -------------------------------------------KISPNNNTYLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFT
                                                   +ISPNNNTYLWHLRLGHINL+RI RL K+GLLNKL+D SLPPCESCLEGKMTKRPFT
Subjt:  -------------------------------------------KISPNNNTYLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFT

Query:  GKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKS
        GKGYRA EPLEL+HSDLCGP+NVKARG +EYFI+FIDDYSRYGYLYLM HKSEALEKFKEYK EVEN L K IK  RSDRGGEYMDL FQDYMIEHGI+S
Subjt:  GKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKS

Query:  QLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLC
        QLSAP TPQQNGVSERRNRTLLDMVRSMMSYAQLP+SFWGYAVET + ILN+VPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLE RSRLC
Subjt:  QLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLC

Query:  QFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYL
        QFVGYPKETRGGLF+DP+EN+V VSTNATFLEEDHMRNHKPRSKLVL+EAT E TRVV + GPSSRVD   +TS QS PSQSL MPRRSGRVVSQP+RYL
Subjt:  QFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYL

Query:  GLAETQVVIHDDGVEDPLSYKQAMNDVDKDQ
        GL ETQVVI DDGVEDPLSYKQAMNDVDKDQ
Subjt:  GLAETQVVIHDDGVEDPLSYKQAMNDVDKDQ

A0A5A7TZD0 Gag/pol protein2.3e-22860.47Show/hide
Query:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA
        CP  P ++A QSV++AYDRWTKANDKA+++ILAS+S+IL+KKHE M           EMFGQPS QI+ E+                          NVA
Subjt:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA

Query:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKK------------
           R                  +F  S SG +K QK+K G GKG  +A +  KGKAK+A K KCFHCN+D HW  NCPKYL + KEK+            
Subjt:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKK------------

Query:  ------------GAINHVCSSFQETSSFKELEEA------------------------------------------------------------------
                    GA NHVCSS QETSSFK+LE++                                                                  
Subjt:  ------------GAINHVCSSFQETSSFKELEEA------------------------------------------------------------------

Query:  -------------------------------------------KISPNNNTYLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFT
                                                   +ISPNNNTYLWHLRLGHINL+RI RL KNGLLNKL+D SLPPCESCLEGKMTKRPFT
Subjt:  -------------------------------------------KISPNNNTYLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFT

Query:  GKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKS
        GKGYRA EPLEL+HSDLCGP+NVKARGG+EYFI+FIDDYSRYGYLYLM HKSEALEKFKEYK EVEN L K IK LRSDRGGEYMDLRFQDYMIEHGI+S
Subjt:  GKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKS

Query:  QLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLC
        QLSAP TPQQNGVSERRNRTLLDMVRSMMSYAQLP+SFWGYAVET + ILN+VPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLE RSRLC
Subjt:  QLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLC

Query:  QFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYL
        QFVGYPKETRGGLF+DPQEN+V VSTNATFLEEDHMRNHKPRSKLVL+EAT E TRVV + GPSSRVD   +TS QS PSQSL MPRRSGRVVSQP+RYL
Subjt:  QFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYL

Query:  GLAETQVVIHDDGVEDPLSYKQAMNDVDKDQ
        GL ETQVVI DDGVEDPLSYKQAMNDVDKDQ
Subjt:  GLAETQVVIHDDGVEDPLSYKQAMNDVDKDQ

A0A5A7UYE8 Gag/pol protein5.5e-23571.07Show/hide
Query:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA
        CP  P ++A QSV++AYDRWTKANDKA+++ILAS+S+IL+KKHE M           EMFGQPS QI+ E+                          NVA
Subjt:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA

Query:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKKGAINHVCSSFQE
           R                  +F  S SG +K QK+K G GKG  +A +  KGKAK+A K KCFHCN+D HW  NCPKYL + KEK+GA NHVCSS QE
Subjt:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKKGAINHVCSSFQE

Query:  TSSFKELEEAKISPNNNT-------YLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKAR
        TSSFK+LE+++++    T        +   +LGHINL+RI RL KNGLLNKL+D SLPPCESCLEGKMTKRPFTGKGYRA EPLEL+HSDLCGP+NVKAR
Subjt:  TSSFKELEEAKISPNNNT-------YLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKAR

Query:  GGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVR
        GG+EYFI+FIDDYSRYGYLYLM HKSEALEKFKEYK EVEN L K IK LRSDRGGEYMDLRFQDYMIEHGI+SQLSAP TPQQNGVSERRNRTLLDMVR
Subjt:  GGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVR

Query:  SMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVST
        SMMSYAQLP+SFWGYAVET + ILN+VPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLE RSRLCQFVGYPKETRGGLF+DPQEN+V VST
Subjt:  SMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVST

Query:  NATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYLGLAETQVVIHDDGVEDPLSYKQAMND
        NATFLEEDHMRNHKPRSKLVL+EAT E TRVV + GPSSRVD   +TS QS PSQSL MPRRSGRVVSQP+RYLGL ETQVVI DDGVEDPLSYKQAMND
Subjt:  NATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYLGLAETQVVIHDDGVEDPLSYKQAMND

Query:  VDKDQ
        VDKDQ
Subjt:  VDKDQ

A0A5A7V8W0 Gag/pol protein1.0e-22068.21Show/hide
Query:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA
        CP  P + A QSV++AYDRWTKANDKA+++ILAS+S+IL+KKHE M           E+FGQ S QI+ E+                          NVA
Subjt:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA

Query:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKKGAINHVCSSFQE
           R                   F  SSSG +K QK+K G GKG  +A + GKGKAK+  KGKCFHC +D HW +N PKYL + KEK+GA NHV SS QE
Subjt:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKKGAINHVCSSFQE

Query:  TSSFKELEEAKISPNNNTYLWHLRLGHIN------LNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKARG
        TSSFK+LEE++++         L++G  +      +   +RL KNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRA EPLEL+HSDLCGP+NVKARG
Subjt:  TSSFKELEEAKISPNNNTYLWHLRLGHIN------LNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKARG

Query:  GYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRS
        G+EYFI+FIDDY RYGYLYLM HK EALEKFKEYK EVEN L K IK LR DRG EYMDLRFQDYMIEHGI+SQLSAP TPQQNGVSERRNRTLLDMV S
Subjt:  GYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRS

Query:  MMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVSTN
        MMSYAQL +SFWGYAVE  + ILN+V SKSVSETPFELW+GRKPSL HFRIWGCPA+VLVTNPKKLELRSRLCQFVGYPKETRGGLF+DPQEN+V+VSTN
Subjt:  MMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVSTN

Query:  ATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYLGLAETQVVIHDDGVEDPLSYKQAMNDV
        ATFLEEDHM +HKPR+KLVLNEAT E TRVV + GPSSRVD   +TS QS PSQSL MPRRSGRVVSQP+RYLG  ETQVVI DDGVEDPLSYKQAMNDV
Subjt:  ATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYLGLAETQVVIHDDGVEDPLSYKQAMNDV

Query:  DKDQ
        DKDQ
Subjt:  DKDQ

A0A5D3BUN8 Gag/pol protein5.5e-23571.07Show/hide
Query:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA
        CP  P ++A QSV++AYDRWTKANDKA+++ILAS+S+IL+KKHE M           EMFGQPS QI+ E+                          NVA
Subjt:  CPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGM-----------EMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVA

Query:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKKGAINHVCSSFQE
           R                  +F  S SG +K QK+K G GKG  +A +  KGKAK+A K KCFHCN+D HW  NCPKYL + KEK+GA NHVCSS QE
Subjt:  EMNRAVIDEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIG-GKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKKGAINHVCSSFQE

Query:  TSSFKELEEAKISPNNNT-------YLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKAR
        TSSFK+LE+++++    T        +   +LGHINL+RI RL KNGLLNKL+D SLPPCESCLEGKMTKRPFTGKGYRA EPLEL+HSDLCGP+NVKAR
Subjt:  TSSFKELEEAKISPNNNT-------YLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKAR

Query:  GGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVR
        GG+EYFI+FIDDYSRYGYLYLM HKSEALEKFKEYK EVEN L K IK LRSDRGGEYMDLRFQDYMIEHGI+SQLSAP TPQQNGVSERRNRTLLDMVR
Subjt:  GGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVR

Query:  SMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVST
        SMMSYAQLP+SFWGYAVET + ILN+VPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLE RSRLCQFVGYPKETRGGLF+DPQEN+V VST
Subjt:  SMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVST

Query:  NATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYLGLAETQVVIHDDGVEDPLSYKQAMND
        NATFLEEDHMRNHKPRSKLVL+EAT E TRVV + GPSSRVD   +TS QS PSQSL MPRRSGRVVSQP+RYLGL ETQVVI DDGVEDPLSYKQAMND
Subjt:  NATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVSQPDRYLGLAETQVVIHDDGVEDPLSYKQAMND

Query:  VDKDQ
        VDKDQ
Subjt:  VDKDQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.4e-4735.86Show/hide
Query:  NNTYLWHLRLGHIN------LNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRA--VEPLELVHSDLCGPVNVKARGGYEYFITFIDDYS
        NN  LWH R GHI+      + R    S   LLN LE  S   CE CL GK  + PF     +     PL +VHSD+CGP+         YF+ F+D ++
Subjt:  NNTYLWHLRLGHIN------LNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRA--VEPLELVHSDLCGPVNVKARGGYEYFITFIDDYS

Query:  RYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWG
         Y   YL+ +KS+    F+++ A+ E      +  L  D G EY+    + + ++ GI   L+ P+TPQ NGVSER  RT+ +  R+M+S A+L  SFWG
Subjt:  RYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWG

Query:  YAVETTIQILNSVPSKSV---SETPFELWKGRKPSLQHFRIWGCPAHVLVTNPK-KLELRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHM
         AV T   ++N +PS+++   S+TP+E+W  +KP L+H R++G   +V + N + K + +S    FVGY  E  G   +D    K IV+ +   ++E +M
Subjt:  YAVETTIQILNSVPSKSV---SETPFELWKGRKPSLQHFRIWGCPAHVLVTNPK-KLELRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHM

Query:  RNHK
         N +
Subjt:  RNHK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-6039.65Show/hide
Query:  LWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKS
        LWH R+GH++   ++ L+K  L++  +  ++ PC+ CL GK  +  F     R +  L+LV+SD+CGP+ +++ GG +YF+TFIDD SR  ++Y++  K 
Subjt:  LWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKS

Query:  EALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNS
        +  + F+++ A VE   G+ +K LRSD GGEY    F++Y   HGI+ + + P TPQ NGV+ER NRT+++ VRSM+  A+LP SFWG AV+T   ++N 
Subjt:  EALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNS

Query:  VPSKSVS-ETPFELWKGRKPSLQHFRIWGCP--AHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEED
         PS  ++ E P  +W  ++ S  H +++GC   AHV      KL+ +S  C F+GY  E  G   +DP + KVI S +  F E +
Subjt:  VPSKSVS-ETPFELWKGRKPSLQHFRIWGCP--AHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEED

Q12337 Transposon Ty2-GR1 Gag-Pol polyprotein1.0e-2329.18Show/hide
Query:  LWHLRLGHINLNRIER-LSKNGLLNKLEDD------SLPPCESCLEGKMTKRPFTGKGYR-----AVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYS
        L H  LGH N   I++ L KN +    E D      S   C  CL GK TK     KG R     + EP + +H+D+ GPV+   +    YFI+F D+ +
Subjt:  LWHLRLGHINLNRIER-LSKNGLLNKLEDD------SLPPCESCLEGKMTKRPFTGKGYR-----AVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYS

Query:  RYGYLYLMHHKSE--ALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASF
        R+ ++Y +H + E   L  F    A ++N     +  ++ DRG EY +     +    GI +  +     + +GV+ER NRTLL+  R+++  + LP   
Subjt:  RYGYLYLMHHKSE--ALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASF

Query:  WGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRI----------WGCPAHVLVTNP-KKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVSTN
        W  AVE +  I NS+ S           K RK + QH  +          +G P  V   NP  K+  R      +   + + G + Y P   K + +TN
Subjt:  WGYAVETTIQILNSVPSKSVSETPFELWKGRKPSLQHFRI----------WGCPAHVLVTNP-KKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVSTN

Query:  ATFLE
           L+
Subjt:  ATFLE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.4e-3929.21Show/hide
Query:  WHLRLGHINLNRIERLSKNGLLNKLE-DDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKS
        WH RLGH   + +  +  N  L+ L        C  CL  K  K PF+     +  PLE ++SD+     + +   Y Y++ F+D ++RY +LY +  KS
Subjt:  WHLRLGHINLNRIERLSKNGLLNKLE-DDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKS

Query:  EALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNS
        +  E F  +K  +EN     I T  SD GGE++ L   +Y  +HGI    S P+TP+ NG+SER++R +++   +++S+A +P ++W YA    + ++N 
Subjt:  EALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNS

Query:  VPSKSVS-ETPFELWKGRKPSLQHFRIWGCPAHVLVT--NPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEE-----------DHMRN
        +P+  +  E+PF+   G  P+    R++GC  +  +   N  KL+ +SR C F+GY       L    Q +++ +S +  F E              ++ 
Subjt:  VPSKSVS-ETPFELWKGRKPSLQHFRIWGCPAHVLVT--NPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEE-----------DHMRN

Query:  HKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVS
         +  S  V +  T  PTR      PS      A+T     P  S   P R+ +V S
Subjt:  HKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQSLGMPRRSGRVVS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-3933.59Show/hide
Query:  WHLRLGHINLNRIERLSKNGLLNKLE-DDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKS
        WH RLGH +L  +  +  N  L  L     L  C  C   K  K PF+     + +PLE ++SD+     + +   Y Y++ F+D ++RY +LY +  KS
Subjt:  WHLRLGHINLNRIERLSKNGLLNKLE-DDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKS

Query:  EALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNS
        +  + F  +K+ VEN     I TL SD GGE++ LR  DY+ +HGI    S P+TP+ NG+SER++R +++M  +++S+A +P ++W YA    + ++N 
Subjt:  EALEKFKEYKAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNS

Query:  VPSKSVS-ETPFELWKGRKPSLQHFRIWGCPAHVLVT--NPKKLELRSRLCQFVGY
        +P+  +  ++PF+   G+ P+ +  +++GC  +  +   N  KLE +S+ C F+GY
Subjt:  VPSKSVS-ETPFELWKGRKPSLQHFRIWGCPAHVLVT--NPKKLELRSRLCQFVGY

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein2.1e-0837.36Show/hide
Query:  ETSSFKELEEAKISPNNNTYLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNV
        ET      E AK    + T LWH RL H++   +E L K G L+  +  SL  CE C+ GK  +  F+   +    PL+ VHSDL G  +V
Subjt:  ETSSFKELEEAKISPNNNTYLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.5e-0634.78Show/hide
Query:  NRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNSVPSKSVS-ETPFELWKGRKPSLQHFRIWGCPAHV
        NRT+++ VRSM+    LP +F   A  T + I+N  PS +++   P E+W    P+  + R +GC A++
Subjt:  NRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNSVPSKSVS-ETPFELWKGRKPSLQHFRIWGCPAHV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGGCTACGCGGCGTCAATAGGGTGTCCCCCTACGGCATGTCCTCAGATTCCTGCTCGTAACGCTCCTCAATCTGTTAAGGAGGCGTACGATCGCTGGACCAAGGC
CAATGATAAGGCCAAGGTCTACATTTTGGCTAGTGTTTCTGAAATTCTGGCCAAAAAGCACGAGGGCATGGAAATGTTTGGACAACCGTCTGGACAGATTCGGCACGAAT
CCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCATCGATGAGAGAACACGTTCTTGATCTGATGGTCCATTTCAACGTGGCTGAGATGAACAGAGCGGTCATT
GACGAGCAGAGTCAGAGGTTCCAGAAAGGTTCATCCTCTGGGACCAAGTTCTGTAGCTCATCTTCTGGGCTTAAGAAGACCCAAAAGAAGAAAATAGGAGGGAAAGGGAA
GGCACTTGCTACTGACAAGGGCAAGGGAAAAGCCAAGCTTGCAGATAAAGGAAAGTGTTTCCACTGCAACATGGATGGGCACTGGAACAGAAACTGCCCAAAATACCTTT
ATGAGCTCAAAGAGAAGAAAGGAGCCATTAATCACGTTTGCTCTTCCTTTCAGGAAACTAGTTCCTTCAAGGAGCTCGAAGAGGCAAAGATTTCTCCAAATAATAATACC
TATCTGTGGCATCTTAGACTTGGTCATATTAATCTCAATCGGATTGAGAGACTTTCTAAGAATGGACTTCTAAACAAGTTAGAAGATGACTCTTTACCGCCTTGCGAGTC
TTGCTTGGAAGGTAAAATGACCAAACGACCTTTTACTGGAAAAGGTTACAGAGCCGTAGAGCCTCTAGAACTTGTGCATTCGGATCTTTGTGGTCCGGTGAATGTTAAAG
CTCGAGGAGGGTACGAATATTTCATCACTTTCATAGATGATTATTCGAGGTATGGTTATCTATACCTAATGCATCACAAGTCTGAAGCTCTTGAAAAGTTCAAAGAGTAT
AAGGCTGAAGTAGAGAATGCATTAGGGAAAACCATTAAAACACTTCGATCCGATCGAGGTGGAGAGTATATGGATTTGAGATTCCAGGACTATATGATAGAACATGGAAT
TAAATCTCAACTCTCTGCACCTAATACACCACAGCAAAATGGTGTGTCAGAAAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCTATGATGAGCTATGCTCAATTAC
CTGCCTCGTTTTGGGGATACGCAGTAGAGACTACAATTCAAATCTTGAACAGTGTTCCATCAAAAAGTGTTTCAGAAACACCTTTTGAACTTTGGAAGGGGCGTAAACCT
AGTTTACAACACTTCAGGATTTGGGGTTGTCCGGCACACGTGCTCGTGACAAACCCAAAGAAACTGGAACTTCGTTCAAGATTGTGCCAATTTGTTGGCTATCCCAAAGA
AACGAGAGGTGGTCTTTTCTACGACCCACAAGAAAACAAGGTGATTGTATCGACAAACGCCACATTCTTGGAGGAAGATCACATGAGGAACCATAAACCGCGTAGTAAAT
TAGTACTAAATGAAGCTACACATGAACCAACAAGAGTTGTTGGTCAAGCTGGACCTTCATCAAGAGTTGATGGAAGAGCCAGCACCTCAAGTCAGTCTCGTCCTTCTCAA
TCGTTGGGAATGCCTCGACGCAGTGGGAGGGTTGTTTCCCAACCTGACCGCTACTTGGGTTTAGCTGAAACTCAAGTTGTCATACATGATGACGGCGTAGAAGATCCATT
GTCTTATAAACAGGCAATGAATGACGTAGACAAGGACCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCGGCTACGCGGCGTCAATAGGGTGTCCCCCTACGGCATGTCCTCAGATTCCTGCTCGTAACGCTCCTCAATCTGTTAAGGAGGCGTACGATCGCTGGACCAAGGC
CAATGATAAGGCCAAGGTCTACATTTTGGCTAGTGTTTCTGAAATTCTGGCCAAAAAGCACGAGGGCATGGAAATGTTTGGACAACCGTCTGGACAGATTCGGCACGAAT
CCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCATCGATGAGAGAACACGTTCTTGATCTGATGGTCCATTTCAACGTGGCTGAGATGAACAGAGCGGTCATT
GACGAGCAGAGTCAGAGGTTCCAGAAAGGTTCATCCTCTGGGACCAAGTTCTGTAGCTCATCTTCTGGGCTTAAGAAGACCCAAAAGAAGAAAATAGGAGGGAAAGGGAA
GGCACTTGCTACTGACAAGGGCAAGGGAAAAGCCAAGCTTGCAGATAAAGGAAAGTGTTTCCACTGCAACATGGATGGGCACTGGAACAGAAACTGCCCAAAATACCTTT
ATGAGCTCAAAGAGAAGAAAGGAGCCATTAATCACGTTTGCTCTTCCTTTCAGGAAACTAGTTCCTTCAAGGAGCTCGAAGAGGCAAAGATTTCTCCAAATAATAATACC
TATCTGTGGCATCTTAGACTTGGTCATATTAATCTCAATCGGATTGAGAGACTTTCTAAGAATGGACTTCTAAACAAGTTAGAAGATGACTCTTTACCGCCTTGCGAGTC
TTGCTTGGAAGGTAAAATGACCAAACGACCTTTTACTGGAAAAGGTTACAGAGCCGTAGAGCCTCTAGAACTTGTGCATTCGGATCTTTGTGGTCCGGTGAATGTTAAAG
CTCGAGGAGGGTACGAATATTTCATCACTTTCATAGATGATTATTCGAGGTATGGTTATCTATACCTAATGCATCACAAGTCTGAAGCTCTTGAAAAGTTCAAAGAGTAT
AAGGCTGAAGTAGAGAATGCATTAGGGAAAACCATTAAAACACTTCGATCCGATCGAGGTGGAGAGTATATGGATTTGAGATTCCAGGACTATATGATAGAACATGGAAT
TAAATCTCAACTCTCTGCACCTAATACACCACAGCAAAATGGTGTGTCAGAAAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCTATGATGAGCTATGCTCAATTAC
CTGCCTCGTTTTGGGGATACGCAGTAGAGACTACAATTCAAATCTTGAACAGTGTTCCATCAAAAAGTGTTTCAGAAACACCTTTTGAACTTTGGAAGGGGCGTAAACCT
AGTTTACAACACTTCAGGATTTGGGGTTGTCCGGCACACGTGCTCGTGACAAACCCAAAGAAACTGGAACTTCGTTCAAGATTGTGCCAATTTGTTGGCTATCCCAAAGA
AACGAGAGGTGGTCTTTTCTACGACCCACAAGAAAACAAGGTGATTGTATCGACAAACGCCACATTCTTGGAGGAAGATCACATGAGGAACCATAAACCGCGTAGTAAAT
TAGTACTAAATGAAGCTACACATGAACCAACAAGAGTTGTTGGTCAAGCTGGACCTTCATCAAGAGTTGATGGAAGAGCCAGCACCTCAAGTCAGTCTCGTCCTTCTCAA
TCGTTGGGAATGCCTCGACGCAGTGGGAGGGTTGTTTCCCAACCTGACCGCTACTTGGGTTTAGCTGAAACTCAAGTTGTCATACATGATGACGGCGTAGAAGATCCATT
GTCTTATAAACAGGCAATGAATGACGTAGACAAGGACCAATAG
Protein sequenceShow/hide protein sequence
MLGYAASIGCPPTACPQIPARNAPQSVKEAYDRWTKANDKAKVYILASVSEILAKKHEGMEMFGQPSGQIRHESLKYVYNSRMKEGSSMREHVLDLMVHFNVAEMNRAVI
DEQSQRFQKGSSSGTKFCSSSSGLKKTQKKKIGGKGKALATDKGKGKAKLADKGKCFHCNMDGHWNRNCPKYLYELKEKKGAINHVCSSFQETSSFKELEEAKISPNNNT
YLWHLRLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAVEPLELVHSDLCGPVNVKARGGYEYFITFIDDYSRYGYLYLMHHKSEALEKFKEY
KAEVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETTIQILNSVPSKSVSETPFELWKGRKP
SLQHFRIWGCPAHVLVTNPKKLELRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPRSKLVLNEATHEPTRVVGQAGPSSRVDGRASTSSQSRPSQ
SLGMPRRSGRVVSQPDRYLGLAETQVVIHDDGVEDPLSYKQAMNDVDKDQ