; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015873 (gene) of Snake gourd v1 genome

Gene IDTan0015873
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG06:42020811..42024077
RNA-Seq ExpressionTan0015873
SyntenyTan0015873
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]4.3e-17939.27Show/hide
Query:  ASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGASID
        A+R+V +AYDR ++AN+KA+ YI+ASM+DVLAKKH+ + TAK IM+SL+EMFGQ S+ +RH+++K+++  RM+EG+ VR+HVLDM+ HFN+A++NG  ID
Subjt:  ASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGASID

Query:  ESS------------------------------------QNFQSLMRVKAPESEAN--VTYRSYHRGLTSGTKHVAPSRPKGKKRMKRGKTDRVAAQKGK
        E++                                    Q FQ+L   K  E EAN  VT R + RG +S  K V PS+ + KK+ K GK     A    
Subjt:  ESS------------------------------------QNFQSLMRVKAPESEAN--VTYRSYHRGLTSGTKHVAPSRPKGKKRMKRGKTDRVAAQKGK

Query:  KVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK--------------------------------------------------------------------
        KVK+ A+KGKCFHCN+  HWKRNCPK++A +K                                                                    
Subjt:  KVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK--------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKS
                                                         +G RAK PLELVHS LCG MNVKARGGYEYF+SFIDD+SRYG++YL+H KS
Subjt:  ------------------------------------------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKS

Query:  ETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPD----------------------------------------------
        E+ EKFKEYK +VEN +GK++KTLRSDRG EYMD++FQDY+IE  I SQLSAP                                               
Subjt:  ETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPD----------------------------------------------

Query:  ----------------------------------------------------YPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLNEMDS
                                                            YPKE+ GGLFY P+ENKV VSTN  FLEEDH R+H PRSKIVL EM  
Subjt:  ----------------------------------------------------YPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLNEMDS

Query:  TSARVADGASTSTSVVD-PNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC---------
         +    D  S+ST VVD  N S Q  +SQ+L +PRRSGRVV QP+RY+GL ET ++  DD  EDPLTY QAM DVD+D+WIK MN EM            
Subjt:  TSARVADGASTSTSVVD-PNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC---------

Query:  ------------------------------------------------------------------------------TSILSGSL-----------WIN
                                                                                      T+ L+G+L           +I 
Subjt:  ------------------------------------------------------------------------------TSILSGSL-----------WIN

Query:  Q---------MGSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGI
        Q           SIYGLKQ SRSWNIRFD  IKSYGF+QNVDEPCVYKKIV+  VAFL+LYVDDILLIGN+VE+LTDVKKWL +QFQMKDLGE QY+LGI
Subjt:  Q---------MGSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGI

Query:  QIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSN
        QIVRN KN+TLA+SQASYIDK+LSRYKMQNSKKG LPFRHG+HLSK+QC KTPQ+VEDMR IPY+SAVGSLMY MLCTRPDICY++GIV+RYQSN
Subjt:  QIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSN

KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-19646.67Show/hide
Query:  SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGA
        ++ A+R+V + Y+R  +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ S+ ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A MN A
Subjt:  SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGA

Query:  SIDESS------------------------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDR
         IDE+S                                    Q F+SLM++K  + EANV  + R +HRG TSGTK +  S    K + K+G    K + 
Subjt:  SIDESS------------------------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDR

Query:  VAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK-------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLM
         AA+  KK K  A KG CF CN+  HWKRNCPK++A +K              +G+RAKEPLELVHS LCG MNVKARGG+EYF++F DDYSRYGY+YLM
Subjt:  VAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK-------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLM

Query:  HKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPD------------------------------------------
          KSE LEKFKEYK +VEN L K++KT RSDRG EYMD +FQ+Y++E  I SQLSAPD                                          
Subjt:  HKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPD------------------------------------------

Query:  --------------------------------------------------------YPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLN
                                                                YPK T GG FYDPK+NKV VSTN  FLEEDH+R+H PRSKIVLN
Subjt:  --------------------------------------------------------YPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLN

Query:  EMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC--
        E+       S RV +  S    VV   +S++    Q L  PRRSGRV   P RYM L ET  V SD D EDPLT+ +AM DVDKDEWIK MN E+     
Subjt:  EMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC--

Query:  -------------------------------------------------------------------------------------TSILSGSL----WIN
                                                                                             T+ L+G+L    ++ 
Subjt:  -------------------------------------------------------------------------------------TSILSGSL----WIN

Query:  Q----------------MGSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGE
        Q                  SIYGLKQ SRSWNIRFD  IKSYGFDQ VDEPCVYK+I++K+VAFLVLYVDDILLIGN++  LTD+K+WLA+QFQMKDLGE
Subjt:  Q----------------MGSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGE

Query:  TQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQ
         Q+VLGIQI R+ KN+ LALSQASYIDK++ +Y MQNSK+GLLPFRHGV LSK+QC KTPQDVE+MR IPYASAVGSLMY MLCTRPDICYA+GIV+RYQ
Subjt:  TQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQ

Query:  SN
        SN
Subjt:  SN

KAA0037371.1 gag/pol protein [Cucumis melo var. makuwa]4.4e-19251.35Show/hide
Query:  MSSSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMN
        +S++ A+R+V +AY+R  +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ S+ ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A+M 
Subjt:  MSSSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMN

Query:  GASIDESS-------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDRVAAQKGKKVKEVAEKGKCFHC
         A IDE+S             Q F+SLM++K  + EANV  + R +HRG  SG K +  S    K + K+G    K +  AA+  KK K+   KG CFHC
Subjt:  GASIDESS-------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDRVAAQKGKKVKEVAEKGKCFHC

Query:  NEGEHWKRNCPKFVAGRK-------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLL
        N+  HWKRNCPK++A +K              +G+R KEPLELVHS LCG MNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK +VEN L
Subjt:  NEGEHWKRNCPKFVAGRK-------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLL

Query:  --------GKS-------LKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP-------------------------------------------------
                G S       L  +RS      +   F  Y ++  +      P                                                 
Subjt:  --------GKS-------LKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP-------------------------------------------------

Query:  DYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLNEMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYM
         YPK T GG FYDPK+NKV VSTN  FLEED++R+H PRSKIVLNE+       S RV +  S  T VV   + ++    Q L  PRRSGRV   P RYM
Subjt:  DYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLNEMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYM

Query:  GLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEM--------------SRCTSILS-------------GSLWINQ-------MGSIYGLKQGS
         L ET  V  D D EDPLT+ +AM DVDKDEWIK MN E+              S  T+ L+             GS+ + Q         SIYGLKQ S
Subjt:  GLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEM--------------SRCTSILS-------------GSLWINQ-------MGSIYGLKQGS

Query:  RSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDK
        RSWNIRFD  IKSYGFDQ VDEPCVYK+I++  VAFLVLYV DILLIGN+V  LTD+K+WLA+QFQMKDLG TQ+VLGIQI R+ KN+ LALSQASYIDK
Subjt:  RSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDK

Query:  MLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSNL
        ++ +Y MQNSK+GLLPFRH V LSK+QC KTPQD+E+MR IPYASAVGSLMYVMLC RP ICYA+GI +RYQSNL
Subjt:  MLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSNL

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-20348.72Show/hide
Query:  SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGA
        ++ A+R+V + Y+R  +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ S+ ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A+MNGA
Subjt:  SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGA

Query:  SIDESS------------------------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDR
         IDE+S                                    Q F+SLM++K  + EANV  + R +HRG TSGTK +  S    K + K+G    K + 
Subjt:  SIDESS------------------------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDR

Query:  VAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK-------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLM
         AA+  KK K  A KG CFHCN+  HWKRNCPK++A +K              +G++AKEPLELVHS LCG MNVKARGG+EYF++F DDYSRYGY+YLM
Subjt:  VAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK-------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLM

Query:  HKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP-------------------------------------------
          KSE LEKFKEYK +VEN L K++KT RSDRG EYMD +FQ+Y++E  I SQLS P                                           
Subjt:  HKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP-------------------------------------------

Query:  -------------------------------------------------------DYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLN
                                                                YPK T GG FYDPK+NKV VSTN  FLEEDH+R+H PRSKIVLN
Subjt:  -------------------------------------------------------DYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLN

Query:  EMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC--
        E+       S RV +  S  T VV   +S++    Q L  PRRSGRV   P  YM L ET  V SD D EDPLT+ +AM DVDKDEWIK MN E+     
Subjt:  EMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC--

Query:  ---------------------------------------------TSILSGSL----WINQ----------------MGSIYGLKQGSRSWNIRFDETIK
                                                     T+ L+G+L    ++ Q                  SIYGLKQ SRSWNIRFD  IK
Subjt:  ---------------------------------------------TSILSGSL----WINQ----------------MGSIYGLKQGSRSWNIRFDETIK

Query:  SYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKK
        SYGFDQ VDEPCVYK+I++K+VAFLVLYVDDILLIGN++  LTD+K+WLA+QFQMKDLGE Q+VLGIQI R+ KN+TLALSQASYIDK++ +Y MQNSK+
Subjt:  SYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKK

Query:  GLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSN
        GLLPFRHGV LSK+QC KTPQDVE+MR IPYASA+GSLMY MLCTRPDICYA+GIV+RYQSN
Subjt:  GLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSN

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-19146.05Show/hide
Query:  SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGA
        ++ A+++V + Y+R  + NEK + YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ S+ + HD+LKY++NARM EG+ VR+HVL+M+ HFN+A+MNGA
Subjt:  SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGA

Query:  SIDESSQNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDRVAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVA
         IDE+SQ           + EANV  + R +HRG TSGTK +  S    K + K+G    K +  AA+  KK K  A KG CFH N+  HWKRNCPK++A
Subjt:  SIDESSQNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDRVAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVA

Query:  GRK----------------------------------------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKK
         +K                                               +G+RAKEPLELVHS LCG MNVKARG +EYF++F DDYSRYGY+YLM  K
Subjt:  GRK----------------------------------------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKK

Query:  SETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP----------------------------------------------
        SE LEKFKEYK +VEN L K++KT RSDRG EYMD +FQ+Y++E EI SQLSAP                                              
Subjt:  SETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP----------------------------------------------

Query:  ----------------------------------------------------DYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLNEMD
                                                             YPK T GG FYDPK+NKV VSTN  FLEEDH+R+H PRSKIVLNE+ 
Subjt:  ----------------------------------------------------DYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLNEMD

Query:  ----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC-----
              S RV +  S  T VV   +S++    Q L  PRRSGRV   P RYM L ET  V SD D EDPLT+ +AM DVDKDEWIK MN E+        
Subjt:  ----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC-----

Query:  ----------------------------------------------------------------------------------TSILSGSL----WINQ--
                                                                                          T+ L+G+L    ++ Q  
Subjt:  ----------------------------------------------------------------------------------TSILSGSL----WINQ--

Query:  --------------MGSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQY
                        SIYGLKQ SRSWNIRFD  IKSYGFDQ VDEPCVYK+I++K+VAFLVLYVDDILLIGN++  LTD+K+WLA+QFQMKDLGE Q+
Subjt:  --------------MGSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQY

Query:  VLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSN
        VLGIQI R+ KN+ LALSQASYIDK++ +Y MQNSK+GLLPFRHGV LSK+QC KTPQDVE+MR IPYASAVGSLMY MLCTRPDICYA+GIV+RYQSN
Subjt:  VLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSN

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein6.3e-17638.28Show/hide
Query:  SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGA
        ++ A+R+V + Y+R  +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ S+ ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A+MNGA
Subjt:  SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGA

Query:  SIDESS------------------------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDR
         IDE+S                                    Q F+SLM++K  + EANV  + R +HRG TSGTK +  S    K + K+G    K + 
Subjt:  SIDESS------------------------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDR

Query:  VAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK-------------------------------------------------------------
         AA+  KK K  A KG CFHCN+  HWKRNCPK++A +K                                                             
Subjt:  VAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK-------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYI
                                                                +G+RAKEPLELVHS LCG MNVKARGG+EYF++F DDYSRYGY+
Subjt:  -------------------------------------------------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYI

Query:  YLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP----------------------------------------
        YLM  KSE LEKFKEYK +VEN L K++KT RSDRG EYMD +FQ+Y++E  I SQLSAP                                        
Subjt:  YLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP----------------------------------------

Query:  ----------------------------------------------------------DYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKI
                                                                   YPK T GG FYDPK+NKV VSTN  FLEEDH+R+H PRSKI
Subjt:  ----------------------------------------------------------DYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKI

Query:  VLNEMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSR
        VLNE+       S RV +  S  T VV   +S++    Q L  PRRSGRV   P RYM L ET  V SD D EDPLT+ +AM DVDKDEWIK MN E+  
Subjt:  VLNEMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSR

Query:  C---------------------------------------------------------------------------------------TSILSGSL----
                                                                                                T+ L+G+L    
Subjt:  C---------------------------------------------------------------------------------------TSILSGSL----

Query:  WINQ----------------MGSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKD
        ++ Q                  SIYGLKQ SRSWNIRFD  IKSYGFDQ VDEPCVYK+I++K+VAFLVLYVDDILLIGN++  LTD+K+WLA+QFQMKD
Subjt:  WINQ----------------MGSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKD

Query:  LGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVN
        LGE Q+VLGIQI R+ KN+ LALSQASYIDK++ +Y MQNSK+GLLPFRHGV LSK+QC KTPQDVE+MR IPYASAVGSLMY MLCTRPDICYA+GIV+
Subjt:  LGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVN

Query:  RYQSN
        RYQSN
Subjt:  RYQSN

A0A5A7SNP8 Gag/pol protein4.9e-19746.67Show/hide
Query:  SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGA
        ++ A+R+V + Y+R  +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ S+ ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A MN A
Subjt:  SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGA

Query:  SIDESS------------------------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDR
         IDE+S                                    Q F+SLM++K  + EANV  + R +HRG TSGTK +  S    K + K+G    K + 
Subjt:  SIDESS------------------------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDR

Query:  VAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK-------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLM
         AA+  KK K  A KG CF CN+  HWKRNCPK++A +K              +G+RAKEPLELVHS LCG MNVKARGG+EYF++F DDYSRYGY+YLM
Subjt:  VAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK-------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLM

Query:  HKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPD------------------------------------------
          KSE LEKFKEYK +VEN L K++KT RSDRG EYMD +FQ+Y++E  I SQLSAPD                                          
Subjt:  HKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPD------------------------------------------

Query:  --------------------------------------------------------YPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLN
                                                                YPK T GG FYDPK+NKV VSTN  FLEEDH+R+H PRSKIVLN
Subjt:  --------------------------------------------------------YPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLN

Query:  EMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC--
        E+       S RV +  S    VV   +S++    Q L  PRRSGRV   P RYM L ET  V SD D EDPLT+ +AM DVDKDEWIK MN E+     
Subjt:  EMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC--

Query:  -------------------------------------------------------------------------------------TSILSGSL----WIN
                                                                                             T+ L+G+L    ++ 
Subjt:  -------------------------------------------------------------------------------------TSILSGSL----WIN

Query:  Q----------------MGSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGE
        Q                  SIYGLKQ SRSWNIRFD  IKSYGFDQ VDEPCVYK+I++K+VAFLVLYVDDILLIGN++  LTD+K+WLA+QFQMKDLGE
Subjt:  Q----------------MGSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGE

Query:  TQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQ
         Q+VLGIQI R+ KN+ LALSQASYIDK++ +Y MQNSK+GLLPFRHGV LSK+QC KTPQDVE+MR IPYASAVGSLMY MLCTRPDICYA+GIV+RYQ
Subjt:  TQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQ

Query:  SN
        SN
Subjt:  SN

A0A5A7T706 Gag/pol protein2.2e-19251.35Show/hide
Query:  MSSSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMN
        +S++ A+R+V +AY+R  +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ S+ ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A+M 
Subjt:  MSSSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMN

Query:  GASIDESS-------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDRVAAQKGKKVKEVAEKGKCFHC
         A IDE+S             Q F+SLM++K  + EANV  + R +HRG  SG K +  S    K + K+G    K +  AA+  KK K+   KG CFHC
Subjt:  GASIDESS-------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDRVAAQKGKKVKEVAEKGKCFHC

Query:  NEGEHWKRNCPKFVAGRK-------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLL
        N+  HWKRNCPK++A +K              +G+R KEPLELVHS LCG MNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK +VEN L
Subjt:  NEGEHWKRNCPKFVAGRK-------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLL

Query:  --------GKS-------LKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP-------------------------------------------------
                G S       L  +RS      +   F  Y ++  +      P                                                 
Subjt:  --------GKS-------LKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP-------------------------------------------------

Query:  DYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLNEMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYM
         YPK T GG FYDPK+NKV VSTN  FLEED++R+H PRSKIVLNE+       S RV +  S  T VV   + ++    Q L  PRRSGRV   P RYM
Subjt:  DYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLNEMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYM

Query:  GLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEM--------------SRCTSILS-------------GSLWINQ-------MGSIYGLKQGS
         L ET  V  D D EDPLT+ +AM DVDKDEWIK MN E+              S  T+ L+             GS+ + Q         SIYGLKQ S
Subjt:  GLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEM--------------SRCTSILS-------------GSLWINQ-------MGSIYGLKQGS

Query:  RSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDK
        RSWNIRFD  IKSYGFDQ VDEPCVYK+I++  VAFLVLYV DILLIGN+V  LTD+K+WLA+QFQMKDLG TQ+VLGIQI R+ KN+ LALSQASYIDK
Subjt:  RSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDK

Query:  MLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSNL
        ++ +Y MQNSK+GLLPFRH V LSK+QC KTPQD+E+MR IPYASAVGSLMYVMLC RP ICYA+GI +RYQSNL
Subjt:  MLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSNL

A0A5A7U869 Gag/pol protein2.7e-20348.72Show/hide
Query:  SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGA
        ++ A+R+V + Y+R  +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQEMFGQ S+ ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A+MNGA
Subjt:  SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGA

Query:  SIDESS------------------------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDR
         IDE+S                                    Q F+SLM++K  + EANV  + R +HRG TSGTK +  S    K + K+G    K + 
Subjt:  SIDESS------------------------------------QNFQSLMRVKAPESEANV--TYRSYHRGLTSGTKHVAPSRPKGKKRMKRG----KTDR

Query:  VAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK-------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLM
         AA+  KK K  A KG CFHCN+  HWKRNCPK++A +K              +G++AKEPLELVHS LCG MNVKARGG+EYF++F DDYSRYGY+YLM
Subjt:  VAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRK-------------NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLM

Query:  HKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP-------------------------------------------
          KSE LEKFKEYK +VEN L K++KT RSDRG EYMD +FQ+Y++E  I SQLS P                                           
Subjt:  HKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAP-------------------------------------------

Query:  -------------------------------------------------------DYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLN
                                                                YPK T GG FYDPK+NKV VSTN  FLEEDH+R+H PRSKIVLN
Subjt:  -------------------------------------------------------DYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLN

Query:  EMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC--
        E+       S RV +  S  T VV   +S++    Q L  PRRSGRV   P  YM L ET  V SD D EDPLT+ +AM DVDKDEWIK MN E+     
Subjt:  EMD----STSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRC--

Query:  ---------------------------------------------TSILSGSL----WINQ----------------MGSIYGLKQGSRSWNIRFDETIK
                                                     T+ L+G+L    ++ Q                  SIYGLKQ SRSWNIRFD  IK
Subjt:  ---------------------------------------------TSILSGSL----WINQ----------------MGSIYGLKQGSRSWNIRFDETIK

Query:  SYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKK
        SYGFDQ VDEPCVYK+I++K+VAFLVLYVDDILLIGN++  LTD+K+WLA+QFQMKDLGE Q+VLGIQI R+ KN+TLALSQASYIDK++ +Y MQNSK+
Subjt:  SYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKK

Query:  GLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSN
        GLLPFRHGV LSK+QC KTPQDVE+MR IPYASA+GSLMY MLCTRPDICYA+GIV+RYQSN
Subjt:  GLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSN

A0A5A7UXG8 Gag/pol protein4.4e-17750.07Show/hide
Query:  SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGA
        ++ A R+V +AY+RR +ANEKA+ YI+AS+S VLAKKHE M+TA+EIM+SLQEMFGQ S+ ++HD+LKY++NARM EG+ VR+HVL+M+ HFN+A+MNGA
Subjt:  SSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGA

Query:  SIDESSQNFQSLMRVKAPESEANVTYRSYHRGLTSGTKHVAPSRPKGKKRMKRGKTDRVA-------------AQKGKKVKEVAEKGK-----------C
         IDE+       +R KA  ++       +H       K   P     KK+ K+G T+ V              A +   ++ + + G            C
Subjt:  SIDESSQNFQSLMRVKAPESEANVTYRSYHRGLTSGTKHVAPSRPKGKKRMKRGKTDRVA-------------AQKGKKVKEVAEKGK-----------C

Query:  FHCNEGEHWKRNCPKFVAGRKNQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSD
          C EG+  KR           +G+RAKEPLELVHS LC  MNVKARGG+EYF++F DDYSRY Y+YLM  KSE LEKFKEYK +VEN L K++KT R +
Subjt:  FHCNEGEHWKRNCPKFVAGRKNQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSD

Query:  RGREYMDTEFQDYMIEHEITSQLSAP----------------------------DYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLNE
        RG EYMD +FQ+Y++E  I SQL AP                             YPK T GG FYDPK+NKV VSTN  FLEEDH+R+H PRSKIVLNE
Subjt:  RGREYMDTEFQDYMIEHEITSQLSAP----------------------------DYPKETGGGLFYDPKENKVLVSTNVIFLEEDHVRDHLPRSKIVLNE

Query:  MDS----TSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRCTSI
        + +     S RV +  S  T V    +S++    Q L  P+RSG                         +PLT+ +AM DVDKDEWIK MN E+    S+
Subjt:  MDS----TSARVADGASTSTSVVDPNTSSQI-SSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMSRCTSI

Query:  LSGSLW-----------------------------------INQMG----------SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVA
           S+W                                   I+  G          SIYGLKQ SRSWNIRFD  IKSYGFDQ+VDEPCVYK+I++  +A
Subjt:  LSGSLW-----------------------------------INQMG----------SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVA

Query:  FLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDV
        FLVLYVD ILL GN++  LTD+K+WLA+QFQMKDLGE Q+VLGIQI ++ KN+TLALSQASYIDK++ +Y MQNSK+GLLPFRHGV LSK+QC KTPQ+V
Subjt:  FLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDV

Query:  EDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSN
        E MR IPYASAVGSLMY MLCTRPDICYA+GIV+RYQSN
Subjt:  EDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSN

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.5e-2232.8Show/hide
Query:  SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDK----TVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKN
        +IYGLKQ +R W   F++ +K   F  +  + C+Y  I+DK       +++LYVDD+++   ++  + + K++L  +F+M DL E ++ +GI+I   ++ 
Subjt:  SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDK----TVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKN

Query:  RTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVH---LSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQS
          + LSQ++Y+ K+LS++ M+N      P    ++   L+ D+   T          P  S +G LMY+MLCTRPD+  A+ I++RY S
Subjt:  RTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVH---LSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQS

P04146 Copia protein3.4e-0931.13Show/hide
Query:  KEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPD
        K PL +VHS +CG +         YFV F+D ++ Y   YL+  KS+    F+++  K E      +  L  D GREY+  E + + ++  I+  L+ P 
Subjt:  KEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPD

Query:  YPKETG
         P+  G
Subjt:  YPKETG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-3943.48Show/hide
Query:  SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVY-KKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTL
        S+YGLKQ  R W ++FD  +KS  + +   +PCVY K+  +     L+LYVDD+L++G +   +  +K  L+  F MKDLG  Q +LG++IVR   +R L
Subjt:  SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVY-KKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTL

Query:  ALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSN
         LSQ  YI+++L R+ M+N+K    P    + LSK  C  T ++  +M  +PY+SAVGSLMY M+CTRPDI +A+G+V+R+  N
Subjt:  ALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-1639.81Show/hide
Query:  LELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPDYPK
        L+LV+S +CG M +++ GG +YFV+FIDD SR  ++Y++  K +  + F+++   VE   G+ LK LRSD G EY   EF++Y   H I  + + P  P+
Subjt:  LELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPDYPK

Query:  ETG
          G
Subjt:  ETG

P25600 Putative transposon Ty5-1 protein YCL074W5.4e-1530.39Show/hide
Query:  GSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTL
        G +YGLKQ    WN   + T+K  GF ++  E  +Y +       ++ +YVDD+L+     +    VK+ L   + MKDLG+    LG+ I ++  N  +
Subjt:  GSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTL

Query:  ALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRY
         LS   YI K  S  ++   K    P  +    SK     T   ++D+   PY S VG L++     RPDI Y + +++R+
Subjt:  ALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.3e-1632.22Show/hide
Query:  SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLA
        ++YGLKQ  R+W +     + + GF  +V +  ++     K++ ++++YVDDIL+ GN+   L +    L+ +F +KD  E  Y LGI+  R      L 
Subjt:  SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLA

Query:  LSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRY
        LSQ  YI  +L+R  M  +K    P      LS     K     E      Y   VGSL Y+   TRPDI YA+  ++++
Subjt:  LSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-0728Show/hide
Query:  PLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPDYP
        PLE ++S +  S  + +   Y Y+V F+D ++RY ++Y + +KS+  E F  +K  +EN     + T  SD G E++     +Y  +H I+   S P  P
Subjt:  PLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPDYP

Query:  KETGGGLFYDPKENKVLVSTNVIFL
        +  G       ++++ +V T +  L
Subjt:  KETGGGLFYDPKENKVLVSTNVIFL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-1530Show/hide
Query:  SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLA
        +IYGLKQ  R+W +     + + GF  ++ +  ++     +++ ++++YVDDIL+ GN+   L      L+ +F +K+  +  Y LGI+  R    + L 
Subjt:  SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLA

Query:  LSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRY
        LSQ  Y   +L+R  M  +K    P      L+     K P   E      Y   VGSL Y+   TRPD+ YA+  +++Y
Subjt:  LSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-0831.25Show/hide
Query:  NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITS
        N    + +PLE ++S +  S  + +   Y Y+V F+D ++RY ++Y + +KS+  + F  +K+ VEN     + TL SD G E++    +DY+ +H I+ 
Subjt:  NQGYRAKEPLELVHSGLCGSMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITS

Query:  QLSAPDYPKETG
          S P  P+  G
Subjt:  QLSAPDYPKETG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.8e-1630Show/hide
Query:  SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLA
        SIYGLKQ SR W ++F  T+  +GF Q+  +   + KI       +++YVDDI++  N    + ++K  L S F+++DLG  +Y LG++I R+     + 
Subjt:  SIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLA

Query:  LSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRY
        + Q  Y   +L    +   K   +P    V  S      +  D  D +   Y   +G LMY+ + TR DI +A+  ++++
Subjt:  LSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRY

ATMG00810.1 DNA/RNA polymerases superfamily protein7.4e-1238.81Show/hide
Query:  FLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQC-FKTPQD
        +L+LYVDDILL G+    L  +   L+S F MKDLG   Y LGIQI  +     L LSQ  Y +++L+   M + K    P    ++ S     +  P D
Subjt:  FLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNLKNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQC-FKTPQD

Query:  VEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIV
                + S VG+L Y+ L TRPDI YA+ IV
Subjt:  VEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAGCTCGACTGCGTCGCGAAGTGTTCACGATGCATACGATCGACGGATCAGGGCCAATGAAAAGGCCAAGGACTATATCATTGCCAGCATGTCTGATGTTTTGGC
AAAGAAGCATGAGCTGATGGTCACCGCTAAAGAGATCATGGAGTCCTTGCAGGAAATGTTTGGACAACAGTCTTTTCACGTCCGACATGACTCGCTCAAATACGTCTTCA
ACGCACGGATGGAAGAGGGGTCGTTTGTCCGTAAACATGTTCTAGACATGATAACCCACTTTAATCTAGCGAAGATGAATGGGGCTTCGATCGACGAGTCAAGCCAGAAT
TTCCAGTCCTTGATGAGGGTCAAGGCACCGGAATCTGAGGCAAATGTTACCTACAGGTCTTATCACAGGGGTTTGACCTCTGGGACTAAACATGTTGCTCCTTCACGCCC
GAAAGGGAAGAAGAGGATGAAGAGGGGTAAAACTGACCGTGTTGCCGCCCAAAAAGGCAAGAAGGTCAAGGAAGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATGAGG
GCGAACACTGGAAGAGGAACTGTCCCAAATTCGTAGCAGGGAGGAAGAATCAAGGATATAGAGCCAAAGAGCCCCTTGAGTTAGTACATTCTGGCCTCTGTGGTTCGATG
AATGTTAAAGCTCGAGGTGGTTATGAATACTTCGTGTCTTTCATTGACGATTACTCGAGGTATGGGTATATTTACCTAATGCATAAGAAGTCTGAAACTCTTGAAAAGTT
CAAGGAGTACAAGACTAAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATCGGATCGAGGTAGAGAGTACATGGACACTGAATTCCAGGACTATATGATAG
AACACGAAATTACGTCCCAACTCTCAGCACCTGATTACCCTAAAGAGACTGGGGGTGGTCTATTTTACGATCCTAAGGAAAATAAGGTGCTTGTGTCGACAAACGTCATT
TTCCTAGAGGAAGACCATGTCAGGGATCATTTACCAAGGAGTAAAATTGTGTTAAATGAAATGGACAGTACATCAGCAAGAGTTGCTGATGGGGCTAGTACATCAACAAG
TGTTGTTGATCCTAACACGTCTAGTCAAATTAGTTCCCAAAAGTTGGGAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCCTGATCGTTACATGGGTTTAGCTGAAA
CCTCAGTTGTCGCTTCTGATGATGACTGTGAGGATCCATTGACCTATGATCAGGCAATGGTTGATGTTGACAAAGACGAATGGATTAAAGATATGAACCAGGAAATGAGT
CGATGTACTTCAATTCTGTCTGGGAGCTTGTGGATCAACCAGATGGGGTCTATTTATGGACTGAAACAAGGTTCGAGGTCTTGGAATATAAGGTTTGATGAGACGATCAA
ATCTTATGGCTTTGATCAAAATGTCGACGAGCCTTGTGTCTACAAGAAAATCGTTGACAAAACTGTCGCATTTTTAGTGTTGTATGTGGATGATATTCTTCTCATTGGAA
ATGAGGTAGAATTTCTTACTGACGTGAAGAAATGGCTAGCTTCGCAATTTCAAATGAAAGATTTGGGAGAAACTCAGTATGTTCTAGGTATCCAGATAGTCCGGAACCTG
AAGAACAGAACGCTAGCCTTGTCTCAGGCGTCTTATATTGACAAGATGTTGTCTAGATATAAGATGCAGAACTCCAAGAAGGGCTTGTTGCCTTTCAGGCATGGGGTTCA
CTTGTCTAAGGATCAGTGTTTTAAGACTCCTCAAGATGTTGAGGATATGAGATGGATTCCATATGCTTCAGCTGTAGGGAGCCTGATGTATGTCATGTTGTGTACTAGGC
CCGACATCTGTTATGCAATAGGGATTGTCAATAGGTATCAATCCAATCTAGAGATTAGATCTTTGGGACGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGAGCTCGACTGCGTCGCGAAGTGTTCACGATGCATACGATCGACGGATCAGGGCCAATGAAAAGGCCAAGGACTATATCATTGCCAGCATGTCTGATGTTTTGGC
AAAGAAGCATGAGCTGATGGTCACCGCTAAAGAGATCATGGAGTCCTTGCAGGAAATGTTTGGACAACAGTCTTTTCACGTCCGACATGACTCGCTCAAATACGTCTTCA
ACGCACGGATGGAAGAGGGGTCGTTTGTCCGTAAACATGTTCTAGACATGATAACCCACTTTAATCTAGCGAAGATGAATGGGGCTTCGATCGACGAGTCAAGCCAGAAT
TTCCAGTCCTTGATGAGGGTCAAGGCACCGGAATCTGAGGCAAATGTTACCTACAGGTCTTATCACAGGGGTTTGACCTCTGGGACTAAACATGTTGCTCCTTCACGCCC
GAAAGGGAAGAAGAGGATGAAGAGGGGTAAAACTGACCGTGTTGCCGCCCAAAAAGGCAAGAAGGTCAAGGAAGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATGAGG
GCGAACACTGGAAGAGGAACTGTCCCAAATTCGTAGCAGGGAGGAAGAATCAAGGATATAGAGCCAAAGAGCCCCTTGAGTTAGTACATTCTGGCCTCTGTGGTTCGATG
AATGTTAAAGCTCGAGGTGGTTATGAATACTTCGTGTCTTTCATTGACGATTACTCGAGGTATGGGTATATTTACCTAATGCATAAGAAGTCTGAAACTCTTGAAAAGTT
CAAGGAGTACAAGACTAAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATCGGATCGAGGTAGAGAGTACATGGACACTGAATTCCAGGACTATATGATAG
AACACGAAATTACGTCCCAACTCTCAGCACCTGATTACCCTAAAGAGACTGGGGGTGGTCTATTTTACGATCCTAAGGAAAATAAGGTGCTTGTGTCGACAAACGTCATT
TTCCTAGAGGAAGACCATGTCAGGGATCATTTACCAAGGAGTAAAATTGTGTTAAATGAAATGGACAGTACATCAGCAAGAGTTGCTGATGGGGCTAGTACATCAACAAG
TGTTGTTGATCCTAACACGTCTAGTCAAATTAGTTCCCAAAAGTTGGGAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCCTGATCGTTACATGGGTTTAGCTGAAA
CCTCAGTTGTCGCTTCTGATGATGACTGTGAGGATCCATTGACCTATGATCAGGCAATGGTTGATGTTGACAAAGACGAATGGATTAAAGATATGAACCAGGAAATGAGT
CGATGTACTTCAATTCTGTCTGGGAGCTTGTGGATCAACCAGATGGGGTCTATTTATGGACTGAAACAAGGTTCGAGGTCTTGGAATATAAGGTTTGATGAGACGATCAA
ATCTTATGGCTTTGATCAAAATGTCGACGAGCCTTGTGTCTACAAGAAAATCGTTGACAAAACTGTCGCATTTTTAGTGTTGTATGTGGATGATATTCTTCTCATTGGAA
ATGAGGTAGAATTTCTTACTGACGTGAAGAAATGGCTAGCTTCGCAATTTCAAATGAAAGATTTGGGAGAAACTCAGTATGTTCTAGGTATCCAGATAGTCCGGAACCTG
AAGAACAGAACGCTAGCCTTGTCTCAGGCGTCTTATATTGACAAGATGTTGTCTAGATATAAGATGCAGAACTCCAAGAAGGGCTTGTTGCCTTTCAGGCATGGGGTTCA
CTTGTCTAAGGATCAGTGTTTTAAGACTCCTCAAGATGTTGAGGATATGAGATGGATTCCATATGCTTCAGCTGTAGGGAGCCTGATGTATGTCATGTTGTGTACTAGGC
CCGACATCTGTTATGCAATAGGGATTGTCAATAGGTATCAATCCAATCTAGAGATTAGATCTTTGGGACGGTAG
Protein sequenceShow/hide protein sequence
MSSSTASRSVHDAYDRRIRANEKAKDYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFHVRHDSLKYVFNARMEEGSFVRKHVLDMITHFNLAKMNGASIDESSQN
FQSLMRVKAPESEANVTYRSYHRGLTSGTKHVAPSRPKGKKRMKRGKTDRVAAQKGKKVKEVAEKGKCFHCNEGEHWKRNCPKFVAGRKNQGYRAKEPLELVHSGLCGSM
NVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTKVENLLGKSLKTLRSDRGREYMDTEFQDYMIEHEITSQLSAPDYPKETGGGLFYDPKENKVLVSTNVI
FLEEDHVRDHLPRSKIVLNEMDSTSARVADGASTSTSVVDPNTSSQISSQKLGMPRRSGRVVRQPDRYMGLAETSVVASDDDCEDPLTYDQAMVDVDKDEWIKDMNQEMS
RCTSILSGSLWINQMGSIYGLKQGSRSWNIRFDETIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLTDVKKWLASQFQMKDLGETQYVLGIQIVRNL
KNRTLALSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCFKTPQDVEDMRWIPYASAVGSLMYVMLCTRPDICYAIGIVNRYQSNLEIRSLGR