; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G03020 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G03020
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPolyprotein
Genome locationChr7:2454920..2457415
RNA-Seq ExpressionCSPI07G03020
SyntenyCSPI07G03020
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025964.1 uncharacterized protein E6C27_scaffold34G002790 [Cucumis melo var. makuwa]1.8e-25562.03Show/hide
Query:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------
        +EFTT+MLEI DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIAA E LVDY AQSK +KP P K  G   K ++  +++ GK              
Subjt:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------

Query:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE
                 KP                             KQ +D   PQIGS+Q IG +K+       GLL+G+++I G+E I MFDTG SHNFM+V E
Subjt:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE

Query:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM
        AKRLGLKF  + G VK VN++ QAI GVA+ V VKIGDWQ +LDF+VLPMDDF+IVLGLGFF+KV+ +++S+  TL+I+DG +TTIP+++G+PV +L A+
Subjt:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM

Query:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY
        QFK GVTK++CY  T+K  + EE  P +  VP  + KVL+EYK++MP+ELPKK PPRR VDHEIELEPGAKPPAMAPYRMAP +LEEL+RQLKELL+  Y
Subjt:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY

Query:  IQHSKAPYGA------HQDRS---------------------------QIR-----------VTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQF
        IQ SKAPYGA       +D S                           Q+R           VTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLD+F
Subjt:  IQHSKAPYGA------HQDRS---------------------------QIR-----------VTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQF

Query:  VVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS-------------------------------KGYS
        VVVYLDDIVVYS+T+EEH+QHLRQVFQVLR+NELYIKLEKCSFAK EV+FLGHWI+EG+LMMD +                               KGYS
Subjt:  VVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS-------------------------------KGYS

Query:  NVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRA
         +AAPLTNLLKK+QTW W+ ECQQAFDRLK AVS+EP M LADHTKPFEVHTDASDFAIGGVLMQ+GHPIA++S KLND ERRYTVQEKEMTAIVH +R 
Subjt:  NVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRA

Query:  WRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVI
        WRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDF+L+YK G+ANVVADALSRKA+L +I
Subjt:  WRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVI

KAA0040659.1 uncharacterized protein E6C27_scaffold370G00130 [Cucumis melo var. makuwa]6.8e-25560.71Show/hide
Query:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------
        +EFTT+MLEI DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIA AE LVDY AQSK +KP P K  G   K ++  +++ GK              
Subjt:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------

Query:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE
                 KP                             KQ +D   PQIGS+Q IG +K+       GLL+G+++I G+E   MFDTGASHNF++V E
Subjt:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE

Query:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM
        AKRLGLKF  + G VK VN++ QAI GVA+ V VKIGDWQ +LDF+VLPMDDF+IVLGLGFF+KV+ +++S+  TL+I+DG +TTIP+++G+PV +L A+
Subjt:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM

Query:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY
        QFK GVTK++CY VT+K  + EE    +  VP  + KVL+EYK++MP+ELPKK PPRR VDHEIELEPGAKPPAMAPYRMAPP+LEEL+RQLKELL+A Y
Subjt:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY

Query:  IQHSKAPYGA------HQDRS-----------------------------------------------QIR-----------VTRYGAYEFLVMPFGLTN
        IQ SKAPYGA       +D S                                               Q+R           VTRYGAYEFLVMPFGLTN
Subjt:  IQHSKAPYGA------HQDRS-----------------------------------------------QIR-----------VTRYGAYEFLVMPFGLTN

Query:  APATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS---------------
        APATFCTLMNKLFQ FLD+FVVVYLDDIVVYS+T+EEH+QHLRQVFQVLRDNELYIKLEKCSFAK EVEFL HWI+EG+LMMD +               
Subjt:  APATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS---------------

Query:  ----------------KGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDT
                        KGYS + APLTNLLKK+QTW W+ ECQQAFDRLK AVS+EP M LADHTKPFEVHTDASDFAIGGVLMQ+GHPIA++S KLNDT
Subjt:  ----------------KGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDT

Query:  ERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVI
        ERRYTVQEKEMTAIVHC+R WRHYLLGSKFT+MTDNVATSYFQTQKKLTPKQARWQDFLAEFDF+L+YK G+ANVVADALSRKAEL  I
Subjt:  ERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVI

KAA0050296.1 reverse transcriptase [Cucumis melo var. makuwa]3.1e-24761.82Show/hide
Query:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------
        +EFTT+ML+I DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIA AE LVDY AQSK +KP P K  G   K +    ++ GK              
Subjt:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------

Query:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE
                 KP                             KQ +D   PQIGS+Q IG +K+   T    LL+G+V+I G+E   MFDTGASHNFM+V E
Subjt:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE

Query:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM
        AKRLGLKF  + GTVK VN++ Q I G+A+ V VKIGDWQ +LDF++LPMDDF IVLGLGFF+KV+ +++S+  TL+I+DG +TTI +++G+PV +L A+
Subjt:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM

Query:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY
        QFK GV+K+ECY  T+K  + EE    +  VP  +  VL+EYK++M AELPKK PP R VDH+IELEPGAKPP MAPYRMAP + +EL+RQLKELL+A Y
Subjt:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY

Query:  IQHSK---------------------------APYGAHQDRSQIR-----------VTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQFVVVYLD
        IQ +                            +    H    Q+R           VTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLD+FVVVYLD
Subjt:  IQHSK---------------------------APYGAHQDRSQIR-----------VTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQFVVVYLD

Query:  DIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPSKGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEP
        DIVVYS+T+EEH+QHLRQVFQV RDNELYIKLEKCSFAK EVEFLGHWI+EG+LMMD +KGYS + APLTNLLKK+QTW W+ ECQ+AFDRLK A+ +EP
Subjt:  DIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPSKGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEP

Query:  AMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQD
         M LADHTKPFEVHTDASDFAIGGVLMQ+ HPIA++S KLNDTERRYTVQEKEMTAIVHC+R WRHYLLGSKFTV+TDNVATSYFQTQKKLTPKQA+WQD
Subjt:  AMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQD

Query:  FLAEFDFQLQYKSGKANVVADALSRKAELIVIPAAI
        FLAEFDF+L+YK G+AN+VADALS KAEL  I  ++
Subjt:  FLAEFDFQLQYKSGKANVVADALSRKAELIVIPAAI

KAA0065760.1 polyprotein [Cucumis melo var. makuwa]2.0e-25460.71Show/hide
Query:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------
        +EFTT+MLEI DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIAAAE LVDY AQSK +KP P K  G   K ++  +R+ GK              
Subjt:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------

Query:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE
                 KP                             KQ +D   PQIGS+Q IG +K+       GLL+G+++I G+E   MFDTGASHNFM+V E
Subjt:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE

Query:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM
        AKRLGLK+  + GTVK VN++ Q I GVA+ V VKIGDWQ +LDF+VLPMDDF+IVLGLGFF+KV+ +++S+  TL+I+DG +TTIP+++G+P+ +L A+
Subjt:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM

Query:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY
        QFK GVTK++CY  T+K  + EE    +  VP  + KVL+EYK++MP+ELPKK PPRR VDHEIELEPGAKPPAMAPYRMAP +LEEL+RQLKELL+A Y
Subjt:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY

Query:  IQHSKAPYGA------HQDRS-----------------------------------------------QIR-----------VTRYGAYEFLVMPFGLTN
        IQ SKAPYGA       +D S                                               Q+R           VTRYGAYEFLVMPFGLTN
Subjt:  IQHSKAPYGA------HQDRS-----------------------------------------------QIR-----------VTRYGAYEFLVMPFGLTN

Query:  APATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS---------------
        APATFCTLMNKLFQPFLD+FVVVYLDDIVVYS+T+EEH+QHL+QVFQVLRDNELYIKLEKCSFAK EVEFLGHWI+EG+LMMD +               
Subjt:  APATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS---------------

Query:  ----------------KGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDT
                        KGYS++AAPLTNLLKK+QTW W+ ECQQAFDRLK AVS+E  M LADHTKPFEVHTDASDFAIGGVL+Q+GHPIA++S KLNDT
Subjt:  ----------------KGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDT

Query:  ERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVI
        ERRYTVQEKEMTAIVHC+R WRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDF+L+YK G+ANVVADALSRKAEL  I
Subjt:  ERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVI

XP_008446938.1 PREDICTED: uncharacterized protein LOC103489499 [Cucumis melo]6.6e-25059.39Show/hide
Query:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------
        +EFTT+MLEI DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIAAAE LVDY  Q K +KP P K  G   K ++  +++ GK              
Subjt:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------

Query:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE
                 KP                             KQ +D   PQIGS+Q I  +K+       GLL+G+++I G+E   MFDTGAS+NFM+V E
Subjt:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE

Query:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM
        AKRLGLKF  + G VK V ++ QAI GVA+ V VKIGDWQ +LDF VLPMDDF IVLGLGFF+KV+ +++S+  TL+I+DG +TTIP+++G+PV +L A+
Subjt:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM

Query:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY
        QFK GVTK++CY  T+K  + E+  P +  VP  + KVL+EYK++MP+ELPKK PPRR VDHEIELEPGAKPPAMAPYRMAPP+LEEL+RQLKELL+A Y
Subjt:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY

Query:  IQHSKAPYGA------HQDRS-----------------------------------------------QIR-----------VTRYGAYEFLVMPFGLTN
        IQ SKAPYGA       +D S                                               Q+R           VTRYGAYEFLVMPFGLTN
Subjt:  IQHSKAPYGA------HQDRS-----------------------------------------------QIR-----------VTRYGAYEFLVMPFGLTN

Query:  APATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS---------------
        APATFCTLMNKLFQPFLD+FV+VYLDDIVVYS+T+EEH+ HLRQVFQVLRDNELYIKLEKCSFAK EVEFLGHWI+EG+LMMD +               
Subjt:  APATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS---------------

Query:  ----------------KGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDT
                        KGYS +AAPLTNLLKK+QTW W+ ECQ+AFD+LK AVS+EP M L DHTKPF+VHTDASDFAIGGVLMQ+ HPIA++S KLNDT
Subjt:  ----------------KGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDT

Query:  ERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVIPAAI
        ERRYTVQEKEMTAIVHC+R WRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEF+F+L+YK G+ANVVADALSRKAEL +I  ++
Subjt:  ERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVIPAAI

TrEMBL top hitse value%identityAlignment
A0A1S3BG92 uncharacterized protein LOC1034894993.2e-25059.39Show/hide
Query:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------
        +EFTT+MLEI DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIAAAE LVDY  Q K +KP P K  G   K ++  +++ GK              
Subjt:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------

Query:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE
                 KP                             KQ +D   PQIGS+Q I  +K+       GLL+G+++I G+E   MFDTGAS+NFM+V E
Subjt:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE

Query:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM
        AKRLGLKF  + G VK V ++ QAI GVA+ V VKIGDWQ +LDF VLPMDDF IVLGLGFF+KV+ +++S+  TL+I+DG +TTIP+++G+PV +L A+
Subjt:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM

Query:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY
        QFK GVTK++CY  T+K  + E+  P +  VP  + KVL+EYK++MP+ELPKK PPRR VDHEIELEPGAKPPAMAPYRMAPP+LEEL+RQLKELL+A Y
Subjt:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY

Query:  IQHSKAPYGA------HQDRS-----------------------------------------------QIR-----------VTRYGAYEFLVMPFGLTN
        IQ SKAPYGA       +D S                                               Q+R           VTRYGAYEFLVMPFGLTN
Subjt:  IQHSKAPYGA------HQDRS-----------------------------------------------QIR-----------VTRYGAYEFLVMPFGLTN

Query:  APATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS---------------
        APATFCTLMNKLFQPFLD+FV+VYLDDIVVYS+T+EEH+ HLRQVFQVLRDNELYIKLEKCSFAK EVEFLGHWI+EG+LMMD +               
Subjt:  APATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS---------------

Query:  ----------------KGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDT
                        KGYS +AAPLTNLLKK+QTW W+ ECQ+AFD+LK AVS+EP M L DHTKPF+VHTDASDFAIGGVLMQ+ HPIA++S KLNDT
Subjt:  ----------------KGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDT

Query:  ERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVIPAAI
        ERRYTVQEKEMTAIVHC+R WRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEF+F+L+YK G+ANVVADALSRKAEL +I  ++
Subjt:  ERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVIPAAI

A0A5A7SJU9 Uncharacterized protein8.6e-25662.03Show/hide
Query:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------
        +EFTT+MLEI DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIAA E LVDY AQSK +KP P K  G   K ++  +++ GK              
Subjt:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------

Query:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE
                 KP                             KQ +D   PQIGS+Q IG +K+       GLL+G+++I G+E I MFDTG SHNFM+V E
Subjt:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE

Query:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM
        AKRLGLKF  + G VK VN++ QAI GVA+ V VKIGDWQ +LDF+VLPMDDF+IVLGLGFF+KV+ +++S+  TL+I+DG +TTIP+++G+PV +L A+
Subjt:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM

Query:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY
        QFK GVTK++CY  T+K  + EE  P +  VP  + KVL+EYK++MP+ELPKK PPRR VDHEIELEPGAKPPAMAPYRMAP +LEEL+RQLKELL+  Y
Subjt:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY

Query:  IQHSKAPYGA------HQDRS---------------------------QIR-----------VTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQF
        IQ SKAPYGA       +D S                           Q+R           VTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLD+F
Subjt:  IQHSKAPYGA------HQDRS---------------------------QIR-----------VTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQF

Query:  VVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS-------------------------------KGYS
        VVVYLDDIVVYS+T+EEH+QHLRQVFQVLR+NELYIKLEKCSFAK EV+FLGHWI+EG+LMMD +                               KGYS
Subjt:  VVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS-------------------------------KGYS

Query:  NVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRA
         +AAPLTNLLKK+QTW W+ ECQQAFDRLK AVS+EP M LADHTKPFEVHTDASDFAIGGVLMQ+GHPIA++S KLND ERRYTVQEKEMTAIVH +R 
Subjt:  NVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRA

Query:  WRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVI
        WRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDF+L+YK G+ANVVADALSRKA+L +I
Subjt:  WRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVI

A0A5A7THC0 Reverse transcriptase domain-containing protein3.3e-25560.71Show/hide
Query:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------
        +EFTT+MLEI DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIA AE LVDY AQSK +KP P K  G   K ++  +++ GK              
Subjt:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------

Query:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE
                 KP                             KQ +D   PQIGS+Q IG +K+       GLL+G+++I G+E   MFDTGASHNF++V E
Subjt:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE

Query:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM
        AKRLGLKF  + G VK VN++ QAI GVA+ V VKIGDWQ +LDF+VLPMDDF+IVLGLGFF+KV+ +++S+  TL+I+DG +TTIP+++G+PV +L A+
Subjt:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM

Query:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY
        QFK GVTK++CY VT+K  + EE    +  VP  + KVL+EYK++MP+ELPKK PPRR VDHEIELEPGAKPPAMAPYRMAPP+LEEL+RQLKELL+A Y
Subjt:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY

Query:  IQHSKAPYGA------HQDRS-----------------------------------------------QIR-----------VTRYGAYEFLVMPFGLTN
        IQ SKAPYGA       +D S                                               Q+R           VTRYGAYEFLVMPFGLTN
Subjt:  IQHSKAPYGA------HQDRS-----------------------------------------------QIR-----------VTRYGAYEFLVMPFGLTN

Query:  APATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS---------------
        APATFCTLMNKLFQ FLD+FVVVYLDDIVVYS+T+EEH+QHLRQVFQVLRDNELYIKLEKCSFAK EVEFL HWI+EG+LMMD +               
Subjt:  APATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS---------------

Query:  ----------------KGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDT
                        KGYS + APLTNLLKK+QTW W+ ECQQAFDRLK AVS+EP M LADHTKPFEVHTDASDFAIGGVLMQ+GHPIA++S KLNDT
Subjt:  ----------------KGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDT

Query:  ERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVI
        ERRYTVQEKEMTAIVHC+R WRHYLLGSKFT+MTDNVATSYFQTQKKLTPKQARWQDFLAEFDF+L+YK G+ANVVADALSRKAEL  I
Subjt:  ERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVI

A0A5A7VEX8 Polyprotein9.6e-25560.71Show/hide
Query:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------
        +EFTT+MLEI DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIAAAE LVDY AQSK +KP P K  G   K ++  +R+ GK              
Subjt:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------

Query:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE
                 KP                             KQ +D   PQIGS+Q IG +K+       GLL+G+++I G+E   MFDTGASHNFM+V E
Subjt:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE

Query:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM
        AKRLGLK+  + GTVK VN++ Q I GVA+ V VKIGDWQ +LDF+VLPMDDF+IVLGLGFF+KV+ +++S+  TL+I+DG +TTIP+++G+P+ +L A+
Subjt:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM

Query:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY
        QFK GVTK++CY  T+K  + EE    +  VP  + KVL+EYK++MP+ELPKK PPRR VDHEIELEPGAKPPAMAPYRMAP +LEEL+RQLKELL+A Y
Subjt:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY

Query:  IQHSKAPYGA------HQDRS-----------------------------------------------QIR-----------VTRYGAYEFLVMPFGLTN
        IQ SKAPYGA       +D S                                               Q+R           VTRYGAYEFLVMPFGLTN
Subjt:  IQHSKAPYGA------HQDRS-----------------------------------------------QIR-----------VTRYGAYEFLVMPFGLTN

Query:  APATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS---------------
        APATFCTLMNKLFQPFLD+FVVVYLDDIVVYS+T+EEH+QHL+QVFQVLRDNELYIKLEKCSFAK EVEFLGHWI+EG+LMMD +               
Subjt:  APATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPS---------------

Query:  ----------------KGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDT
                        KGYS++AAPLTNLLKK+QTW W+ ECQQAFDRLK AVS+E  M LADHTKPFEVHTDASDFAIGGVL+Q+GHPIA++S KLNDT
Subjt:  ----------------KGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDT

Query:  ERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVI
        ERRYTVQEKEMTAIVHC+R WRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDF+L+YK G+ANVVADALSRKAEL  I
Subjt:  ERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSRKAELIVI

A0A5D3BWR0 Reverse transcriptase1.5e-24761.82Show/hide
Query:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------
        +EFTT+ML+I DLPEKEALFQF+DGLKDWAK+ELDRRNVQTLDDAIA AE LVDY AQSK +KP P K  G   K +    ++ GK              
Subjt:  QEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSKERKPSPNKREGNHYKHRDSEQREKGK--------------

Query:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE
                 KP                             KQ +D   PQIGS+Q IG +K+   T    LL+G+V+I G+E   MFDTGASHNFM+V E
Subjt:  ---------KP-----------------------------KQGDDT--PQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNE

Query:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM
        AKRLGLKF  + GTVK VN++ Q I G+A+ V VKIGDWQ +LDF++LPMDDF IVLGLGFF+KV+ +++S+  TL+I+DG +TTI +++G+PV +L A+
Subjt:  AKRLGLKFTNKEGTVKAVNSEAQAIKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAM

Query:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY
        QFK GV+K+ECY  T+K  + EE    +  VP  +  VL+EYK++M AELPKK PP R VDH+IELEPGAKPP MAPYRMAP + +EL+RQLKELL+A Y
Subjt:  QFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKEVHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASY

Query:  IQHSK---------------------------APYGAHQDRSQIR-----------VTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQFVVVYLD
        IQ +                            +    H    Q+R           VTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLD+FVVVYLD
Subjt:  IQHSK---------------------------APYGAHQDRSQIR-----------VTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQFVVVYLD

Query:  DIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPSKGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEP
        DIVVYS+T+EEH+QHLRQVFQV RDNELYIKLEKCSFAK EVEFLGHWI+EG+LMMD +KGYS + APLTNLLKK+QTW W+ ECQ+AFDRLK A+ +EP
Subjt:  DIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPSKGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEP

Query:  AMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQD
         M LADHTKPFEVHTDASDFAIGGVLMQ+ HPIA++S KLNDTERRYTVQEKEMTAIVHC+R WRHYLLGSKFTV+TDNVATSYFQTQKKLTPKQA+WQD
Subjt:  AMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQD

Query:  FLAEFDFQLQYKSGKANVVADALSRKAELIVIPAAI
        FLAEFDF+L+YK G+AN+VADALS KAEL  I  ++
Subjt:  FLAEFDFQLQYKSGKANVVADALSRKAELIVIPAAI

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.2e-5539.67Show/hide
Query:  TRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMD
        T++G YE+L MPFGL NAPATF   MN + +P L++  +VYLDDI+V+S +++EH+Q L  VF+ L    L ++L+KC F K E  FLGH +    +  +
Subjt:  TRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMD

Query:  PSK-------------------------------GYSNVAAPLTNLLKKSQTWHWSG-ECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGV
        P K                                ++++A P+T  LKK+     +  E   AF +LK  +S++P + + D TK F + TDASD A+G V
Subjt:  PSK-------------------------------GYSNVAAPLTNLLKKSQTWHWSG-ECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGV

Query:  LMQEGHPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSR
        L Q+GHP++Y S  LN+ E  Y+  EKE+ AIV   + +RHYLLG  F + +D+   S+    K    K  RW+  L+EFDF ++Y  GK N VADALSR
Subjt:  LMQEGHPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSR

P0CT34 Transposon Tf2-1 polyprotein5.0e-4333.11Show/hide
Query:  GAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIRE---------
        G +E+LVMP+G++ APA F   +N +     +  VV Y+DDI+++S++  EH++H++ V Q L++  L I   KC F + +V+F+G+ I E         
Subjt:  GAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIRE---------

Query:  -----------GRLMMDPSKG-----------YSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQE
                    R  +    G            S +  PL NLLKK   W W+    QA + +K+ +   P +   D +K   + TDASD A+G VL Q+
Subjt:  -----------GRLMMDPSKG-----------YSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQE

Query:  G-----HPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRAWRHYLLGS--KFTVMTD--NVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVA
              +P+ Y S K++  +  Y+V +KEM AI+  ++ WRHYL  +   F ++TD  N+        +    + ARWQ FL +F+F++ Y+ G AN +A
Subjt:  G-----HPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRAWRHYLLGS--KFTVMTD--NVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVA

Query:  DALSR
        DALSR
Subjt:  DALSR

P0CT35 Transposon Tf2-2 polyprotein5.0e-4333.11Show/hide
Query:  GAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIRE---------
        G +E+LVMP+G++ APA F   +N +     +  VV Y+DDI+++S++  EH++H++ V Q L++  L I   KC F + +V+F+G+ I E         
Subjt:  GAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIRE---------

Query:  -----------GRLMMDPSKG-----------YSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQE
                    R  +    G            S +  PL NLLKK   W W+    QA + +K+ +   P +   D +K   + TDASD A+G VL Q+
Subjt:  -----------GRLMMDPSKG-----------YSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQE

Query:  G-----HPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRAWRHYLLGS--KFTVMTD--NVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVA
              +P+ Y S K++  +  Y+V +KEM AI+  ++ WRHYL  +   F ++TD  N+        +    + ARWQ FL +F+F++ Y+ G AN +A
Subjt:  G-----HPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRAWRHYLLGS--KFTVMTD--NVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVA

Query:  DALSR
        DALSR
Subjt:  DALSR

P0CT41 Transposon Tf2-12 polyprotein5.0e-4333.11Show/hide
Query:  GAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIRE---------
        G +E+LVMP+G++ APA F   +N +     +  VV Y+DDI+++S++  EH++H++ V Q L++  L I   KC F + +V+F+G+ I E         
Subjt:  GAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIRE---------

Query:  -----------GRLMMDPSKG-----------YSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQE
                    R  +    G            S +  PL NLLKK   W W+    QA + +K+ +   P +   D +K   + TDASD A+G VL Q+
Subjt:  -----------GRLMMDPSKG-----------YSNVAAPLTNLLKKSQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQE

Query:  G-----HPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRAWRHYLLGS--KFTVMTD--NVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVA
              +P+ Y S K++  +  Y+V +KEM AI+  ++ WRHYL  +   F ++TD  N+        +    + ARWQ FL +F+F++ Y+ G AN +A
Subjt:  G-----HPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRAWRHYLLGS--KFTVMTD--NVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVA

Query:  DALSR
        DALSR
Subjt:  DALSR

P20825 Retrovirus-related Pol polyprotein from transposon 2972.7e-5233.6Show/hide
Query:  PAELPKKRPPRRGVD-HEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASYIQHSKAPYGAHQDRSQIRVTRY----GAYEFLVMPFGLTNAPATF
        P  +  K+P   G + + + ++         P R   P ++E+  +L +    + I  +K  +    D   I  T +    G YE+L MPFGL NAPATF
Subjt:  PAELPKKRPPRRGVD-HEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASYIQHSKAPYGAHQDRSQIRVTRY----GAYEFLVMPFGLTNAPATF

Query:  CTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPSK-------------------
           MN + +P L++  +VYLDDI+++S ++ EH+  ++ VF  L D  L ++L+KC F K E  FLGH +    +  +P K                   
Subjt:  CTLMNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPSK-------------------

Query:  ------------GYSNVAAPLTNLLKK-SQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDTERRY
                     Y+++A P+T+ LKK ++      E  +AF++LK  + ++P + L D  K F + TDAS+ A+G VL Q GHPI++ S  LND E  Y
Subjt:  ------------GYSNVAAPLTNLLKK-SQTWHWSGECQQAFDRLKEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDTERRY

Query:  TVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSR
        +  EKE+ AIV   + +RHYLLG +F + +D+    +    K+   K  RW+  L+E+ F++ Y  GK N VADALSR
Subjt:  TVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFQLQYKSGKANVVADALSR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCACAACTTGGCGTCGACGGTACGCCGACCAAAACGGGAATGCCATCCAGACGTGGGAGCAACTCAAGATCTAATTAAGGAAGCACTTTCTTCCCCCACAATGCC
GAAATGGAGTCAAGAGCCAAACTTCGAAGGCTACGACACACAGGAATTTACCACTATTATGCTTGAAATTAGTGATCTGCCCGAGAAGGAAGCCCTATTCCAATTCAGAG
ATGGACTAAAAGATTGGGCTAAGGTTGAGCTAGACCGTCGCAATGTTCAAACATTGGATGACGCTATAGCTGCAGCCGAGATACTTGTCGATTATATAGCCCAATCAAAG
GAGAGAAAACCCAGTCCAAATAAACGTGAAGGGAATCACTACAAGCATAGAGACTCCGAACAAAGAGAAAAAGGAAAGAAGCCCAAGCAAGGAGATGATACACCACAGAT
TGGTTCCTTGCAACACATTGGAGCTCTAAAGCAGGCCAATCAAACAACTAAAGGAGGCCTACTCCATGGCAACGTAAAGATAAACGGGAGAGAGACTATTGTCATGTTTG
ACACCGGTGCAAGCCATAACTTCATGAACGTGAATGAGGCTAAGAGACTAGGCCTCAAGTTCACAAACAAAGAGGGGACTGTCAAGGCGGTAAACTCAGAAGCACAAGCC
ATCAAAGGTGTAGCAAGAGATGTAACAGTCAAAATTGGCGATTGGCAAGGTAAGCTCGACTTCACCGTGCTACCTATGGATGACTTTGACATTGTGCTTGGCCTAGGGTT
CTTCAATAAGGTAATTGCCATCATTAATTCTTCCGGAAGTACCCTCACCATAGTGGATGGTCAAATAACAACCATCCCACTGAAGAAGGGAAGGCCGGTGACCGTGCTCT
TAGCCATGCAATTCAAGGGAGGGGTCACCAAGCACGAGTGCTATGCAGTTACCCTAAAGGAGGCACAACCAGAGGAGACTGGTCCTACACAAAATACCGTCCCCAAAGAA
GTACATAAAGTCCTTGAAGAATACAAGAACGTCATGCCCGCCGAGTTGCCGAAGAAACGTCCCCCGAGAAGGGGAGTAGATCACGAGATTGAGCTGGAGCCAGGAGCCAA
ACCACCCGCTATGGCTCCCTATAGAATGGCCCCTCCAAAACTTGAGGAGTTAAAGAGGCAATTAAAAGAACTCCTCGAAGCTAGCTACATTCAACACTCGAAGGCCCCAT
ATGGTGCACATCAAGATCGATCTCAGATCAGGGTAACCAGGTACGGCGCTTACGAATTCTTAGTAATGCCCTTCGGATTAACAAATGCCCCGGCCACTTTCTGCACTCTG
ATGAACAAACTCTTCCAGCCATTCCTAGATCAGTTTGTTGTTGTGTATCTGGACGACATTGTAGTATACAGTCGAACCATGGAAGAACATATTCAACATCTCAGACAAGT
ATTCCAGGTACTCCGCGACAACGAGTTATATATTAAGCTGGAAAAATGCTCATTTGCCAAACCAGAAGTGGAGTTTTTGGGACATTGGATTAGAGAAGGCAGATTAATGA
TGGATCCATCAAAGGGGTACTCAAATGTTGCTGCACCCCTCACCAACTTACTAAAGAAAAGCCAAACCTGGCATTGGAGTGGGGAGTGTCAACAAGCCTTTGACAGATTG
AAAGAAGCAGTCTCGAAAGAGCCTGCGATGACTTTAGCTGATCACACCAAACCATTTGAGGTGCACACTGATGCATCAGACTTTGCAATAGGGGGTGTCTTGATGCAGGA
AGGCCATCCTATAGCCTACAAGAGCTGCAAACTCAATGATACGGAGCGGCGATACACCGTGCAAGAAAAGGAGATGACCGCCATCGTCCACTGCGTACGAGCGTGGAGAC
ACTACCTGTTAGGCAGTAAATTCACGGTGATGACAGACAATGTGGCAACCAGCTACTTCCAGACGCAGAAGAAACTCACACCAAAGCAAGCTCGATGGCAAGACTTTCTT
GCCGAGTTTGATTTTCAACTACAATACAAGTCGGGCAAAGCTAATGTAGTTGCTGATGCCTTAAGCCGAAAGGCAGAGCTTATTGTTATTCCTGCTGCCATAATCTCATA
A
mRNA sequenceShow/hide mRNA sequence
ATGCCGCACAACTTGGCGTCGACGGTACGCCGACCAAAACGGGAATGCCATCCAGACGTGGGAGCAACTCAAGATCTAATTAAGGAAGCACTTTCTTCCCCCACAATGCC
GAAATGGAGTCAAGAGCCAAACTTCGAAGGCTACGACACACAGGAATTTACCACTATTATGCTTGAAATTAGTGATCTGCCCGAGAAGGAAGCCCTATTCCAATTCAGAG
ATGGACTAAAAGATTGGGCTAAGGTTGAGCTAGACCGTCGCAATGTTCAAACATTGGATGACGCTATAGCTGCAGCCGAGATACTTGTCGATTATATAGCCCAATCAAAG
GAGAGAAAACCCAGTCCAAATAAACGTGAAGGGAATCACTACAAGCATAGAGACTCCGAACAAAGAGAAAAAGGAAAGAAGCCCAAGCAAGGAGATGATACACCACAGAT
TGGTTCCTTGCAACACATTGGAGCTCTAAAGCAGGCCAATCAAACAACTAAAGGAGGCCTACTCCATGGCAACGTAAAGATAAACGGGAGAGAGACTATTGTCATGTTTG
ACACCGGTGCAAGCCATAACTTCATGAACGTGAATGAGGCTAAGAGACTAGGCCTCAAGTTCACAAACAAAGAGGGGACTGTCAAGGCGGTAAACTCAGAAGCACAAGCC
ATCAAAGGTGTAGCAAGAGATGTAACAGTCAAAATTGGCGATTGGCAAGGTAAGCTCGACTTCACCGTGCTACCTATGGATGACTTTGACATTGTGCTTGGCCTAGGGTT
CTTCAATAAGGTAATTGCCATCATTAATTCTTCCGGAAGTACCCTCACCATAGTGGATGGTCAAATAACAACCATCCCACTGAAGAAGGGAAGGCCGGTGACCGTGCTCT
TAGCCATGCAATTCAAGGGAGGGGTCACCAAGCACGAGTGCTATGCAGTTACCCTAAAGGAGGCACAACCAGAGGAGACTGGTCCTACACAAAATACCGTCCCCAAAGAA
GTACATAAAGTCCTTGAAGAATACAAGAACGTCATGCCCGCCGAGTTGCCGAAGAAACGTCCCCCGAGAAGGGGAGTAGATCACGAGATTGAGCTGGAGCCAGGAGCCAA
ACCACCCGCTATGGCTCCCTATAGAATGGCCCCTCCAAAACTTGAGGAGTTAAAGAGGCAATTAAAAGAACTCCTCGAAGCTAGCTACATTCAACACTCGAAGGCCCCAT
ATGGTGCACATCAAGATCGATCTCAGATCAGGGTAACCAGGTACGGCGCTTACGAATTCTTAGTAATGCCCTTCGGATTAACAAATGCCCCGGCCACTTTCTGCACTCTG
ATGAACAAACTCTTCCAGCCATTCCTAGATCAGTTTGTTGTTGTGTATCTGGACGACATTGTAGTATACAGTCGAACCATGGAAGAACATATTCAACATCTCAGACAAGT
ATTCCAGGTACTCCGCGACAACGAGTTATATATTAAGCTGGAAAAATGCTCATTTGCCAAACCAGAAGTGGAGTTTTTGGGACATTGGATTAGAGAAGGCAGATTAATGA
TGGATCCATCAAAGGGGTACTCAAATGTTGCTGCACCCCTCACCAACTTACTAAAGAAAAGCCAAACCTGGCATTGGAGTGGGGAGTGTCAACAAGCCTTTGACAGATTG
AAAGAAGCAGTCTCGAAAGAGCCTGCGATGACTTTAGCTGATCACACCAAACCATTTGAGGTGCACACTGATGCATCAGACTTTGCAATAGGGGGTGTCTTGATGCAGGA
AGGCCATCCTATAGCCTACAAGAGCTGCAAACTCAATGATACGGAGCGGCGATACACCGTGCAAGAAAAGGAGATGACCGCCATCGTCCACTGCGTACGAGCGTGGAGAC
ACTACCTGTTAGGCAGTAAATTCACGGTGATGACAGACAATGTGGCAACCAGCTACTTCCAGACGCAGAAGAAACTCACACCAAAGCAAGCTCGATGGCAAGACTTTCTT
GCCGAGTTTGATTTTCAACTACAATACAAGTCGGGCAAAGCTAATGTAGTTGCTGATGCCTTAAGCCGAAAGGCAGAGCTTATTGTTATTCCTGCTGCCATAATCTCATA
A
Protein sequenceShow/hide protein sequence
MPHNLASTVRRPKRECHPDVGATQDLIKEALSSPTMPKWSQEPNFEGYDTQEFTTIMLEISDLPEKEALFQFRDGLKDWAKVELDRRNVQTLDDAIAAAEILVDYIAQSK
ERKPSPNKREGNHYKHRDSEQREKGKKPKQGDDTPQIGSLQHIGALKQANQTTKGGLLHGNVKINGRETIVMFDTGASHNFMNVNEAKRLGLKFTNKEGTVKAVNSEAQA
IKGVARDVTVKIGDWQGKLDFTVLPMDDFDIVLGLGFFNKVIAIINSSGSTLTIVDGQITTIPLKKGRPVTVLLAMQFKGGVTKHECYAVTLKEAQPEETGPTQNTVPKE
VHKVLEEYKNVMPAELPKKRPPRRGVDHEIELEPGAKPPAMAPYRMAPPKLEELKRQLKELLEASYIQHSKAPYGAHQDRSQIRVTRYGAYEFLVMPFGLTNAPATFCTL
MNKLFQPFLDQFVVVYLDDIVVYSRTMEEHIQHLRQVFQVLRDNELYIKLEKCSFAKPEVEFLGHWIREGRLMMDPSKGYSNVAAPLTNLLKKSQTWHWSGECQQAFDRL
KEAVSKEPAMTLADHTKPFEVHTDASDFAIGGVLMQEGHPIAYKSCKLNDTERRYTVQEKEMTAIVHCVRAWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFL
AEFDFQLQYKSGKANVVADALSRKAELIVIPAAIIS