; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015614 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015614
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr12:17701343..17710680
RNA-Seq ExpressionLag0015614
SyntenyLag0015614
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.9e-21356.83Show/hide
Query:  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIES
        + +  L  GE+TL+VG GE++SA+ VGD+ +FF ++RY++L +V  VP +KRNLISI+C+LE  Y++SF VN  FI  KG  ICSA  ENNLY LRP  +
Subjt:  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIES

Query:  RAILNNEMFKTAETQPKRQKVSQSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL--------------
          +LN EMF+T ETQ K+QKVS + YLWHLRLGHINLNRI RLVK+G+L+QLED +LP CESCLEGKMTKR FTGKG RA  PLEL              
Subjt:  RAILNNEMFKTAETQPKRQKVSQSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL--------------

Query:  ------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLLDMVR
                                      K  EY                                        LSA  TPQQNGVSERRNRTLLDMVR
Subjt:  ------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLLDMVR

Query:  SMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV------------KIVL-----SELSG------------KAT
        SMMSYAQLP SFWGYA+ETA++ILN  PSKSV ETP+ELWKGRK SLR+FRIWGCPAHV            K+ L      E  G             +T
Subjt:  SMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV------------KIVL-----SELSG------------KAT

Query:  NESTRVVDEA---GPSTRV---------ADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMN
        N +    D      P +++          D+PSS    ST+V+D+ + S Q+H SQELR+ RRSGR+V QP RYLGL ET  +IPDDGVEDPLT++QAMN
Subjt:  NESTRVVDEA---GPSTRV---------ADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMN

Query:  DVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIW
        DVD DQW+KAM+LEMESMYFN VW LVDLP  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQKE VDYEETFSPVAMLKSIRILLSIATFY+YEIW
Subjt:  DVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIW

Query:  QKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIINNSVAFLVLYVDDILLIGNDVE
        Q DVKTAFLNGNL+E+I+M QPEGF+AQ QEQK                                 NVDEPCVYK+I+N+ VAFL+LYVDDILLIGNDVE
Subjt:  QKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIINNSVAFLVLYVDDILLIGNDVE

Query:  YLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN
        YL D+KKWL+T+FQMK LGEAQY+LGIQI+RN
Subjt:  YLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]3.8e-23059.12Show/hide
Query:  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIES
        + +  L+  EMTL+VG G++ISA+ VGD K+FF   ++M L+N+YIVP+IKRNL+S+SCL+E  YS++FS+N AFI K G  ICSAK+ENNLYVLRP E+
Subjt:  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIES

Query:  RAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL------------
        +A+LN+EMF+TA TQ KRQ++S   +TYLWHLRLGHINL+RIGRLVKNGLL++L+D +LP CESCLEGKMTKRPFTGKGYRA EPLEL            
Subjt:  RAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL------------

Query:  --------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLLDM
                                        K  EY                                        LSA GTPQQNGVSERRNRTLLDM
Subjt:  --------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLLDM

Query:  VRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV---------------------------------------
        VRSMMSYAQLPSSFWGYAVETAV+ILN  PSKSVSETPFELW+GRK SL HFRIWGCPAHV                                       
Subjt:  VRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV---------------------------------------

Query:  -------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAETP
                           K+VLSE    AT+ESTRVVDE GPS+RV               DE +TSGQ+HPSQ LRM RRSGR+V QP RYLGL ET 
Subjt:  -------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAETP

Query:  TVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAM
         VIPDDGVEDPL+++QAMNDVD DQWVKAMDLEMESMYFN VW+LVDLPEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQ+E VDYEETFSPVAM
Subjt:  TVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAM

Query:  LKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIINNS
        LKSIRILLSIATFYDYEIWQ DVKTAFLNGNL+E+IFMSQPEGF+ QGQEQK                                 NVDEPCVYK+I    
Subjt:  LKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIINNS

Query:  VAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN
        VAFLVLYVDDILLIGNDV YL D+K WL+ +FQMK LGEAQYVLGIQIIR+
Subjt:  VAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-22758.59Show/hide
Query:  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIES
        + +  L+  EMTL+VG G++ISA+ VGD K+FF   ++M L+N+YIVP+IKRNL+S+SCL+E  YS++FS+N AFI K G  ICSAK+ENNLYVLRP E+
Subjt:  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIES

Query:  RAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL------------
        +A+LN+EMF+TA TQ KRQ++S   +TYLWHLRLGHINL+RIGRLVK+GLL++L+D +LP CESCLEGKMTKRPFTGKGYRA EPLEL            
Subjt:  RAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL------------

Query:  --------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLLDM
                                        K  EY                                        LSA GTPQQNGVSERRNRTLLDM
Subjt:  --------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLLDM

Query:  VRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV---------------------------------------
        VRSMMSYAQLPSSFWGYAVETAV+ILN  PSKSVSETPFELW+GRK SL HFRIWGCPAHV                                       
Subjt:  VRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV---------------------------------------

Query:  -------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAETP
                           K+VLSE    AT+ESTRVVDE GPS+RV               DE +TSGQ+HPSQ LRM RRSGR+V QP RYLGL ET 
Subjt:  -------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAETP

Query:  TVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAM
         VIPDDGVEDPL+++QAMNDVD DQWVKAMDLEMESMYFN VW+LVDLPEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT+KE VDYEETFS VAM
Subjt:  TVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAM

Query:  LKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIINNS
        LKSIRILLSIA FYDYEIWQ DVKTAFLNGNL+E+IFMSQPEGF+ QGQEQK                                 NVDEPCVYK+I    
Subjt:  LKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIINNS

Query:  VAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN
        VAFLVLYVDDILLIGNDV YL D+K WL+ +FQMK LGE QYVLGIQIIR+
Subjt:  VAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-20654.32Show/hide
Query:  GVADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPI
        G++ +  L+ GEMT+RVG G ++SA  VG +++   +  ++LL+NVY+VP +KRNLIS+ CLLEQ YS++F+VN  FI K G +ICSAK+ENNLYVLR +
Subjt:  GVADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPI

Query:  ESRAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL----------
         S+A+LN EMFKTA TQ KR K+S  ++ +LWHLRLGHINLNRI RLVKNGLLS+LE+ +LP+CESCLEGKMTKRPFTGKG+RA EPLEL          
Subjt:  ESRAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL----------

Query:  ----------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLL
                                          K  EY                                        LSA GTPQQNGVSERRNRTLL
Subjt:  ----------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLL

Query:  DMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV-------------------------------------
        DMVRSMMSYA LP+SFWGYAV+TAVYILN  PSKSVSETP +LW GRK SLRHFRIWGCPAHV                                     
Subjt:  DMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV-------------------------------------

Query:  ---------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAE
                             KIVL+ELS + T  STRVV+E    TRV        G STR           H  Q LR  RRSGR+   P RY+ L E
Subjt:  ---------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAE

Query:  TPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPV
        T TVI D  +EDPLTF++AM DVD D+W+KAM+LE+ESMYFN VWDLVD P+GVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ E VDYEETFSPV
Subjt:  TPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIIN
        AMLKSIRILLSIA ++DYEIWQ DVKTAFLNGNL+E I+M QPEGF+  GQEQK                                  VDEPCVYKRIIN
Subjt:  AMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIIN

Query:  NSVAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN
         SVAFLVLYVDDILLIGND+  L DIK+WL+T+FQMK LGEAQ+VLGIQI R+
Subjt:  NSVAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-20654.32Show/hide
Query:  GVADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPI
        G++ +  L+ GEMT+RVG G ++SA  VG +++   +  ++LL+NVY+VP +KRNLIS+ CLLEQ YS++F+VN  FI K G +ICSAK+ENNLYVLR +
Subjt:  GVADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPI

Query:  ESRAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL----------
         S+A+LN EMFKTA TQ KR K+S  ++ +LWHLRLGHINLNRI RLVKNGLLS+LE+ +LP+CESCLEGKMTKRPFTGKG+RA EPLEL          
Subjt:  ESRAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL----------

Query:  ----------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLL
                                          K  EY                                        LSA GTPQQNGVSERRNRTLL
Subjt:  ----------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLL

Query:  DMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV-------------------------------------
        DMVRSMMSYA LP+SFWGYAV+TAVYILN  PSKSVSETP +LW GRK SLRHFRIWGCPAHV                                     
Subjt:  DMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV-------------------------------------

Query:  ---------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAE
                             KIVL+ELS + T  STRVV+E    TRV        G STR           H  Q LR  RRSGR+   P RY+ L E
Subjt:  ---------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAE

Query:  TPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPV
        T TVI D  +EDPLTF++AM DVD D+W+KAM+LE+ESMYFN VWDLVD P+GVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ E VDYEETFSPV
Subjt:  TPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIIN
        AMLKSIRILLSIA ++DYEIWQ DVKTAFLNGNL+E I+M QPEGF+  GQEQK                                  VDEPCVYKRIIN
Subjt:  AMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIIN

Query:  NSVAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN
         SVAFLVLYVDDILLIGND+  L DIK+WL+T+FQMK LGEAQ+VLGIQI R+
Subjt:  NSVAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein6.4e-20754.32Show/hide
Query:  GVADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPI
        G++ +  L+ GEMT+RVG G ++SA  VG +++   +  ++LL+NVY+VP +KRNLIS+ CLLEQ YS++F+VN  FI K G +ICSAK+ENNLYVLR +
Subjt:  GVADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPI

Query:  ESRAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL----------
         S+A+LN EMFKTA TQ KR K+S  ++ +LWHLRLGHINLNRI RLVKNGLLS+LE+ +LP+CESCLEGKMTKRPFTGKG+RA EPLEL          
Subjt:  ESRAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL----------

Query:  ----------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLL
                                          K  EY                                        LSA GTPQQNGVSERRNRTLL
Subjt:  ----------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLL

Query:  DMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV-------------------------------------
        DMVRSMMSYA LP+SFWGYAV+TAVYILN  PSKSVSETP +LW GRK SLRHFRIWGCPAHV                                     
Subjt:  DMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV-------------------------------------

Query:  ---------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAE
                             KIVL+ELS + T  STRVV+E    TRV        G STR           H  Q LR  RRSGR+   P RY+ L E
Subjt:  ---------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAE

Query:  TPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPV
        T TVI D  +EDPLTF++AM DVD D+W+KAM+LE+ESMYFN VWDLVD P+GVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ E VDYEETFSPV
Subjt:  TPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIIN
        AMLKSIRILLSIA ++DYEIWQ DVKTAFLNGNL+E I+M QPEGF+  GQEQK                                  VDEPCVYKRIIN
Subjt:  AMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIIN

Query:  NSVAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN
         SVAFLVLYVDDILLIGND+  L DIK+WL+T+FQMK LGEAQ+VLGIQI R+
Subjt:  NSVAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN

A0A5A7T2V9 Gag/pol protein1.1e-22758.59Show/hide
Query:  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIES
        + +  L+  EMTL+VG G++ISA+ VGD K+FF   ++M L+N+YIVP+IKRNL+S+SCL+E  YS++FS+N AFI K G  ICSAK+ENNLYVLRP E+
Subjt:  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIES

Query:  RAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL------------
        +A+LN+EMF+TA TQ KRQ++S   +TYLWHLRLGHINL+RIGRLVK+GLL++L+D +LP CESCLEGKMTKRPFTGKGYRA EPLEL            
Subjt:  RAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL------------

Query:  --------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLLDM
                                        K  EY                                        LSA GTPQQNGVSERRNRTLLDM
Subjt:  --------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLLDM

Query:  VRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV---------------------------------------
        VRSMMSYAQLPSSFWGYAVETAV+ILN  PSKSVSETPFELW+GRK SL HFRIWGCPAHV                                       
Subjt:  VRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV---------------------------------------

Query:  -------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAETP
                           K+VLSE    AT+ESTRVVDE GPS+RV               DE +TSGQ+HPSQ LRM RRSGR+V QP RYLGL ET 
Subjt:  -------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAETP

Query:  TVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAM
         VIPDDGVEDPL+++QAMNDVD DQWVKAMDLEMESMYFN VW+LVDLPEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT+KE VDYEETFS VAM
Subjt:  TVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAM

Query:  LKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIINNS
        LKSIRILLSIA FYDYEIWQ DVKTAFLNGNL+E+IFMSQPEGF+ QGQEQK                                 NVDEPCVYK+I    
Subjt:  LKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIINNS

Query:  VAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN
        VAFLVLYVDDILLIGNDV YL D+K WL+ +FQMK LGE QYVLGIQIIR+
Subjt:  VAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN

A0A5A7TZD0 Gag/pol protein1.9e-23059.12Show/hide
Query:  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIES
        + +  L+  EMTL+VG G++ISA+ VGD K+FF   ++M L+N+YIVP+IKRNL+S+SCL+E  YS++FS+N AFI K G  ICSAK+ENNLYVLRP E+
Subjt:  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIES

Query:  RAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL------------
        +A+LN+EMF+TA TQ KRQ++S   +TYLWHLRLGHINL+RIGRLVKNGLL++L+D +LP CESCLEGKMTKRPFTGKGYRA EPLEL            
Subjt:  RAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL------------

Query:  --------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLLDM
                                        K  EY                                        LSA GTPQQNGVSERRNRTLLDM
Subjt:  --------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLLDM

Query:  VRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV---------------------------------------
        VRSMMSYAQLPSSFWGYAVETAV+ILN  PSKSVSETPFELW+GRK SL HFRIWGCPAHV                                       
Subjt:  VRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV---------------------------------------

Query:  -------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAETP
                           K+VLSE    AT+ESTRVVDE GPS+RV               DE +TSGQ+HPSQ LRM RRSGR+V QP RYLGL ET 
Subjt:  -------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAETP

Query:  TVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAM
         VIPDDGVEDPL+++QAMNDVD DQWVKAMDLEMESMYFN VW+LVDLPEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQ+E VDYEETFSPVAM
Subjt:  TVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAM

Query:  LKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIINNS
        LKSIRILLSIATFYDYEIWQ DVKTAFLNGNL+E+IFMSQPEGF+ QGQEQK                                 NVDEPCVYK+I    
Subjt:  LKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIINNS

Query:  VAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN
        VAFLVLYVDDILLIGNDV YL D+K WL+ +FQMK LGEAQYVLGIQIIR+
Subjt:  VAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN

A0A5D3CPJ6 Gag/pol protein6.4e-20754.32Show/hide
Query:  GVADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPI
        G++ +  L+ GEMT+RVG G ++SA  VG +++   +  ++LL+NVY+VP +KRNLIS+ CLLEQ YS++F+VN  FI K G +ICSAK+ENNLYVLR +
Subjt:  GVADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPI

Query:  ESRAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL----------
         S+A+LN EMFKTA TQ KR K+S  ++ +LWHLRLGHINLNRI RLVKNGLLS+LE+ +LP+CESCLEGKMTKRPFTGKG+RA EPLEL          
Subjt:  ESRAILNNEMFKTAETQPKRQKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL----------

Query:  ----------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLL
                                          K  EY                                        LSA GTPQQNGVSERRNRTLL
Subjt:  ----------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLL

Query:  DMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV-------------------------------------
        DMVRSMMSYA LP+SFWGYAV+TAVYILN  PSKSVSETP +LW GRK SLRHFRIWGCPAHV                                     
Subjt:  DMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV-------------------------------------

Query:  ---------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAE
                             KIVL+ELS + T  STRVV+E    TRV        G STR           H  Q LR  RRSGR+   P RY+ L E
Subjt:  ---------------------KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAE

Query:  TPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPV
        T TVI D  +EDPLTF++AM DVD D+W+KAM+LE+ESMYFN VWDLVD P+GVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ E VDYEETFSPV
Subjt:  TPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPV

Query:  AMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIIN
        AMLKSIRILLSIA ++DYEIWQ DVKTAFLNGNL+E I+M QPEGF+  GQEQK                                  VDEPCVYKRIIN
Subjt:  AMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIIN

Query:  NSVAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN
         SVAFLVLYVDDILLIGND+  L DIK+WL+T+FQMK LGEAQ+VLGIQI R+
Subjt:  NSVAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN

E2GK51 Gag/pol protein (Fragment)9.2e-21456.83Show/hide
Query:  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIES
        + +  L  GE+TL+VG GE++SA+ VGD+ +FF ++RY++L +V  VP +KRNLISI+C+LE  Y++SF VN  FI  KG  ICSA  ENNLY LRP  +
Subjt:  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIES

Query:  RAILNNEMFKTAETQPKRQKVSQSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL--------------
          +LN EMF+T ETQ K+QKVS + YLWHLRLGHINLNRI RLVK+G+L+QLED +LP CESCLEGKMTKR FTGKG RA  PLEL              
Subjt:  RAILNNEMFKTAETQPKRQKVSQSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLEL--------------

Query:  ------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLLDMVR
                                      K  EY                                        LSA  TPQQNGVSERRNRTLLDMVR
Subjt:  ------------------------------KNMEY----------------------------------------LSALGTPQQNGVSERRNRTLLDMVR

Query:  SMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV------------KIVL-----SELSG------------KAT
        SMMSYAQLP SFWGYA+ETA++ILN  PSKSV ETP+ELWKGRK SLR+FRIWGCPAHV            K+ L      E  G             +T
Subjt:  SMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV------------KIVL-----SELSG------------KAT

Query:  NESTRVVDEA---GPSTRV---------ADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMN
        N +    D      P +++          D+PSS    ST+V+D+ + S Q+H SQELR+ RRSGR+V QP RYLGL ET  +IPDDGVEDPLT++QAMN
Subjt:  NESTRVVDEA---GPSTRV---------ADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMN

Query:  DVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIW
        DVD DQW+KAM+LEMESMYFN VW LVDLP  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQKE VDYEETFSPVAMLKSIRILLSIATFY+YEIW
Subjt:  DVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIW

Query:  QKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIINNSVAFLVLYVDDILLIGNDVE
        Q DVKTAFLNGNL+E+I+M QPEGF+AQ QEQK                                 NVDEPCVYK+I+N+ VAFL+LYVDDILLIGNDVE
Subjt:  QKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQK---------------------------------NVDEPCVYKRIINNSVAFLVLYVDDILLIGNDVE

Query:  YLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN
        YL D+KKWL+T+FQMK LGEAQY+LGIQI+RN
Subjt:  YLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.8e-3926.03Show/hide
Query:  GKGYRANEPLEL---KNMEY-LSALGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSV---SETPFELWKGRKVSLRHFR
        G+ Y +NE  +    K + Y L+   TPQ NGVSER  RT+ +  R+M+S A+L  SFWG AV TA Y++N  PS+++   S+TP+E+W  +K  L+H R
Subjt:  GKGYRANEPLEL---KNMEY-LSALGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSV---SETPFELWKGRKVSLRHFR

Query:  IWGCPAHVKIVLSELSGKATNESTR---------------------------VVDEAG---------PSTRVADEPSSE----AGPSTRVI--DEPSTSG
        ++G   +V I      GK  ++S +                           VVDE            +  + D   SE       S ++I  + P+ S 
Subjt:  IWGCPAHVKIVLSELSGKATNESTR---------------------------VVDEAG---------PSTRVADEPSSE----AGPSTRVI--DEPSTSG

Query:  QAHPSQELRMLRRS--------GRIVIQPE---------------------RYL-------------------------GLAETPTVIPDDGVEDP----
        +    Q L+  + S         R +IQ E                     +Y                            +ET   + + G+++P    
Subjt:  QAHPSQELRMLRRS--------GRIVIQPE---------------------RYL-------------------------GLAETPTVIPDDGVEDP----

Query:  -------------------------------LTFRQAMNDV-----------DHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAG
                                       L      NDV           D   W +A++ E+ +   N  W +   PE    +  +W++  K +  G
Subjt:  -------------------------------LTFRQAMNDV-----------DHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAG

Query:  KVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEG-------------------------F
            +KARLVA+G+TQK ++DYEETF+PVA + S R +LS+   Y+ ++ Q DVKTAFLNG L E I+M  P+G                         F
Subjt:  KVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEG-------------------------F

Query:  VAQGQEQKNVD------EPCVY---KRIINNSVAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQI
            Q  K  +      + C+Y   K  IN ++ +++LYVDD+++   D+  + + K++L  +F+M  L E ++ +GI+I
Subjt:  VAQGQEQKNVD------EPCVY---KRIINNSVAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-6434.08Show/hide
Query:  GTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVS-ETPFELWKGRKVSLRHFRIWGCP--AHV-KIVLSELSGK------
        GTPQ NGV+ER NRT+++ VRSM+  A+LP SFWG AV+TA Y++N +PS  ++ E P  +W  ++VS  H +++GC   AHV K   ++L  K      
Subjt:  GTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVS-ETPFELWKGRKVSLRHFRIWGCP--AHV-KIVLSELSGK------

Query:  -----------------------------------ATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQ-------------------AHPSQ
                                           A + S +V +   P+       S+    +    DE S  G+                    HP+Q
Subjt:  -----------------------------------ATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQ-------------------AHPSQ

Query:  ---ELRMLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQ
           + + LRRS R  ++  RY     T  V+  D   +P + ++ ++  + +Q +KAM  EMES+  N  + LV+LP+G +P+ CKW++K K+D   K+ 
Subjt:  ---ELRMLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQ

Query:  TFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQ---------------------
         +KARLV KG+ QK+ +D++E FSPV  + SIR +LS+A   D E+ Q DVKTAFL+G+L+E I+M QPEGF   G++                      
Subjt:  TFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQ---------------------

Query:  ------------KNVDEPCVY-KRIINNSVAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRNPERDAEITWV
                    K   +PCVY KR   N+   L+LYVDD+L++G D   +A +K  LS  F MK LG AQ +LG++I+R  ER +   W+
Subjt:  ------------KNVDEPCVY-KRIINNSVAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRNPERDAEITWV

P92520 Uncharacterized mitochondrial protein AtMg008204.7e-1340.7Show/hide
Query:  WVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIA
        W +AM  E++++  N+ W LV  P     +GCKW++K K  + G +   KARLVAKG+ Q+E + + ET+SPV    +IR +L++A
Subjt:  WVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-3129.3Show/hide
Query:  SELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEP------STSGQAHPSQELRMLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQA
        S+L+   +  +        P+T  +   +S   PS  +   P      + + QA P     M  R+   +I+P     LA     +      +P T  QA
Subjt:  SELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEP------STSGQAHPSQELRMLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQA

Query:  MNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEG-VKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDY
        + D   ++W  AM  E+ +   N  WDLV  P   V  +GC+WI+ +K ++ G +  +KARLVAKGY Q+  +DY ETFSPV    SIRI+L +A    +
Subjt:  MNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEG-VKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDY

Query:  EIWQKDVKTAFLNGNLDENIFMSQPEGFVAQ--------------GQEQ-------------------KNVDEPCVYKRIINNSVAFLVLYVDDILLIGN
         I Q DV  AFL G L ++++MSQP GF+ +              G +Q                    +V +  ++      S+ ++++YVDDIL+ GN
Subjt:  EIWQKDVKTAFLNGNLDENIFMSQPEGFVAQ--------------GQEQ-------------------KNVDEPCVYKRIINNSVAFLVLYVDDILLIGN

Query:  DVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRNPE--RDAEITWVVSAERDAEIITTVVLIPPSPPSPNI
        D   L +    LS  F +K   E  Y LGI+  R P     ++  +++       +IT   +  P  PSP +
Subjt:  DVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRNPE--RDAEITWVVSAERDAEIITTVVLIPPSPPSPNI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.5e-0832.91Show/hide
Query:  TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVS-ETPFELWKGRKVSLRHFRIWGCPAH
        TP+ NG+SER++R +++   +++S+A +P ++W YA   AVY++N  P+  +  E+PF+   G   +    R++GC  +
Subjt:  TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVS-ETPFELWKGRKVSLRHFRIWGCPAH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.4e-3235.51Show/hide
Query:  DPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLV-DLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILL
        +P T  QAM D   D+W +AM  E+ +   N  WDLV   P  V  +GC+WI+ +K ++ G +  +KARLVAKGY Q+  +DY ETFSPV    SIRI+L
Subjt:  DPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLV-DLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQ--------------GQEQ-------------------KNVDEPCVYKRIINNSVAFLVLYV
         +A    + I Q DV  AFL G L + ++MSQP GFV +              G +Q                    ++ +  ++      S+ ++++YV
Subjt:  SIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQ--------------GQEQ-------------------KNVDEPCVYKRIINNSVAFLVLYV

Query:  DDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRNPE
        DDIL+ GND   L      LS  F +K   +  Y LGI+  R P+
Subjt:  DDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRNPE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-0928.89Show/hide
Query:  SQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLELKNMEYLSALG---------TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYIL
        SQ++D T  + +S +E +   R  T       E + L+  +YLS  G         TP+ NG+SER++R +++M  +++S+A +P ++W YA   AVY++
Subjt:  SQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLELKNMEYLSALG---------TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYIL

Query:  NVAPSKSVS-ETPFELWKGRKVSLRHFRIWGCPAH
        N  P+  +  ++PF+   G+  +    +++GC  +
Subjt:  NVAPSKSVS-ETPFELWKGRKVSLRHFRIWGCPAH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.2e-3733.6Show/hide
Query:  EDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILL
        ++P T+ +A   +    W  AMD E+ +M     W++  LP   KPIGCKW+YK K ++ G ++ +KARLVAKGYTQ+E +D+ ETFSPV  L S++++L
Subjt:  EDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQKNVDEPCVYKRII---------------------------NNSVAFL----------
        +I+  Y++ + Q D+  AFLNG+LDE I+M  P G+ A+  +    +  C  K+ I                           ++   FL          
Subjt:  SIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQKNVDEPCVYKRII---------------------------NNSVAFL----------

Query:  VLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN
        ++YVDDI++  N+   + ++K  L + F+++ LG  +Y LG++I R+
Subjt:  VLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRN

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.3e-0633.75Show/hide
Query:  NRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVS-ETPFELWKGRKVSLRHFRIWGCPAHVKIVLSELSGKA
        NRT+++ VRSM+    LP +F   A  TAV+I+N  PS +++   P E+W     +  + R +GC A++     +L  +A
Subjt:  NRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVS-ETPFELWKGRKVSLRHFRIWGCPAHVKIVLSELSGKA

ATMG00810.1 DNA/RNA polymerases superfamily protein4.1e-0450Show/hide
Query:  FLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRNP
        +L+LYVDDILL G+    L  +   LS+ F MK LG   Y LGIQI  +P
Subjt:  FLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRNP

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.3e-1440.7Show/hide
Query:  WVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIA
        W +AM  E++++  N+ W LV  P     +GCKW++K K  + G +   KARLVAKG+ Q+E + + ET+SPV    +IR +L++A
Subjt:  WVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTATTCCGGGAATAGGGATTGGCTGAGATTGTCTCAATTCAGAGGAGGAGTAGCTGACTACCCTACGTTAGATGCTGGGGAAATGACTCTCAGAGTTGGGATTGG
GGAGCTCATTTCAGCTAAAGTAGTGGGAGATATTAAAATGTTCTTCTCCAGGGAACGTTATATGTTATTAGATAACGTATATATAGTTCCCAGAATAAAAAGAAATTTGA
TTTCTATTTCTTGTTTGCTAGAACAAGAGTATTCAGTGTCCTTTTCTGTGAATGGAGCTTTCATTACTAAAAAGGGTGCTGATATATGTTCTGCAAAAATGGAGAACAAT
TTGTATGTCTTAAGACCAATAGAGTCTAGGGCCATTTTGAACAATGAAATGTTTAAAACAGCTGAGACTCAACCAAAAAGGCAAAAAGTTTCTCAAAGTACCTATCTTTG
GCACTTGAGACTTGGCCACATAAATCTCAATAGGATTGGGAGATTGGTTAAGAACGGACTTCTAAGCCAGTTAGAAGATGGTACTTTACCTCTATGTGAGTCATGTCTCG
AAGGTAAGATGACCAAACGACCTTTTACTGGAAAAGGTTATCGTGCCAATGAGCCCTTAGAACTCAAGAACATGGAATATCTCTCAGCACTAGGTACACCTCAGCAGAAT
GGTGTATCTGAAAGGAGAAACAGAACCTTGTTAGACATGGTTCGATCTATGATGAGCTATGCTCAGTTGCCCAGTTCTTTTTGGGGTTATGCAGTAGAGACTGCAGTTTA
TATTTTGAATGTTGCTCCCTCAAAGAGTGTTTCTGAAACACCATTTGAGCTTTGGAAAGGGCGTAAAGTTAGTTTACGTCATTTCAGAATTTGGGGCTGTCCTGCACATG
TTAAAATTGTATTAAGTGAACTTTCTGGTAAAGCTACAAATGAATCAACAAGAGTTGTTGATGAGGCTGGACCTTCAACAAGAGTTGCTGATGAGCCCAGTTCAGAGGCT
GGACCTTCAACAAGAGTTATTGATGAGCCTAGTACATCTGGTCAAGCCCATCCTTCTCAAGAGTTGAGAATGCTTCGACGTAGTGGGAGGATTGTTATTCAACCTGAACG
CTATCTGGGTTTAGCAGAAACTCCAACTGTCATACCTGATGATGGGGTTGAGGATCCATTGACCTTTAGACAGGCAATGAATGATGTGGACCATGACCAGTGGGTCAAAG
CTATGGACCTAGAGATGGAGTCTATGTACTTCAATCAAGTCTGGGATCTTGTAGACCTACCTGAAGGGGTTAAACCCATAGGGTGCAAATGGATCTACAAGAGAAAACGA
GACACAGCTGGGAAAGTACAGACTTTCAAAGCTCGACTTGTGGCAAAGGGTTATACCCAAAAGGAGGAGGTGGACTATGAGGAAACCTTCTCCCCAGTTGCCATGTTAAA
GTCGATAAGAATTCTCTTGTCCATTGCCACCTTTTATGATTATGAGATTTGGCAAAAGGACGTCAAGACTGCCTTTCTGAATGGCAATCTTGATGAGAACATCTTTATGT
CTCAACCAGAGGGGTTCGTTGCTCAAGGTCAGGAGCAAAAAAACGTTGATGAGCCTTGTGTTTACAAAAGGATTATCAACAATTCAGTAGCTTTTCTGGTGTTGTATGTA
GATGATATCCTTCTCATTGGGAATGATGTAGAATATCTGGCTGACATTAAAAAATGGCTATCGACTGAGTTCCAAATGAAATATTTGGGTGAAGCTCAGTATGTTCTAGG
GATTCAAATTATCAGGAATCCGGAGCGTGACGCGGAAATCACGTGGGTTGTGAGTGCGGAGCGTGACGCGGAAATCATTACCACGGTTGTACTGATACCCCCTTCCCCAC
CTTCCCCAAACATTTTAGATGTTACAGGTATCGAGGATGATCCGGACCTTGGTGGCGAGGAGGAGAACTGGGAGGAAAGATTCCTAGAGTTTCTAGAGTTCAGAGTTAAG
GGTTGTAGTAGTGTCCCTAGGCTAAGGGTTGGAAAACCTGGGGCGTTACAAAACTACAGTCCATCTTCTAAGATGAGTAGTCTCTCTCTCTGTGGAACTCTCTGTAGCTT
TGATCTACAAAGTCTTCAAGAAGCTTTTGTCTTGAGTCTTGAAGGAGGATTTGAGCTTGAACCTTCATTTTTGCAGGAGAATCTGAGCTTCAATCTTCAGTTCTCGTCGA
GACGCTGTAAAGACAGCGTCGCGACGCTGCCTCTAAACACGCGCGTTCTGGAAAGGAAATTACGGCAGCGTCGAGACGCTAAGGTCGCAGCGTCGAGACGCTGCGATGAT
TTCGCGTCTATCCAAATAAGGAATGCTAGCGTCGAGACGCCTAAGGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTTATTCCGGGAATAGGGATTGGCTGAGATTGTCTCAATTCAGAGGAGGAGTAGCTGACTACCCTACGTTAGATGCTGGGGAAATGACTCTCAGAGTTGGGATTGG
GGAGCTCATTTCAGCTAAAGTAGTGGGAGATATTAAAATGTTCTTCTCCAGGGAACGTTATATGTTATTAGATAACGTATATATAGTTCCCAGAATAAAAAGAAATTTGA
TTTCTATTTCTTGTTTGCTAGAACAAGAGTATTCAGTGTCCTTTTCTGTGAATGGAGCTTTCATTACTAAAAAGGGTGCTGATATATGTTCTGCAAAAATGGAGAACAAT
TTGTATGTCTTAAGACCAATAGAGTCTAGGGCCATTTTGAACAATGAAATGTTTAAAACAGCTGAGACTCAACCAAAAAGGCAAAAAGTTTCTCAAAGTACCTATCTTTG
GCACTTGAGACTTGGCCACATAAATCTCAATAGGATTGGGAGATTGGTTAAGAACGGACTTCTAAGCCAGTTAGAAGATGGTACTTTACCTCTATGTGAGTCATGTCTCG
AAGGTAAGATGACCAAACGACCTTTTACTGGAAAAGGTTATCGTGCCAATGAGCCCTTAGAACTCAAGAACATGGAATATCTCTCAGCACTAGGTACACCTCAGCAGAAT
GGTGTATCTGAAAGGAGAAACAGAACCTTGTTAGACATGGTTCGATCTATGATGAGCTATGCTCAGTTGCCCAGTTCTTTTTGGGGTTATGCAGTAGAGACTGCAGTTTA
TATTTTGAATGTTGCTCCCTCAAAGAGTGTTTCTGAAACACCATTTGAGCTTTGGAAAGGGCGTAAAGTTAGTTTACGTCATTTCAGAATTTGGGGCTGTCCTGCACATG
TTAAAATTGTATTAAGTGAACTTTCTGGTAAAGCTACAAATGAATCAACAAGAGTTGTTGATGAGGCTGGACCTTCAACAAGAGTTGCTGATGAGCCCAGTTCAGAGGCT
GGACCTTCAACAAGAGTTATTGATGAGCCTAGTACATCTGGTCAAGCCCATCCTTCTCAAGAGTTGAGAATGCTTCGACGTAGTGGGAGGATTGTTATTCAACCTGAACG
CTATCTGGGTTTAGCAGAAACTCCAACTGTCATACCTGATGATGGGGTTGAGGATCCATTGACCTTTAGACAGGCAATGAATGATGTGGACCATGACCAGTGGGTCAAAG
CTATGGACCTAGAGATGGAGTCTATGTACTTCAATCAAGTCTGGGATCTTGTAGACCTACCTGAAGGGGTTAAACCCATAGGGTGCAAATGGATCTACAAGAGAAAACGA
GACACAGCTGGGAAAGTACAGACTTTCAAAGCTCGACTTGTGGCAAAGGGTTATACCCAAAAGGAGGAGGTGGACTATGAGGAAACCTTCTCCCCAGTTGCCATGTTAAA
GTCGATAAGAATTCTCTTGTCCATTGCCACCTTTTATGATTATGAGATTTGGCAAAAGGACGTCAAGACTGCCTTTCTGAATGGCAATCTTGATGAGAACATCTTTATGT
CTCAACCAGAGGGGTTCGTTGCTCAAGGTCAGGAGCAAAAAAACGTTGATGAGCCTTGTGTTTACAAAAGGATTATCAACAATTCAGTAGCTTTTCTGGTGTTGTATGTA
GATGATATCCTTCTCATTGGGAATGATGTAGAATATCTGGCTGACATTAAAAAATGGCTATCGACTGAGTTCCAAATGAAATATTTGGGTGAAGCTCAGTATGTTCTAGG
GATTCAAATTATCAGGAATCCGGAGCGTGACGCGGAAATCACGTGGGTTGTGAGTGCGGAGCGTGACGCGGAAATCATTACCACGGTTGTACTGATACCCCCTTCCCCAC
CTTCCCCAAACATTTTAGATGTTACAGGTATCGAGGATGATCCGGACCTTGGTGGCGAGGAGGAGAACTGGGAGGAAAGATTCCTAGAGTTTCTAGAGTTCAGAGTTAAG
GGTTGTAGTAGTGTCCCTAGGCTAAGGGTTGGAAAACCTGGGGCGTTACAAAACTACAGTCCATCTTCTAAGATGAGTAGTCTCTCTCTCTGTGGAACTCTCTGTAGCTT
TGATCTACAAAGTCTTCAAGAAGCTTTTGTCTTGAGTCTTGAAGGAGGATTTGAGCTTGAACCTTCATTTTTGCAGGAGAATCTGAGCTTCAATCTTCAGTTCTCGTCGA
GACGCTGTAAAGACAGCGTCGCGACGCTGCCTCTAAACACGCGCGTTCTGGAAAGGAAATTACGGCAGCGTCGAGACGCTAAGGTCGCAGCGTCGAGACGCTGCGATGAT
TTCGCGTCTATCCAAATAAGGAATGCTAGCGTCGAGACGCCTAAGGGCTAG
Protein sequenceShow/hide protein sequence
MSYSGNRDWLRLSQFRGGVADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENN
LYVLRPIESRAILNNEMFKTAETQPKRQKVSQSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLELKNMEYLSALGTPQQN
GVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHVKIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEA
GPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKR
DTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQKNVDEPCVYKRIINNSVAFLVLYV
DDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRNPERDAEITWVVSAERDAEIITTVVLIPPSPPSPNILDVTGIEDDPDLGGEEENWEERFLEFLEFRVK
GCSSVPRLRVGKPGALQNYSPSSKMSSLSLCGTLCSFDLQSLQEAFVLSLEGGFELEPSFLQENLSFNLQFSSRRCKDSVATLPLNTRVLERKLRQRRDAKVAASRRCDD
FASIQIRNASVETPKG