; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G04400 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G04400
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionBeta-galactosidase
Genome locationClcChr04:15204329..15206775
RNA-Seq ExpressionClc04G04400
SyntenyClc04G04400
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031941.1 Beta-galactosidase [Cucumis melo var. makuwa]2.7e-18347.97Show/hide
Query:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS
        MDL RE +WD P       KL +   V  FLAGLN KFD + GRILGQR  P+LMEV F V LEEDR++AM ++ TP   S AFSA+SS    DK NGKS
Subjt:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS

Query:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL
          VCEHCKK WHTKDQCW+LHGRPP GK+RS N+K N  R  +SET                                               +GATDHL
Subjt:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL

Query:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG
        TGS +HF+SY PCAGNEKIRIADGS AP+A KG I PFDG  LQNVLH+PK+SYNL S+SKITR+L+C   F  + V+                 GL + 
Subjt:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG

Query:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS
        DDD  C  +   +SL+                                           L+   V G            FVTFIDDHTRLTWV+L++DKS
Subjt:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS

Query:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR
        EV SIFQ FY TI+TQF+ KIAI RSDNGREF  + L EFL+ +GIVHQ  CAYTPQQNGVAERKNRHL+EVA SLMLSTS PSYLWGDA+LTA HLINR
Subjt:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR

Query:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----
        MPSR+LHLQTPL+CLKESYP+T  + +VPL VFGC A+VH+  PNQTKFT   +  VFVGYPLHQ GYKCFHP S+KYF++MD+TF +++P+FP+     
Subjt:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----

Query:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---
                                            P    +RRNLRKE+  P    PA +  +EP + Q   +P     +NT        I  +EN   
Subjt:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---

Query:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE
            D   +R+  S     +  + K ++Y  SLD+P ALRKGTRS TK+P+ +++SY+NLS +FRAFTA+LD+  IPKNI+ A+E  E
Subjt:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE

KAA0061447.1 Beta-galactosidase [Cucumis melo var. makuwa]5.5e-18448.57Show/hide
Query:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS
        MDL RE +WD P       KL +   V  FLAGLN KFD + GRILGQR  P+LMEV F V LEEDR++AM ++ TP   S AFSA+SS    DK NGKS
Subjt:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS

Query:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL
          VCEHCKK WHTKDQCW+LHGRPP GK+RS N+K N  R  +SET                                               +GATDHL
Subjt:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL

Query:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG
        TGS +HF+SY PCAGNEKIRIADGS AP+A KG I PFDG  LQNVLH+PK+SYNL S+SKITR+L+C   F  + V+                 GL + 
Subjt:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG

Query:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS
        DDD  C  +   +SL+                                           L+   V G            FVTFIDDHTRLTWV+L++DKS
Subjt:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS

Query:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR
        EV SIFQ FY TI+TQF+ KIAI RSDNGREF  + L EFL+ +GIVHQ  CAYTPQQNGVAERKNRHL+EVA SLMLSTS PSYLWGDA+LTA HLINR
Subjt:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR

Query:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----
        MPSR+LHLQTPL+CLKESYP+T  + +VPL VFGC A+VH+  PNQTKFT   +  VFVGYPLHQ GYKCFHP S+KYF++MD+TF +++P+FP      
Subjt:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----

Query:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDPDNTIICVENDIEHMRVIMSQKPTTE
                                            P    +RRNLRKE+  P    PA +  +EP + Q+    D T          +R+  S     +
Subjt:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDPDNTIICVENDIEHMRVIMSQKPTTE

Query:  EESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE
          + K ++Y  SLD+P ALRKGTRS TK+P+ +++SY+NLS +FRAFTA+LD+  IPKNI+ A+E  E
Subjt:  EESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE

KAA0061456.1 Beta-galactosidase [Cucumis melo var. makuwa]4.2e-18448.1Show/hide
Query:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS
        MDL RE +WD P       KL +   V  FLAGLN KFD + GRILGQR  P+LMEV F V LEEDR++AM ++ TP   S AFSA+SS    DK NGKS
Subjt:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS

Query:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL
          VCEHCKK WHTKDQCW+LHGRPP GK+RS N+K N  R  +SET                                               +GATDHL
Subjt:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL

Query:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG
        TGS +HF+SY PCAGNEKIRIADGS AP+A KG I PFDG  LQNVLH+PK+SYNL S+SKITR+L+C   F  + V+                 GL + 
Subjt:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG

Query:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS
        DDD  C  +   +SL+                                           L+   V G            FVTFIDDHTRLTWV+L++DKS
Subjt:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS

Query:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR
        EV SIFQ FY TI+TQF+ KIAI RSDNGREF  + L EFL+ +GIVHQ  CAYTPQQNGVAERKNRHL+EVA SLMLSTS PSYLWGDA+LTA HLINR
Subjt:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR

Query:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----
        MPSR+LHLQTPL+CLKESYP+T  + +VPL VFGC A+VH+  PNQTKFT   +  VFVGYPLHQ GYKCFHP S+KYF++MD+TF +++P+FP+     
Subjt:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----

Query:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---
                                            P    +RRNLRKE+  P    PA +  +EP + Q   +P     +NT        +  +EN   
Subjt:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---

Query:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE
            D   +R+  S     +  + K ++Y  SLD+P ALRKGTRS TKYP+ +++SY+NLS +FRAFTASLD+  IPKNI+ A+E  E
Subjt:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE

TYK03453.1 Beta-galactosidase [Cucumis melo var. makuwa]3.5e-18347.97Show/hide
Query:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS
        MDL RE +WD P       KL +   V  FLAGLN KFD + GRILGQR  P+LMEV F VCLEEDR++AM ++ TP   S AFSA+SS    DK NGKS
Subjt:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS

Query:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL
          VCEHCKK WHTKDQCW+LHGRP  GK+RS N+K N  R  +SET                                               +GATDHL
Subjt:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL

Query:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG
        TGS +HF+SY PCAGNEKIRIADGS AP+A KG I PFDG  LQNVLH+PK+SYNL S+SKITR+L+C   F  + V+                 GL + 
Subjt:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG

Query:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS
        DDD  C  +   +SL+                                           L+   V G            FVTFIDDHTRLTWV+L++DKS
Subjt:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS

Query:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR
        EV SIFQ FY TI+TQF+ KIAI RSDNGREF  + L EFL+ +GIVHQ  CAYTPQQNGVAERKNRHL+EVA SLMLSTS PSYLWGDA+LTA HLINR
Subjt:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR

Query:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----
        MPSR+LHLQTPL+CLKESYP+T  + +VPL VFGC A+VH+  PNQTKFT   +  VFVGYPLHQ GYKCFHP S+KYF++MD+TF +++P+FP+     
Subjt:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----

Query:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---
                                            P    +RRNLRKE+  P    PA +  +EP + Q   +P     +NT        I  +EN   
Subjt:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---

Query:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE
            D   +R+  S     +  + K ++Y  SLD+P ALRKGT+S TK+P+ +++SY+NLS +FRAFTASLD+  IPKNI+ A+E  E
Subjt:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE

TYK31050.1 Beta-galactosidase [Cucumis melo var. makuwa]3.5e-18347.97Show/hide
Query:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS
        MDL RE +WD P       KL +   V  FLAGLN KFD + GRILGQR  P+LMEV F V LEEDR++AM ++ TP   S AFSA+SS    DK NGKS
Subjt:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS

Query:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL
          VCEHCKK WHTKDQCW+LHGRPP GK+RS N+K N  R  +SET                                               +GATDHL
Subjt:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL

Query:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG
        TGS +HF+SY PCAGNEKIRIADGS AP+A KG I PFDG  LQNVLH+PK+SYNL S+SKITR+L+C   F  + V+                 GL + 
Subjt:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG

Query:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS
        DDD  C  +   +SL+                                           L+   V G            FVTFIDDHTRLTWV+L++DKS
Subjt:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS

Query:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR
        EV SIFQ FY TI+TQF+ KIAI RSDNGREF  + L EFL+ +GIVHQ  CAYTPQQNGVAERKNRHL+EVA SLMLSTS PSYLWGDA+LTA HLINR
Subjt:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR

Query:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----
        MPSR+LHLQTPL+CLKESYP+T  + +VPL VFGC A+VH+  PNQTKFT   +  VFVGYPLHQ GYKCFHP S+KYF++MD+TF +++P+FP+     
Subjt:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----

Query:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---
                                            P    +RRNLRKE+  P    PA +  +EP + Q   +P     +NT        I  +EN   
Subjt:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---

Query:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE
            D   +R+  S     +  + K ++Y  SLD+P ALRKGTRS TK+P+ +++SY+NLS +FRAFTA+LD+  IPKNI+ A+E  E
Subjt:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE

TrEMBL top hitse value%identityAlignment
A0A5A7SQW1 Beta-galactosidase1.3e-18347.97Show/hide
Query:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS
        MDL RE +WD P       KL +   V  FLAGLN KFD + GRILGQR  P+LMEV F V LEEDR++AM ++ TP   S AFSA+SS    DK NGKS
Subjt:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS

Query:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL
          VCEHCKK WHTKDQCW+LHGRPP GK+RS N+K N  R  +SET                                               +GATDHL
Subjt:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL

Query:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG
        TGS +HF+SY PCAGNEKIRIADGS AP+A KG I PFDG  LQNVLH+PK+SYNL S+SKITR+L+C   F  + V+                 GL + 
Subjt:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG

Query:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS
        DDD  C  +   +SL+                                           L+   V G            FVTFIDDHTRLTWV+L++DKS
Subjt:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS

Query:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR
        EV SIFQ FY TI+TQF+ KIAI RSDNGREF  + L EFL+ +GIVHQ  CAYTPQQNGVAERKNRHL+EVA SLMLSTS PSYLWGDA+LTA HLINR
Subjt:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR

Query:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----
        MPSR+LHLQTPL+CLKESYP+T  + +VPL VFGC A+VH+  PNQTKFT   +  VFVGYPLHQ GYKCFHP S+KYF++MD+TF +++P+FP+     
Subjt:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----

Query:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---
                                            P    +RRNLRKE+  P    PA +  +EP + Q   +P     +NT        I  +EN   
Subjt:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---

Query:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE
            D   +R+  S     +  + K ++Y  SLD+P ALRKGTRS TK+P+ +++SY+NLS +FRAFTA+LD+  IPKNI+ A+E  E
Subjt:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE

A0A5A7V3J5 Beta-galactosidase2.6e-18448.57Show/hide
Query:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS
        MDL RE +WD P       KL +   V  FLAGLN KFD + GRILGQR  P+LMEV F V LEEDR++AM ++ TP   S AFSA+SS    DK NGKS
Subjt:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS

Query:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL
          VCEHCKK WHTKDQCW+LHGRPP GK+RS N+K N  R  +SET                                               +GATDHL
Subjt:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL

Query:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG
        TGS +HF+SY PCAGNEKIRIADGS AP+A KG I PFDG  LQNVLH+PK+SYNL S+SKITR+L+C   F  + V+                 GL + 
Subjt:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG

Query:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS
        DDD  C  +   +SL+                                           L+   V G            FVTFIDDHTRLTWV+L++DKS
Subjt:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS

Query:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR
        EV SIFQ FY TI+TQF+ KIAI RSDNGREF  + L EFL+ +GIVHQ  CAYTPQQNGVAERKNRHL+EVA SLMLSTS PSYLWGDA+LTA HLINR
Subjt:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR

Query:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----
        MPSR+LHLQTPL+CLKESYP+T  + +VPL VFGC A+VH+  PNQTKFT   +  VFVGYPLHQ GYKCFHP S+KYF++MD+TF +++P+FP      
Subjt:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----

Query:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDPDNTIICVENDIEHMRVIMSQKPTTE
                                            P    +RRNLRKE+  P    PA +  +EP + Q+    D T          +R+  S     +
Subjt:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDPDNTIICVENDIEHMRVIMSQKPTTE

Query:  EESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE
          + K ++Y  SLD+P ALRKGTRS TK+P+ +++SY+NLS +FRAFTA+LD+  IPKNI+ A+E  E
Subjt:  EESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE

A0A5A7V4Q4 Beta-galactosidase2.0e-18448.1Show/hide
Query:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS
        MDL RE +WD P       KL +   V  FLAGLN KFD + GRILGQR  P+LMEV F V LEEDR++AM ++ TP   S AFSA+SS    DK NGKS
Subjt:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS

Query:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL
          VCEHCKK WHTKDQCW+LHGRPP GK+RS N+K N  R  +SET                                               +GATDHL
Subjt:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL

Query:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG
        TGS +HF+SY PCAGNEKIRIADGS AP+A KG I PFDG  LQNVLH+PK+SYNL S+SKITR+L+C   F  + V+                 GL + 
Subjt:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG

Query:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS
        DDD  C  +   +SL+                                           L+   V G            FVTFIDDHTRLTWV+L++DKS
Subjt:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS

Query:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR
        EV SIFQ FY TI+TQF+ KIAI RSDNGREF  + L EFL+ +GIVHQ  CAYTPQQNGVAERKNRHL+EVA SLMLSTS PSYLWGDA+LTA HLINR
Subjt:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR

Query:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----
        MPSR+LHLQTPL+CLKESYP+T  + +VPL VFGC A+VH+  PNQTKFT   +  VFVGYPLHQ GYKCFHP S+KYF++MD+TF +++P+FP+     
Subjt:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----

Query:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---
                                            P    +RRNLRKE+  P    PA +  +EP + Q   +P     +NT        +  +EN   
Subjt:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---

Query:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE
            D   +R+  S     +  + K ++Y  SLD+P ALRKGTRS TKYP+ +++SY+NLS +FRAFTASLD+  IPKNI+ A+E  E
Subjt:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE

A0A5D3BUT8 Beta-galactosidase1.7e-18347.97Show/hide
Query:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS
        MDL RE +WD P       KL +   V  FLAGLN KFD + GRILGQR  P+LMEV F VCLEEDR++AM ++ TP   S AFSA+SS    DK NGKS
Subjt:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS

Query:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL
          VCEHCKK WHTKDQCW+LHGRP  GK+RS N+K N  R  +SET                                               +GATDHL
Subjt:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL

Query:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG
        TGS +HF+SY PCAGNEKIRIADGS AP+A KG I PFDG  LQNVLH+PK+SYNL S+SKITR+L+C   F  + V+                 GL + 
Subjt:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG

Query:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS
        DDD  C  +   +SL+                                           L+   V G            FVTFIDDHTRLTWV+L++DKS
Subjt:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS

Query:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR
        EV SIFQ FY TI+TQF+ KIAI RSDNGREF  + L EFL+ +GIVHQ  CAYTPQQNGVAERKNRHL+EVA SLMLSTS PSYLWGDA+LTA HLINR
Subjt:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR

Query:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----
        MPSR+LHLQTPL+CLKESYP+T  + +VPL VFGC A+VH+  PNQTKFT   +  VFVGYPLHQ GYKCFHP S+KYF++MD+TF +++P+FP+     
Subjt:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----

Query:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---
                                            P    +RRNLRKE+  P    PA +  +EP + Q   +P     +NT        I  +EN   
Subjt:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---

Query:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE
            D   +R+  S     +  + K ++Y  SLD+P ALRKGT+S TK+P+ +++SY+NLS +FRAFTASLD+  IPKNI+ A+E  E
Subjt:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE

A0A5D3E603 Beta-galactosidase1.7e-18347.97Show/hide
Query:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS
        MDL RE +WD P       KL +   V  FLAGLN KFD + GRILGQR  P+LMEV F V LEEDR++AM ++ TP   S AFSA+SS    DK NGKS
Subjt:  MDLGRELIWDCPCGEFNIIKLRKL-TVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKS

Query:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL
          VCEHCKK WHTKDQCW+LHGRPP GK+RS N+K N  R  +SET                                               +GATDHL
Subjt:  PLVCEHCKKPWHTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSET-----------------------------------------------AGATDHL

Query:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG
        TGS +HF+SY PCAGNEKIRIADGS AP+A KG I PFDG  LQNVLH+PK+SYNL S+SKITR+L+C   F  + V+                 GL + 
Subjt:  TGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVF----------------SGLELG

Query:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS
        DDD  C  +   +SL+                                           L+   V G            FVTFIDDHTRLTWV+L++DKS
Subjt:  DDDWHCPIIGDSISLM-----------------------------------------MILLLGVVIG-----------LFVTFIDDHTRLTWVFLLTDKS

Query:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR
        EV SIFQ FY TI+TQF+ KIAI RSDNGREF  + L EFL+ +GIVHQ  CAYTPQQNGVAERKNRHL+EVA SLMLSTS PSYLWGDA+LTA HLINR
Subjt:  EVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINR

Query:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----
        MPSR+LHLQTPL+CLKESYP+T  + +VPL VFGC A+VH+  PNQTKFT   +  VFVGYPLHQ GYKCFHP S+KYF++MD+TF +++P+FP+     
Subjt:  MPSRVLHLQTPLECLKESYPTT--LPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIG----

Query:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---
                                            P    +RRNLRKE+  P    PA +  +EP + Q   +P     +NT        I  +EN   
Subjt:  ------------------------------------PLDNLHRRNLRKEIVFPPDS-PASILAYEPTQAQDTTDP-----DNT--------IICVEN---

Query:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE
            D   +R+  S     +  + K ++Y  SLD+P ALRKGTRS TK+P+ +++SY+NLS +FRAFTA+LD+  IPKNI+ A+E  E
Subjt:  ----DIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.5e-2734.17Show/hide
Query:  FVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLS
        FV F+D  T     +L+  KS+V S+FQ F    E  FN K+     DNGRE+L+N + +F   +GI +     +TPQ NGV+ER  R + E A +++  
Subjt:  FVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLS

Query:  TSHPSYLWGDAVLTATHLINRMPSRVL--HLQTPLECLKESYPTTLPDVPLWVFGCIAFVHSHDPNQTKF--THVLRVFVGYPLHQRGYKCFHPTSQKYF
               WG+AVLTAT+LINR+PSR L    +TP E      P       L VFG   +VH  +  Q KF       +FVGY     G+K +   ++K+ 
Subjt:  TSHPSYLWGDAVLTATHLINRMPSRVL--HLQTPLECLKESYPTTLPDVPLWVFGCIAFVHSHDPNQTKF--THVLRVFVGYPLHQRGYKCFHPTSQKYF

Query:  ISMDITFLKDKPFFPIGPLDNLHRRNLRKEIVFPPDSPAS
        ++ D+          +   + ++ R ++ E VF  DS  S
Subjt:  ISMDITFLKDKPFFPIGPLDNLHRRNLRKEIVFPPDSPAS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-3130.16Show/hide
Query:  FVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLS
        FVTFIDD +R  WV++L  K +V  +FQ+F+  +E +   K+   RSDNG E+ +    E+ S  GI H+     TPQ NGVAER NR ++E   S++  
Subjt:  FVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLS

Query:  TSHPSYLWGDAVLTATHLINRMPSRVLHLQTPLECLKESYPTTLPDVPLWVFGCIAFVHSHDPNQTKF--THVLRVFVGYPLHQRGYKCFHPTSQKYFIS
           P   WG+AV TA +LINR PS  L  + P           +    L VFGC AF H     +TK     +  +F+GY   + GY+ + P  +K   S
Subjt:  TSHPSYLWGDAVLTATHLINRMPSRVLHLQTPLECLKESYPTTLPDVPLWVFGCIAFVHSHDPNQTKF--THVLRVFVGYPLHQRGYKCFHPTSQKYFIS

Query:  MDITFLKDKPFFPIGPLDNLHRRNLRKEIVFPPDSPASILAYEPTQAQDTTDPDNTIICVENDIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKG
         D+ F + +        + +    +   +  P  S        PT A+ TTD       V    E    ++ Q    +E  ++ E      +    LR+ 
Subjt:  MDITFLKDKPFFPIGPLDNLHRRNLRKEIVFPPDSPASILAYEPTQAQDTTDPDNTIICVENDIEHMRVIMSQKPTTEEESDKPEDYYASLDMPNALRKG

Query:  TRSYTKYPMYSFLSY
         R   +   Y    Y
Subjt:  TRSYTKYPMYSFLSY

P47024 Transposon Ty4-J Gag-Pol polyprotein4.6e-0826.04Show/hide
Query:  IETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINRMPSRVLHLQTPL
        +ETQF+ K+    SD G EF  + + E+   +GI H          NG AER  R ++  A +L+  ++     W  AV +AT++ N +  +    + PL
Subjt:  IETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINRMPSRVLHLQTPL

Query:  ECLKESYPTTLPDVPLWVFGCIAFVHSHDPNQTKFTHVLRVFVGYPLHQRGYKCFHPTSQKYFISMDIT
        + +    P T+  +    FG    + +H+  + K + +  + +    +  GYK F P+  K   S + T
Subjt:  ECLKESYPTTLPDVPLWVFGCIAFVHSHDPNQTKFTHVLRVFVGYPLHQRGYKCFHPTSQKYFISMDIT

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.3e-3132.33Show/hide
Query:  ASKGHISP--FDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVFSGLELG---DDDWHCPIIGDSISLMMILLLGVVIGLFVTFIDDHTRLTW
        A  GH +P   + ++    L +   S+   S S    + +  V FS   + S   L     D W  PI+                  +V F+D  TR TW
Subjt:  ASKGHISP--FDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVFSGLELG---DDDWHCPIIGDSISLMMILLLGVVIGLFVTFIDDHTRLTW

Query:  VFLLTDKSEVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVL
        ++ L  KS+V   F  F   +E +F  +I  F SDNG EF+   L E+ S  GI H     +TP+ NG++ERK+RH++E  L+L+   S P   W  A  
Subjt:  VFLLTDKSEVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVL

Query:  TATHLINRMPSRVLHLQTPLECLKESYPTTLPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFP
         A +LINR+P+ +L L++P + L   + T+     L VFGC  +      NQ K     R  VF+GY L Q  Y C H  + + +IS  + F  D+  FP
Subjt:  TATHLINRMPSRVLHLQTPLECLKESYPTTLPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.1e-2931.65Show/hide
Query:  GH--ISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVFSGLELG---DDDWHCPIIGDSISLMMILLLGVVIGLFVTFIDDHTRLTWVFL
        GH  ++  + ++  + L +   S+ L S S    + +  V FS+  + S   L     D W  PI+  SI              +V F+D  TR TW++ 
Subjt:  GH--ISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNCHVAFSHDDVFSGLELG---DDDWHCPIIGDSISLMMILLLGVVIGLFVTFIDDHTRLTWVFL

Query:  LTDKSEVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTAT
        L  KS+V   F  F + +E +F  +I    SDNG EF+   L ++LS  GI H     +TP+ NG++ERK+RH++E+ L+L+   S P   W  A   A 
Subjt:  LTDKSEVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGIVHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTAT

Query:  HLINRMPSRVLHLQTPLECLKESYPTTLPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFP
        +LINR+P+ +L LQ+P + L    P       L VFGC  +      N+ K     +   F+GY L Q  Y C H  + + + S  + F  D+  FP
Subjt:  HLINRMPSRVLHLQTPLECLKESYPTTLPDVPLWVFGCIAFVHSHDPNQTKFTHVLR--VFVGYPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFP

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0434.25Show/hide
Query:  NRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINRMPSRVLHLQTPLECLKESYPTTLPDVPLWVFGCIAFVH
        NR ++E   S++     P     DA  TA H+IN+ PS  ++   P E   +S PT      L  FGC+A++H
Subjt:  NRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINRMPSRVLHLQTPLECLKESYPTTLPDVPLWVFGCIAFVH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTGGGTCGGGAACTTATCTGGGACTGTCCTTGTGGGGAGTTCAATATTATAAAGTTGAGGAAATTGACTGTGTGTGCTTTCTTAGCAGGTCTAAACTCTAAATT
TGATGTCATTCGAGGTAGGATACTGGGACAGAGACTGACACCAACACTTATGGAAGTCTATTTCGGGGTTTGTCTTGAGGAAGACAGATCCAGTGCTATGAACATCATAG
CTACTCCCGTGACGGTCTCAGTTGCCTTCAGTGCTAAATCATCTGGTTCTACTAGAGACAAAAAAAATGGGAAATCGCCTTTAGTGTGTGAGCACTGTAAGAAACCCTGG
CATACGAAAGATCAGTGTTGGAGGTTACATGGTCGACCACCAAATGGCAAACGACGATCTCCGAATGACAAGCCTAATCCTAACCGAATTTTGGTGAGTGAAACTGCTGG
AGCTACAGATCATCTGACTGGATCCTTTGATCATTTCCTATCATATCACCCGTGTGCTGGTAATGAAAAAATCCGGATTGCTGATGGATCCTTTGCTCCAGTTGCGAGCA
AGGGTCACATTTCTCCCTTTGATGGCTTAGTATTACAGAATGTGCTGCACATCCCTAAAATATCTTACAATTTATTTTCTGTGAGTAAGATAACTAGAGATTTGAATTGT
CATGTGGCGTTCTCACATGATGATGTTTTTTCAGGACTTGAGCTCGGGGACGATGATTGGCACTGCCCAATAATAGGGGACTCTATTTCCTTGATGATGATACTTCTTCT
AGGCGTAGTTATAGGACTATTTGTGACCTTCATCGATGACCACACTCGCCTTACTTGGGTATTTCTCCTCACTGATAAATCTGAGGTCTCATCCATTTTTCAACAATTTT
ACACTACCATTGAGACTCAGTTCAATGCCAAAATTGCCATCTTTCGGAGTGACAATGGTCGAGAGTTCCTTACTAATACCCTTTGTGAGTTTCTGTCCATTGAAGGTATT
GTTCATCAGAACTTGTGTGCCTATACCCCTCAACAGAATGGAGTGGCTGAAAGGAAAAATCGTCATCTTCTCGAGGTTGCCTTATCTCTGATGCTGTCTACCTCTCATCC
GTCTTACTTGTGGGGGGATGCAGTTCTGACTGCCACTCATCTTATTAATCGGATGCCTTCTCGTGTCCTCCATCTTCAAACTCCTCTTGAATGCCTCAAAGAGTCTTACC
CTACCACGCTTCCTGATGTTCCCCTCTGGGTGTTTGGGTGCATTGCGTTTGTCCATAGTCATGACCCTAACCAGACTAAGTTTACCCATGTGCTCAGAGTCTTTGTTGGG
TATCCTCTCCACCAGCGAGGTTACAAATGCTTCCATCCCACTTCCCAAAAATACTTCATCTCTATGGATATCACCTTCCTTAAGGATAAACCCTTCTTTCCCATAGGTCC
CCTGGATAACTTACATAGGAGGAATCTCAGAAAGGAAATTGTGTTCCCTCCTGATTCGCCTGCTTCGATCCTAGCATATGAACCAACACAGGCTCAAGATACTACTGACC
CTGATAATACTATTATTTGTGTTGAAAATGATATTGAGCACATGAGAGTGATCATGTCTCAGAAACCTACTACTGAGGAGGAATCCGACAAACCAGAAGATTATTATGCT
TCTCTTGACATGCCCAATGCTCTGAGAAAGGGCACCAGATCCTACACCAAATATCCCATGTATAGCTTCCTCTCTTATAATAATCTGTCTTCTAAGTTTAGAGCATTTAC
TGCCAGCCTTGACACTGTAACGATACCAAAGAACATACATGTGGCTATGGAAATTCTTGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCTGGGTCGGGAACTTATCTGGGACTGTCCTTGTGGGGAGTTCAATATTATAAAGTTGAGGAAATTGACTGTGTGTGCTTTCTTAGCAGGTCTAAACTCTAAATT
TGATGTCATTCGAGGTAGGATACTGGGACAGAGACTGACACCAACACTTATGGAAGTCTATTTCGGGGTTTGTCTTGAGGAAGACAGATCCAGTGCTATGAACATCATAG
CTACTCCCGTGACGGTCTCAGTTGCCTTCAGTGCTAAATCATCTGGTTCTACTAGAGACAAAAAAAATGGGAAATCGCCTTTAGTGTGTGAGCACTGTAAGAAACCCTGG
CATACGAAAGATCAGTGTTGGAGGTTACATGGTCGACCACCAAATGGCAAACGACGATCTCCGAATGACAAGCCTAATCCTAACCGAATTTTGGTGAGTGAAACTGCTGG
AGCTACAGATCATCTGACTGGATCCTTTGATCATTTCCTATCATATCACCCGTGTGCTGGTAATGAAAAAATCCGGATTGCTGATGGATCCTTTGCTCCAGTTGCGAGCA
AGGGTCACATTTCTCCCTTTGATGGCTTAGTATTACAGAATGTGCTGCACATCCCTAAAATATCTTACAATTTATTTTCTGTGAGTAAGATAACTAGAGATTTGAATTGT
CATGTGGCGTTCTCACATGATGATGTTTTTTCAGGACTTGAGCTCGGGGACGATGATTGGCACTGCCCAATAATAGGGGACTCTATTTCCTTGATGATGATACTTCTTCT
AGGCGTAGTTATAGGACTATTTGTGACCTTCATCGATGACCACACTCGCCTTACTTGGGTATTTCTCCTCACTGATAAATCTGAGGTCTCATCCATTTTTCAACAATTTT
ACACTACCATTGAGACTCAGTTCAATGCCAAAATTGCCATCTTTCGGAGTGACAATGGTCGAGAGTTCCTTACTAATACCCTTTGTGAGTTTCTGTCCATTGAAGGTATT
GTTCATCAGAACTTGTGTGCCTATACCCCTCAACAGAATGGAGTGGCTGAAAGGAAAAATCGTCATCTTCTCGAGGTTGCCTTATCTCTGATGCTGTCTACCTCTCATCC
GTCTTACTTGTGGGGGGATGCAGTTCTGACTGCCACTCATCTTATTAATCGGATGCCTTCTCGTGTCCTCCATCTTCAAACTCCTCTTGAATGCCTCAAAGAGTCTTACC
CTACCACGCTTCCTGATGTTCCCCTCTGGGTGTTTGGGTGCATTGCGTTTGTCCATAGTCATGACCCTAACCAGACTAAGTTTACCCATGTGCTCAGAGTCTTTGTTGGG
TATCCTCTCCACCAGCGAGGTTACAAATGCTTCCATCCCACTTCCCAAAAATACTTCATCTCTATGGATATCACCTTCCTTAAGGATAAACCCTTCTTTCCCATAGGTCC
CCTGGATAACTTACATAGGAGGAATCTCAGAAAGGAAATTGTGTTCCCTCCTGATTCGCCTGCTTCGATCCTAGCATATGAACCAACACAGGCTCAAGATACTACTGACC
CTGATAATACTATTATTTGTGTTGAAAATGATATTGAGCACATGAGAGTGATCATGTCTCAGAAACCTACTACTGAGGAGGAATCCGACAAACCAGAAGATTATTATGCT
TCTCTTGACATGCCCAATGCTCTGAGAAAGGGCACCAGATCCTACACCAAATATCCCATGTATAGCTTCCTCTCTTATAATAATCTGTCTTCTAAGTTTAGAGCATTTAC
TGCCAGCCTTGACACTGTAACGATACCAAAGAACATACATGTGGCTATGGAAATTCTTGAGTAG
Protein sequenceShow/hide protein sequence
MDLGRELIWDCPCGEFNIIKLRKLTVCAFLAGLNSKFDVIRGRILGQRLTPTLMEVYFGVCLEEDRSSAMNIIATPVTVSVAFSAKSSGSTRDKKNGKSPLVCEHCKKPW
HTKDQCWRLHGRPPNGKRRSPNDKPNPNRILVSETAGATDHLTGSFDHFLSYHPCAGNEKIRIADGSFAPVASKGHISPFDGLVLQNVLHIPKISYNLFSVSKITRDLNC
HVAFSHDDVFSGLELGDDDWHCPIIGDSISLMMILLLGVVIGLFVTFIDDHTRLTWVFLLTDKSEVSSIFQQFYTTIETQFNAKIAIFRSDNGREFLTNTLCEFLSIEGI
VHQNLCAYTPQQNGVAERKNRHLLEVALSLMLSTSHPSYLWGDAVLTATHLINRMPSRVLHLQTPLECLKESYPTTLPDVPLWVFGCIAFVHSHDPNQTKFTHVLRVFVG
YPLHQRGYKCFHPTSQKYFISMDITFLKDKPFFPIGPLDNLHRRNLRKEIVFPPDSPASILAYEPTQAQDTTDPDNTIICVENDIEHMRVIMSQKPTTEEESDKPEDYYA
SLDMPNALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTASLDTVTIPKNIHVAMEILE