; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0097031 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0097031
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionBeta-galactosidase
Genome locationCMiso1.1chr04:11703269..11704891
RNA-Seq ExpressionCmc04g0097031
SyntenyCmc04g0097031
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005525 - GTP binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025363.1 Beta-galactosidase [Cucumis melo var. makuwa]8.0e-28880.89Show/hide
Query:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL
        MYSKNP+TSFPN QSNYITGSL    GNFS EKLNGQNYFSWSQSI+MFLEGR+QF FLTGETVRP P DALERLWKGEDSLIRSMLINSMEPQI KPLL
Subjt:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL

Query:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI
        YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCG I
Subjt:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI

Query:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------
        LGQRPLPSLMEVCFEVRLEED TNAMGVLTT                        +P               W                           
Subjt:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------

Query:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
                          TPTLGAIAQSGMPQSLGLISVD KNPWILDSGATDHLTGS EHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
Subjt:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN

Query:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF
        VLHV KLSYNLLSISKITRELHCKAIFLPESVYFQDMS+GRTIG   H RGLYILDDDTSCSSLSRVSLLSSYFSTSEQ+CMLWHFRLGH NFTYMQ+LF
Subjt:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF

Query:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK
        PHLFSK+DVSSLSCDVCIR KQHRVSFPSQP+KPTQ FNLIH+DVWGPSKVTTSSGKRWF+TFIDDHTRLTWVYLI+DK EVPSIFQNFYHTIKTQFHTK
Subjt:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK

Query:  IAILRSDNGREFQKHNLSEFLASKGIVH
        IAILRSDNGREFQ HNLSEFLASKGIVH
Subjt:  IAILRSDNGREFQKHNLSEFLASKGIVH

KAA0045897.1 Beta-galactosidase [Cucumis melo var. makuwa]8.0e-28880.89Show/hide
Query:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL
        MYSKNP+TSFPN QSNYITGSL    GNFS EKLNGQNYFSWSQSI+MFLEGR+QF FLTGETVRP P DALERLWKGEDSLIRSMLINSMEPQI KPLL
Subjt:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL

Query:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI
        YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCG I
Subjt:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI

Query:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------
        LGQRPLPSLMEVCFEVRLEED TNAMGVLTT                        +P               W                           
Subjt:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------

Query:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
                          TPTLGAIAQSGMPQSLGLISVD KNPWILDSGATDHLTGS EHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
Subjt:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN

Query:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF
        VLHV KLSYNLLSISKITRELHCKAIFLPESVYFQDMS+GRTIG   H RGLYILDDDTSCSSLSRVSLLSSYFSTSEQ+CMLWHFRLGH NFTYMQ+LF
Subjt:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF

Query:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK
        PHLFSK+DVSSLSCDVCIR KQHRVSFPSQP+KPTQ FNLIH+DVWGPSKVTTSSGKRWF+TFIDDHTRLTWVYLI+DK EVPSIFQNFYHTIKTQFHTK
Subjt:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK

Query:  IAILRSDNGREFQKHNLSEFLASKGIVH
        IAILRSDNGREFQ HNLSEFLASKGIVH
Subjt:  IAILRSDNGREFQKHNLSEFLASKGIVH

KAA0047763.1 Beta-galactosidase [Cucumis melo var. makuwa]6.2e-28881.05Show/hide
Query:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL
        MYSKNPITSFPN QSNYITGSL    GNFS EKLNGQNYFSWSQSI+MFLEGR+QF FLTGETVRP P DALERLWKGEDSLIRSMLINSMEPQI KPLL
Subjt:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL

Query:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI
        YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCG I
Subjt:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI

Query:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------
        LGQRPLPSLMEVCFEVRLEED TNAMGVLTT                        +P               W                           
Subjt:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------

Query:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
                          TPTLGAIAQSGMPQSLGLISVD KNPWILDSGATDHLTGS EHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
Subjt:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN

Query:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF
        VLHV KLSYNLLSISKITRELHCKAIFLPESVYFQDMS+GRTIG   H RGLYILDDDTSCSSLSRVSLLSSYFSTSEQ+CMLWHFRLGH NFTYMQ+LF
Subjt:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF

Query:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK
        PHLFSK+DVSSLSCDVCIR KQHRVSFPSQP+KPTQ FNLIH+DVWGPSKVTTSSGKRWF+TFIDDHTRLTWVYLI+DK EVPSIFQNFYHTIKTQFHTK
Subjt:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK

Query:  IAILRSDNGREFQKHNLSEFLASKGIVH
        IAILRSDNGREFQ HNLSEFLASKGIVH
Subjt:  IAILRSDNGREFQKHNLSEFLASKGIVH

KAA0048203.1 Beta-galactosidase [Cucumis melo var. makuwa]8.0e-28880.89Show/hide
Query:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL
        MYSKNP+TSFPN QSNYITGSL    GNFS EKLNGQNYFSWSQSI+MFLEGR+QF FLTGETVRP P DALERLWKGEDSLIRSMLINSMEPQI KPLL
Subjt:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL

Query:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI
        YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCG I
Subjt:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI

Query:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------
        LGQRPLPSLMEVCFEVRLEED TNAMGVLTT                        +P               W                           
Subjt:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------

Query:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
                          TPTLGAIAQSGMPQSLGLISVD KNPWILDSGATDHLTGS EHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
Subjt:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN

Query:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF
        VLHV KLSYNLLSISKITRELHCKAIFLPESVYFQDMS+GRTIG   H RGLYILDDDTSCSSLSRVSLLSSYFSTSEQ+CMLWHFRLGH NFTYMQ+LF
Subjt:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF

Query:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK
        PHLFSK+DVSSLSCDVCIR KQHRVSFPSQP+KPTQ FNLIH+DVWGPSKVTTSSGKRWF+TFIDDHTRLTWVYLI+DK EVPSIFQNFYHTIKTQFHTK
Subjt:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK

Query:  IAILRSDNGREFQKHNLSEFLASKGIVH
        IAILRSDNGREFQ HNLSEFLASKGIVH
Subjt:  IAILRSDNGREFQKHNLSEFLASKGIVH

KAA0056107.1 Beta-galactosidase [Cucumis melo var. makuwa]8.0e-28880.89Show/hide
Query:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL
        MYSKNP+TSFPN QSNYITGSL    GNFS EKLNGQNYFSWSQSI+MFLEGR+QF FLTGETVRP P DALERLWKGEDSLIRSMLINSMEPQI KPLL
Subjt:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL

Query:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI
        YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCG I
Subjt:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI

Query:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------
        LGQRPLPSLMEVCFEVRLEED TNAMGVLTT                        +P               W                           
Subjt:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------

Query:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
                          TPTLGAIAQSGMPQSLGLISVD KNPWILDSGATDHLTGS EHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
Subjt:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN

Query:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF
        VLHV KLSYNLLSISKITRELHCKAIFLPESVYFQDMS+GRTIG   H RGLYILDDDTSCSSLSRVSLLSSYFSTSEQ+CMLWHFRLGH NFTYMQ+LF
Subjt:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF

Query:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK
        PHLFSK+DVSSLSCDVCIR KQHRVSFPSQP+KPTQ FNLIH+DVWGPSKVTTSSGKRWF+TFIDDHTRLTWVYLI+DK EVPSIFQNFYHTIKTQFHTK
Subjt:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK

Query:  IAILRSDNGREFQKHNLSEFLASKGIVH
        IAILRSDNGREFQ HNLSEFLASKGIVH
Subjt:  IAILRSDNGREFQKHNLSEFLASKGIVH

TrEMBL top hitse value%identityAlignment
A0A5A7TVI6 Beta-galactosidase3.9e-28880.89Show/hide
Query:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL
        MYSKNP+TSFPN QSNYITGSL    GNFS EKLNGQNYFSWSQSI+MFLEGR+QF FLTGETVRP P DALERLWKGEDSLIRSMLINSMEPQI KPLL
Subjt:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL

Query:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI
        YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCG I
Subjt:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI

Query:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------
        LGQRPLPSLMEVCFEVRLEED TNAMGVLTT                        +P               W                           
Subjt:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------

Query:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
                          TPTLGAIAQSGMPQSLGLISVD KNPWILDSGATDHLTGS EHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
Subjt:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN

Query:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF
        VLHV KLSYNLLSISKITRELHCKAIFLPESVYFQDMS+GRTIG   H RGLYILDDDTSCSSLSRVSLLSSYFSTSEQ+CMLWHFRLGH NFTYMQ+LF
Subjt:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF

Query:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK
        PHLFSK+DVSSLSCDVCIR KQHRVSFPSQP+KPTQ FNLIH+DVWGPSKVTTSSGKRWF+TFIDDHTRLTWVYLI+DK EVPSIFQNFYHTIKTQFHTK
Subjt:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK

Query:  IAILRSDNGREFQKHNLSEFLASKGIVH
        IAILRSDNGREFQ HNLSEFLASKGIVH
Subjt:  IAILRSDNGREFQKHNLSEFLASKGIVH

A0A5A7TW20 Beta-galactosidase3.0e-28881.05Show/hide
Query:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL
        MYSKNPITSFPN QSNYITGSL    GNFS EKLNGQNYFSWSQSI+MFLEGR+QF FLTGETVRP P DALERLWKGEDSLIRSMLINSMEPQI KPLL
Subjt:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL

Query:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI
        YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCG I
Subjt:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI

Query:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------
        LGQRPLPSLMEVCFEVRLEED TNAMGVLTT                        +P               W                           
Subjt:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------

Query:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
                          TPTLGAIAQSGMPQSLGLISVD KNPWILDSGATDHLTGS EHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
Subjt:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN

Query:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF
        VLHV KLSYNLLSISKITRELHCKAIFLPESVYFQDMS+GRTIG   H RGLYILDDDTSCSSLSRVSLLSSYFSTSEQ+CMLWHFRLGH NFTYMQ+LF
Subjt:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF

Query:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK
        PHLFSK+DVSSLSCDVCIR KQHRVSFPSQP+KPTQ FNLIH+DVWGPSKVTTSSGKRWF+TFIDDHTRLTWVYLI+DK EVPSIFQNFYHTIKTQFHTK
Subjt:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK

Query:  IAILRSDNGREFQKHNLSEFLASKGIVH
        IAILRSDNGREFQ HNLSEFLASKGIVH
Subjt:  IAILRSDNGREFQKHNLSEFLASKGIVH

A0A5A7TX68 Beta-galactosidase3.9e-28880.89Show/hide
Query:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL
        MYSKNP+TSFPN QSNYITGSL    GNFS EKLNGQNYFSWSQSI+MFLEGR+QF FLTGETVRP P DALERLWKGEDSLIRSMLINSMEPQI KPLL
Subjt:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL

Query:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI
        YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCG I
Subjt:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI

Query:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------
        LGQRPLPSLMEVCFEVRLEED TNAMGVLTT                        +P               W                           
Subjt:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------

Query:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
                          TPTLGAIAQSGMPQSLGLISVD KNPWILDSGATDHLTGS EHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
Subjt:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN

Query:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF
        VLHV KLSYNLLSISKITRELHCKAIFLPESVYFQDMS+GRTIG   H RGLYILDDDTSCSSLSRVSLLSSYFSTSEQ+CMLWHFRLGH NFTYMQ+LF
Subjt:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF

Query:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK
        PHLFSK+DVSSLSCDVCIR KQHRVSFPSQP+KPTQ FNLIH+DVWGPSKVTTSSGKRWF+TFIDDHTRLTWVYLI+DK EVPSIFQNFYHTIKTQFHTK
Subjt:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK

Query:  IAILRSDNGREFQKHNLSEFLASKGIVH
        IAILRSDNGREFQ HNLSEFLASKGIVH
Subjt:  IAILRSDNGREFQKHNLSEFLASKGIVH

A0A5A7UNC5 Beta-galactosidase3.9e-28880.89Show/hide
Query:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL
        MYSKNP+TSFPN QSNYITGSL    GNFS EKLNGQNYFSWSQSI+MFLEGR+QF FLTGETVRP P DALERLWKGEDSLIRSMLINSMEPQI KPLL
Subjt:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL

Query:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI
        YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCG I
Subjt:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI

Query:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------
        LGQRPLPSLMEVCFEVRLEED TNAMGVLTT                        +P               W                           
Subjt:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------

Query:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
                          TPTLGAIAQSGMPQSLGLISVD KNPWILDSGATDHLTGS EHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
Subjt:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN

Query:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF
        VLHV KLSYNLLSISKITRELHCKAIFLPESVYFQDMS+GRTIG   H RGLYILDDDTSCSSLSRVSLLSSYFSTSEQ+CMLWHFRLGH NFTYMQ+LF
Subjt:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF

Query:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK
        PHLFSK+DVSSLSCDVCIR KQHRVSFPSQP+KPTQ FNLIH+DVWGPSKVTTSSGKRWF+TFIDDHTRLTWVYLI+DK EVPSIFQNFYHTIKTQFHTK
Subjt:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK

Query:  IAILRSDNGREFQKHNLSEFLASKGIVH
        IAILRSDNGREFQ HNLSEFLASKGIVH
Subjt:  IAILRSDNGREFQKHNLSEFLASKGIVH

A0A5A7VLQ7 Beta-galactosidase3.9e-28880.89Show/hide
Query:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL
        MYSKNP+TSFPN QSNYITGSL    GNFS EKLNGQNYFSWSQSI+MFLEGR+QF FLTGETVRP P DALERLWKGEDSLIRSMLINSMEPQI KPLL
Subjt:  MYSKNPITSFPNLQSNYITGSL----GNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLL

Query:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI
        YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCG I
Subjt:  YAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHI

Query:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------
        LGQRPLPSLMEVCFEVRLEED TNAMGVLTT                        +P               W                           
Subjt:  LGQRPLPSLMEVCFEVRLEEDHTNAMGVLTT------------------------LP---------------W---------------------------

Query:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
                          TPTLGAIAQSGMPQSLGLISVD KNPWILDSGATDHLTGS EHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN
Subjt:  ------------------TPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQN

Query:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF
        VLHV KLSYNLLSISKITRELHCKAIFLPESVYFQDMS+GRTIG   H RGLYILDDDTSCSSLSRVSLLSSYFSTSEQ+CMLWHFRLGH NFTYMQ+LF
Subjt:  VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF

Query:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK
        PHLFSK+DVSSLSCDVCIR KQHRVSFPSQP+KPTQ FNLIH+DVWGPSKVTTSSGKRWF+TFIDDHTRLTWVYLI+DK EVPSIFQNFYHTIKTQFHTK
Subjt:  PHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTK

Query:  IAILRSDNGREFQKHNLSEFLASKGIVH
        IAILRSDNGREFQ HNLSEFLASKGIVH
Subjt:  IAILRSDNGREFQKHNLSEFLASKGIVH

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-1927.03Show/hide
Query:  WILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDG---FALQNVLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGR
        ++LDSGA+DHL      +          KI +A       A K  IV         L++VL   + + NL+S+ ++             S+ F D S   
Subjt:  WILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDG---FALQNVLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQDMSTGR

Query:  TIGIVWHRRGLYILDDDTSCSSLSRVSLLS-SYFSTSEQNCMLWHFRLGHLNFTYM-------QYLFPHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFK
          G+   + GL ++ +    +++  ++  + S  +  + N  LWH R GH++   +        +    L + L++S   C+ C+  KQ R+ F     K
Subjt:  TIGIVWHRRGLYILDDDTSCSSLSRVSLLS-SYFSTSEQNCMLWHFRLGHLNFTYM-------QYLFPHLFSKLDVSSLSCDVCIRTKQHRVSFPSQPFK

Query:  ---PTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQKHNLSEFLASKGI
              LF ++H+DV GP    T   K +F+ F+D  T     YLI  K +V S+FQ+F    +  F+ K+  L  DNGRE+  + + +F   KGI
Subjt:  ---PTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQKHNLSEFLASKGI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-2729.1Show/hide
Query:  KNPWILDSGATDHLTGSLEHFISYAPCAGN-EKIRIADGSLAPIAGKGQIVPFDG----FALQNVLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQD
        ++ W++D+ A+ H T   + F  Y   AG+   +++ + S + IAG G I           L++V HV  L  NL+S   + R+ +          YF +
Subjt:  KNPWILDSGATDHLTGSLEHFISYAPCAGN-EKIRIADGSLAPIAGKGQIVPFDG----FALQNVLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQD

Query:  MSTGRTIGIVWHRRG-----LYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF-PHLFSKLDVSSLS-CDVCIRTKQHRVSFPS
             T G +   +G     LY  + +     L+         +  E +  LWH R+GH++   +Q L    L S    +++  CD C+  KQHRVSF +
Subjt:  MSTGRTIGIVWHRRG-----LYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLF-PHLFSKLDVSSLS-CDVCIRTKQHRVSFPS

Query:  QPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQKHNLSEFLASKGIVH
           +   + +L+++DV GP ++ +  G ++F+TFIDD +R  WVY++  K +V  +FQ F+  ++ +   K+  LRSDNG E+      E+ +S GI H
Subjt:  QPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQKHNLSEFLASKGIVH

Q12501 Transposon Ty2-OR2 Gag-Pol polyprotein8.2e-1726.3Show/hide
Query:  ILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSL--APIAGKGQIVPFDGFALQN-------VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQD
        ++DSGA+  L  S  H++ +A    N +I I D      PI   G +     F  QN        LH   ++Y+LLS+S++T + +  A F   ++   +
Subjt:  ILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSL--APIAGKGQIVPFDGFALQN-------VLHVSKLSYNLLSISKITRELHCKAIFLPESVYFQD

Query:  MSTGRTIG-IVWHRRGLYILDDDTSCSSLSRVSL--LSSYFSTSEQNCMLWHFRLGHLNFTYMQ---------YLFPHLFSKLDVSSLSCDVCI---RTK
         S G  +  IV H    ++       S +S++++  ++   S ++    L H  LGH NF  +Q         YL        + S+  C  C+    TK
Subjt:  MSTGRTIG-IVWHRRGLYILDDDTSCSSLSRVSL--LSSYFSTSEQNCMLWHFRLGHLNFTYMQ---------YLFPHLFSKLDVSSLSCDVCI---RTK

Query:  QHRVSFPSQPFKPT-QLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFE--VPSIFQNFYHTIKTQFHTKIAILRSDNGREFQKHNLS
           V      ++ + + F  +H D++GP      S   +FI+F D+ TR  WVY + D+ E  + ++F +    IK QF+ ++ +++ D G E+    L 
Subjt:  QHRVSFPSQPFKPT-QLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFE--VPSIFQNFYHTIKTQFHTKIAILRSDNGREFQKHNLS

Query:  EFLASKGI
        +F  ++GI
Subjt:  EFLASKGI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-3724.96Show/hide
Query:  KLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSP---RDALERL------WKGEDSLIRSMLINSMEPQIVKPLLYAATAKDLWDTTQTLYSKRQNAS
        KL   NY  WS+ +    +G     FL G T  P      DA  R+      WK +D LI S ++ ++   +   +  A TA  +W+T + +Y+   +  
Subjt:  KLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSP---RDALERL------WKGEDSLIRSMLINSMEPQIVKPLLYAATAKDLWDTTQTLYSKRQNAS

Query:  RLYTLRKQVHNCKQGTLDVTTY-------FNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHILGQRPLPSLMEVCFEV
         +  LR Q+    +GT  +  Y       F++L+LL + MD          +D       E+ +RV   L  L  ++  V   I  +   P+L E+   +
Subjt:  RLYTLRKQVHNCKQGTLDVTTY-------FNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHILGQRPLPSLMEVCFEV

Query:  RLEEDHTNAMGVLTTLPWT---------------------------------------------------PTLG----------------------AIAQ
           E    A+   T +P T                                                   P LG                      +   
Subjt:  RLEEDHTNAMGVLTTLPWT---------------------------------------------------PTLG----------------------AIAQ

Query:  SGMPQS----------LGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQI---VPFDGFALQNVLHVSKLSYNLLSI
        S  P S          L L S    N W+LDSGAT H+T    +   + P  G + + +ADGS  PI+  G            L N+L+V  +  NL+S+
Subjt:  SGMPQS----------LGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQI---VPFDGFALQNVLHVSKLSYNLLSI

Query:  SKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGH-----LNFTYMQYLFPHLFSKLDV
         ++         F P S   +D++TG  +     +  LY    +   +S   VSL +S   +S+     WH RLGH     LN     Y      S L+ 
Subjt:  SKITRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGH-----LNFTYMQYLFPHLFSKLDV

Query:  SS--LSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTKIAILRSD
        S   LSC  C+  K ++V F       T+    I++DVW  S + +    R+++ F+D  TR TW+Y +  K +V   F  F + ++ +F T+I    SD
Subjt:  SS--LSCDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTKIAILRSD

Query:  NGREFQKHNLSEFLASKGIVH
        NG EF    L E+ +  GI H
Subjt:  NGREFQKHNLSEFLASKGIVH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.9e-3524.27Show/hide
Query:  LQSNYITGSLGNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSP---RDALERL------WKGEDSLIRSMLINSMEPQIVKPLLYAATAKD
        + +N +  ++ N +  KL   NY  WS+ +    +G     FL G T  P      DA+ R+      W+ +D LI S ++ ++   +   +  A TA  
Subjt:  LQSNYITGSLGNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSP---RDALERL------WKGEDSLIRSMLINSMEPQIVKPLLYAATAKD

Query:  LWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMD-------------------LCRETVWDTPNDSTQ-YAKL--EEADRVYDF
        +W+T + +Y+   N S  +  +          L   T F++L+LL + MD                   + +    DTP   T+ + +L   E+  +   
Subjt:  LWDTTQTLYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMD-------------------LCRETVWDTPNDSTQ-YAKL--EEADRVYDF

Query:  LAGLNPKFDNVCGH--------------------------------------------ILGQRPLPSLM----EVCFEVRLEEDHTNAMGVLTTL-PWTP
         A + P   NV  H                                             LG+  + S+     + C ++   +  TN     +   PW P
Subjt:  LAGLNPKFDNVCGH--------------------------------------------ILGQRPLPSLM----EVCFEVRLEEDHTNAMGVLTTL-PWTP

Query:  TLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQI---VPFDGFALQNVLHVSKLSYNLLSISKI
                    +L + S    N W+LDSGAT H+T    +   + P  G + + IADGS  PI   G            L  VL+V  +  NL+S+ ++
Subjt:  TLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQI---VPFDGFALQNVLHVSKLSYNLLSISKI

Query:  TRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLFP-HLFSKLDVSS--LSC
                 F P S   +D++TG  +     +  LY    +   +S   VS+ +S  S +  +   WH RLGH +   +  +   H    L+ S   LSC
Subjt:  TRELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLFP-HLFSKLDVSS--LSC

Query:  DVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQK
          C   K H+V F +     ++    I++DVW  S + +    R+++ F+D  TR TW+Y +  K +V   F  F   ++ +F T+I  L SDNG EF  
Subjt:  DVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQK

Query:  HNLSEFLASKGIVH
          L ++L+  GI H
Subjt:  HNLSEFLASKGIVH

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.1e-1629.53Show/hide
Query:  NFSSEKL--NGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLLYAATAKDLWDTTQTLYSKRQNASRL
        +FS +KL  +  NY +W    R FL    +F F+ G   +P P   L + W+  ++++   L+NSM  ++++ ++YA TA  +W+  + ++    +  ++
Subjt:  NFSSEKL--NGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLLYAATAKDLWDTTQTLYSKRQNASRL

Query:  YTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMD----LCRETVWDTPNDSTQYA-KLEEADRVYDFLAG--LNPKFDNVCGHILGQRPLPSLME
        Y LR+++   +QG   V  YF KLS +W E+     +          + T+ A +  E ++ Y+FL G  LN  F+ V   I+ Q+P PSL E
Subjt:  YTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMD----LCRETVWDTPNDSTQYA-KLEEADRVYDFLAG--LNPKFDNVCGHILGQRPLPSLME

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.0e-0724.06Show/hide
Query:  NYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEP-QIVKPLLYAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCK
        NY +W +   +FL     F  +        P +A +  W+  D +++  L  ++ P Q     + ++T++D+W   +  +   ++A R   L  ++    
Subjt:  NYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEP-QIVKPLLYAATAKDLWDTTQTLYSKRQNASRLYTLRKQVHNCK

Query:  QGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADR--VYDFLAGLNPKFDNVCGHILGQRPLPSLMEVCFEVRLEED
         G + V  Y+ K+  L                DS +   +   DR  V   L GLNPKFDN+   I  ++P PS  +    ++ EED
Subjt:  QGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADR--VYDFLAGLNPKFDNVCGHILGQRPLPSLMEVCFEVRLEED

ATMG00300.1 Gag-Pol-related retrotransposon family protein2.5e-0533.72Show/hide
Query:  SSYFSTSEQNCMLWHFRLGHLNFTYMQYLFPHLF-SKLDVSSLS-CDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTS
        S+   T++    LWH RL H++   M+ L    F     VSSL  C+ CI  K HRV+F +         + +H+D+WG   V  S
Subjt:  SSYFSTSEQNCMLWHFRLGHLNFTYMQYLFPHLF-SKLDVSSLS-CDVCIRTKQHRVSFPSQPFKPTQLFNLIHNDVWGPSKVTTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTCCAAGAACCCGATAACTTCGTTCCCTAATTTACAGTCAAATTATATAACTGGCTCTTTGGGGAATTTTTCAAGTGAAAAATTAAATGGTCAAAATTATTTTTC
TTGGTCTCAATCAATAAGGATGTTTCTCGAAGGTCGACACCAGTTCAGCTTCTTAACTGGAGAGACTGTACGTCCTTCACCAAGAGACGCCTTGGAACGACTCTGGAAAG
GAGAGGACTCACTTATTCGGTCCATGCTGATTAATAGTATGGAACCACAGATCGTCAAGCCTCTACTATATGCAGCCACAGCAAAAGATTTGTGGGATACAACTCAGACC
CTTTACTCGAAACGACAGAATGCCTCTCGGTTATATACACTGCGAAAACAGGTCCATAATTGCAAACAAGGGACCCTGGACGTAACTACCTATTTTAACAAGCTCTCTCT
CCTCTGGCAAGAGATGGATTTGTGCAGAGAGACAGTTTGGGACACACCAAATGACAGTACACAATATGCTAAACTTGAAGAGGCTGACCGTGTTTATGACTTCCTTGCAG
GACTTAATCCCAAATTTGATAATGTTTGTGGTCATATACTCGGACAAAGACCTCTTCCCTCCCTAATGGAAGTTTGTTTTGAAGTCCGCCTGGAAGAGGATCACACTAAT
GCCATGGGTGTATTGACTACCCTACCATGGACACCGACTCTGGGTGCCATTGCTCAGTCAGGTATGCCTCAGTCCCTTGGGCTTATTAGCGTTGATAGGAAGAATCCCTG
GATCTTAGACTCGGGGGCTACAGATCACTTGACAGGTTCTTTGGAACACTTTATCTCATATGCCCCGTGTGCCGGTAATGAAAAAATTCGAATAGCCGATGGCTCTCTAG
CTCCGATCGCTGGCAAAGGACAAATAGTTCCCTTTGACGGTTTTGCTCTCCAGAATGTTTTACATGTCTCTAAATTGTCTTACAATTTGTTATCTATAAGCAAGATCACT
CGTGAGTTGCATTGTAAAGCTATCTTCTTACCTGAATCGGTTTATTTTCAGGACATGAGCACGGGGAGGACGATTGGCATTGTCTGGCATAGGAGGGGACTTTACATCCT
TGATGATGATACCTCATGTAGTAGTTTGTCTAGGGTTAGTTTACTGTCATCCTACTTTAGCACTTCTGAACAAAACTGTATGTTGTGGCATTTTCGACTGGGCCACCTAA
ACTTTACATATATGCAATATTTATTTCCCCACCTTTTTTCTAAACTTGATGTCTCTTCTCTATCTTGTGATGTGTGTATCCGGACTAAACAACATCGAGTCTCTTTTCCC
TCACAACCATTTAAACCTACACAACTGTTTAACCTCATCCATAATGACGTTTGGGGTCCTTCCAAGGTCACCACCTCCTCGGGAAAGCGGTGGTTTATAACTTTCATTGA
TGACCATACCCGTCTTACCTGGGTATACCTTATCACAGATAAATTCGAGGTTCCATCCATTTTCCAAAACTTCTATCATACTATCAAAACACAATTTCATACAAAAATTG
CAATTCTTCGAAGTGATAATGGTCGGGAATTCCAAAAACATAACCTTAGTGAATTTCTAGCCTCCAAGGGGATTGTTCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATTCCAAGAACCCGATAACTTCGTTCCCTAATTTACAGTCAAATTATATAACTGGCTCTTTGGGGAATTTTTCAAGTGAAAAATTAAATGGTCAAAATTATTTTTC
TTGGTCTCAATCAATAAGGATGTTTCTCGAAGGTCGACACCAGTTCAGCTTCTTAACTGGAGAGACTGTACGTCCTTCACCAAGAGACGCCTTGGAACGACTCTGGAAAG
GAGAGGACTCACTTATTCGGTCCATGCTGATTAATAGTATGGAACCACAGATCGTCAAGCCTCTACTATATGCAGCCACAGCAAAAGATTTGTGGGATACAACTCAGACC
CTTTACTCGAAACGACAGAATGCCTCTCGGTTATATACACTGCGAAAACAGGTCCATAATTGCAAACAAGGGACCCTGGACGTAACTACCTATTTTAACAAGCTCTCTCT
CCTCTGGCAAGAGATGGATTTGTGCAGAGAGACAGTTTGGGACACACCAAATGACAGTACACAATATGCTAAACTTGAAGAGGCTGACCGTGTTTATGACTTCCTTGCAG
GACTTAATCCCAAATTTGATAATGTTTGTGGTCATATACTCGGACAAAGACCTCTTCCCTCCCTAATGGAAGTTTGTTTTGAAGTCCGCCTGGAAGAGGATCACACTAAT
GCCATGGGTGTATTGACTACCCTACCATGGACACCGACTCTGGGTGCCATTGCTCAGTCAGGTATGCCTCAGTCCCTTGGGCTTATTAGCGTTGATAGGAAGAATCCCTG
GATCTTAGACTCGGGGGCTACAGATCACTTGACAGGTTCTTTGGAACACTTTATCTCATATGCCCCGTGTGCCGGTAATGAAAAAATTCGAATAGCCGATGGCTCTCTAG
CTCCGATCGCTGGCAAAGGACAAATAGTTCCCTTTGACGGTTTTGCTCTCCAGAATGTTTTACATGTCTCTAAATTGTCTTACAATTTGTTATCTATAAGCAAGATCACT
CGTGAGTTGCATTGTAAAGCTATCTTCTTACCTGAATCGGTTTATTTTCAGGACATGAGCACGGGGAGGACGATTGGCATTGTCTGGCATAGGAGGGGACTTTACATCCT
TGATGATGATACCTCATGTAGTAGTTTGTCTAGGGTTAGTTTACTGTCATCCTACTTTAGCACTTCTGAACAAAACTGTATGTTGTGGCATTTTCGACTGGGCCACCTAA
ACTTTACATATATGCAATATTTATTTCCCCACCTTTTTTCTAAACTTGATGTCTCTTCTCTATCTTGTGATGTGTGTATCCGGACTAAACAACATCGAGTCTCTTTTCCC
TCACAACCATTTAAACCTACACAACTGTTTAACCTCATCCATAATGACGTTTGGGGTCCTTCCAAGGTCACCACCTCCTCGGGAAAGCGGTGGTTTATAACTTTCATTGA
TGACCATACCCGTCTTACCTGGGTATACCTTATCACAGATAAATTCGAGGTTCCATCCATTTTCCAAAACTTCTATCATACTATCAAAACACAATTTCATACAAAAATTG
CAATTCTTCGAAGTGATAATGGTCGGGAATTCCAAAAACATAACCTTAGTGAATTTCTAGCCTCCAAGGGGATTGTTCACTAA
Protein sequenceShow/hide protein sequence
MYSKNPITSFPNLQSNYITGSLGNFSSEKLNGQNYFSWSQSIRMFLEGRHQFSFLTGETVRPSPRDALERLWKGEDSLIRSMLINSMEPQIVKPLLYAATAKDLWDTTQT
LYSKRQNASRLYTLRKQVHNCKQGTLDVTTYFNKLSLLWQEMDLCRETVWDTPNDSTQYAKLEEADRVYDFLAGLNPKFDNVCGHILGQRPLPSLMEVCFEVRLEEDHTN
AMGVLTTLPWTPTLGAIAQSGMPQSLGLISVDRKNPWILDSGATDHLTGSLEHFISYAPCAGNEKIRIADGSLAPIAGKGQIVPFDGFALQNVLHVSKLSYNLLSISKIT
RELHCKAIFLPESVYFQDMSTGRTIGIVWHRRGLYILDDDTSCSSLSRVSLLSSYFSTSEQNCMLWHFRLGHLNFTYMQYLFPHLFSKLDVSSLSCDVCIRTKQHRVSFP
SQPFKPTQLFNLIHNDVWGPSKVTTSSGKRWFITFIDDHTRLTWVYLITDKFEVPSIFQNFYHTIKTQFHTKIAILRSDNGREFQKHNLSEFLASKGIVH