; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010826 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010826
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr1:7324138..7331066
RNA-Seq ExpressionLag0010826
SyntenyLag0010826
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]6.7e-16863.37Show/hide
Query:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPVIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAEVE
        M  ++A+SK +A+S++Y GPITRSRSK      +Q   IA+ +L+++ ES +  IV  EN + + + S S +  + + PDVMSVMM D    E  MAE+E
Subjt:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPVIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAEVE

Query:  KKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQDQ--SQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDN
        +KI++L+K+V+E+D EI +L+ QM++R+ AESSQTP +K  DKGK + Q+    Q S S+ASLSVQQLQDMI NSIRAQYGGP+Q S +YSKPYTKRIDN
Subjt:  KKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQDQ--SQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDN

Query:  LRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKG
        LRMP GYQPPKFQQFDGKGNPKQHV HFVETCENAG+RGDQLVRQFVR+LKGNAF WYTDLEPE+I+SWEQL KEFLNRFYSTRRTVSM ELT+TKQRKG
Subjt:  LRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKG

Query:  EPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM-
        EPV+DYINRWRALSLDCKD+LTELSAVEMC QGMHWGLLYILQGIKPR+FEELATRAHDMELSIA+RG KD +VP+ +K+KKE+K  +K  KST+KESM 
Subjt:  EPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM-

Query:  ---------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELILN
                                                           LLEKQLI+LPECKRPE+ G+V+DPNYCKYHRVISHPVEKCFVLKELIL 
Subjt:  ---------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELILN

Query:  NESEQ
           E+
Subjt:  NESEQ

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]3.3e-16763.17Show/hide
Query:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPVIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAEVE
        M  ++A+SK +A+S++Y GPITRSRSK      +Q   IA+ +L+++ ES +  IV  EN + + + S S +  + + PDVMSVMM D    E  MAE+E
Subjt:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPVIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAEVE

Query:  KKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQDQ--SQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDN
        +KI++L+K+V+E+D EI +L+ QM++R+ AESSQTP +K  DKGK + Q+    Q S S+ASLSVQQLQDMI +SIRAQYGGP+Q S +YSKPYTKRIDN
Subjt:  KKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQDQ--SQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDN

Query:  LRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKG
        LRMP GYQPPKFQQFDGKGNPKQHV HFVETCENAG+RGDQLVRQFVR+LKGNAF WYTDLEPE+I+SWEQL KEFLNRFYSTRRTVSM ELT+TKQRKG
Subjt:  LRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKG

Query:  EPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM-
        EPV+DYINRWRALSLDCKD+LTELSAVEMC QGMHWGLLYILQGIKPR+FEELATRAHDMELSIA+RG KD +VP+ +K+KKE+K  +K  KST KESM 
Subjt:  EPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM-

Query:  ---------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELILN
                                                           LLEKQLI+LPECKRPE+ G+V+DPNYCKYHRVISHPVEKCFVLKELIL 
Subjt:  ---------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELILN

Query:  NESEQ
           E+
Subjt:  NESEQ

XP_031740568.1 uncharacterized protein LOC116403508 [Cucumis sativus]2.5e-16763.17Show/hide
Query:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPVIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAEVE
        M  ++A+SK +A+S++Y GPITRSRSK      +Q   IA+ +L+++ ES +  IV  EN + + + S S +  + + PDVMSVMM D    E  MAE+E
Subjt:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPVIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAEVE

Query:  KKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQDQ--SQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDN
        +KI++L+K+V+E+D EI +L+ QM++R+ AESSQTP +K  DKGK + Q+    Q S S+ASLSVQQLQDMI +SIRAQYGGP+Q S +YSKPYTKRIDN
Subjt:  KKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQDQ--SQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDN

Query:  LRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKG
        LRMP GYQPPKFQQFDGKGNPKQHV HFVETCENAG+RGDQLVRQFVR+LKGNAF WYTDLEPE+I+SWEQL KEFLNRFYSTRRTVSM ELT+TKQRKG
Subjt:  LRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKG

Query:  EPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM-
        EPV+DYINRWRALSLDCKD+LTELSAVEMC QGMHWGLLYILQGIKPR+FEELATRAHDMELSIA+RG KD +VP+ +K+KKE+K  +K  KST+KESM 
Subjt:  EPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM-

Query:  ---------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELILN
                                                           LLEKQLI+LPECKRPE+ G+V+DPNYCKYHRVISHPVEKCFVLKELIL 
Subjt:  ---------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELILN

Query:  NESEQ
           E+
Subjt:  NESEQ

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]4.3e-16763.17Show/hide
Query:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPVIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAEVE
        M  ++A+SK +A+S++Y GPITRSRSK      +Q   IA+ +L+++ ES +  IV  EN + + + S S +  + + PDVMSVMM D    E  MAE+E
Subjt:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPVIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAEVE

Query:  KKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQDQ--SQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDN
        +KI++L+K+V+E+D EI +L+ QM++R+ AESSQTP +K  DKGK + Q+    Q S S+ASLSVQQLQDMI +SIRAQYGGP+Q S +YSKPYTKRIDN
Subjt:  KKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQDQ--SQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDN

Query:  LRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKG
        LRMP GYQPPKFQQFDGKGNPKQHV HFVETCENAG+RGDQLVRQFVR+LKGNAF WYTDLEPE+I+SWEQL KEFLNRFYSTRRTVSM ELT+TKQRKG
Subjt:  LRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKG

Query:  EPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM-
        EPV+DYINRWRALSLDCKD+LTELSAVEMC QGMHWGLLYILQGIKPR+FEELATRAHDMELSIA+RG KD +VP+ +K+KKE+K  +K  KST KESM 
Subjt:  EPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM-

Query:  ---------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELILN
                                                           LLEKQLI+LPECKRPE+ G+V+DPNYCKYHRVISHPVEKCFVLKELIL 
Subjt:  ---------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELILN

Query:  NESEQ
           E+
Subjt:  NESEQ

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]6.7e-16863.37Show/hide
Query:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPVIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAEVE
        M  ++A+SK +A+S++Y GPITRSRSK      +Q   IA+ +L+++ ES +  IV  EN + + + S S +  + + PDVMSVMM D    E  MAE+E
Subjt:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPVIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAEVE

Query:  KKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQDQ--SQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDN
        +KI++L+K+V+E+D EI +L+ QM++R+ AESSQTP +K  DKGK + Q+    Q S S+ASLSVQQLQDMI NSIRAQYGGP+Q S +YSKPYTKRIDN
Subjt:  KKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQDQ--SQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDN

Query:  LRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKG
        LRMP GYQPPKFQQFDGKGNPKQHV HFVETCENAG+RGDQLVRQFVR+LKGNAF WYTDLEPE+I+SWEQL KEFLNRFYSTRRTVSM ELT+TKQRKG
Subjt:  LRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKG

Query:  EPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM-
        EPV+DYINRWRALSLDCKD+LTELSAVEMC QGMHWGLLYILQGIKPR+FEELATRAHDMELSIA+RG KD +VP+ +K+KKE+K  +K  KST+KESM 
Subjt:  EPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM-

Query:  ---------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELILN
                                                           LLEKQLI+LPECKRPE+ G+V+DPNYCKYHRVISHPVEKCFVLKELIL 
Subjt:  ---------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELILN

Query:  NESEQ
           E+
Subjt:  NESEQ

TrEMBL top hitse value%identityAlignment
A0A5A7SXL9 Ty3-gypsy retrotransposon protein5.5e-15258.57Show/hide
Query:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTP--VIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAE
        M   +A+SK + +S+SY+G +T+S  K +  E  Q    V+ KK L ++ ES +  I+  +N + +     S   ++ S  +V+SVMM D  T+E  + E
Subjt:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTP--VIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAE

Query:  VEKKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQDQ--SQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRI
        +E+KI+ L+K++EE+D EI +L++QM++ +  ESSQTP +K  DKGK + Q+    Q S S+ASLSVQQLQDMI NSIRAQYGGP Q S +YSK YTKRI
Subjt:  VEKKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQDQ--SQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRI

Query:  DNLRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQR
        DNLRMP GYQPPKFQQFDG+GNPKQH+ HFVETCENAG+RGDQLV+QFVR+LKGNAF WYTDLEPE ID+WEQL  EFLNRFYSTRR VSM ELT+TKQR
Subjt:  DNLRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQR

Query:  KGEPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKES
        KGEPV+DYINRWRALSLDCKDKLTELSAVEMC QGMHW LLYILQGIKPR+FEELATRAHDMELSIA +G KD +V   R ++ E+ +TKK   +   ES
Subjt:  KGEPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKES

Query:  M-----------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKEL
        M                                                     L+EKQLI+LPECKRPE+ G+V+DPNYCKYHRVISHPVEKCFVLKEL
Subjt:  M-----------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKEL

Query:  IL
        IL
Subjt:  IL

A0A5A7TZU9 Ribonuclease H5.3e-16364.13Show/hide
Query:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPV-IAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPN-ETSSPDVMSVMMTDAGTSEERMAE
        MT +  +SK  A S+    P TRSRS+E Q  ++  P  +AK +  ++++  +  IV  EN  + +  S S++ N E   P++MSVM+TD  TSE+RMAE
Subjt:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPV-IAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPN-ETSSPDVMSVMMTDAGTSEERMAE

Query:  VEKKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQD-QSQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRID
        +EKK++ML+K VEE+D EI  L+N +ESRD AESS T TIK  +KGK I Q+ Q Q+S SIASLSVQQLQ+MI NSI+ QYGGP Q   LYSKPYTKRID
Subjt:  VEKKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQD-QSQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRID

Query:  NLRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRK
        N+RMPHGYQPPKFQQFDGKGNPKQHV HF+ETCE AGTRGD LV+QFVRTLKGNAF WYTDLEPE+IDSWEQL ++FLNRFYSTRR VSM ELT+TKQRK
Subjt:  NLRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRK

Query:  GEPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM
        GEPV+DYINRWRALSLDCKD+LTELSAVEMC QGMHWGLLYILQGIKPR+FEELATRAHDMELSIA RGN D +VP+ RKEKKEVK+T+K  K  TKE+M
Subjt:  GEPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM

Query:  ---------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELIL
                                                           LLEKQLI+LPECKRP E GRVNDPNYCKYHRVISHPVEKCFVLKELIL
Subjt:  ---------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELIL

A0A5D3CD35 Ty3-gypsy retrotransposon protein2.8e-15660.2Show/hide
Query:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPVIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAEVE
        M  ++ +S  + +S++Y GPIT S SK    E +Q   +A+ +L+++ ES ++ IV  EN +   F   S +  + + PDVM VMM D  T+E  MA++E
Subjt:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPVIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAEVE

Query:  KKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQ--DQSQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDN
        KKI+ L+K +EE D EI +LR QM +R+ AESSQTP +K  DKGK + Q     Q S SIASLSVQ+LQDMI NSIRAQYGGP Q + +YSKPYTKRIDN
Subjt:  KKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQ--DQSQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDN

Query:  LRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKG
        LRMP GYQPPKFQQFDGKGNPKQH+ HFVETC NAG+RGDQLVRQFVR+LKGNAF WYT+LE E ID+WEQL KEFL+RFYS RRTVSM ELT+TKQ+KG
Subjt:  LRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKG

Query:  EPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM-
        EPV+DYINRWRALSLDCKD+LTELS+VEMC QGMHW LLYILQGIKPR+FEE ATRAHDMELSIA+RG KD  VP+ RK+KKE K+ +K  KST K SM 
Subjt:  EPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM-

Query:  ---------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELILN
                                                           LLEKQLI+LPECKRPE+ G+V+DPNYCKYHRVISHPVEKCFVLKEL L 
Subjt:  ---------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELILN

Query:  NESEQ
           E+
Subjt:  NESEQ

A0A5D3D4X3 Ty3-gypsy retrotransposon protein2.1e-15161.75Show/hide
Query:  IAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETS-SPDVMSVMMTDAGTSEERMAEVEKKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPT
        +AK +  ++++  +  +V  ENL + K  S S++ NE +   ++MSVM+TD   SE+RMAE+EKK++ML+K+VEE+D EI  L+N +ESRD AESS   T
Subjt:  IAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETS-SPDVMSVMMTDAGTSEERMAEVEKKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPT

Query:  IK--DKGKIIAQD-QSQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDNLRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTR
        +K  DKGK + Q+ Q Q+S SIASLSVQQLQ+MI +SI+ QYGGP Q   LYSKPYTKRIDNLRMP+GYQPPKFQQFDGKGNPKQHV HF+ETCE AGTR
Subjt:  IK--DKGKIIAQD-QSQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDNLRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTR

Query:  GDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKGEPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGL
        GD LV+QFVRTLKGNAF WY DLEPE+ID+WEQL ++FLNRFYSTRR VSM ELT+T+Q+KGE V+DYINRWRALSLDCKD+LTELSAVEMC QGMHWGL
Subjt:  GDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKGEPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGL

Query:  LYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM---------------------------------------
        LYILQGIKPR+FEELATRAHDMELSIA RG KD ++P  R +K E+ +TKK   S  KESM                                       
Subjt:  LYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESM---------------------------------------

Query:  --------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELILNNESEQ
                      LLE QLI+LPECKRPE+ G+V+DPNYCKYHRVISHPVEKCFVLKELIL    E+
Subjt:  --------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKELILNNESEQ

E5GCP6 Ty3-gypsy retrotransposon protein5.5e-15258.57Show/hide
Query:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTP--VIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAE
        M   +A+SK + +S+SY+G +T+S  K +  E  Q    V+ KK L ++ ES +  I+  +N + +     S   ++ S  +V+SVMM D  T+E  + E
Subjt:  MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTP--VIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAE

Query:  VEKKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQDQ--SQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRI
        +E+KI+ L+K++EE+D EI +L++QM++ +  ESSQTP +K  DKGK + Q+    Q S S+ASLSVQQLQDMI NSIRAQYGGP Q S +YSK YTKRI
Subjt:  VEKKISMLLKMVEEKDQEIVSLRNQMESRDVAESSQTPTIK--DKGKIIAQDQ--SQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRI

Query:  DNLRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQR
        DNLRMP GYQPPKFQQFDG+GNPKQH+ HFVETCENAG+RGDQLV+QFVR+LKGNAF WYTDLEPE ID+WEQL  EFLNRFYSTRR VSM ELT+TKQR
Subjt:  DNLRMPHGYQPPKFQQFDGKGNPKQHVVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQR

Query:  KGEPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKES
        KGEPV+DYINRWRALSLDCKDKLTELSAVEMC QGMHW LLYILQGIKPR+FEELATRAHDMELSIA +G KD +V   R ++ E+ +TKK   +   ES
Subjt:  KGEPVLDYINRWRALSLDCKDKLTELSAVEMCNQGMHWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKES

Query:  M-----------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKEL
        M                                                     L+EKQLI+LPECKRPE+ G+V+DPNYCKYHRVISHPVEKCFVLKEL
Subjt:  M-----------------------------------------------------LLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKEL

Query:  IL
        IL
Subjt:  IL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G06790.1 RNA polymerase Rpb7 N-terminal domain-containing protein8.5e-0459.46Show/hide
Query:  GADCYLASVVFTLIVFRPFVGEIILGKVKESDVNGLR
        GA  Y   V   ++VFRPFVGE+I  K KESD NGLR
Subjt:  GADCYLASVVFTLIVFRPFVGEIILGKVKESDVNGLR

AT1G06790.2 RNA polymerase Rpb7 N-terminal domain-containing protein8.5e-0459.46Show/hide
Query:  GADCYLASVVFTLIVFRPFVGEIILGKVKESDVNGLR
        GA  Y   V   ++VFRPFVGE+I  K KESD NGLR
Subjt:  GADCYLASVVFTLIVFRPFVGEIILGKVKESDVNGLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTTCGAAGAAGCTTCTTCAAAGATCGCTGCTTCAAGCAACTCCTATGTGGGGCCAATTACTCGCAGTCGCTCCAAGGAAACCCAACACGAAGACAACCAAACTCC
TGTCATTGCAAAAAAGATGCTGAGACGCATGACAGAATCTGCTCAACTTGAAATCGTCGCTGCAGAAAATTTGATGCTTAGCAAATTCATCTCTTATTCTCAGCAACCAA
ACGAAACATCTAGTCCTGATGTGATGTCTGTTATGATGACTGATGCTGGAACTAGTGAAGAACGAATGGCTGAAGTTGAGAAGAAAATCAGCATGCTACTGAAGATGGTC
GAAGAGAAAGACCAAGAAATCGTCTCTCTCAGAAACCAGATGGAAAGTCGAGATGTTGCTGAATCGAGTCAGACCCCTACCATCAAAGATAAGGGAAAGATCATTGCACA
AGATCAGTCACAACATTCCGCTTCTATTGCTTCACTATCCGTTCAGCAGCTCCAGGATATGATCATGAATTCCATCAGAGCTCAATATGGTGGACCGACTCAGAACTCTC
TCTTATATTCCAAGCCGTATACCAAAAGAATCGACAATCTTAGAATGCCTCATGGATACCAGCCACCAAAGTTTCAACAATTCGACGGGAAGGGCAATCCCAAGCAGCAC
GTTGTTCATTTCGTCGAGACATGTGAAAACGCTGGTACTAGAGGAGACCAACTAGTCAGGCAGTTTGTACGAACACTGAAGGGGAATGCTTTCTACTGGTATACCGATCT
TGAGCCTGAGACTATCGATAGTTGGGAACAACTTGCGAAGGAGTTCCTCAACCGCTTCTACAGCACAAGACGAACCGTTAGTATGACGGAGCTCACTAGCACTAAACAAC
GGAAAGGTGAACCGGTTCTTGACTACATCAACCGTTGGAGAGCTTTGAGTTTAGATTGCAAGGACAAACTTACCGAATTGTCTGCTGTTGAAATGTGCAACCAAGGCATG
CATTGGGGACTCCTCTATATCCTGCAAGGAATAAAGCCCCGCTCTTTCGAGGAGTTGGCAACCCGCGCTCATGACATGGAATTGAGCATTGCTACCAGAGGGAACAAAGA
TTCTATTGTCCCCGATGGCAGGAAAGAAAAGAAAGAAGTTAAAAATACTAAGAAAACTACCAAAAGCACTACGAAAGAGTCAATGCTCCTGGAAAAACAACTTATTGAGC
TACCAGAATGCAAGCGACCAGAAGAGACAGGGAGAGTTAATGATCCAAACTACTGCAAATATCACAGAGTCATCAGCCACCCGGTGGAAAAGTGCTTCGTTCTGAAGGAG
CTAATTTTGAACAACGAATCTGAACAACCGAGTAACTCCAAGCCCGGTTCAGGATTCAGTAGCAAGTTTGATCCAGTTCGGGACATTGGAGCCGATTGTTATTTGGCTTC
AGTTGTGTTCACATTGATTGTGTTTCGTCCGTTCGTGGGGGAGATAATTCTTGGAAAAGTTAAAGAATCAGATGTAAATGGTTTGCGAAGTTCTTCAAGTTCGATGTACG
AGCAAGATTACGCTGACAAGAAAGGAAGCTGCATCGATGGCTCTTCTTTGGAGCTGCTGGTCGTCGACGACGACTACACAAGAGAGAGAGATCGCCACATCTGTGTTGTT
GTAGCTCGAAAAAAACCCACTGCTCTAGATCCGGCGACTGTTTTGTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTTTCGAAGAAGCTTCTTCAAAGATCGCTGCTTCAAGCAACTCCTATGTGGGGCCAATTACTCGCAGTCGCTCCAAGGAAACCCAACACGAAGACAACCAAACTCC
TGTCATTGCAAAAAAGATGCTGAGACGCATGACAGAATCTGCTCAACTTGAAATCGTCGCTGCAGAAAATTTGATGCTTAGCAAATTCATCTCTTATTCTCAGCAACCAA
ACGAAACATCTAGTCCTGATGTGATGTCTGTTATGATGACTGATGCTGGAACTAGTGAAGAACGAATGGCTGAAGTTGAGAAGAAAATCAGCATGCTACTGAAGATGGTC
GAAGAGAAAGACCAAGAAATCGTCTCTCTCAGAAACCAGATGGAAAGTCGAGATGTTGCTGAATCGAGTCAGACCCCTACCATCAAAGATAAGGGAAAGATCATTGCACA
AGATCAGTCACAACATTCCGCTTCTATTGCTTCACTATCCGTTCAGCAGCTCCAGGATATGATCATGAATTCCATCAGAGCTCAATATGGTGGACCGACTCAGAACTCTC
TCTTATATTCCAAGCCGTATACCAAAAGAATCGACAATCTTAGAATGCCTCATGGATACCAGCCACCAAAGTTTCAACAATTCGACGGGAAGGGCAATCCCAAGCAGCAC
GTTGTTCATTTCGTCGAGACATGTGAAAACGCTGGTACTAGAGGAGACCAACTAGTCAGGCAGTTTGTACGAACACTGAAGGGGAATGCTTTCTACTGGTATACCGATCT
TGAGCCTGAGACTATCGATAGTTGGGAACAACTTGCGAAGGAGTTCCTCAACCGCTTCTACAGCACAAGACGAACCGTTAGTATGACGGAGCTCACTAGCACTAAACAAC
GGAAAGGTGAACCGGTTCTTGACTACATCAACCGTTGGAGAGCTTTGAGTTTAGATTGCAAGGACAAACTTACCGAATTGTCTGCTGTTGAAATGTGCAACCAAGGCATG
CATTGGGGACTCCTCTATATCCTGCAAGGAATAAAGCCCCGCTCTTTCGAGGAGTTGGCAACCCGCGCTCATGACATGGAATTGAGCATTGCTACCAGAGGGAACAAAGA
TTCTATTGTCCCCGATGGCAGGAAAGAAAAGAAAGAAGTTAAAAATACTAAGAAAACTACCAAAAGCACTACGAAAGAGTCAATGCTCCTGGAAAAACAACTTATTGAGC
TACCAGAATGCAAGCGACCAGAAGAGACAGGGAGAGTTAATGATCCAAACTACTGCAAATATCACAGAGTCATCAGCCACCCGGTGGAAAAGTGCTTCGTTCTGAAGGAG
CTAATTTTGAACAACGAATCTGAACAACCGAGTAACTCCAAGCCCGGTTCAGGATTCAGTAGCAAGTTTGATCCAGTTCGGGACATTGGAGCCGATTGTTATTTGGCTTC
AGTTGTGTTCACATTGATTGTGTTTCGTCCGTTCGTGGGGGAGATAATTCTTGGAAAAGTTAAAGAATCAGATGTAAATGGTTTGCGAAGTTCTTCAAGTTCGATGTACG
AGCAAGATTACGCTGACAAGAAAGGAAGCTGCATCGATGGCTCTTCTTTGGAGCTGCTGGTCGTCGACGACGACTACACAAGAGAGAGAGATCGCCACATCTGTGTTGTT
GTAGCTCGAAAAAAACCCACTGCTCTAGATCCGGCGACTGTTTTGTTGTAG
Protein sequenceShow/hide protein sequence
MTFEEASSKIAASSNSYVGPITRSRSKETQHEDNQTPVIAKKMLRRMTESAQLEIVAAENLMLSKFISYSQQPNETSSPDVMSVMMTDAGTSEERMAEVEKKISMLLKMV
EEKDQEIVSLRNQMESRDVAESSQTPTIKDKGKIIAQDQSQHSASIASLSVQQLQDMIMNSIRAQYGGPTQNSLLYSKPYTKRIDNLRMPHGYQPPKFQQFDGKGNPKQH
VVHFVETCENAGTRGDQLVRQFVRTLKGNAFYWYTDLEPETIDSWEQLAKEFLNRFYSTRRTVSMTELTSTKQRKGEPVLDYINRWRALSLDCKDKLTELSAVEMCNQGM
HWGLLYILQGIKPRSFEELATRAHDMELSIATRGNKDSIVPDGRKEKKEVKNTKKTTKSTTKESMLLEKQLIELPECKRPEETGRVNDPNYCKYHRVISHPVEKCFVLKE
LILNNESEQPSNSKPGSGFSSKFDPVRDIGADCYLASVVFTLIVFRPFVGEIILGKVKESDVNGLRSSSSSMYEQDYADKKGSCIDGSSLELLVVDDDYTRERDRHICVV
VARKKPTALDPATVLL