; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025493 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025493
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr10:13880920..13890832
RNA-Seq ExpressionLag0025493
SyntenyLag0025493
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025314 - Domain of unknown function DUF4219
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
DAD45748.1 TPA_asm: hypothetical protein HUJ06_003978 [Nelumbo nucifera]4.9e-10544.17Show/hide
Query:  MNGG---NNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGK-----
        MNGG   +NICVDKL  +NYSYWKLCMEA+LQGQDLWDL++ D+  IP D  +N E ++KWK+KCGK LFALRT I KEYIEHV D+    +  +     
Subjt:  MNGG---NNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGK-----

Query:  --------------HLKGCLLKRTRQGCSIWRMNLLKLFKEALMKQMFGNNKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRC
                       L G +    +Q   I   NLL   +EAL+KQM  N+     +VE A++   + K N S K   NDS+ +++EGQSKGN K C+RC
Subjt:  --------------HLKGCLLKRTRQGCSIWRMNLLKLFKEALMKQMFGNNKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRC

Query:  GKPGHIK----RDCRAKVS--------------------------------------------------------------------------------Y
        GKPGHI+    R+CR  ++                                                                                +
Subjt:  GKPGHIK----RDCRAKVS--------------------------------------------------------------------------------Y

Query:  CNE---------------EGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDA
        C +               EG  NV +++    G+SL+DVYHVPGLKKNL SVSQI D GRYVLFGPN+V+I+ N+K  EAD+L TGKRK+SLYVLSA+DA
Subjt:  CNE---------------EGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDA

Query:  YVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFV
        YV++TGQN SAALWHARLGH+GYQLLQ+IS +KLLDG+P+FK+ HHD+VY                               MGPT+TPSYSG +YVM+FV
Subjt:  YVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFV

Query:  DDFSRFTWVYFLKAKSETFSKFV
        DDF RFTWVYFL+ KSE FSKF+
Subjt:  DDFSRFTWVYFLKAKSETFSKFV

KAA8537014.1 hypothetical protein F0562_029492 [Nyssa sinensis]2.7e-10356.95Show/hide
Query:  DLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGCLLKR---TRQGCSIWRMNLLKLFKEALMKQMFGN
        DLWDL+ GDD  IP DT  N E ++KWK+KCGK LFALRT I +EYIEHVRDV    K G    G    +      GCS      + L  E         
Subjt:  DLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGCLLKR---TRQGCSIWRMNLLKLFKEALMKQMFGN

Query:  NKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKVSYCNEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNL
          + HY   A V              ++++S    V                                +EG  NVK+D  N  GVSL+DVYHVPGLKKNL
Subjt:  NKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKVSYCNEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNL

Query:  VSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVV
         SVSQIAD+GRYVLF PNDVKI+ NIK  EADV+ TGKRKDSLYVLSASDAYVE+TGQ+ S  LWHARLGHVGYQLLQ+IS KKLLDGVPLFKEIH DVV
Subjt:  VSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVV

Query:  YLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
         LGCQ+GKSHRLPFPNS NR    L +VHSDLMGPTRTPSY G  YVMV VD FSRFTWV+FL+ KSETFSKF+
Subjt:  YLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV

KAA8540328.1 hypothetical protein F0562_024753 [Nyssa sinensis]9.6e-10944.96Show/hide
Query:  MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRD-----------------
        MNGG+++ +DKL  +NYSYWKLCMEA+LQGQDLWDL+SGDD  IP DT +N E +RKWK+KCGK LFALRTSI +EYI+HVRD                 
Subjt:  MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRD-----------------

Query:  ---------------------------VNLQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN
                                   + +++ C +              L+  L++  R          QG     SI  +  L   +EALMKQM  +N
Subjt:  ---------------------------VNLQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN

Query:  KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------
        KQ   +VE A+Y KDK K NS  K SS D+K SK EGQS+GN + C+RCGK GH+KRDCR KV                                     
Subjt:  KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------

Query:  ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY
                                +Y N                                                 +EG  NVK D  N  GVSL+DVY
Subjt:  ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY

Query:  HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL
        HVPGLKKNL SVSQIAD GRYVLFGP+DVKI+ NIK  EADVL TGKRKDSLYVLSASDAYVE+ GQN S  LWHARLGHVGYQLL +IS KKLLDGVPL
Subjt:  HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL

Query:  FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHV
        FKEIH DVV  GCQ+GKSHR PFPNS NR   AL +
Subjt:  FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHV

KAA8549858.1 hypothetical protein F0562_001542 [Nyssa sinensis]2.7e-10342.02Show/hide
Query:  MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGC---
        MNGG+++ VDKL  +NYSY KLCMEA+LQGQ+LWDL+SGDD  I  DT +NVE +RKWK+K GK LFALRTSI +EYI+HVRD     +  K L+     
Subjt:  MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGC---

Query:  -------LLKRTRQGCSIWRMNLLKLF-------------------------------------------------------------KEALMKQMFGNN
                LK    G +   +++L+ F                                                             +EALMKQ+  NN
Subjt:  -------LLKRTRQGCSIWRMNLLKLF-------------------------------------------------------------KEALMKQMFGNN

Query:  KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------
        KQ   +VE A+Y KDK K NS  K SS DSK SK +GQS+GN K  +RCGK GH+KRDC  KV                                     
Subjt:  KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------

Query:  ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY
                                +Y N                                                 +EG  NVK D  NV+GVSL+DVY
Subjt:  ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY

Query:  HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL
        HVP LKKNL SVSQI D GRYVLFGP+DVKI+ NIK  EADVL TGKRKDSLYVLSASDAYVE+TGQN S  LWHARLGHVGYQ LQ+IS KKLLDGVPL
Subjt:  HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL

Query:  FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
        FKEIH DVV  GCQ+GKSH LPF NS N+   AL +                                V+FL+ KSETFSKF+
Subjt:  FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV

RWR74934.1 Integrase, catalytic core [Cinnamomum micranthum f. kanehirae]7.3e-11744.79Show/hide
Query:  MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVN---------------
        MNGG+N+ +DKL  +NY+YWKLCMEA+LQGQDLWDL+SGD+  IP DTS+N +  RKWK+KCGK LFALRTSI ++YI  VRDV+               
Subjt:  MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVN---------------

Query:  -----------------------------LQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN
                                     +++ C +              L   L++  R          QG     SI  +  L   +EAL+KQM  N+
Subjt:  -----------------------------LQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN

Query:  KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV--SYCNEEG------RVNVKDDAPNVA----------
        K+    VE A+Y KD+G  N   K+ S+D++ S  EG+ +GN KGCFRCG+ GHIKRDC A+V  + C + G      RV + +   NVA          
Subjt:  KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV--SYCNEEG------RVNVKDDAPNVA----------

Query:  -------------------------------------------------------------------------------------GVSLEDVYHVPGLKK
                                                                                             GVSL +VYHV GLKK
Subjt:  -------------------------------------------------------------------------------------GVSLEDVYHVPGLKK

Query:  NLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHD
        NL SVSQI D  RYVLFGP +V+I+ NIK  EADVL TG+RK+SLYVLSASDAYVE+T QN+SA LWH+RLGHVGYQLLQ+IS KKLL+G+PLFKEIHHD
Subjt:  NLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHD

Query:  VVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
        VV  GCQ+ KSHRLPFP S NR +  L +VHSDLMGPT+T SYS  RYVM+ VDDFSRFTWVYFL+ KSE FSKFV
Subjt:  VVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV

TrEMBL top hitse value%identityAlignment
A0A443N8T5 Integrase, catalytic core3.5e-11744.79Show/hide
Query:  MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVN---------------
        MNGG+N+ +DKL  +NY+YWKLCMEA+LQGQDLWDL+SGD+  IP DTS+N +  RKWK+KCGK LFALRTSI ++YI  VRDV+               
Subjt:  MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVN---------------

Query:  -----------------------------LQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN
                                     +++ C +              L   L++  R          QG     SI  +  L   +EAL+KQM  N+
Subjt:  -----------------------------LQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN

Query:  KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV--SYCNEEG------RVNVKDDAPNVA----------
        K+    VE A+Y KD+G  N   K+ S+D++ S  EG+ +GN KGCFRCG+ GHIKRDC A+V  + C + G      RV + +   NVA          
Subjt:  KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV--SYCNEEG------RVNVKDDAPNVA----------

Query:  -------------------------------------------------------------------------------------GVSLEDVYHVPGLKK
                                                                                             GVSL +VYHV GLKK
Subjt:  -------------------------------------------------------------------------------------GVSLEDVYHVPGLKK

Query:  NLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHD
        NL SVSQI D  RYVLFGP +V+I+ NIK  EADVL TG+RK+SLYVLSASDAYVE+T QN+SA LWH+RLGHVGYQLLQ+IS KKLL+G+PLFKEIHHD
Subjt:  NLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHD

Query:  VVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
        VV  GCQ+ KSHRLPFP S NR +  L +VHSDLMGPT+T SYS  RYVM+ VDDFSRFTWVYFL+ KSE FSKFV
Subjt:  VVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV

A0A5J5B3A5 Integrase catalytic domain-containing protein1.3e-10356.95Show/hide
Query:  DLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGCLLKR---TRQGCSIWRMNLLKLFKEALMKQMFGN
        DLWDL+ GDD  IP DT  N E ++KWK+KCGK LFALRT I +EYIEHVRDV    K G    G    +      GCS      + L  E         
Subjt:  DLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGCLLKR---TRQGCSIWRMNLLKLFKEALMKQMFGN

Query:  NKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKVSYCNEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNL
          + HY   A V              ++++S    V                                +EG  NVK+D  N  GVSL+DVYHVPGLKKNL
Subjt:  NKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKVSYCNEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNL

Query:  VSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVV
         SVSQIAD+GRYVLF PNDVKI+ NIK  EADV+ TGKRKDSLYVLSASDAYVE+TGQ+ S  LWHARLGHVGYQLLQ+IS KKLLDGVPLFKEIH DVV
Subjt:  VSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVV

Query:  YLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
         LGCQ+GKSHRLPFPNS NR    L +VHSDLMGPTRTPSY G  YVMV VD FSRFTWV+FL+ KSETFSKF+
Subjt:  YLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV

A0A5J5B552 Uncharacterized protein1.1e-9152.94Show/hide
Query:  YSVKMNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKG
        + VKMN G+++ VDKL  +NYSYWKLC+EA+LQGQDLWDL+SGDD  IP DT +N E +RKWK+KCGK LFALRTSI +EYI+HVRD  L + C  H  G
Subjt:  YSVKMNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKG

Query:  CLLKRTRQGCSIWRMNLLKLFKEALMKQMFGNNKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKVSYCN
                                                 A++ ++D+         ++++S    V                                
Subjt:  CLLKRTRQGCSIWRMNLLKLFKEALMKQMFGNNKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKVSYCN

Query:  EEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHAR
        +E   NVK D  N  GVSL+DVYHVPGLKKNL SVSQIAD  RYVLFGP+DVKI+ NIK  EADVL TGKRKDSLYVLS SDAYVE+TGQN S  LWHAR
Subjt:  EEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHAR

Query:  LGHVGYQLLQRISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVAL
        LGHVGYQLLQ+IS KKLLD   LFKEIH +VV  GCQ+GKSHRLPFPNSNNR   AL
Subjt:  LGHVGYQLLQRISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVAL

A0A5J5BCB3 Uncharacterized protein4.6e-10944.96Show/hide
Query:  MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRD-----------------
        MNGG+++ +DKL  +NYSYWKLCMEA+LQGQDLWDL+SGDD  IP DT +N E +RKWK+KCGK LFALRTSI +EYI+HVRD                 
Subjt:  MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRD-----------------

Query:  ---------------------------VNLQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN
                                   + +++ C +              L+  L++  R          QG     SI  +  L   +EALMKQM  +N
Subjt:  ---------------------------VNLQSKCGK-------------HLKGCLLKRTR----------QGC----SIWRMNLLKLFKEALMKQMFGNN

Query:  KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------
        KQ   +VE A+Y KDK K NS  K SS D+K SK EGQS+GN + C+RCGK GH+KRDCR KV                                     
Subjt:  KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------

Query:  ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY
                                +Y N                                                 +EG  NVK D  N  GVSL+DVY
Subjt:  ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY

Query:  HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL
        HVPGLKKNL SVSQIAD GRYVLFGP+DVKI+ NIK  EADVL TGKRKDSLYVLSASDAYVE+ GQN S  LWHARLGHVGYQLL +IS KKLLDGVPL
Subjt:  HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL

Query:  FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHV
        FKEIH DVV  GCQ+GKSHR PFPNS NR   AL +
Subjt:  FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHV

A0A5J5C3K7 Uncharacterized protein1.3e-10342.02Show/hide
Query:  MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGC---
        MNGG+++ VDKL  +NYSY KLCMEA+LQGQ+LWDL+SGDD  I  DT +NVE +RKWK+K GK LFALRTSI +EYI+HVRD     +  K L+     
Subjt:  MNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRTSIGKEYIEHVRDVNLQSKCGKHLKGC---

Query:  -------LLKRTRQGCSIWRMNLLKLF-------------------------------------------------------------KEALMKQMFGNN
                LK    G +   +++L+ F                                                             +EALMKQ+  NN
Subjt:  -------LLKRTRQGCSIWRMNLLKLF-------------------------------------------------------------KEALMKQMFGNN

Query:  KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------
        KQ   +VE A+Y KDK K NS  K SS DSK SK +GQS+GN K  +RCGK GH+KRDC  KV                                     
Subjt:  KQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKPGHIKRDCRAKV-------------------------------------

Query:  ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY
                                +Y N                                                 +EG  NVK D  NV+GVSL+DVY
Subjt:  ------------------------SYCN-------------------------------------------------EEGRVNVKDDAPNVAGVSLEDVY

Query:  HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL
        HVP LKKNL SVSQI D GRYVLFGP+DVKI+ NIK  EADVL TGKRKDSLYVLSASDAYVE+TGQN S  LWHARLGHVGYQ LQ+IS KKLLDGVPL
Subjt:  HVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALWHARLGHVGYQLLQRISMKKLLDGVPL

Query:  FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
        FKEIH DVV  GCQ+GKSH LPF NS N+   AL +                                V+FL+ KSETFSKF+
Subjt:  FKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-1327.49Show/hide
Query:  YCNEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALW
        Y  + G V +++D      ++LEDV        NL+SV ++ + G  + F  + V I  N       V+      +++ V++   AY       ++  LW
Subjt:  YCNEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAALW

Query:  HARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDV-VYLGCQFGKSHRLPFPNSNNRVAV--ALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYF
        H R GH+    L  I  K +     L   +     +   C  GK  RLPF    ++  +   L VVHSD+ GP    +     Y ++FVD F+ +   Y 
Subjt:  HARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDV-VYLGCQFGKSHRLPFPNSNNRVAV--ALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYF

Query:  LKAKSETFSKF
        +K KS+ FS F
Subjt:  LKAKSETFSKF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.2e-1826.87Show/hide
Query:  SIKRSSNDSKDSKVEGQSKGNFK----GCFRCGKPGHIKRDC----RAK----------------------VSYCNEE----------------------
        S +RSSN+   S   G+SK   K     C+ C +PGH KRDC    + K                      V + NEE                      
Subjt:  SIKRSSNDSKDSKVEGQSKGNFK----GCFRCGKPGHIKRDC----RAK----------------------VSYCNEE----------------------

Query:  ----------------GRVNVKDDA-PNVAGVS-------------LEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRK
                        G V + + +   +AG+              L+DV HVP L+ NL+S   +   G    F     ++        + V+  G  +
Subjt:  ----------------GRVNVKDDA-PNVAGVS-------------LEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRK

Query:  DSLYVLSASDAYVEQTGQND--SAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRT
         +LY  +A     E     D  S  LWH R+GH+  + LQ ++ K L+           D     C FGK HR+ F  S+ R    L +V+SD+ GP   
Subjt:  DSLYVLSASDAYVEQTGQND--SAALWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRT

Query:  PSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKF
         S  G +Y + F+DD SR  WVY LK K + F  F
Subjt:  PSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKF

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.8e-1531.63Show/hide
Query:  VSLEDVYHVPGLKKNLVSVSQIAD-YGRYVLFGPNDVKIVYNIKQFEADV-LLTGKRKDSLYVLS-ASDAYVEQTGQNDSAAL---WHARLGHVGYQLLQ
        ++L ++ +VP + KNL+SV ++ +  G  V F P      + +K     V LL GK KD LY    AS   V       S A    WHARLGH    +L 
Subjt:  VSLEDVYHVPGLKKNLVSVSQIAD-YGRYVLFGPNDVKIVYNIKQFEADV-LLTGKRKDSLYVLS-ASDAYVEQTGQNDSAAL---WHARLGHVGYQLLQ

Query:  RISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
         +     L    +    H  +    C   KS+++PF  S       L  ++SD+   +   S+   RY ++FVD F+R+TW+Y LK KS+    F+
Subjt:  RISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.8e-1733.16Show/hide
Query:  VSLEDVYHVPGLKKNLVSVSQIADYGRY-VLFGPNDVKIVYNIKQFEADV-LLTGKRKDSLYVLS-ASDAYVEQTGQNDSAAL---WHARLGHVGYQLLQ
        + L  V +VP + KNL+SV ++ +  R  V F P      + +K     V LL GK KD LY    AS   V       S A    WH+RLGH    +L 
Subjt:  VSLEDVYHVPGLKKNLVSVSQIADYGRY-VLFGPNDVKIVYNIKQFEADV-LLTGKRKDSLYVLS-ASDAYVEQTGQNDSAAL---WHARLGHVGYQLLQ

Query:  RISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV
         +     L   P+    H  +    C   KSH++PF NS    +  L  ++SD+   +   S    RY ++FVD F+R+TW+Y LK KS+    F+
Subjt:  RISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKFV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAGGATGGAGGGAGGGGCCGGAGGGTTGAAATCGGAGGAGGGAGGGGTCGGAGGGTCGAAATCGGAGGAGGGAGGTCCTTCCTCGTCGGAAACCTCCCTCTTCA
GTCGGTTTCGTTCCACGAACCCAAGCCCACGCCCAGCTTGAACGCCCAGCTCGCCGTCGTTCCACAAGCCCACGCCCAACTCGCCGCCGCTCGTTTGGTTTCGCCCAGAT
CGAACGCCCTCCCTCTTCAGTCAGGTTTGTTCCACGAACCCACGCCCACGCCCACGCCCACGCCCAGCTCGCCGTCGTTCCACGAACCCACGCTCAGCTCGCTGCCGCTC
GTTCGGTTTCGCTCAGATCGAATGTCAGATCGAACATCCCTTCTTCTTCTTCTTCGCCCAGATCTGAGGGAAGACGGTGTGGGTGAGCTGACTACGAACGGGCGGCGAAC
ATCAACCAACGAGGGACTACGAAGGGACGTGAACGAACGACGAAAGACTACAAATGGGCGGAACGATGGCGAAAAAATGAGAAGAAACGAAAAAAATGTTATCGTCGACG
ACGGCCGGTCGCCGACGGTTGCCGGCGAAGTTTCTCTCGGACTAGGAGAGGACGATGCGCCTATGTTCAAGACCCGGAATCAGCCCTTAAGGGAACACACATCTACTTAC
CCCAATAGGGGAAGGAGTGAATTCCATCTTGTAATGTTATGTTCCCAGCTCCCACTCGGTCTTGTCCCTAAAATGGATACCCCCACTCGCATGTCTACTACATGGATGTT
TTGGATCATTGCATCTTTATCGAATACAAAGCGGATCATATCACATAGTGTCACTAGGATAAGAGTCAATATGGATAGTATCCCAGAAGAAAGTTCGGTCCATGTTGTGG
AGAAATCAATCGGAGCTAAGAAAAGCAAAGTAGATCATTCTAGTCCAACCAAAAAGCGTAAGAATACTAAAATCTTTGTAGTTTGGGATCATTTTGAGAAAATTAAAGAT
TGTGATCCTAATGATCCTTATGCTAAGTGTAAATACTATGGGACTAAATATGCATGTCATTCCAAGCGTAATGAGCTTCCAGGTTCTTTATTTCGTGTTTGCTTATACTC
AGTGAAAATGAATGGTGGAAATAATATTTGTGTGGACAAGCTTACCAGTGATAATTATAGCTATTGGAAGCTATGTATGGAAGCTTTTCTACAAGGACAAGATTTGTGGG
ATCTTGTTTCCGGTGATGATACTAAAATTCCAAGAGATACTTCGGAAAATGTTGAGGCACAGAGGAAGTGGAAGGTCAAATGTGGCAAAGTTTTATTTGCTTTGCGAACT
TCTATTGGCAAGGAGTATATTGAGCATGTTCGTGATGTGAATCTCCAAAGCAAGTGTGGGAAACACTTGAAAGGTTGTTTGCTCAAAAGAACACGACAAGGTTGCAGTAT
TTGGAGAATGAACTTGCTGAAACTATTCAAGGAAGCGTTGATGAAACAAATGTTTGGCAACAACAAGCAAATTCACTATGAGGTGGAAGCTGCTGTTTATGCAAAAGATA
AAGGAAAAGTTAATTCTTCTATCAAGCGTTCTTCAAATGATAGCAAGGATTCCAAGGTTGAAGGGCAGTCCAAGGGCAATTTCAAAGGATGTTTTAGATGCGGCAAGCCA
GGACACATCAAACGTGATTGTCGGGCGAAGGTGAGTTATTGCAACGAAGAAGGGCGTGTTAATGTTAAGGATGATGCACCAAATGTTGCTGGTGTTTCTCTTGAAGATGT
TTATCATGTTCCAGGCCTAAAGAAGAATTTGGTTTCAGTCTCTCAGATTGCTGATTATGGGAGGTATGTTCTCTTTGGTCCAAATGATGTGAAAATTGTTTATAATATTA
AGCAATTTGAAGCTGATGTTTTATTAACTGGAAAGAGGAAAGATTCTCTCTACGTTTTGTCTGCAAGTGATGCTTATGTTGAACAGACAGGTCAAAATGATAGTGCAGCA
CTTTGGCATGCTCGATTAGGCCATGTTGGTTATCAGTTACTACAAAGAATTTCTATGAAAAAGCTTCTTGACGGTGTTCCTCTCTTTAAGGAAATTCATCATGATGTGGT
TTATCTTGGTTGTCAATTTGGAAAATCACATCGTCTTCCTTTCCCAAATTCAAATAACAGGGTTGCTGTTGCATTGCATGTGGTTCATTCAGATTTGATGGGGCCAACTA
GAACACCCAGTTATTCTGGTTGTCGTTATGTGATGGTTTTTGTGGATGATTTTTCTCGATTCACGTGGGTGTATTTCTTGAAAGCTAAAAGTGAGACTTTCTCCAAGTTT
GTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAGGATGGAGGGAGGGGCCGGAGGGTTGAAATCGGAGGAGGGAGGGGTCGGAGGGTCGAAATCGGAGGAGGGAGGTCCTTCCTCGTCGGAAACCTCCCTCTTCA
GTCGGTTTCGTTCCACGAACCCAAGCCCACGCCCAGCTTGAACGCCCAGCTCGCCGTCGTTCCACAAGCCCACGCCCAACTCGCCGCCGCTCGTTTGGTTTCGCCCAGAT
CGAACGCCCTCCCTCTTCAGTCAGGTTTGTTCCACGAACCCACGCCCACGCCCACGCCCACGCCCAGCTCGCCGTCGTTCCACGAACCCACGCTCAGCTCGCTGCCGCTC
GTTCGGTTTCGCTCAGATCGAATGTCAGATCGAACATCCCTTCTTCTTCTTCTTCGCCCAGATCTGAGGGAAGACGGTGTGGGTGAGCTGACTACGAACGGGCGGCGAAC
ATCAACCAACGAGGGACTACGAAGGGACGTGAACGAACGACGAAAGACTACAAATGGGCGGAACGATGGCGAAAAAATGAGAAGAAACGAAAAAAATGTTATCGTCGACG
ACGGCCGGTCGCCGACGGTTGCCGGCGAAGTTTCTCTCGGACTAGGAGAGGACGATGCGCCTATGTTCAAGACCCGGAATCAGCCCTTAAGGGAACACACATCTACTTAC
CCCAATAGGGGAAGGAGTGAATTCCATCTTGTAATGTTATGTTCCCAGCTCCCACTCGGTCTTGTCCCTAAAATGGATACCCCCACTCGCATGTCTACTACATGGATGTT
TTGGATCATTGCATCTTTATCGAATACAAAGCGGATCATATCACATAGTGTCACTAGGATAAGAGTCAATATGGATAGTATCCCAGAAGAAAGTTCGGTCCATGTTGTGG
AGAAATCAATCGGAGCTAAGAAAAGCAAAGTAGATCATTCTAGTCCAACCAAAAAGCGTAAGAATACTAAAATCTTTGTAGTTTGGGATCATTTTGAGAAAATTAAAGAT
TGTGATCCTAATGATCCTTATGCTAAGTGTAAATACTATGGGACTAAATATGCATGTCATTCCAAGCGTAATGAGCTTCCAGGTTCTTTATTTCGTGTTTGCTTATACTC
AGTGAAAATGAATGGTGGAAATAATATTTGTGTGGACAAGCTTACCAGTGATAATTATAGCTATTGGAAGCTATGTATGGAAGCTTTTCTACAAGGACAAGATTTGTGGG
ATCTTGTTTCCGGTGATGATACTAAAATTCCAAGAGATACTTCGGAAAATGTTGAGGCACAGAGGAAGTGGAAGGTCAAATGTGGCAAAGTTTTATTTGCTTTGCGAACT
TCTATTGGCAAGGAGTATATTGAGCATGTTCGTGATGTGAATCTCCAAAGCAAGTGTGGGAAACACTTGAAAGGTTGTTTGCTCAAAAGAACACGACAAGGTTGCAGTAT
TTGGAGAATGAACTTGCTGAAACTATTCAAGGAAGCGTTGATGAAACAAATGTTTGGCAACAACAAGCAAATTCACTATGAGGTGGAAGCTGCTGTTTATGCAAAAGATA
AAGGAAAAGTTAATTCTTCTATCAAGCGTTCTTCAAATGATAGCAAGGATTCCAAGGTTGAAGGGCAGTCCAAGGGCAATTTCAAAGGATGTTTTAGATGCGGCAAGCCA
GGACACATCAAACGTGATTGTCGGGCGAAGGTGAGTTATTGCAACGAAGAAGGGCGTGTTAATGTTAAGGATGATGCACCAAATGTTGCTGGTGTTTCTCTTGAAGATGT
TTATCATGTTCCAGGCCTAAAGAAGAATTTGGTTTCAGTCTCTCAGATTGCTGATTATGGGAGGTATGTTCTCTTTGGTCCAAATGATGTGAAAATTGTTTATAATATTA
AGCAATTTGAAGCTGATGTTTTATTAACTGGAAAGAGGAAAGATTCTCTCTACGTTTTGTCTGCAAGTGATGCTTATGTTGAACAGACAGGTCAAAATGATAGTGCAGCA
CTTTGGCATGCTCGATTAGGCCATGTTGGTTATCAGTTACTACAAAGAATTTCTATGAAAAAGCTTCTTGACGGTGTTCCTCTCTTTAAGGAAATTCATCATGATGTGGT
TTATCTTGGTTGTCAATTTGGAAAATCACATCGTCTTCCTTTCCCAAATTCAAATAACAGGGTTGCTGTTGCATTGCATGTGGTTCATTCAGATTTGATGGGGCCAACTA
GAACACCCAGTTATTCTGGTTGTCGTTATGTGATGGTTTTTGTGGATGATTTTTCTCGATTCACGTGGGTGTATTTCTTGAAAGCTAAAAGTGAGACTTTCTCCAAGTTT
GTCTAG
Protein sequenceShow/hide protein sequence
MKEDGGRGRRVEIGGGRGRRVEIGGGRSFLVGNLPLQSVSFHEPKPTPSLNAQLAVVPQAHAQLAAARLVSPRSNALPLQSGLFHEPTPTPTPTPSSPSFHEPTLSSLPL
VRFRSDRMSDRTSLLLLLRPDLREDGVGELTTNGRRTSTNEGLRRDVNERRKTTNGRNDGEKMRRNEKNVIVDDGRSPTVAGEVSLGLGEDDAPMFKTRNQPLREHTSTY
PNRGRSEFHLVMLCSQLPLGLVPKMDTPTRMSTTWMFWIIASLSNTKRIISHSVTRIRVNMDSIPEESSVHVVEKSIGAKKSKVDHSSPTKKRKNTKIFVVWDHFEKIKD
CDPNDPYAKCKYYGTKYACHSKRNELPGSLFRVCLYSVKMNGGNNICVDKLTSDNYSYWKLCMEAFLQGQDLWDLVSGDDTKIPRDTSENVEAQRKWKVKCGKVLFALRT
SIGKEYIEHVRDVNLQSKCGKHLKGCLLKRTRQGCSIWRMNLLKLFKEALMKQMFGNNKQIHYEVEAAVYAKDKGKVNSSIKRSSNDSKDSKVEGQSKGNFKGCFRCGKP
GHIKRDCRAKVSYCNEEGRVNVKDDAPNVAGVSLEDVYHVPGLKKNLVSVSQIADYGRYVLFGPNDVKIVYNIKQFEADVLLTGKRKDSLYVLSASDAYVEQTGQNDSAA
LWHARLGHVGYQLLQRISMKKLLDGVPLFKEIHHDVVYLGCQFGKSHRLPFPNSNNRVAVALHVVHSDLMGPTRTPSYSGCRYVMVFVDDFSRFTWVYFLKAKSETFSKF
V