; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005665 (gene) of Snake gourd v1 genome

Gene IDTan0005665
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG06:68879090..68880406
RNA-Seq ExpressionTan0005665
SyntenyTan0005665
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025724 - GAG-pre-integrase domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]7.2e-7338.02Show/hide
Query:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------
        ST+FEV +F+G GDF LW+ K+ AI+ Q KV   +L+ + +P+NIT  + ++++++A+  I+ YLSD ++R +    T  +LW KLES Y          
Subjt:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------

Query:  -----------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGR
                         ++ D+F                            E+Y +VK+A+K+G D+L+  IVL+A++ +  E+K+ +K + E L  RGR
Subjt:  -----------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGR

Query:  SENRKKHRSKSRG-RSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHS------------------LMVSNTREDDTWILD
        SE +K  + K R  RSKSKG S R+C+ C KEGH ++ CP  +++++S      +S  N+++ YN +                  LMVS+    D WI+D
Subjt:  SENRKKHRSKSRG-RSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHS------------------LMVSNTREDDTWILD

Query:  SGCSFHMTPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLL
        SGC+FHMTP+R++L +FQ   GGKV+LG+   C V+G  +V++  HDG +R LT VRYVP++KRNLISL  LD+ G TIK E GV+KV KG LV ++G L
Subjt:  SGCSFHMTPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLL

Query:  KDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK
        +  LY LEG+TV GS+ +      +   LWHKRL H+ E+G++ L +Q ++  +K
Subjt:  KDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.7e-7438.46Show/hide
Query:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------
        ST+FEV +F+G GDF LW+ K+ AI+ Q KV   +L+ + +P+NIT  + ++++++A+S I+ YLSD ++R +    T  +LW KLES Y          
Subjt:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------

Query:  -----------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGR
                         ++ D+F                            E+Y +VK+A+K+GRD+L+  IVL+A++ +  E+K+ +K + E L  RGR
Subjt:  -----------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGR

Query:  SENRKKHRSKSRG-RSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHS------------------LMVSNTREDDTWILD
        SE +K  + K R  RSKSKG S R+C+ C KEGH ++ CP  +++++S      +S  N+++ YN +                  LMVS+    D WI+D
Subjt:  SENRKKHRSKSRG-RSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHS------------------LMVSNTREDDTWILD

Query:  SGCSFHMTPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLL
        SGC+FHMTP+R++L +FQ   GGKV+LG+   C V+G  +V++  HDG +R LT VRYVP++KRNLISL  LD+ G TIK E GV+KV KG LV ++G L
Subjt:  SGCSFHMTPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLL

Query:  KDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK
        +  LY LEG+TV GS+ +          LWHKRL H+ E+G++ L +Q ++  +K
Subjt:  KDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK

KAA0067607.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]3.0e-7137.53Show/hide
Query:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------
        ST+FE+ +F+G GDF +W+ K+  I+ Q KV   +L+ +++P NIT  + ++++++ +S I+ YLSD ++R +    T  +LW KLES Y          
Subjt:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------

Query:  -------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGRSENR
                     ++ D+F                            E+Y +VK+A+K+GRD+L+  IVL+A++ +  E+K+ +K + E L  RGRSE +
Subjt:  -------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGRSENR

Query:  KKHRSKSRGRSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSE-----EYNHSLMVSNTREDDTWILDSGCSFHMTPNREWLEDFQ
             +   RSKSKG S R+C+ C K GH ++ CP  +++++S      +   N +E     E    LMVS+    D WI+DSGC+FHMTP+R++L +FQ
Subjt:  KKHRSKSRGRSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSE-----EYNHSLMVSNTREDDTWILDSGCSFHMTPNREWLEDFQ

Query:  TGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLLKDTLYSLEGSTVIGSSDL
         G GGKV+LG+   C V+   +V++  HD  +R LT VRYVP++KRNLISL  LD+ G TIK E GV+KV KG LV ++G L+  LY LEG+TV GS+ +
Subjt:  TGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLLKDTLYSLEGSTVIGSSDL

Query:  VKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK
                  LWH RL H+ E+G++ L +Q ++  +K
Subjt:  VKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK

KAD4178028.1 hypothetical protein E3N88_26619 [Mikania micrantha]4.1e-6836.57Show/hide
Query:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTYQS--------
        STKFE+E+FDG+ DF LW++KM A++  Q V  AL  A+E+P  ++  + KE+ + AHS II  L D ++R++  E+TAAK+W KLES Y +        
Subjt:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTYQS--------

Query:  ----------------------FDKFFTE-----------------------SYNDVKSAMKFGRDTLSTEIVLNAIRVKE-QELKETKKSNSETLYVRG
                              F+K   +                       SY      + FGR++L+ E V+ A+  KE Q  KE K  N + LYVRG
Subjt:  ----------------------FDKFFTE-----------------------SYNDVKSAMKFGRDTLSTEIVLNAIRVKE-QELKETKKSNSETLYVRG

Query:  RSENRKKHRSKSRGRSKSKGASNRRCYHCQKEGHIRRFCPDLRNKK---SSGKGKEESSVVNLSEEYNHS--LMVSNTREDDTWILDSGCSFHMTPNREW
        RSE+R     +   RSKSKG   RRC+ C  E H +R CP+ + +K   +  K K   S  +  + Y  +  L++S    D +WILDSGCS+HMTP++E+
Subjt:  RSENRKKHRSKSRGRSKSKGASNRRCYHCQKEGHIRRFCPDLRNKK---SSGKGKEESSVVNLSEEYNHS--LMVSNTREDDTWILDSGCSFHMTPNREW

Query:  LEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLLKD-TLYSLEGSTV
         ++ ++G  G V LG+ H C ++G  TV  +L + ++  L  VRY+P+++RNLISL   ++ G+ + L  G  KV+KG LV M G  K+  +Y L+G  V
Subjt:  LEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLLKD-TLYSLEGSTV

Query:  IGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK
        I  + + +    S    WH+RLGHM  QG+ EL KQ+VI  ++
Subjt:  IGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.7e-7438.46Show/hide
Query:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------
        ST+FEV +F+G GDF LW+ K+ AI+ Q KV   +L+ + +P+NIT  + ++++++A+S I+ YLSD ++R +    T  +LW KLES Y          
Subjt:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------

Query:  -----------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGR
                         ++ D+F                            E+Y +VK+A+K+GRD+L+  IVL+A++ +  E+K+ +K + E L  RGR
Subjt:  -----------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGR

Query:  SENRKKHRSKSRG-RSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHS------------------LMVSNTREDDTWILD
        SE +K  + K R  RSKSKG S R+C+ C KEGH ++ CP  +++++S      +S  N+++ YN +                  LMVS+    D WI+D
Subjt:  SENRKKHRSKSRG-RSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHS------------------LMVSNTREDDTWILD

Query:  SGCSFHMTPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLL
        SGC+FHMTP+R++L +FQ   GGKV+LG+   C V+G  +V++  HDG +R LT VRYVP++KRNLISL  LD+ G TIK E GV+KV KG LV ++G L
Subjt:  SGCSFHMTPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLL

Query:  KDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK
        +  LY LEG+TV GS+ +          LWHKRL H+ E+G++ L +Q ++  +K
Subjt:  KDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK

TrEMBL top hitse value%identityAlignment
A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class3.5e-7338.02Show/hide
Query:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------
        ST+FEV +F+G GDF LW+ K+ AI+ Q KV   +L+ + +P+NIT  + ++++++A+  I+ YLSD ++R +    T  +LW KLES Y          
Subjt:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------

Query:  -----------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGR
                         ++ D+F                            E+Y +VK+A+K+G D+L+  IVL+A++ +  E+K+ +K + E L  RGR
Subjt:  -----------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGR

Query:  SENRKKHRSKSRG-RSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHS------------------LMVSNTREDDTWILD
        SE +K  + K R  RSKSKG S R+C+ C KEGH ++ CP  +++++S      +S  N+++ YN +                  LMVS+    D WI+D
Subjt:  SENRKKHRSKSRG-RSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHS------------------LMVSNTREDDTWILD

Query:  SGCSFHMTPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLL
        SGC+FHMTP+R++L +FQ   GGKV+LG+   C V+G  +V++  HDG +R LT VRYVP++KRNLISL  LD+ G TIK E GV+KV KG LV ++G L
Subjt:  SGCSFHMTPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLL

Query:  KDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK
        +  LY LEG+TV GS+ +      +   LWHKRL H+ E+G++ L +Q ++  +K
Subjt:  KDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK

A0A5A7UB25 Putative gag-pol polyprotein8.3e-7538.46Show/hide
Query:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------
        ST+FEV +F+G GDF LW+ K+ AI+ Q KV   +L+ + +P+NIT  + ++++++A+S I+ YLSD ++R +    T  +LW KLES Y          
Subjt:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------

Query:  -----------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGR
                         ++ D+F                            E+Y +VK+A+K+GRD+L+  IVL+A++ +  E+K+ +K + E L  RGR
Subjt:  -----------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGR

Query:  SENRKKHRSKSRG-RSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHS------------------LMVSNTREDDTWILD
        SE +K  + K R  RSKSKG S R+C+ C KEGH ++ CP  +++++S      +S  N+++ YN +                  LMVS+    D WI+D
Subjt:  SENRKKHRSKSRG-RSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHS------------------LMVSNTREDDTWILD

Query:  SGCSFHMTPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLL
        SGC+FHMTP+R++L +FQ   GGKV+LG+   C V+G  +V++  HDG +R LT VRYVP++KRNLISL  LD+ G TIK E GV+KV KG LV ++G L
Subjt:  SGCSFHMTPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLL

Query:  KDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK
        +  LY LEG+TV GS+ +          LWHKRL H+ E+G++ L +Q ++  +K
Subjt:  KDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK

A0A5A7VKC2 Retrotransposon protein, putative, Ty1-copia subclass1.5e-7137.53Show/hide
Query:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------
        ST+FE+ +F+G GDF +W+ K+  I+ Q KV   +L+ +++P NIT  + ++++++ +S I+ YLSD ++R +    T  +LW KLES Y          
Subjt:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------

Query:  -------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGRSENR
                     ++ D+F                            E+Y +VK+A+K+GRD+L+  IVL+A++ +  E+K+ +K + E L  RGRSE +
Subjt:  -------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGRSENR

Query:  KKHRSKSRGRSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSE-----EYNHSLMVSNTREDDTWILDSGCSFHMTPNREWLEDFQ
             +   RSKSKG S R+C+ C K GH ++ CP  +++++S      +   N +E     E    LMVS+    D WI+DSGC+FHMTP+R++L +FQ
Subjt:  KKHRSKSRGRSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSE-----EYNHSLMVSNTREDDTWILDSGCSFHMTPNREWLEDFQ

Query:  TGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLLKDTLYSLEGSTVIGSSDL
         G GGKV+LG+   C V+   +V++  HD  +R LT VRYVP++KRNLISL  LD+ G TIK E GV+KV KG LV ++G L+  LY LEG+TV GS+ +
Subjt:  TGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLLKDTLYSLEGSTVIGSSDL

Query:  VKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK
                  LWH RL H+ E+G++ L +Q ++  +K
Subjt:  VKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK

A0A5D3DNU1 Putative gag-pol polyprotein8.3e-7538.46Show/hide
Query:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------
        ST+FEV +F+G GDF LW+ K+ AI+ Q KV   +L+ + +P+NIT  + ++++++A+S I+ YLSD ++R +    T  +LW KLES Y          
Subjt:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTY----------

Query:  -----------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGR
                         ++ D+F                            E+Y +VK+A+K+GRD+L+  IVL+A++ +  E+K+ +K + E L  RGR
Subjt:  -----------------QSFDKF--------------------------FTESYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGR

Query:  SENRKKHRSKSRG-RSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHS------------------LMVSNTREDDTWILD
        SE +K  + K R  RSKSKG S R+C+ C KEGH ++ CP  +++++S      +S  N+++ YN +                  LMVS+    D WI+D
Subjt:  SENRKKHRSKSRG-RSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHS------------------LMVSNTREDDTWILD

Query:  SGCSFHMTPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLL
        SGC+FHMTP+R++L +FQ   GGKV+LG+   C V+G  +V++  HDG +R LT VRYVP++KRNLISL  LD+ G TIK E GV+KV KG LV ++G L
Subjt:  SGCSFHMTPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLL

Query:  KDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK
        +  LY LEG+TV GS+ +          LWHKRL H+ E+G++ L +Q ++  +K
Subjt:  KDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK

A0A5N6MUI5 CCHC-type domain-containing protein2.0e-6836.57Show/hide
Query:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTYQS--------
        STKFE+E+FDG+ DF LW++KM A++  Q V  AL  A+E+P  ++  + KE+ + AHS II  L D ++R++  E+TAAK+W KLES Y +        
Subjt:  STKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTYQS--------

Query:  ----------------------FDKFFTE-----------------------SYNDVKSAMKFGRDTLSTEIVLNAIRVKE-QELKETKKSNSETLYVRG
                              F+K   +                       SY      + FGR++L+ E V+ A+  KE Q  KE K  N + LYVRG
Subjt:  ----------------------FDKFFTE-----------------------SYNDVKSAMKFGRDTLSTEIVLNAIRVKE-QELKETKKSNSETLYVRG

Query:  RSENRKKHRSKSRGRSKSKGASNRRCYHCQKEGHIRRFCPDLRNKK---SSGKGKEESSVVNLSEEYNHS--LMVSNTREDDTWILDSGCSFHMTPNREW
        RSE+R     +   RSKSKG   RRC+ C  E H +R CP+ + +K   +  K K   S  +  + Y  +  L++S    D +WILDSGCS+HMTP++E+
Subjt:  RSENRKKHRSKSRGRSKSKGASNRRCYHCQKEGHIRRFCPDLRNKK---SSGKGKEESSVVNLSEEYNHS--LMVSNTREDDTWILDSGCSFHMTPNREW

Query:  LEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLLKD-TLYSLEGSTV
         ++ ++G  G V LG+ H C ++G  TV  +L + ++  L  VRY+P+++RNLISL   ++ G+ + L  G  KV+KG LV M G  K+  +Y L+G  V
Subjt:  LEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLLKD-TLYSLEGSTV

Query:  IGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK
        I  + + +    S    WH+RLGHM  QG+ EL KQ+VI  ++
Subjt:  IGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.6e-0922.74Show/hide
Query:  KFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTYQS----------
        K  ++ FDG+  + +WK ++ A++ +Q V   L     +  N   D  K+  + A S II YLSD+ +    S+ TA ++   L++ Y+           
Subjt:  KFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTYQS----------

Query:  --------------------FDKFFTE-----------------------SYNDVKSAMK-FGRDTLSTEIVLNAIRVKEQELK------ETKKS-----
                            FD+  +E                        Y+ + +A++    + L+   V N  R+ +QE+K      +T K      
Subjt:  --------------------FDKFFTE-----------------------SYNDVKSAMK-FGRDTLSTEIVLNAIRVKEQELK------ETKKS-----

Query:  --NSETLYVRGRSENR-KKHRSKSRGRSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHSLM---VSNTREDDT--WILDS
          N+   Y     +NR  K +   +G SK K     +C+HC +EGHI++ C   + +  + K KE    V  +  +  + M   V+NT   D   ++LDS
Subjt:  --NSETLYVRGRSENR-KKHRSKSRGRSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHSLM---VSNTREDDT--WILDS

Query:  GCSFHMTPNREWLEDFQTGVGG-KVMLGNR-HFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMK--
        G S H+  +     D    V   K+ +  +  F     +  VRLR +D  I  L  V +  E   NL+S+  L + G +I+ +   + + K  L+ +K  
Subjt:  GCSFHMTPNREWLEDFQTGVGG-KVMLGNR-HFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMK--

Query:  GLLKDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQV
        G+L +         +   +  +     +   LWH+R GH+ +  + E+ ++ +
Subjt:  GLLKDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-3827.01Show/hide
Query:  KFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTYQS----------
        K+EV +F+G   F  W+ +M  ++ QQ +   L    + P+ + A+   ++++ A S I  +LSD ++  +  EDTA  +WT+LES Y S          
Subjt:  KFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTYQS----------

Query:  --------------------FDKFFTE-----------------------SYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYV-RGRS
                            F+   T+                       SY+++ + +  G+ T+  + V +A+ + E+  K+ +      +   RGRS
Subjt:  --------------------FDKFFTE-----------------------SYNDVKSAMKFGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYV-RGRS

Query:  ENRKKH---RSKSRGRSKSKGASN-RRCYHCQKEGHIRRFCPDLRNKK--SSGKGKEESSVVNLSEEYNHSLMVSNTRE-------DDTWILDSGCSFHM
          R  +   RS +RG+SK++  S  R CY+C + GH +R CP+ R  K  +SG+  ++++   +    N  L ++   E       +  W++D+  S H 
Subjt:  ENRKKH---RSKSRGRSKSKGASN-RRCYHCQKEGHIRRFCPDLRNKK--SSGKGKEESSVVNLSEEYNHSLMVSNTRE-------DDTWILDSGCSFHM

Query:  TPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLLKDTLYSL
        TP R+    +  G  G V +GN  +  + G   + ++ + G    L  VR+VP+++ NLIS   LD+ GY         ++ KG LV  KG+ + TLY  
Subjt:  TPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLLKDTLYSL

Query:  EGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK
             I   +L    D   + LWHKR+GHM E+G++ L K+ +I   K
Subjt:  EGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIK

P93293 Uncharacterized mitochondrial protein AtMg003003.2e-0739.73Show/hide
Query:  GVIKVIKGILVAMKGLLKDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIKM
        GV+KV+KG    +KG   D+LY L+GS   G S+L  E+   +  LWH RL HM ++G+  L+K+  + + K+
Subjt:  GVIKVIKGILVAMKGLLKDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIKM

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein2.3e-0839.73Show/hide
Query:  GVIKVIKGILVAMKGLLKDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIKM
        GV+KV+KG    +KG   D+LY L+GS   G S+L  E+   +  LWH RL HM ++G+  L+K+  + + K+
Subjt:  GVIKVIKGILVAMKGLLKDTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIKM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTACCAAGTTCGAGGTGGAGCGGTTTGATGGGAAAGGAGATTTTGGTCTATGGAAGATCAAAATGTTGGCCATCGTTCGTCAACAAAAGGTCGATTACGCACTTTT
GGAGGCTAAGGAAATTCCGAACAACATAACAGCGGACCAACTGAAAGAGATAAATAAGGTAGCTCACAGCCTTATAATCTTTTATTTGTCTGATACAATTATAAGGCAAA
TGCGTAGTGAAGATACCGCTGCTAAACTTTGGACAAAATTAGAATCTACCTACCAAAGTTTCGATAAATTCTTTACCGAATCTTATAATGATGTGAAATCTGCTATGAAG
TTTGGTAGGGATACTTTGTCCACTGAAATTGTCTTAAATGCCATTAGAGTAAAAGAACAAGAACTAAAAGAAACCAAGAAATCTAATAGTGAGACTTTGTACGTTAGAGG
GAGATCAGAGAATAGGAAAAAGCATAGAAGCAAGAGTAGGGGTAGATCAAAGTCTAAGGGGGCTAGTAATAGGAGGTGTTATCATTGTCAAAAAGAAGGCCACATTAGAA
GGTTTTGTCCTGACCTTAGGAATAAAAAGTCGTCGGGTAAAGGGAAAGAAGAGTCTAGTGTAGTAAACCTTAGTGAAGAGTATAACCACTCTCTTATGGTTAGTAACACT
AGAGAGGATGACACTTGGATCCTAGATTCAGGTTGTTCTTTCCATATGACCCCTAATAGAGAATGGCTAGAAGACTTCCAAACCGGTGTGGGAGGTAAAGTCATGTTAGG
GAACCGACACTTTTGTAGTGTAGAAGGTCAAGAAACAGTACGCCTTAGGCTTCATGACGGAAATATTAGGTTTCTAACTGGAGTTAGGTACGTACCTGAAGTTAAAAGAA
ACCTAATATCCCTTAGCATGTTAGATAAGCAAGGGTACACCATTAAGTTAGAGGGTGGTGTTATAAAAGTCATAAAGGGCATCTTAGTTGCAATGAAAGGGCTTCTTAAG
GATACCTTATACTCTTTAGAGGGTTCGACGGTCATAGGGTCAAGTGATCTAGTTAAAGAGAGTGACAATTCCAAAATTACCCTTTGGCATAAACGCTTAGGTCACATGGG
TGAGCAAGGAATTAGGGAGTTAATAAAACAACAAGTAATTCCTAACATAAAAATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTACCAAGTTCGAGGTGGAGCGGTTTGATGGGAAAGGAGATTTTGGTCTATGGAAGATCAAAATGTTGGCCATCGTTCGTCAACAAAAGGTCGATTACGCACTTTT
GGAGGCTAAGGAAATTCCGAACAACATAACAGCGGACCAACTGAAAGAGATAAATAAGGTAGCTCACAGCCTTATAATCTTTTATTTGTCTGATACAATTATAAGGCAAA
TGCGTAGTGAAGATACCGCTGCTAAACTTTGGACAAAATTAGAATCTACCTACCAAAGTTTCGATAAATTCTTTACCGAATCTTATAATGATGTGAAATCTGCTATGAAG
TTTGGTAGGGATACTTTGTCCACTGAAATTGTCTTAAATGCCATTAGAGTAAAAGAACAAGAACTAAAAGAAACCAAGAAATCTAATAGTGAGACTTTGTACGTTAGAGG
GAGATCAGAGAATAGGAAAAAGCATAGAAGCAAGAGTAGGGGTAGATCAAAGTCTAAGGGGGCTAGTAATAGGAGGTGTTATCATTGTCAAAAAGAAGGCCACATTAGAA
GGTTTTGTCCTGACCTTAGGAATAAAAAGTCGTCGGGTAAAGGGAAAGAAGAGTCTAGTGTAGTAAACCTTAGTGAAGAGTATAACCACTCTCTTATGGTTAGTAACACT
AGAGAGGATGACACTTGGATCCTAGATTCAGGTTGTTCTTTCCATATGACCCCTAATAGAGAATGGCTAGAAGACTTCCAAACCGGTGTGGGAGGTAAAGTCATGTTAGG
GAACCGACACTTTTGTAGTGTAGAAGGTCAAGAAACAGTACGCCTTAGGCTTCATGACGGAAATATTAGGTTTCTAACTGGAGTTAGGTACGTACCTGAAGTTAAAAGAA
ACCTAATATCCCTTAGCATGTTAGATAAGCAAGGGTACACCATTAAGTTAGAGGGTGGTGTTATAAAAGTCATAAAGGGCATCTTAGTTGCAATGAAAGGGCTTCTTAAG
GATACCTTATACTCTTTAGAGGGTTCGACGGTCATAGGGTCAAGTGATCTAGTTAAAGAGAGTGACAATTCCAAAATTACCCTTTGGCATAAACGCTTAGGTCACATGGG
TGAGCAAGGAATTAGGGAGTTAATAAAACAACAAGTAATTCCTAACATAAAAATGTAG
Protein sequenceShow/hide protein sequence
MSTKFEVERFDGKGDFGLWKIKMLAIVRQQKVDYALLEAKEIPNNITADQLKEINKVAHSLIIFYLSDTIIRQMRSEDTAAKLWTKLESTYQSFDKFFTESYNDVKSAMK
FGRDTLSTEIVLNAIRVKEQELKETKKSNSETLYVRGRSENRKKHRSKSRGRSKSKGASNRRCYHCQKEGHIRRFCPDLRNKKSSGKGKEESSVVNLSEEYNHSLMVSNT
REDDTWILDSGCSFHMTPNREWLEDFQTGVGGKVMLGNRHFCSVEGQETVRLRLHDGNIRFLTGVRYVPEVKRNLISLSMLDKQGYTIKLEGGVIKVIKGILVAMKGLLK
DTLYSLEGSTVIGSSDLVKESDNSKITLWHKRLGHMGEQGIRELIKQQVIPNIKM