; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036587 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036587
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:48909024..48910982
RNA-Seq ExpressionLag0036587
SyntenyLag0036587
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TKR74765.1 hypothetical protein D5086_0000292320 [Populus alba]1.8e-7342.49Show/hide
Query:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  +K+H   L  +P+ M +E+W+ LD +V+  IR+ LS  V   V  E +  KLMEAL+  YE PSANNKV+L+KK FN
Subjt:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN

Query:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-
        ++MVE+ASV  ++N   T+ NQL SV I+F DE+  + LL SLP SWE M+T VSNS G + LK  ++ DL + EE+ R+ S + S+ GSAL + T+G+ 
Subjt:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-

Query:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT
         D+        S   S+ K+ +R +VEC+ C K GHF   C K K  +     A T  V DAL+         +   +WILDS AS H      +  ++ 
Subjt:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT

Query:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL
        GG HG+V + +G    I G GDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL
Subjt:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL

TKR89927.1 hypothetical protein D5086_0000238200 [Populus alba]1.8e-7342.23Show/hide
Query:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  +K+H   L  +P+ M  E+W+ LD +V+  IR+ LS  V   V  E +  KLMEAL+  YE PSANNKV+L+KK FN
Subjt:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN

Query:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-
        ++MVE+ SV  ++N   T+ NQL SV+I+F DE+  + LL SLP SWE M+T VSNS G + LK  ++ DL + EE+ R+ S + S+ GSAL + T+G+ 
Subjt:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-

Query:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT
         D+        S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+         +   +WILDS AS H      +  ++ 
Subjt:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT

Query:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL
        GG HG+V + +G    I GIGDV +KT  G    L+++R VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL
Subjt:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL

TKS02608.1 hypothetical protein D5086_0000161380 [Populus alba]1.3e-7443.01Show/hide
Query:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  +K+H   L  +P+ M  E+W+ LD +V+  IR+ LS  V   V  E +  KLMEAL+  YE PSANNKV+L+KK FN
Subjt:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN

Query:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-
        ++MVE+ SV  ++N   T+ NQL SV+I+F DE+  + LL SLP SWE M+T VSNS G + LK  ++ DL + EE+ R+ S + S+ GSAL + T+G+ 
Subjt:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-

Query:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT
         D+        S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+ V       +   +WILDS AS H      +  ++ 
Subjt:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT

Query:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL
        GG HG+V + +G   KI GIGDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL
Subjt:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL

TKS13843.1 hypothetical protein D5086_0000049350 [Populus alba]2.4e-7342.23Show/hide
Query:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN
        G+ KFDG +FGYWKM+++DYL  +K+H   L  +P+ M  E+W+ LD +V+  IR+ LS  V   V  E +  KLMEAL+  YE PSANNKV+L+KK FN
Subjt:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN

Query:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-
        ++MVE+ SV  ++N   T+ NQL SV+I+F DE+  + LL SLP SWE M+T VSNS G + LK  ++ DL + EE+ R+ S + S+ GSAL + T+G+ 
Subjt:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-

Query:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT
         D+        S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+         +   +WILDS AS H      +  ++ 
Subjt:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT

Query:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL
        GG HG++ + +G    I GIGDV +KT  G    L++VR VP +K  LIS+G+L D GY   F     K+  G+ V+A G +  TL
Subjt:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL

TKS18269.1 hypothetical protein D5086_0000005180 [Populus alba]1.4e-7339.95Show/hide
Query:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  +K+H   L  +P+ + +EDW  LD +V+  +R+ LS  V   V  E +  +LMEAL+  YE PSANNKV+L+KK FN
Subjt:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN

Query:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-
        +++VE+ SV  ++N   T+ NQL SV+I+F DE+  + LL SLP SWE M+T VSNS G + LK  ++ DL + EE+ R+ S + S+  SAL + T+G+ 
Subjt:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-

Query:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT
         D+        S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+         +   +WILDS AS H      +  ++ 
Subjt:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT

Query:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTLYKFVRHSNELKKSL
        GG HG+V + +G    I GIGDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TLY   R ++ +  + 
Subjt:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTLYKFVRHSNELKKSL

Query:  RRVEASKWKARAVAKVKG
           +A  W ++AV   +G
Subjt:  RRVEASKWKARAVAKVKG

TrEMBL top hitse value%identityAlignment
A0A4U5P1P0 CCHC-type domain-containing protein8.9e-7442.23Show/hide
Query:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  +K+H   L  +P+ M  E+W+ LD +V+  IR+ LS  V   V  E +  KLMEAL+  YE PSANNKV+L+KK FN
Subjt:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN

Query:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-
        ++MVE+ SV  ++N   T+ NQL SV+I+F DE+  + LL SLP SWE M+T VSNS G + LK  ++ DL + EE+ R+ S + S+ GSAL + T+G+ 
Subjt:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-

Query:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT
         D+        S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+         +   +WILDS AS H      +  ++ 
Subjt:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT

Query:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL
        GG HG+V + +G    I GIGDV +KT  G    L+++R VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL
Subjt:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL

A0A4U5PY83 CCHC-type domain-containing protein6.2e-7543.01Show/hide
Query:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  +K+H   L  +P+ M  E+W+ LD +V+  IR+ LS  V   V  E +  KLMEAL+  YE PSANNKV+L+KK FN
Subjt:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN

Query:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-
        ++MVE+ SV  ++N   T+ NQL SV+I+F DE+  + LL SLP SWE M+T VSNS G + LK  ++ DL + EE+ R+ S + S+ GSAL + T+G+ 
Subjt:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-

Query:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT
         D+        S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+ V       +   +WILDS AS H      +  ++ 
Subjt:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT

Query:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL
        GG HG+V + +G   KI GIGDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL
Subjt:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL

A0A4U5QGR0 Uncharacterized protein1.2e-7341.39Show/hide
Query:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  +K+H   L  +P+ M  E+W+ LD +V+  IR+ LS  V   V  E +  KLMEAL+  YE PSANNKV+L+KK FN
Subjt:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN

Query:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-
        ++MVE+ SV  ++N   T+ NQL SV+I+F DE+  + LL SLP SWE M+T VSNS G + LK  ++ DL + EE+ R+ S + S+ GSAL + T+G+ 
Subjt:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-

Query:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT
         D+        S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+         +   +WILDS AS H      +  ++ 
Subjt:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT

Query:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTLYKFVRHSNELKKSL
        GG HG+V + +G   KI GIGDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL      SN L   L
Subjt:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTLYKFVRHSNELKKSL

Query:  RRVEASKWKARAVAKVKG
          ++    KA  V  + G
Subjt:  RRVEASKWKARAVAKVKG

A0A4U5R597 CCHC-type domain-containing protein6.8e-7439.95Show/hide
Query:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  +K+H   L  +P+ + +EDW  LD +V+  +R+ LS  V   V  E +  +LMEAL+  YE PSANNKV+L+KK FN
Subjt:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN

Query:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-
        +++VE+ SV  ++N   T+ NQL SV+I+F DE+  + LL SLP SWE M+T VSNS G + LK  ++ DL + EE+ R+ S + S+  SAL + T+G+ 
Subjt:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-

Query:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT
         D+        S   S+ K+ +R +VEC+ C K GHF   C K K+ +     A T  V DAL+         +   +WILDS AS H      +  ++ 
Subjt:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT

Query:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTLYKFVRHSNELKKSL
        GG HG+V + +G    I GIGDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TLY   R ++ +  + 
Subjt:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTLYKFVRHSNELKKSL

Query:  RRVEASKWKARAVAKVKG
           +A  W ++AV   +G
Subjt:  RRVEASKWKARAVAKVKG

A0A4V6XW18 CCHC-type domain-containing protein8.9e-7442.49Show/hide
Query:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  +K+H   L  +P+ M +E+W+ LD +V+  IR+ LS  V   V  E +  KLMEAL+  YE PSANNKV+L+KK FN
Subjt:  GVTKFDGKNFGYWKMQVKDYLTCRKMH-KALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFN

Query:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-
        ++MVE+ASV  ++N   T+ NQL SV I+F DE+  + LL SLP SWE M+T VSNS G + LK  ++ DL + EE+ R+ S + S+ GSAL + T+G+ 
Subjt:  MQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSAL-VMTKGK-

Query:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT
         D+        S   S+ K+ +R +VEC+ C K GHF   C K K  +     A T  V DAL+         +   +WILDS AS H      +  ++ 
Subjt:  -DKVDEENEPSS---SRKKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANT--VYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFT

Query:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL
        GG HG+V + +G    I G GDV +KT  G    L++VR VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL
Subjt:  GGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-4934.19Show/hide
Query:  VTKFDGKN-FGYWKMQVKDYLTCRKMHKAL---KERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKF
        V KF+G N F  W+ +++D L  + +HK L    ++P  M  EDW  LDE   ++IR+ LS DV + ++ E TA  +   L + Y + +  NK+YL K+ 
Subjt:  VTKFDGKN-FGYWKMQVKDYLTCRKMHKAL---KERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKF

Query:  FNMQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSALVMTKGK
        + + M E  +  S++N    LI QL ++ +   +E   I LL SLP S++ + TT+ +  G  T+++ +V    +  E  R+   K    G AL+ T+G+
Subjt:  FNMQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSALVMTKGK

Query:  DKVDEENE----PSSSRKKWKNRNEV---ECFYCHKKGHFKSEC---RKLK-EDQKRKQEANTV-----YDALVCV----ESDTRYSNHSSDWILDSAAS
         +  + +      S +R K KNR++     C+ C++ GHFK +C   RK K E   +K + NT       D +V      E     S   S+W++D+AAS
Subjt:  DKVDEENE----PSSSRKKWKNRNEV---ECFYCHKKGHFKSEC---RKLK-EDQKRKQEANTV-----YDALVCV----ESDTRYSNHSSDWILDSAAS

Query:  VHIASDRNLFTSFTGGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL
         H    R+LF  +  G  G V+MGN   SKI+GIGD+ +KT  G  LVL+DVR VP+++MNLIS   L  DGY+  F +++ +L  GS V+A G  + TL
Subjt:  VHIASDRNLFTSFTGGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTL

Query:  YKFVRH--SNELKKSLRRVEASKWKAR
        Y+        EL  +   +    W  R
Subjt:  YKFVRH--SNELKKSLRRVEASKWKAR

P25601 Putative transposon Ty5-1 protein YCL075W3.4e-0635.23Show/hide
Query:  VCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFTGGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISI
        +C+ S T  +  SS+WI D+  + H+  DR++F+SFT         G G +  I G G V++ T     + L DV +VP++ +NLIS+
Subjt:  VCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFTGGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-1023.45Show/hide
Query:  NFDGVTKFDGKNFGYWKMQVKDYLTCRKMHKAL---KERPKGMTDED-----------WEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYE
        N   VTK    N+  W  QV       ++   L      P      D           W+  D+ + +++   +SM V   V   TTA ++ E L   Y 
Subjt:  NFDGVTKFDGKNFGYWKMQVKDYLTCRKMHKAL---KERPKGMTDED-----------WEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYE

Query:  NPSANNKVYLVKKFFNMQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNK
        NPS  + V  ++           +++ Y+  + T  +QL  +      +  V ++L +LP+ ++ +   ++      TL            EIH +  N 
Subjt:  NPSANNKVYLVKKFFNMQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNK

Query:  ES---TVGSALVMTKGKDKVDEENEPSSSRKKWKNRNE--------------------------------VECFYCHKKGHFKSECRKLKE-----DQKR
        ES    V SA V+    + V   N  +++     NRN                                  +C  C  +GH    C +L+      + ++
Subjt:  ES---TVGSALVMTKGKDKVDEENEPSSSRKKWKNRNE--------------------------------VECFYCHKKGHFKSECRKLKE-----DQKR

Query:  KQEANTVYDALVCVESDTRYSNHSSDWILDSAASVHIASDRN---LFTSFTGGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLI
             T +     +   + YS  S++W+LDS A+ HI SD N   L   +TGG    V + +G T  IS  G  SL T+    L L ++ +VPNI  NLI
Subjt:  KQEANTVYDALVCVESDTRYSNHSSDWILDSAASVHIASDRN---LFTSFTGGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLI

Query:  SIGKLVD-DGYKCEFGSRQCKLKFGSQVVAV--GHRKSTLYKFVRHSNE----LKKSLRRVEASKWKAR
        S+ +L + +G   EF     ++K  +  V +  G  K  LY++   S++          +   S W AR
Subjt:  SIGKLVD-DGYKCEFGSRQCKLKFGSQVVAV--GHRKSTLYKFVRHSNE----LKKSLRRVEASKWKAR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.7e-0923.47Show/hide
Query:  NFDGVTKFDGKNFGYWKMQVKDYLTCRKMHKAL---KERPKGMTDED-----------WEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYE
        N   VTK    N+  W  QV       ++   L      P      D           W   D+ + ++I   +SM V   V   TTA ++ E L   Y 
Subjt:  NFDGVTKFDGKNFGYWKMQVKDYLTCRKMHKAL---KERPKGMTDED-----------WEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYE

Query:  NPSANNKVYLVKKFFNMQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQC-SN
        NPS     ++ +  F  +  + A +   ++    +   L+++  D+   ++ I    + P   E+ +  ++  +    L  +EV  +      HR   +N
Subjt:  NPSANNKVYLVKKFFNMQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQC-SN

Query:  KESTVGSALVMTKGKDKVDEENEPSSSRKKWKNRNEV----ECFYCHKKGHFKSECRKLKE-----DQKRKQEANTVYDALVCVESDTRYSNHSSDWILD
        +              +      +PSSS  +  NR        C  C  +GH    C +L +     +Q++     T +     +  ++ Y  ++++W+LD
Subjt:  KESTVGSALVMTKGKDKVDEENEPSSSRKKWKNRNEV----ECFYCHKKGHFKSECRKLKE-----DQKRKQEANTVYDALVCVESDTRYSNHSSDWILD

Query:  SAASVHIASDRN---LFTSFTGGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKL
        S A+ HI SD N       +TGG    V + +G T  I+  G  SL T     L L  V +VPNI  NLIS+ +L
Subjt:  SAASVHIASDRN---LFTSFTGGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKL

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein4.5e-0921.72Show/hide
Query:  LVKKFFNMQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIH---RQCSNKESTVGS
        L K+  +++MV+  S +SY+++   ++ +L   K++ +D      + T+L  S++ +     +S     + V ++   ++ E  +    + S +E+  G 
Subjt:  LVKKFFNMQMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIH---RQCSNKESTVGS

Query:  ALVMTKGKDKVDEENEPSSSRKKWKNRNEVECFYCHKKGHFKSECR-KLKEDQKRKQEANTVYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFT
               KD             + K+++E  C  C+K  H + +C+ ++  D++ K++   V   L  V +    +     WI+   A +++      FT
Subjt:  ALVMTKGKDKVDEENEPSSSRKKWKNRNEVECFYCHKKGHFKSECR-KLKEDQKRKQEANTVYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFT

Query:  SFTGGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFG
        +        V   +G    + G GDV ++ + G K  +R+V FVP +  N++S GK+V   Y    G
Subjt:  SFTGGHHGLVRMGNGRTSKISGIGDVSLKTECGDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFG

AT3G29785.1 unknown protein1.5e-0932.95Show/hide
Query:  KFDGKNFGYWKMQVKDYLTCRKMHKALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKV
        K DG ++ + +M+++DYL  +K+H+ L ++ + M+ +DW  L  +V+  IR+ +S ++   V  E +   LM+ L++ Y+ PS NN V
Subjt:  KFDGKNFGYWKMQVKDYLTCRKMHKALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTGTAGAGCCAAGAAATTTCGATGGAGTCACGAAGTTCGATGGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAGGATTATTTAACTTGCAGGAAAATGCA
TAAGGCATTGAAGGAGAGACCGAAAGGAATGACGGACGAAGATTGGGAAGCTCTAGATGAAGAGGTAGTTGCAAGCATAAGGATGTGCTTATCAATGGATGTGACAAGTC
TAGTGGTCCATGAGACAACTGCAGTTAAATTAATGGAAGCACTTACAAACAGGTATGAAAACCCCTCTGCTAATAATAAAGTATACCTAGTTAAGAAGTTCTTCAACATG
CAAATGGTTGAGGATGCTTCTGTGAATTCCTATATTAACGAAGTTACCACTTTGATTAATCAGTTAAAATCTGTTAAGATAGATTTTACTGATGAGGTGAATGTTATTCA
GTTATTAACATCTTTACCTGATAGTTGGGAAATGATGAAGACAACAGTGTCTAATTCGACTGGAAATAATACTTTAAAAGTTTCAGAAGTTTGTGATTTAGCCATAACTG
AGGAAATTCATAGACAGTGTAGTAATAAAGAGTCTACAGTAGGGTCAGCTTTGGTTATGACTAAAGGTAAAGATAAGGTTGATGAAGAAAATGAACCGAGTAGCAGTAGG
AAAAAGTGGAAAAATAGGAATGAGGTAGAATGTTTTTACTGCCACAAGAAAGGTCACTTCAAGAGTGAGTGTAGAAAACTTAAAGAGGATCAGAAAAGAAAACAAGAGGC
AAATACAGTGTATGATGCCTTAGTTTGTGTTGAGAGTGACACAAGGTACAGTAACCACTCATCAGATTGGATATTAGACAGTGCAGCTTCTGTACACATAGCTTCAGATA
GGAATTTGTTCACATCATTCACAGGAGGACATCATGGTCTAGTGAGGATGGGGAATGGTAGAACCTCCAAAATCAGTGGGATTGGAGATGTTAGTCTGAAGACAGAGTGT
GGAGATAAATTAGTACTGCGAGATGTCAGGTTTGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAATTGGTAGATGATGGTTACAAGTGTGAGTTTGGTAGTCG
CCAGTGTAAACTCAAGTTCGGATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAAATTTGTCAGACACTCGAATGAATTGAAGAAGTCGCTTAGGCGAG
TTGAGGCATCAAAGTGGAAGGCCAGAGCAGTTGCTAAGGTCAAAGGTCAGGTCTCTAGCTTGGTAACAGGGAGACGTGTTGCTTGTTTTGTCTCCAAGTGGGAGATTGTT
GAGTTTGGAGCCCAAAACAAGGCCCACCAAATCATTGCGGTGAGAAAAGTCATTGCAGCGAGAAAAGTCATTGCGGCGAGTTCCTTTCTTTGCGGCGAGAAAGTCACTGC
GGCGAAGTGTGAACATAAGAATTTATTGCGGCTGTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTTGTAGAGCCAAGAAATTTCGATGGAGTCACGAAGTTCGATGGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAGGATTATTTAACTTGCAGGAAAATGCA
TAAGGCATTGAAGGAGAGACCGAAAGGAATGACGGACGAAGATTGGGAAGCTCTAGATGAAGAGGTAGTTGCAAGCATAAGGATGTGCTTATCAATGGATGTGACAAGTC
TAGTGGTCCATGAGACAACTGCAGTTAAATTAATGGAAGCACTTACAAACAGGTATGAAAACCCCTCTGCTAATAATAAAGTATACCTAGTTAAGAAGTTCTTCAACATG
CAAATGGTTGAGGATGCTTCTGTGAATTCCTATATTAACGAAGTTACCACTTTGATTAATCAGTTAAAATCTGTTAAGATAGATTTTACTGATGAGGTGAATGTTATTCA
GTTATTAACATCTTTACCTGATAGTTGGGAAATGATGAAGACAACAGTGTCTAATTCGACTGGAAATAATACTTTAAAAGTTTCAGAAGTTTGTGATTTAGCCATAACTG
AGGAAATTCATAGACAGTGTAGTAATAAAGAGTCTACAGTAGGGTCAGCTTTGGTTATGACTAAAGGTAAAGATAAGGTTGATGAAGAAAATGAACCGAGTAGCAGTAGG
AAAAAGTGGAAAAATAGGAATGAGGTAGAATGTTTTTACTGCCACAAGAAAGGTCACTTCAAGAGTGAGTGTAGAAAACTTAAAGAGGATCAGAAAAGAAAACAAGAGGC
AAATACAGTGTATGATGCCTTAGTTTGTGTTGAGAGTGACACAAGGTACAGTAACCACTCATCAGATTGGATATTAGACAGTGCAGCTTCTGTACACATAGCTTCAGATA
GGAATTTGTTCACATCATTCACAGGAGGACATCATGGTCTAGTGAGGATGGGGAATGGTAGAACCTCCAAAATCAGTGGGATTGGAGATGTTAGTCTGAAGACAGAGTGT
GGAGATAAATTAGTACTGCGAGATGTCAGGTTTGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAATTGGTAGATGATGGTTACAAGTGTGAGTTTGGTAGTCG
CCAGTGTAAACTCAAGTTCGGATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAAATTTGTCAGACACTCGAATGAATTGAAGAAGTCGCTTAGGCGAG
TTGAGGCATCAAAGTGGAAGGCCAGAGCAGTTGCTAAGGTCAAAGGTCAGGTCTCTAGCTTGGTAACAGGGAGACGTGTTGCTTGTTTTGTCTCCAAGTGGGAGATTGTT
GAGTTTGGAGCCCAAAACAAGGCCCACCAAATCATTGCGGTGAGAAAAGTCATTGCAGCGAGAAAAGTCATTGCGGCGAGTTCCTTTCTTTGCGGCGAGAAAGTCACTGC
GGCGAAGTGTGAACATAAGAATTTATTGCGGCTGTCTTAA
Protein sequenceShow/hide protein sequence
MGFVEPRNFDGVTKFDGKNFGYWKMQVKDYLTCRKMHKALKERPKGMTDEDWEALDEEVVASIRMCLSMDVTSLVVHETTAVKLMEALTNRYENPSANNKVYLVKKFFNM
QMVEDASVNSYINEVTTLINQLKSVKIDFTDEVNVIQLLTSLPDSWEMMKTTVSNSTGNNTLKVSEVCDLAITEEIHRQCSNKESTVGSALVMTKGKDKVDEENEPSSSR
KKWKNRNEVECFYCHKKGHFKSECRKLKEDQKRKQEANTVYDALVCVESDTRYSNHSSDWILDSAASVHIASDRNLFTSFTGGHHGLVRMGNGRTSKISGIGDVSLKTEC
GDKLVLRDVRFVPNIKMNLISIGKLVDDGYKCEFGSRQCKLKFGSQVVAVGHRKSTLYKFVRHSNELKKSLRRVEASKWKARAVAKVKGQVSSLVTGRRVACFVSKWEIV
EFGAQNKAHQIIAVRKVIAARKVIAASSFLCGEKVTAAKCEHKNLLRLS