; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025638 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025638
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:16799724..16804562
RNA-Seq ExpressionLag0025638
SyntenyLag0025638
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TKR74765.1 hypothetical protein D5086_0000292320 [Populus alba]3.0e-7442.89Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M++E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-
        ++M E+ASV  ++N   T+ NQL SV IEF DE+ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-

Query:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF
         D+        S   SK K+  R +VEC+ C K GHF   C K K  +     A    V++ ++  V S         +WILDS AS H      +  ++
Subjt:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF

Query:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL
         GG HG+V + +G      G GDV ++T  G    L++VR+VP +K  LIS+G+L D G    F     K+  G+ V+A G +  TL
Subjt:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL

TKR89927.1 hypothetical protein D5086_0000238200 [Populus alba]2.3e-7442.64Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-
        ++M E+ SV  ++N   T+ NQL SV+IEF DE+ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-

Query:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF
         D+        S   SK K+  R +VEC+ C K GHF   C K K+ +     A    V++ ++  V+S         +WILDS AS H      +  ++
Subjt:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF

Query:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL
         GG HG+V + +G      GIGDV ++T  G    L+++R+VP +K  LIS+G+L D G    F     K+  G+ V+A G +  TL
Subjt:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL

TKS02608.1 hypothetical protein D5086_0000161380 [Populus alba]1.6e-7542.16Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-
        ++M E+ SV  ++N   T+ NQL SV+IEF DE+ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-

Query:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF
         D+        S   SK K+  R +VEC+ C K GHF   C K K+ +     A    V++ ++  V+S         +WILDS AS H      +  ++
Subjt:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF

Query:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLYRC--QLNVAKGS
         GG HG+V + +G   K  GIGDV ++T  G    L++VR+VP +K  LIS+G+L D G    F     K+  G+ V+A G +  TL     +L V++ +
Subjt:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLYRC--QLNVAKGS

Query:  ERQWMPVK
        E +W  +K
Subjt:  ERQWMPVK

TKS09800.1 hypothetical protein D5086_0000089010 [Populus alba]4.6e-7543.15Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-
        ++M E+ SV  ++N   T+ NQL SV+IEF DE+ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-

Query:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF
         D+        S   SK K+  R +VEC+ C K GHF   C K K+ +     A    V++ ++  V+S         +WILDS AS H      +  ++
Subjt:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF

Query:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL
         GG HG+V + +G   K  GIGDV ++T  G    L++VR+VP +K  LIS+G+L D G    F     K+  G+ V+A G +  TL
Subjt:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL

TKS13843.1 hypothetical protein D5086_0000049350 [Populus alba]3.9e-7442.38Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKM+++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-
        ++M E+ SV  ++N   T+ NQL SV+IEF DE+ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-

Query:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF
         D+        S   SK K+  R +VEC+ C K GHF   C K K+ +     A    V++ ++  V+S         +WILDS AS H      +  ++
Subjt:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF

Query:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL
         GG HG++ + +G      GIGDV ++T  G    L++VR+VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL
Subjt:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL

TrEMBL top hitse value%identityAlignment
A0A4U5P1P0 CCHC-type domain-containing protein1.1e-7442.64Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-
        ++M E+ SV  ++N   T+ NQL SV+IEF DE+ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-

Query:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF
         D+        S   SK K+  R +VEC+ C K GHF   C K K+ +     A    V++ ++  V+S         +WILDS AS H      +  ++
Subjt:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF

Query:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL
         GG HG+V + +G      GIGDV ++T  G    L+++R+VP +K  LIS+G+L D G    F     K+  G+ V+A G +  TL
Subjt:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL

A0A4U5PY83 CCHC-type domain-containing protein7.6e-7642.16Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-
        ++M E+ SV  ++N   T+ NQL SV+IEF DE+ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-

Query:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF
         D+        S   SK K+  R +VEC+ C K GHF   C K K+ +     A    V++ ++  V+S         +WILDS AS H      +  ++
Subjt:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF

Query:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLYRC--QLNVAKGS
         GG HG+V + +G   K  GIGDV ++T  G    L++VR+VP +K  LIS+G+L D G    F     K+  G+ V+A G +  TL     +L V++ +
Subjt:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLYRC--QLNVAKGS

Query:  ERQWMPVK
        E +W  +K
Subjt:  ERQWMPVK

A0A4U5QGR0 Uncharacterized protein2.2e-7543.15Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-
        ++M E+ SV  ++N   T+ NQL SV+IEF DE+ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-

Query:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF
         D+        S   SK K+  R +VEC+ C K GHF   C K K+ +     A    V++ ++  V+S         +WILDS AS H      +  ++
Subjt:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF

Query:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL
         GG HG+V + +G   K  GIGDV ++T  G    L++VR+VP +K  LIS+G+L D G    F     K+  G+ V+A G +  TL
Subjt:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL

A0A4U5QS59 CCHC-type domain-containing protein1.9e-7442.38Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKM+++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-
        ++M E+ SV  ++N   T+ NQL SV+IEF DE+ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-

Query:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF
         D+        S   SK K+  R +VEC+ C K GHF   C K K+ +     A    V++ ++  V+S         +WILDS AS H      +  ++
Subjt:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF

Query:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL
         GG HG++ + +G      GIGDV ++T  G    L++VR+VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL
Subjt:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL

A0A4V6XW18 CCHC-type domain-containing protein1.4e-7442.89Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M++E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-
        ++M E+ASV  ++N   T+ NQL SV IEF DE+ A+ LL SLP SWE M+TAVSNS G + LK+ ++ DL +AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSAL-VMTKGK-

Query:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF
         D+        S   SK K+  R +VEC+ C K GHF   C K K  +     A    V++ ++  V S         +WILDS AS H      +  ++
Subjt:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF

Query:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL
         GG HG+V + +G      G GDV ++T  G    L++VR+VP +K  LIS+G+L D G    F     K+  G+ V+A G +  TL
Subjt:  TGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.3e-5034.88Show/hide
Query:  VMKFDGKN-FGYWKMQVKDYLTCKKVHKAL---KERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKF
        V KF+G N F  W+ +++D L  + +HK L    ++P  MK EDW  LDE A + IR+ LS DV + +  E TA  +   L + Y   +  NK+YL K+ 
Subjt:  VMKFDGKN-FGYWKMQVKDYLTCKKVHKAL---KERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKF

Query:  FNMQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCD-LAIAEEIRRQGSNKESTVGSALVMT-K
        + + MSE  +  S++N    LI QL ++ ++  +E  AI LL SLP S++ + T + +  G  T++  +V   L + E++R++  N+    G AL+   +
Subjt:  FNMQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCD-LAIAEEIRRQGSNKESTVGSALVMT-K

Query:  GKDKVDEDNEPSSSKKKWKGRNEVE-----CYYCHKKGHFKYQC---RKLK-EDQKRKPEANIV------EEVVLACVESD--TKYSNHSSDWILDSAAS
        G+      N    S  + K +N  +     CY C++ GHFK  C   RK K E   +K + N        + VVL   E +     S   S+W++D+AAS
Subjt:  GKDKVDEDNEPSSSKKKWKGRNEVE-----CYYCHKKGHFKYQC---RKLK-EDQKRKPEANIV------EEVVLACVESD--TKYSNHSSDWILDSAAS

Query:  VHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL
         H    R LF  +  G  G V+MGN   SK  GIGD+ ++T  G  LVL+DVR+VP+++MNLIS   L  DG+   F +++ +L  GS V+A G  + TL
Subjt:  VHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL

Query:  YRCQLNVAKG
        YR    + +G
Subjt:  YRCQLNVAKG

P25601 Putative transposon Ty5-1 protein YCL075W2.1e-0636.78Show/hide
Query:  CVESDTKYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISI
        C+ S T  +  SS+WI D+  + H+  DRS+F+SFT         G G +    G G V++     G + L DV YVP++ +NLIS+
Subjt:  CVESDTKYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.2e-0723.05Show/hide
Query:  WEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTS
        W+  D+   + +   +SM V   V+   TA ++ E+L   Y  PS  + V  ++           +++ Y+  + T  +QL  +      +    ++L +
Subjt:  WEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTS

Query:  LPDSWETM------------KTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALVMTKGKDKVDEDNEPSSSKKKWK--------GRNEV--
        LP+ ++ +             T +     N+  K   V    +        S++ +T  +        ++ D  N  ++S K W+          N+   
Subjt:  LPDSWETM------------KTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALVMTKGKDKVDEDNEPSSSKKKWK--------GRNEV--

Query:  ---ECYYCHKKGHFKYQCRKLK----EDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKT
           +C  C  +GH   +C +L+        ++P +        A +   + YS  S++W+LDS A+ HI SD    SL   +TGG    V + +G T   
Subjt:  ---ECYYCHKKGHFKYQCRKLK----EDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKT

Query:  RGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTD-DGFMCEF
           G  SL T+    L L ++ YVPNI  NLIS+ +L + +G   EF
Subjt:  RGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTD-DGFMCEF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-0422.96Show/hide
Query:  WEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTS
        W   D+   + I   +SM V   V+   TA ++ E+L   Y  PS     ++ +  F  +  + A +   ++    +   L+++  ++   ++ I    +
Subjt:  WEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTS

Query:  LPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALVMTKGKDKVDEDN--EPSSSKKKWKGRNEV----ECYYCHKKGHFKYQCR
         P   E  +  ++  +    L  +EV  +  A  +  + +N      +        +  +  N  +PSSS  +   R        C  C  +GH   +C 
Subjt:  LPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALVMTKGKDKVDEDN--EPSSSKKKWKGRNEV----ECYYCHKKGHFKYQCR

Query:  KLKEDQKRKPEANIVEEVV----LACVESDTKYSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLR
        +L + Q    +             A +  ++ Y  ++++W+LDS A+ HI SD    S    +TGG    V + +G T      G  SL T     L L 
Subjt:  KLKEDQKRKPEANIVEEVV----LACVESDTKYSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLR

Query:  DVRYVPNIKMNLISIGKL
         V YVPNI  NLIS+ +L
Subjt:  DVRYVPNIKMNLISIGKL

Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein9.7e-0731.19Show/hide
Query:  NIVEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKT-----RGIGDVSLQTECGGKLVLRDVRYVPNIKMNLIS
        +++ EV     E  +KY+ H + W++ S  S H+      FT+        V+  +G  S+T      GIGDV+  T  G K  +++V YVP I+ N +S
Subjt:  NIVEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKT-----RGIGDVSLQTECGGKLVLRDVRYVPNIKMNLIS

Query:  IGKLTDDGF
        + +L  +GF
Subjt:  IGKLTDDGF

AT3G21000.1 Gag-Pol-related retrotransposon family protein1.9e-0720.83Show/hide
Query:  LVKKFFNMQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALV
        L K+  +++M +  S +SY+++   ++ +L   K+E SD      + T+L  S++ + + +      + +    + +       R   S+ E  +   L 
Subjt:  LVKKFFNMQMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALV

Query:  MTKGKDKVDEDNEPSSSKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSFT
                 +D    S  +KW       C  C+K  H +  C+      K + E  IV +  L  V +    +     WI+   A +++      FT+  
Subjt:  MTKGKDKVDEDNEPSSSKKKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSFT

Query:  GGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFG
              V   +G      G GDV ++ + G K  +R+V +VP +  N++S GK+    +    G
Subjt:  GGHHGLVRMGNGRTSKTRGIGDVSLQTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFG

AT3G29785.1 unknown protein7.2e-1036.36Show/hide
Query:  KFDGKNFGYWKMQVKDYLTCKKVHKALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKV
        K DG ++ + +M+++DYL  KK+H+ L ++ + M  +DW  L  + +  IR+ +S ++A  VA E +   LM+ L++ Y+KPS NN V
Subjt:  KFDGKNFGYWKMQVKDYLTCKKVHKALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATCGGTGGTGGAGGTAACTGGGAGAGTCGGTAAGCGGTCACTACCGGTGGACACAGTGCTGGAAGAAGAGGCCATAAGGAAGACGACAGATGTCCCGAAACACCT
CCGAATTCCTAAAAACCCTAGGAGGACAAACAGGCATCGGAGGCGGTGTGGCCTACACCAAGCCGGTGTGCAGCGGTTTTTGCTGGTCTTGCAGGTCACGTCTTCCCCAG
TTTCTACAAATTCACTGTTGGTGTCACGTGAAGGACGGCAGAAAGTGGCTTTTCAGAGTTTTGTGGCTGAGAGCAGAATTGAGGGGCAGCCAGCTTTTGGTTCAGGAGTC
ATGGGTTTTGTAGAGCCAAAAAGTTTCGATGGAGTCATGAAATTCGATGGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAGGATTATTTAACTTGCAAGAAAGTGCA
TAAGGCATTGAAGGAGAGACCGAAAGATATGAAGGACGAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAACCATTAGGATGTGTTTGTCGATGGATGTGGCAAGTC
TAGTAGCCCATGAGATAACTGCAGTCAAGTTGATGGAATCGCTTACAAACAGGTATGAAAAACCCTCTGCAAATAATAAGGTCTACCTAGTTAAGAAGTTTTTCAACATG
CAAATGTCTGAGGATGCTTCTGTGAATTCCTATATTAATGAGGTTACCACTTTGATTAATCAGTTAAAATCTGTTAAGATAGAATTTTCTGATGAGGTGAATGCTATTCA
GTTGTTAACGTCTTTACCTGATAGTTGGGAAACGATGAAGACAGCAGTGTCTAATTCGACTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGCTG
AGGAAATTCGTAGGCAGGGTAGTAATAAAGAGTCTACGGTAGGGTCAGCTTTGGTTATGACTAAAGGTAAAGATAAGGTTGATGAAGATAATGAACCGAGTAGCAGTAAG
AAAAAGTGGAAAGGTAGGAATGAGGTAGAATGTTATTACTGCCATAAGAAAGGTCACTTCAAGTATCAGTGTAGGAAACTTAAAGAGGATCAGAAAAGAAAACCAGAGGC
AAATATAGTGGAGGAGGTTGTCTTAGCTTGTGTTGAGAGTGACACAAAGTATAGTAATCACTCATCAGATTGGATATTAGACAGTGCAGCTTCTGTTCACATAGCTTCAG
ATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGAATGGGGAATGGTAGAACCTCCAAGACTAGAGGGATTGGAGATGTTAGTCTACAGACAGAA
TGTGGAGGTAAATTGGTACTGCGAGATGTCAGGTACGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAGTTGACAGATGATGGTTTCATGTGTGAGTTTGGCAG
TCGCCAGTGTAAACTCAAGTTCGGATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAGATGTCAGTTGAATGTTGCCAAAGGTTCAGAGAGACAGTGGA
TGCCGGTTAAAGCTGCCGATGGTAGTTGTAGAGGTACAGTTGAGCCAGCAGCAAGGATAGCCAAGTTCGATCAGTTCGATCAAGATCCTTCAGTTCAGAAACAATTGGGA
AGTCTAGGAGAGAAAGTTGATGGCTATTGTGAATCCCCAGTTGTCAGACGGTCGAATGAATTGAAGAAGTCGCTTAGGCGAGTTGAGGCATCAAAGTGGAAGGCCAGAGC
AGTTGCTAAGGTCAAAGGTCAGGTCTCTAGCTTGGTAACAGGTTTGAATAGAGGACTCAAGCCATTCTCAGAGTGTATCTTCTTCAGGAACAGTTGTTCGGGTTGGAAGA
AGATGACAGGACGGCAGAAAGTGGCTTTTCAGAGTTTTGTGGCTGAGAGCAGAATTGAGGGGCAGCCGCTTCCTGTAATTTCCCAAATACTAGTAGAACCTGCAGTTGCA
GAAGACTGGACGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATATCGGTGGTGGAGGTAACTGGGAGAGTCGGTAAGCGGTCACTACCGGTGGACACAGTGCTGGAAGAAGAGGCCATAAGGAAGACGACAGATGTCCCGAAACACCT
CCGAATTCCTAAAAACCCTAGGAGGACAAACAGGCATCGGAGGCGGTGTGGCCTACACCAAGCCGGTGTGCAGCGGTTTTTGCTGGTCTTGCAGGTCACGTCTTCCCCAG
TTTCTACAAATTCACTGTTGGTGTCACGTGAAGGACGGCAGAAAGTGGCTTTTCAGAGTTTTGTGGCTGAGAGCAGAATTGAGGGGCAGCCAGCTTTTGGTTCAGGAGTC
ATGGGTTTTGTAGAGCCAAAAAGTTTCGATGGAGTCATGAAATTCGATGGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAGGATTATTTAACTTGCAAGAAAGTGCA
TAAGGCATTGAAGGAGAGACCGAAAGATATGAAGGACGAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAACCATTAGGATGTGTTTGTCGATGGATGTGGCAAGTC
TAGTAGCCCATGAGATAACTGCAGTCAAGTTGATGGAATCGCTTACAAACAGGTATGAAAAACCCTCTGCAAATAATAAGGTCTACCTAGTTAAGAAGTTTTTCAACATG
CAAATGTCTGAGGATGCTTCTGTGAATTCCTATATTAATGAGGTTACCACTTTGATTAATCAGTTAAAATCTGTTAAGATAGAATTTTCTGATGAGGTGAATGCTATTCA
GTTGTTAACGTCTTTACCTGATAGTTGGGAAACGATGAAGACAGCAGTGTCTAATTCGACTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGCTG
AGGAAATTCGTAGGCAGGGTAGTAATAAAGAGTCTACGGTAGGGTCAGCTTTGGTTATGACTAAAGGTAAAGATAAGGTTGATGAAGATAATGAACCGAGTAGCAGTAAG
AAAAAGTGGAAAGGTAGGAATGAGGTAGAATGTTATTACTGCCATAAGAAAGGTCACTTCAAGTATCAGTGTAGGAAACTTAAAGAGGATCAGAAAAGAAAACCAGAGGC
AAATATAGTGGAGGAGGTTGTCTTAGCTTGTGTTGAGAGTGACACAAAGTATAGTAATCACTCATCAGATTGGATATTAGACAGTGCAGCTTCTGTTCACATAGCTTCAG
ATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGAATGGGGAATGGTAGAACCTCCAAGACTAGAGGGATTGGAGATGTTAGTCTACAGACAGAA
TGTGGAGGTAAATTGGTACTGCGAGATGTCAGGTACGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAGTTGACAGATGATGGTTTCATGTGTGAGTTTGGCAG
TCGCCAGTGTAAACTCAAGTTCGGATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAGATGTCAGTTGAATGTTGCCAAAGGTTCAGAGAGACAGTGGA
TGCCGGTTAAAGCTGCCGATGGTAGTTGTAGAGGTACAGTTGAGCCAGCAGCAAGGATAGCCAAGTTCGATCAGTTCGATCAAGATCCTTCAGTTCAGAAACAATTGGGA
AGTCTAGGAGAGAAAGTTGATGGCTATTGTGAATCCCCAGTTGTCAGACGGTCGAATGAATTGAAGAAGTCGCTTAGGCGAGTTGAGGCATCAAAGTGGAAGGCCAGAGC
AGTTGCTAAGGTCAAAGGTCAGGTCTCTAGCTTGGTAACAGGTTTGAATAGAGGACTCAAGCCATTCTCAGAGTGTATCTTCTTCAGGAACAGTTGTTCGGGTTGGAAGA
AGATGACAGGACGGCAGAAAGTGGCTTTTCAGAGTTTTGTGGCTGAGAGCAGAATTGAGGGGCAGCCGCTTCCTGTAATTTCCCAAATACTAGTAGAACCTGCAGTTGCA
GAAGACTGGACGTAG
Protein sequenceShow/hide protein sequence
MISVVEVTGRVGKRSLPVDTVLEEEAIRKTTDVPKHLRIPKNPRRTNRHRRRCGLHQAGVQRFLLVLQVTSSPVSTNSLLVSREGRQKVAFQSFVAESRIEGQPAFGSGV
MGFVEPKSFDGVMKFDGKNFGYWKMQVKDYLTCKKVHKALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHEITAVKLMESLTNRYEKPSANNKVYLVKKFFNM
QMSEDASVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTVGSALVMTKGKDKVDEDNEPSSSK
KKWKGRNEVECYYCHKKGHFKYQCRKLKEDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLQTE
CGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLYRCQLNVAKGSERQWMPVKAADGSCRGTVEPAARIAKFDQFDQDPSVQKQLG
SLGEKVDGYCESPVVRRSNELKKSLRRVEASKWKARAVAKVKGQVSSLVTGLNRGLKPFSECIFFRNSCSGWKKMTGRQKVAFQSFVAESRIEGQPLPVISQILVEPAVA
EDWT