; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028843 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028843
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr8:31742242..31744542
RNA-Seq ExpressionLag0028843
SyntenyLag0028843
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]8.9e-6036.89Show/hide
Query:  EPKSFDGVMKFDGKNFGYWKMQVKDYLTCKKVHKTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLV
        E  S  G+ KFDG +F +W+MQ++DYL  KK+H+ L  +P+ M  E+W+ LD + +  IR+ LS +VA  VA E T   LM+ L++ YEKPSANNKV+L+
Subjt:  EPKSFDGVMKFDGKNFGYWKMQVKDYLTCKKVHKTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLV

Query:  KKFFNMQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVQGS-------------------------NKESTVGSALVMT
        KK F+++M E   V +++NE  T++NQL SV+IEF DEV A+ L+ SLP+SWE M+ AV  S                          + ST  +  V  
Subjt:  KKFFNMQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVQGS-------------------------NKESTVGSALVMT

Query:  KGKDKVDEDNEPSSSKKK-----WKGRNEVECYYCHKKGHFKYQ--GRKFKEDQKRKPKANIVDEV---VLACVESDTKYSNHSS---------------
        +G+D+   +     SK +      K R  VEC+ C K GHFK        KE+ K      + DE+   +L  V+S  K   H +               
Subjt:  KGKDKVDEDNEPSSSKKK-----WKGRNEVECYYCHKKGHFKYQ--GRKFKEDQKRKPKANIVDEV---VLACVESDTKYSNHSS---------------

Query:  --DWILDSATSVHIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQ
           W+LDS  S H   DR++  ++  G+ G V + NG      GIGD++LK   G    +  VR+VP +  N+IS+G L D G+   FG    K+K GS 
Subjt:  --DWILDSATSVHIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQ

Query:  VVAVGHRKSTLY
        VVA GH++ +LY
Subjt:  VVAVGHRKSTLY

KAG7584790.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]3.0e-6037.38Show/hide
Query:  EPKSFDGVMKFDGKNFGYWKMQVKDYLTCKKVHKTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLV
        E  S  G+ KFDG +F +W+MQ++DYL  KK+H+ L  +P+ M  E+W+ LD + +  IR+ LS +VA  VA E T   LM+ L++ YEKPSANNKV+L+
Subjt:  EPKSFDGVMKFDGKNFGYWKMQVKDYLTCKKVHKTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLV

Query:  KKFFNMQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVQGS------------------------NKESTVGSAL-VMT
        KK F+++M E   V +++NE  T++NQL SV+IEF DEV A+ LL SLP+SWE M+ AV  S                          E+++ SA  V  
Subjt:  KKFFNMQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVQGS------------------------NKESTVGSAL-VMT

Query:  KGKDKVDEDNEPSSSKKK-----WKGRNEVECYYCHKKGHFKYQ--GRKFKEDQKRKPKANIVDEV---VLACVESDTKYSNHSS---------------
        +G+D+   +     SK +      K R  VEC+ C K GHFK        KE+ K      + DE+   +L  V+S  K   H +               
Subjt:  KGKDKVDEDNEPSSSKKK-----WKGRNEVECYYCHKKGHFKYQ--GRKFKEDQKRKPKANIVDEV---VLACVESDTKYSNHSS---------------

Query:  --DWILDSATSVHIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQ
           W+LDS  S H   DR++  ++  G+ G V + NG      GIGD++LK   G    +  VR+VP +  N+IS+G L D G+   FG    K+K GS 
Subjt:  --DWILDSATSVHIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQ

Query:  VVAVGHRKSTLY
        VVA GH++ +LY
Subjt:  VVAVGHRKSTLY

TKR89927.1 hypothetical protein D5086_0000238200 [Populus alba]1.7e-5838.24Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAV-------------------------QGSNKESTVGSAL-VMTKGK-
        ++M E+  V  ++N   T+ NQL SV+IEF DE+ A+ LL SLP SWE M+TAV                         + S + S+ GSAL + T+G+ 
Subjt:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAV-------------------------QGSNKESTVGSAL-VMTKGK-

Query:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQGRKFKEDQKRKPKANI--VDEVVLACVESDTKYSNHSSDWILDSATSVHIASDRSLFTSF
         D+        S   SK K+  R +VEC+ C K GHF     K K+ +     A    V + ++  V+S         +WILDS  S H      +  ++
Subjt:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQGRKFKEDQKRKPKANI--VDEVVLACVESDTKYSNHSSDWILDSATSVHIASDRSLFTSF

Query:  TGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTL
         GG  G+V + +G      GIGDV +KT  G    L+++R+VP +K  +IS+G L D G+   F     K+  G+ V+A G +  TL
Subjt:  TGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTL

TKS02608.1 hypothetical protein D5086_0000161380 [Populus alba]1.5e-5937.99Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAV-------------------------QGSNKESTVGSAL-VMTKGK-
        ++M E+  V  ++N   T+ NQL SV+IEF DE+ A+ LL SLP SWE M+TAV                         + S + S+ GSAL + T+G+ 
Subjt:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAV-------------------------QGSNKESTVGSAL-VMTKGK-

Query:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQGRKFKEDQKRKPKANI--VDEVVLACVESDTKYSNHSSDWILDSATSVHIASDRSLFTSF
         D+        S   SK K+  R +VEC+ C K GHF     K K+ +     A    V + ++  V+S         +WILDS  S H      +  ++
Subjt:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQGRKFKEDQKRKPKANI--VDEVVLACVESDTKYSNHSSDWILDSATSVHIASDRSLFTSF

Query:  TGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRC--QLNVAKGS
         GG  G+V + +G   K  GIGDV +KT  G    L++VR+VP +K  +IS+G L D G+   F     K+  G+ V+A G +  TL     +L V++ +
Subjt:  TGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRC--QLNVAKGS

Query:  ERHWMPVK
        E  W  +K
Subjt:  ERHWMPVK

TKS09800.1 hypothetical protein D5086_0000089010 [Populus alba]3.4e-5938.76Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAV-------------------------QGSNKESTVGSAL-VMTKGK-
        ++M E+  V  ++N   T+ NQL SV+IEF DE+ A+ LL SLP SWE M+TAV                         + S + S+ GSAL + T+G+ 
Subjt:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAV-------------------------QGSNKESTVGSAL-VMTKGK-

Query:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQGRKFKEDQKRKPKANI--VDEVVLACVESDTKYSNHSSDWILDSATSVHIASDRSLFTSF
         D+        S   SK K+  R +VEC+ C K GHF     K K+ +     A    V + ++  V+S         +WILDS  S H      +  ++
Subjt:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQGRKFKEDQKRKPKANI--VDEVVLACVESDTKYSNHSSDWILDSATSVHIASDRSLFTSF

Query:  TGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTL
         GG  G+V + +G   K  GIGDV +KT  G    L++VR+VP +K  +IS+G L D G+   F     K+  G+ V+A G +  TL
Subjt:  TGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTL

TrEMBL top hitse value%identityAlignment
A0A2N9FA13 Uncharacterized protein4.3e-6037.75Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L ++P  M+D +W  LD + +  IR+ LS  VA  V  ETT V LM +L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVQ---GSNK----------------------ESTVGSALV-------
        ++M+E A V  ++NE  T+ NQL SV+IEF DE+ A+ +L SLP+SWE M+ AV    G  K                      ++  G A V       
Subjt:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVQ---GSNK----------------------ESTVGSALV-------

Query:  MTKGKDKVDEDNEPSSSKKKWKGRNEVECYYCHKKGHFKY----------QGRKFKEDQKRKPKANIVDE--VVLACVESDTKY-SNHSSDWILDSATSV
          K ++     N+ S S+ K +     EC++C KKGH +           + +  K D ++   A + DE  VVL+  E   ++  N   +W++DSA + 
Subjt:  MTKGKDKVDEDNEPSSSKKKWKGRNEVECYYCHKKGHFKY----------QGRKFKEDQKRKPKANIVDE--VVLACVESDTKY-SNHSSDWILDSATSV

Query:  HIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLY
        H+   + LFT++  G  G V+MGN   SK  GIGDV +KT  G  ++L++VR+VP++  N+IS   +   GY    G+ + KL  G  VVA G     LY
Subjt:  HIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLY

Query:  RCQLNVAK
        + ++   K
Subjt:  RCQLNVAK

A0A2N9G318 CCHC-type domain-containing protein1.1e-6037.99Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L ++P  M+D +W  LD + +  IR+ LS  VA  V  ETT V LM +L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVQ---GSNK----------------------ESTVGSALV-------
        ++M+E A V  Y+NE  T+ NQL SV+IEF DE+ A+ +L SLP+SWE M+ AV    G  K                      ++  G A V       
Subjt:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVQ---GSNK----------------------ESTVGSALV-------

Query:  MTKGKDKVDEDNEPSSSKKKWKGRNEVECYYCHKKGHFKYQGRKFKEDQ---------KRKPKANIVDE---VVLACVESDTKY-SNHSSDWILDSATSV
          K ++     N+ S S+ K +     EC++C KKGH +   + ++++Q           K    IVD+   VVL+  E   ++  N   +W++DSA + 
Subjt:  MTKGKDKVDEDNEPSSSKKKWKGRNEVECYYCHKKGHFKYQGRKFKEDQ---------KRKPKANIVDE---VVLACVESDTKY-SNHSSDWILDSATSV

Query:  HIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLY
        H+   + LFT++  G  G V+MGN   SK  GIGDV +KT  G  ++L++VR+VP++  N+IS   +   GY    G+ + KL  G  VVA G     LY
Subjt:  HIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLY

Query:  RCQLNVAK
        + ++   K
Subjt:  RCQLNVAK

A0A2N9GXF9 CCHC-type domain-containing protein4.3e-6036.24Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L ++P  M+D +W  LD + +  IR+ LS  VA  V  ETT V LM +L+  YEKP ANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVQGSNK-------------------------ESTVGSALVMT---KG
        ++M+E A V  ++NE  T+ NQL  V+IEF+DE+ A+ +L SLP+SWE M+ AV  S                           ++  G A V     + 
Subjt:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVQGSNK-------------------------ESTVGSALVMT---KG

Query:  KDKVDED----NEPSSSKKKWKGRNEVECYYCHKKGHFKYQGRKF----------KEDQKRKPKANIVDE--VVLACVESDTKY-SNHSSDWILDSATSV
        ++K   D    N+ S S  K +     EC++C KKGH +   R +          K D K+   A +VDE  VVL+  E   ++  N   +W++DSA + 
Subjt:  KDKVDED----NEPSSSKKKWKGRNEVECYYCHKKGHFKYQGRKF----------KEDQKRKPKANIVDE--VVLACVESDTKY-SNHSSDWILDSATSV

Query:  HIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLY
        H+   + LFT +  G    V+MGN   SK  GIGDV +KT  G  ++L++VR+VP++  N+IS   +   GY    G+ + KL  G  VVA G     LY
Subjt:  HIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLY

Query:  RCQLNVAKGSERHWMPVKAADGNGRGTVEPAARITNFDQSDQDPSVQ
        R ++   KG  +  +P  A D       +      N +Q ++ P+++
Subjt:  RCQLNVAKGSERHWMPVKAADGNGRGTVEPAARITNFDQSDQDPSVQ

A0A2N9IS39 CCHC-type domain-containing protein5.6e-6037.75Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L ++P  M+D +W  LD + +  IR+ LS  VA  V  ETT V LM +L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVQ---GSNK----------------------ESTVGSALV-------
        ++M+E A V  ++NE  T+ NQL SV+IEF DE+ A+ +L SLP+SWE M+ AV    G  K                      ++  G A V       
Subjt:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVQ---GSNK----------------------ESTVGSALV-------

Query:  MTKGKDKVDEDNEPSSSKKKWKGRNEVECYYCHKKGHFKY----------QGRKFKEDQKRKPKANIVDE--VVLACVESDTKY-SNHSSDWILDSATSV
          K ++     N+ S S+ K +     EC++C KKGH +           + +  K D ++   A + DE  VVL+  E   ++  N   +W++DSA + 
Subjt:  MTKGKDKVDEDNEPSSSKKKWKGRNEVECYYCHKKGHFKY----------QGRKFKEDQKRKPKANIVDE--VVLACVESDTKY-SNHSSDWILDSATSV

Query:  HIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLY
        H+   + LFT++  G  G V+MGN   SK  GIGDV +KT  G  ++L++VR+VP++  N+IS   +   GY    G+ + KL  G  VVA G     LY
Subjt:  HIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLY

Query:  RCQLNVAK
        + ++   K
Subjt:  RCQLNVAK

A0A4U5PY83 CCHC-type domain-containing protein7.3e-6037.99Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAV-------------------------QGSNKESTVGSAL-VMTKGK-
        ++M E+  V  ++N   T+ NQL SV+IEF DE+ A+ LL SLP SWE M+TAV                         + S + S+ GSAL + T+G+ 
Subjt:  MQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAV-------------------------QGSNKESTVGSAL-VMTKGK-

Query:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQGRKFKEDQKRKPKANI--VDEVVLACVESDTKYSNHSSDWILDSATSVHIASDRSLFTSF
         D+        S   SK K+  R +VEC+ C K GHF     K K+ +     A    V + ++  V+S         +WILDS  S H      +  ++
Subjt:  -DKVDEDNEPSS---SKKKWKGRNEVECYYCHKKGHFKYQGRKFKEDQKRKPKANI--VDEVVLACVESDTKYSNHSSDWILDSATSVHIASDRSLFTSF

Query:  TGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRC--QLNVAKGS
         GG  G+V + +G   K  GIGDV +KT  G    L++VR+VP +K  +IS+G L D G+   F     K+  G+ V+A G +  TL     +L V++ +
Subjt:  TGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRC--QLNVAKGS

Query:  ERHWMPVK
        E  W  +K
Subjt:  ERHWMPVK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-4533.17Show/hide
Query:  VMKFDGKN-FGYWKMQVKDYLTCKKVHKTL---KERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKF
        V KF+G N F  W+ +++D L  + +HK L    ++P  MK EDW  LDE A + IR+ LS DV + +  E TA  +   L + Y   +  NK+YL K+ 
Subjt:  VMKFDGKN-FGYWKMQVKDYLTCKKVHKTL---KERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKF

Query:  FNMQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAV--------------------QGSNKESTVGSALVMT-KGKDKVD
        + + MSE     S++N    LI QL ++ ++  +E  AI LL SLP S++ + T +                    +   K    G AL+   +G+    
Subjt:  FNMQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAV--------------------QGSNKESTVGSALVMT-KGKDKVD

Query:  EDNEPSSSKKKWKGRNEVE-----CYYCHKKGHFK------YQGRKFKEDQKRKPKANIV----DEVVLACVESD--TKYSNHSSDWILDSATSVHIASD
          N    S  + K +N  +     CY C++ GHFK       +G+     QK       +    D VVL   E +     S   S+W++D+A S H    
Subjt:  EDNEPSSSKKKWKGRNEVE-----CYYCHKKGHFK------YQGRKFKEDQKRKPKANIV----DEVVLACVESD--TKYSNHSSDWILDSATSVHIASD

Query:  RSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRCQLN
        R LF  +  G  G V+MGN   SK  GIGD+ +KT  G  LVL+DVR+VP+++MN+IS   L  DGY   F +++ +L  GS V+A G  + TLYR    
Subjt:  RSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRCQLN

Query:  VAKG
        + +G
Subjt:  VAKG

P25601 Putative transposon Ty5-1 protein YCL075W2.6e-0636.78Show/hide
Query:  CVESDTKYSNHSSDWILDSATSVHIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISI
        C+ S T  +  SS+WI D+  + H+  DRS+F+SFT   R     G G +    G G V++ T     + L DV YVP++ +N+IS+
Subjt:  CVESDTKYSNHSSDWILDSATSVHIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISI

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein1.1e-0419.51Show/hide
Query:  TIRMCLSMDVASLV--AHETTAVKLMESLTNRYEKPSANNKVYLVKKFFNMQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETM
        T+    + DV  L+   +E   ++ +E +T R           L K+  +++M +    +SY+++   ++ +L   K+E SD      + T+L  S++ +
Subjt:  TIRMCLSMDVASLV--AHETTAVKLMESLTNRYEKPSANNKVYLVKKFFNMQMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETM

Query:  KTAVQGSNKESTVGSALVMTKGKDKVDEDNEPSS-----SKKKWKGRNEVECYYCHKKGHFKYQGRKFKEDQKRKPKANIVDEVVLACVESDTKYSNHSS
         + ++       + S  ++     +V E +   +        + K ++E  C  C+K  H +   +      K + +  IV +  L  V +    +    
Subjt:  KTAVQGSNKESTVGSALVMTKGKDKVDEDNEPSS-----SKKKWKGRNEVECYYCHKKGHFKYQGRKFKEDQKRKPKANIVDEVVLACVESDTKYSNHSS

Query:  DWILDSATSVHIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFG
         WI+     +++      FT+     +  V   +G      G GDV ++ + G K  +R+V +VP +  N++S G +    Y    G
Subjt:  DWILDSATSVHIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGTLTDDGYMCEFG

AT3G29785.1 unknown protein5.3e-1036.36Show/hide
Query:  KFDGKNFGYWKMQVKDYLTCKKVHKTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKV
        K DG ++ + +M+++DYL  KK+H+ L ++ + M  +DW  L  + +  IR+ +S ++A  VA E +   LM+ L++ Y+KPS NN V
Subjt:  KFDGKNFGYWKMQVKDYLTCKKVHKTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTGTAGAGCCAAAAAGTTTCGATGGAGTCATGAAGTTCGATGGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAGGATTATTTAACTTGCAAGAAAGTGCA
TAAGACATTGAAGGAGAGACCGAAAGATATGAAGGACGAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAACCATTAGGATGTGTTTGTCGATGGATGTGGCAAGTC
TTGTAGCCCATGAGACAACTGCAGTCAAGTTGATGGAATCGCTTACAAACAGGTATGAAAAACCCTCTGCAAATAATAAGGTCTACCTAGTTAAGAAGTTTTTCAACATG
CAAATGTCTGAGGATGCTTATGTGAATTCCTATATTAATGAGGTTACCACTTTGATTAATCAATTAAAATCTGTTAAGATAGAATTTTCTGATGAGGTGAATGCTATTCA
GTTGTTAACGTCTTTACCTGATAGTTGGGAAACGATGAAGACAGCAGTGCAGGGTAGTAATAAAGAGTCTACTGTAGGGTCAGCTTTGGTTATGACTAAAGGTAAAGATA
AGGTTGATGAAGATAATGAACCGAGTAGCAGTAAGAAAAAGTGGAAAGGTAGGAATGAGGTAGAATGTTATTACTGCCATAAGAAAGGTCACTTCAAGTATCAGGGTAGG
AAATTTAAAGAGGATCAGAAAAGAAAACCAAAGGCAAATATAGTGGATGAGGTTGTCTTAGCTTGTGTTGAGAGTGACACAAAGTATAGTAACCACTCATCAGATTGGAT
ATTAGACAGTGCAACTTCTGTTCACATAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCGTGGCCTTGTGAGGATGGGGAATGGTAGAACCTCCAAGA
CTAGAGGGATTGGAGATGTTAGTCTGAAGACAGAATGTGGAGATAAATTGGTACTGCGAGATGTCAGGTACGTGCCTAATATCAAGATGAATATTATTTCTATTGGTACA
TTGACAGATGATGGTTACATGTGTGAGTTTGGTAGTCGCCAGTGTAAACTCAAGTTCGGATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAGATGTCA
GTTGAATGTTGCCAAAGGTTCAGAGAGACATTGGATGCCAGTTAAAGCTGCAGATGGTAATGGTAGAGGTACAGTTGAGCCAGCAGCAAGGATAACCAATTTCGATCAGT
CCGATCAAGATCCTTCAGTTCAGAAACAATTGGGAAGTCTAGAAGAGAAAGTTGATGGCTATCGTGAATCCCCAGTTGTCAGACGGTCGAATGAATTGAAGAAGTCGCTT
AGGCGAGTCGAGGCATCAAAGTGGAAGGCCAGGGCAGTTGCTAAGGTCAAAGGTCAGGTCTCTAGCTTGGTAACAAGTTTGAATAGAGGATTCAAGCAATTCTCAGATTG
TATCTTCTTCAGGAACAGTTGTTCGGGTTGGAAGAAGATGACAGCCTTGGAGTTTACCAAGAAGATCAGAGATAACAGTTGTAATGGGAGTAGATCCTTGGAGTTTGCCA
AGATGTTTAGAGTTTTGTTGCAGTGGGAGACGAGATCTTTGTTTTGTCTCCAAGTGGGAAATTGTTGGAGTGTGGAGTCAAAACTCCTTCAGCCTCAATTTGTCGTGCAT
GGAAAGAAAAGAAGGGAATGGGCCCAAGGCCCATTTGTCAGACGGTCGAATGAATTGAAGAAGTCGCTTAGGCGAGTTGAGGCATCAAAGTGGAAGGCCAGGGCAGTTGC
TAAGGTCAAAGGTCAGGTCTCTAGCTTGGTAACAAGTTTGAATAGAGGATTCAAGCAATTCTCAGATTGTATCTTCTTCAGGAACAGTTGTTCGGGTTGGAAGAAGATGA
CAGGTATTTTGAGCCTTGGAGTTTACCAAGAAGATCAGAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTTGTAGAGCCAAAAAGTTTCGATGGAGTCATGAAGTTCGATGGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAGGATTATTTAACTTGCAAGAAAGTGCA
TAAGACATTGAAGGAGAGACCGAAAGATATGAAGGACGAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAACCATTAGGATGTGTTTGTCGATGGATGTGGCAAGTC
TTGTAGCCCATGAGACAACTGCAGTCAAGTTGATGGAATCGCTTACAAACAGGTATGAAAAACCCTCTGCAAATAATAAGGTCTACCTAGTTAAGAAGTTTTTCAACATG
CAAATGTCTGAGGATGCTTATGTGAATTCCTATATTAATGAGGTTACCACTTTGATTAATCAATTAAAATCTGTTAAGATAGAATTTTCTGATGAGGTGAATGCTATTCA
GTTGTTAACGTCTTTACCTGATAGTTGGGAAACGATGAAGACAGCAGTGCAGGGTAGTAATAAAGAGTCTACTGTAGGGTCAGCTTTGGTTATGACTAAAGGTAAAGATA
AGGTTGATGAAGATAATGAACCGAGTAGCAGTAAGAAAAAGTGGAAAGGTAGGAATGAGGTAGAATGTTATTACTGCCATAAGAAAGGTCACTTCAAGTATCAGGGTAGG
AAATTTAAAGAGGATCAGAAAAGAAAACCAAAGGCAAATATAGTGGATGAGGTTGTCTTAGCTTGTGTTGAGAGTGACACAAAGTATAGTAACCACTCATCAGATTGGAT
ATTAGACAGTGCAACTTCTGTTCACATAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCGTGGCCTTGTGAGGATGGGGAATGGTAGAACCTCCAAGA
CTAGAGGGATTGGAGATGTTAGTCTGAAGACAGAATGTGGAGATAAATTGGTACTGCGAGATGTCAGGTACGTGCCTAATATCAAGATGAATATTATTTCTATTGGTACA
TTGACAGATGATGGTTACATGTGTGAGTTTGGTAGTCGCCAGTGTAAACTCAAGTTCGGATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAGATGTCA
GTTGAATGTTGCCAAAGGTTCAGAGAGACATTGGATGCCAGTTAAAGCTGCAGATGGTAATGGTAGAGGTACAGTTGAGCCAGCAGCAAGGATAACCAATTTCGATCAGT
CCGATCAAGATCCTTCAGTTCAGAAACAATTGGGAAGTCTAGAAGAGAAAGTTGATGGCTATCGTGAATCCCCAGTTGTCAGACGGTCGAATGAATTGAAGAAGTCGCTT
AGGCGAGTCGAGGCATCAAAGTGGAAGGCCAGGGCAGTTGCTAAGGTCAAAGGTCAGGTCTCTAGCTTGGTAACAAGTTTGAATAGAGGATTCAAGCAATTCTCAGATTG
TATCTTCTTCAGGAACAGTTGTTCGGGTTGGAAGAAGATGACAGCCTTGGAGTTTACCAAGAAGATCAGAGATAACAGTTGTAATGGGAGTAGATCCTTGGAGTTTGCCA
AGATGTTTAGAGTTTTGTTGCAGTGGGAGACGAGATCTTTGTTTTGTCTCCAAGTGGGAAATTGTTGGAGTGTGGAGTCAAAACTCCTTCAGCCTCAATTTGTCGTGCAT
GGAAAGAAAAGAAGGGAATGGGCCCAAGGCCCATTTGTCAGACGGTCGAATGAATTGAAGAAGTCGCTTAGGCGAGTTGAGGCATCAAAGTGGAAGGCCAGGGCAGTTGC
TAAGGTCAAAGGTCAGGTCTCTAGCTTGGTAACAAGTTTGAATAGAGGATTCAAGCAATTCTCAGATTGTATCTTCTTCAGGAACAGTTGTTCGGGTTGGAAGAAGATGA
CAGGTATTTTGAGCCTTGGAGTTTACCAAGAAGATCAGAGATAA
Protein sequenceShow/hide protein sequence
MGFVEPKSFDGVMKFDGKNFGYWKMQVKDYLTCKKVHKTLKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFNM
QMSEDAYVNSYINEVTTLINQLKSVKIEFSDEVNAIQLLTSLPDSWETMKTAVQGSNKESTVGSALVMTKGKDKVDEDNEPSSSKKKWKGRNEVECYYCHKKGHFKYQGR
KFKEDQKRKPKANIVDEVVLACVESDTKYSNHSSDWILDSATSVHIASDRSLFTSFTGGHRGLVRMGNGRTSKTRGIGDVSLKTECGDKLVLRDVRYVPNIKMNIISIGT
LTDDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRCQLNVAKGSERHWMPVKAADGNGRGTVEPAARITNFDQSDQDPSVQKQLGSLEEKVDGYRESPVVRRSNELKKSL
RRVEASKWKARAVAKVKGQVSSLVTSLNRGFKQFSDCIFFRNSCSGWKKMTALEFTKKIRDNSCNGSRSLEFAKMFRVLLQWETRSLFCLQVGNCWSVESKLLQPQFVVH
GKKRREWAQGPFVRRSNELKKSLRRVEASKWKARAVAKVKGQVSSLVTSLNRGFKQFSDCIFFRNSCSGWKKMTGILSLGVYQEDQR