; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015071 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015071
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr12:7252676..7255881
RNA-Seq ExpressionLag0015071
SyntenyLag0015071
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016772 - transferase activity, transferring phosphorus-containing groups (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5549868.1 hypothetical protein RHGRI_014986 [Rhododendron griersonianum]1.2e-5134.24Show/hide
Query:  VMKFDGKNFGYWKMQVKDYLTCKKVHKTLK---ERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFF
        ++ F+G N+  WK++++D L CK +H  ++    +P DMK EDW  L+ +AV  IR  +   V   V+ ET+A  L + L + Y++ SA NK +L KK  
Subjt:  VMKFDGKNFGYWKMQVKDYLTCKKVHKTLK---ERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFF

Query:  NMQMSEDASVNSYINEVTTLINQLKSVKIEFSDE---------------------------GSNKESTVGSALVMTKGKDKV------------------
        N++  E  S+  ++NE+ +++NQL S+KI F DE                           G   +S V S+L+  + + K                   
Subjt:  NMQMSEDASVNSYINEVTTLINQLKSVKIEFSDE---------------------------GSNKESTVGSALVMTKGKDKV------------------

Query:  -------DEDNEPSSSRKKWKYRNEVECYYCHKKGHLKYQC--RKFKEDQKRKPEANIVE---------EVVLACVESDTKYSNHSSDWILDSAASVHIA
               + D    SSR +   + ++EC+YC KKGH+K +C   KFKE+ + K      E         E+V+ C ES     +  ++W++DS AS H+ 
Subjt:  -------DEDNEPSSSRKKWKYRNEVECYYCHKKGHLKYQC--RKFKEDQKRKPEANIVE---------EVVLACVESDTKYSNHSSDWILDSAASVHIA

Query:  SDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLYRCQ
        S    FTS+  G  G VRMGN   SK  G+G++ L+T  G KLVL+DVR+VP+I++NLIS GKL D+G+   FG+ + KL  GS VVA G +  +LY  +
Subjt:  SDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLYRCQ

Query:  LNVAKG
          ++KG
Subjt:  LNVAKG

TKR74765.1 hypothetical protein D5086_0000292320 [Populus alba]1.2e-5136.15Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M+EE+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE-----------------------------------------------GSNKESTVGSAL-VMTKGKD
        ++M E+ASV  ++N   T+ NQL SV IEF DE                                                S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE-----------------------------------------------GSNKESTVGSAL-VMTKGKD

Query:  KVDEDNEPSSSRKKWKYRN--------EVECYYCHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLF
            D   S  R K +YR+        +VEC+ C K GH    C K K  +     A    V++ ++  V S         +WILDS AS H      + 
Subjt:  KVDEDNEPSSSRKKWKYRN--------EVECYYCHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLF

Query:  TSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL
         ++ GG HG+V + +G      G GDV +KT  G    L++VR+VP +K  LIS+G+L D G    F     K+  G+ V+A G +  TL
Subjt:  TSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL

TKS02608.1 hypothetical protein D5086_0000161380 [Populus alba]1.5e-5235.52Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE-----------------------------------------------GSNKESTVGSAL-VMTKGKD
        ++M E+ SV  ++N   T+ NQL SV+IEF DE                                                S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE-----------------------------------------------GSNKESTVGSAL-VMTKGKD

Query:  KVDEDNEPSSSRKKWKYRN--------EVECYYCHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLF
            D   S  R K +YR+        +VEC+ C K GH    C K K+ +     A    V++ ++  V+S         +WILDS AS H      + 
Subjt:  KVDEDNEPSSSRKKWKYRN--------EVECYYCHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLF

Query:  TSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLYRC--QLNVA
         ++ GG HG+V + +G   K  GIGDV +KT  G    L++VR+VP +K  LIS+G+L D G    F     K+  G+ V+A G +  TL     +L V+
Subjt:  TSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLYRC--QLNVA

Query:  KGSERQWMAVK
        + +E +W  +K
Subjt:  KGSERQWMAVK

TKS09800.1 hypothetical protein D5086_0000089010 [Populus alba]3.3e-5236.15Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E +  KLME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE-----------------------------------------------GSNKESTVGSAL-VMTKGKD
        ++M E+ SV  ++N   T+ NQL SV+IEF DE                                                S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE-----------------------------------------------GSNKESTVGSAL-VMTKGKD

Query:  KVDEDNEPSSSRKKWKYRN--------EVECYYCHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLF
            D   S  R K +YR+        +VEC+ C K GH    C K K+ +     A    V++ ++  V+S         +WILDS AS H      + 
Subjt:  KVDEDNEPSSSRKKWKYRN--------EVECYYCHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLF

Query:  TSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL
         ++ GG HG+V + +G   K  GIGDV +KT  G    L++VR+VP +K  LIS+G+L D G    F     K+  G+ V+A G +  TL
Subjt:  TSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTL

TKS18269.1 hypothetical protein D5086_0000005180 [Populus alba]7.3e-5236.34Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L  +P+D++EEDW  LD + +  +R+ LS  VA  V  E +  +LME+L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE-------GSNKESTVG--SALVMTKGKDKVDEDN----------------EPSSSR-----------
        +++ E+ SV  ++N   T+ NQL SV+IEF DE        S   S  G  +A+  + GK K+  D+                E SSSR           
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE-------GSNKESTVG--SALVMTKGKDKVDEDN----------------EPSSSR-----------

Query:  -----------------KKWKYRNEVECYYCHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF
                          K+  R +VEC+ C K GH    C K K+ +     A    V++ ++  V+S         +WILDS AS H      +  ++
Subjt:  -----------------KKWKYRNEVECYYCHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSF

Query:  TGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLY
         GG HG+V + +G      GIGDV +KT  G    L++VR+VP +K  LIS+G+L D G    F     K+  G+ V+A G +  TLY
Subjt:  TGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLY

TrEMBL top hitse value%identityAlignment
A0A2N9FA13 Uncharacterized protein1.1e-5335.54Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L ++P  M++ +W  LD + +  IR+ LS  VA  V  ETT V LM +L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE---------GSNKESTVGSALVMTKGKDKVDED----------------------------------
        ++M+E A+V  ++NE  T+ NQL SV+IEF DE           N    +  A+  + GK K+  D                                  
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE---------GSNKESTVGSALVMTKGKDKVDED----------------------------------

Query:  -----------NEPSSSRKKWKYRNEVECYYCHKKGHLKYQCRKFKEDQ---------KRKPEANIV--EEVVLACVESD--TKYSNHSSDWILDSAASV
                   N+ S SR K ++    EC++C KKGH++  C+ ++++Q           K    IV  EEVV+  V+        N   +W++DSAA+ 
Subjt:  -----------NEPSSSRKKWKYRNEVECYYCHKKGHLKYQCRKFKEDQ---------KRKPEANIV--EEVVLACVESD--TKYSNHSSDWILDSAASV

Query:  HIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLY
        H+   + LFT++  G  G V+MGN   SK  GIGDV +KT  G  ++L++VR+VP++  NLIS   +   G+    G+ + KL  G  VVA G     LY
Subjt:  HIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLY

Query:  RCQLNVAK
        + ++   K
Subjt:  RCQLNVAK

A0A2N9FQY2 CCHC-type domain-containing protein8.4e-5435.54Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L ++P  M++ +W  LD + +  IR+ LS  VA  V  ETT V LM +L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE---------GSNKESTVGSALVMTKGKDKVDED----------------------------------
        ++M+E A+V  ++NE  T+ NQL SV+IEF DE           N    +  A+  + GK K+  D                                  
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE---------GSNKESTVGSALVMTKGKDKVDED----------------------------------

Query:  -----------NEPSSSRKKWKYRNEVECYYCHKKGHLKYQCRKFK------EDQKRKPEANIV-----EEVVLACVESD--TKYSNHSSDWILDSAASV
                   N+ S SR K ++    EC++C KKGH++  C+ ++      ED K   E         EEVV+  V+        N   +W++DSAA+ 
Subjt:  -----------NEPSSSRKKWKYRNEVECYYCHKKGHLKYQCRKFK------EDQKRKPEANIV-----EEVVLACVESD--TKYSNHSSDWILDSAASV

Query:  HIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLY
        H+   + LFT++  G  G V+MGN   SK  GIGDV +KT  G  ++L++VR+VP++  NLIS   +   G+    G+ + KL  G  VVA G     LY
Subjt:  HIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLY

Query:  RCQLNVAK
        + ++   K
Subjt:  RCQLNVAK

A0A2N9G318 CCHC-type domain-containing protein2.9e-5435.78Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L ++P  M++ +W  LD + +  IR+ LS  VA  V  ETT V LM +L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE---------GSNKESTVGSALVMTKGKDKVDED----------------------------------
        ++M+E A+V  Y+NE  T+ NQL SV+IEF DE           N    +  A+  + GK K+  D                                  
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE---------GSNKESTVGSALVMTKGKDKVDED----------------------------------

Query:  -----------NEPSSSRKKWKYRNEVECYYCHKKGHLKYQCRKFKEDQ---------KRKPEANIV--EEVVLACVESD--TKYSNHSSDWILDSAASV
                   N+ S SR K ++    EC++C KKGH++  C+ ++++Q           K    IV  EEVV+  V+        N   +W++DSAA+ 
Subjt:  -----------NEPSSSRKKWKYRNEVECYYCHKKGHLKYQCRKFKEDQ---------KRKPEANIV--EEVVLACVESD--TKYSNHSSDWILDSAASV

Query:  HIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLY
        H+   + LFT++  G  G V+MGN   SK  GIGDV +KT  G  ++L++VR+VP++  NLIS   +   G+    G+ + KL  G  VVA G     LY
Subjt:  HIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLY

Query:  RCQLNVAK
        + ++   K
Subjt:  RCQLNVAK

A0A2N9G6J0 CCHC-type domain-containing protein8.4e-5435.54Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L ++P  M++ +W  LD + +  IR+ LS  VA  V  ETT V LM +L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE---------GSNKESTVGSALVMTKGKDKVDED----------------------------------
        ++M+E A+V  ++NE  T+ NQL SV+IEF DE           N    +  A+  + GK K+  D                                  
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE---------GSNKESTVGSALVMTKGKDKVDED----------------------------------

Query:  -----------NEPSSSRKKWKYRNEVECYYCHKKGHLKYQCRKFK------EDQKRKPEANIV-----EEVVLACVESD--TKYSNHSSDWILDSAASV
                   N+ S SR K ++    EC++C KKGH++  C+ ++      ED K   E         EEVV+  V+        N   +W++DSAA+ 
Subjt:  -----------NEPSSSRKKWKYRNEVECYYCHKKGHLKYQCRKFK------EDQKRKPEANIV-----EEVVLACVESD--TKYSNHSSDWILDSAASV

Query:  HIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLY
        H+   + LFT++  G  G V+MGN   SK  GIGDV +KT  G  ++L++VR+VP++  NLIS   +   G+    G+ + KL  G  VVA G     LY
Subjt:  HIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLY

Query:  RCQLNVAK
        + ++   K
Subjt:  RCQLNVAK

A0A2N9IGK1 CCHC-type domain-containing protein8.4e-5435.54Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN
        G+ KFDG +FGYWKMQ++DYL  KK+H   L ++P  M++ +W  LD + +  IR+ LS  VA  V  ETT V LM +L+  YEKPSANNKV+L+KK FN
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE---------GSNKESTVGSALVMTKGKDKVDED----------------------------------
        ++M+E A+V  ++NE  T+ NQL SV+IEF DE           N    +  A+  + GK K+  D                                  
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSDE---------GSNKESTVGSALVMTKGKDKVDED----------------------------------

Query:  -----------NEPSSSRKKWKYRNEVECYYCHKKGHLKYQCRKFK------EDQKRKPEANIV-----EEVVLACVESD--TKYSNHSSDWILDSAASV
                   N+ S SR K ++    EC++C KKGH++  C+ ++      ED K   E         EEVV+  V+        N   +W++DSAA+ 
Subjt:  -----------NEPSSSRKKWKYRNEVECYYCHKKGHLKYQCRKFK------EDQKRKPEANIV-----EEVVLACVESD--TKYSNHSSDWILDSAASV

Query:  HIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLY
        H+   + LFT++  G  G V+MGN   SK  GIGDV +KT  G  ++L++VR+VP++  NLIS   +   G+    G+ + KL  G  VVA G     LY
Subjt:  HIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLY

Query:  RCQLNVAK
        + ++   K
Subjt:  RCQLNVAK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-3933.66Show/hide
Query:  VMKFDGKN-FGYWKMQVKDYLTCKKVHKTL---KERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKF
        V KF+G N F  W+ +++D L  + +HK L    ++P  MK EDW  LDE A + IR+ LS DV + +  E TA  +   L + Y   +  NK+YL K+ 
Subjt:  VMKFDGKN-FGYWKMQVKDYLTCKKVHKTL---KERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKF

Query:  FNMQMSEDASVNSYINEVTTLINQLKS--VKIEFSDEG-----------SNKEST------------VGSALVMTKGKDKVDEDNEP-------------
        + + MSE  +  S++N    LI QL +  VKIE  D+             N  +T            V SAL++ +   K  E+                
Subjt:  FNMQMSEDASVNSYINEVTTLINQLKS--VKIEFSDEG-----------SNKEST------------VGSALVMTKGKDKVDEDNEP-------------

Query:  -------SSSRKKWKYRNEV---ECYYCHKKGHLKYQC---RKFK-EDQKRKPEANIV------EEVVLACVESD--TKYSNHSSDWILDSAASVHIASD
               S +R K K R++     CY C++ GH K  C   RK K E   +K + N        + VVL   E +     S   S+W++D+AAS H    
Subjt:  -------SSSRKKWKYRNEV---ECYYCHKKGHLKYQC---RKFK-EDQKRKPEANIV------EEVVLACVESD--TKYSNHSSDWILDSAASVHIASD

Query:  RSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLYRCQLN
        R LF  +  G  G V+MGN   SK  GIGD+ +KT  G  LVL+DVR+VP+++MNLIS   L  DG+   F +++ +L  GS V+A G  + TLYR    
Subjt:  RSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLYRCQLN

Query:  VAKG
        + +G
Subjt:  VAKG

P25601 Putative transposon Ty5-1 protein YCL075W1.7e-0636.78Show/hide
Query:  CVESDTKYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISI
        C+ S T  +  SS+WI D+  + H+  DRS+F+SFT         G G +    G G V++     G + L DV YVP++ +NLIS+
Subjt:  CVESDTKYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.1e-0531.94Show/hide
Query:  ECYYCHKKGHLKYQCRKFK----EDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKTRGI
        +C  C  +GH   +C + +        ++P +        A +   + YS  S++W+LDS A+ HI SD    SL   +TGG    V + +G T      
Subjt:  ECYYCHKKGHLKYQCRKFK----EDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASVHIASD---RSLFTSFTGGHHGLVRMGNGRTSKTRGI

Query:  GDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTD-DGFMCEF
        G  SL T+    L L ++ YVPNI  NLIS+ +L + +G   EF
Subjt:  GDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTD-DGFMCEF

Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein7.6e-0731.19Show/hide
Query:  NIVEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKT-----RGIGDVSLKTECGGKLVLRDVRYVPNIKMNLIS
        +++ EV     E  +KY+ H + W++ S  S H+      FT+        V+  +G  S+T      GIGDV+  T  G K  +++V YVP I+ N +S
Subjt:  NIVEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKT-----RGIGDVSLKTECGGKLVLRDVRYVPNIKMNLIS

Query:  IGKLTDDGF
        + +L  +GF
Subjt:  IGKLTDDGF

AT3G21000.1 Gag-Pol-related retrotransposon family protein2.9e-0624.48Show/hide
Query:  KYRNEVECYYCHKKGHLKYQCRKFKEDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIG
        K ++E  C  C+K  H +  C+      K + E  IV +  L  V +    +     WI+   A +++      FT+        V   +G      G G
Subjt:  KYRNEVECYYCHKKGHLKYQCRKFKEDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSFTGGHHGLVRMGNGRTSKTRGIG

Query:  DVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFG
        DV ++ + G K  +R+V +VP +  N++S GK+    +    G
Subjt:  DVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFG

AT3G29785.1 unknown protein3.3e-1036.36Show/hide
Query:  KFDGKNFGYWKMQVKDYLTCKKVHKTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKV
        K DG ++ + +M+++DYL  KK+H+ L ++ + M ++DW  L  + +  IR+ +S ++A  VA E +   LM+ L++ Y+KPS NN V
Subjt:  KFDGKNFGYWKMQVKDYLTCKKVHKTLKERPKDMKEEDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAACTGTATTTACAAGCGACTGGAAGAGAATTGGAGAAGGTTCATCGGAATGAAGAAAAAACAGAGTCGGATGAACAGCCCATCTGACGGAAGAGCGAGC
GAGCGGCGGAGCAGGGCGGAGGATGCGGCGGCGTGGTGGGAATTAGCTTTTGGTTCAGGAGTCATGGGTTTTGTAGAGCCAAAAAGTTTCGATGGAGTCATGAAG
TTCGATGGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAAGATTACTTAACTTGCAAGAAAGTGCATAAGACATTGAAGGAGAGACCGAAAGATATGAAGGAA
GAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAACAATTAGGATGTGTTTGTCGATGGATGTGGCAAGTCTTGTAGCCCATGAGACAACTGCAGTCAAGTTG
ATGGAATCGCTTACAAACAGGTATGAAAAACCCTCTGCAAATAATAAGGTCTACCTAGTTAAGAAGTTTTTCAACATGCAAATGTCTGAGGATGCTTCTGTGAAT
TCCTATATTAATGAGGTTACCACTTTGATTAATCAGTTAAAATCTGTTAAGATAGAATTTTCTGATGAGGGTAGTAATAAAGAGTCTACTGTAGGGTCAGCTTTG
GTTATGACTAAAGGTAAAGATAAAGTTGATGAAGATAATGAACCGAGTAGCAGTAGGAAAAAGTGGAAATATAGGAATGAGGTAGAATGTTATTACTGCCATAAG
AAAGGTCACTTGAAGTATCAGTGTAGGAAATTTAAAGAGGATCAGAAAAGAAAACCAGAGGCAAATATAGTGGAGGAGGTTGTCTTAGCTTGTGTTGAAAGTGAC
ACAAAGTATAGTAACCACTCATCAGATTGGATATTAGACAGTGCAGCTTCTGTTCACATAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCAT
GGCCTAGTGAGGATGGGGAATGGTAGAACCTCCAAGACTAGAGGGATTGGAGATGTTAGTCTGAAGACAGAATGTGGAGGTAAATTGGTACTGCGAGATGTCAGG
TACGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAGTTGACAGATGATGGTTTCATGTGTGAGTTTGGCAGTCGCCAGTGTAAACTCAAGTTCGGATCC
CAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTATACAGATGTCAGTTGAATGTTGCCAAAGGTTCAGAGAGACAGTGGATGGCGGTTAAAGCTGCAGATGGT
AGTTGTAGAGGTACAGTTGAGCCAGCAGCAAGGATAGCCAATTTCGATCAGTTCGATCAAGATCCTTCAGTTCAGAAACAATTGGGAAGTCTAGGAGAGAAGTTT
ATGGCTATCGTGAATCCCCAGTATTTTGAACCTTGGAGTTTACCAAGAAGATCAGAGATAACAGTTGTAGTGGGAGTAGATCCTTGGAGTTTGCCAAGATGTTTA
GAGTTTTGTTGCAGTGGGAGACGAGATCGTTGTTTTGTCTCCAAGTGGGAAATTGTTGGTGTGTGGAGACAAAATCCACAAAACTCCCTCACCTGCCGTGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGACAAACTGTATTTACAAGCGACTGGAAGAGAATTGGAGAAGGTTCATCGGAATGAAGAAAAAACAGAGTCGGATGAACAGCCCATCTGACGGAAGAGCGAGC
GAGCGGCGGAGCAGGGCGGAGGATGCGGCGGCGTGGTGGGAATTAGCTTTTGGTTCAGGAGTCATGGGTTTTGTAGAGCCAAAAAGTTTCGATGGAGTCATGAAG
TTCGATGGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAAGATTACTTAACTTGCAAGAAAGTGCATAAGACATTGAAGGAGAGACCGAAAGATATGAAGGAA
GAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAACAATTAGGATGTGTTTGTCGATGGATGTGGCAAGTCTTGTAGCCCATGAGACAACTGCAGTCAAGTTG
ATGGAATCGCTTACAAACAGGTATGAAAAACCCTCTGCAAATAATAAGGTCTACCTAGTTAAGAAGTTTTTCAACATGCAAATGTCTGAGGATGCTTCTGTGAAT
TCCTATATTAATGAGGTTACCACTTTGATTAATCAGTTAAAATCTGTTAAGATAGAATTTTCTGATGAGGGTAGTAATAAAGAGTCTACTGTAGGGTCAGCTTTG
GTTATGACTAAAGGTAAAGATAAAGTTGATGAAGATAATGAACCGAGTAGCAGTAGGAAAAAGTGGAAATATAGGAATGAGGTAGAATGTTATTACTGCCATAAG
AAAGGTCACTTGAAGTATCAGTGTAGGAAATTTAAAGAGGATCAGAAAAGAAAACCAGAGGCAAATATAGTGGAGGAGGTTGTCTTAGCTTGTGTTGAAAGTGAC
ACAAAGTATAGTAACCACTCATCAGATTGGATATTAGACAGTGCAGCTTCTGTTCACATAGCTTCAGATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCAT
GGCCTAGTGAGGATGGGGAATGGTAGAACCTCCAAGACTAGAGGGATTGGAGATGTTAGTCTGAAGACAGAATGTGGAGGTAAATTGGTACTGCGAGATGTCAGG
TACGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAGTTGACAGATGATGGTTTCATGTGTGAGTTTGGCAGTCGCCAGTGTAAACTCAAGTTCGGATCC
CAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTATACAGATGTCAGTTGAATGTTGCCAAAGGTTCAGAGAGACAGTGGATGGCGGTTAAAGCTGCAGATGGT
AGTTGTAGAGGTACAGTTGAGCCAGCAGCAAGGATAGCCAATTTCGATCAGTTCGATCAAGATCCTTCAGTTCAGAAACAATTGGGAAGTCTAGGAGAGAAGTTT
ATGGCTATCGTGAATCCCCAGTATTTTGAACCTTGGAGTTTACCAAGAAGATCAGAGATAACAGTTGTAGTGGGAGTAGATCCTTGGAGTTTGCCAAGATGTTTA
GAGTTTTGTTGCAGTGGGAGACGAGATCGTTGTTTTGTCTCCAAGTGGGAAATTGTTGGTGTGTGGAGACAAAATCCACAAAACTCCCTCACCTGCCGTGCATGA
Protein sequenceShow/hide protein sequence
MTNCIYKRLEENWRRFIGMKKKQSRMNSPSDGRASERRSRAEDAAAWWELAFGSGVMGFVEPKSFDGVMKFDGKNFGYWKMQVKDYLTCKKVHKTLKERPKDMKE
EDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYINEVTTLINQLKSVKIEFSDEGSNKESTVGSAL
VMTKGKDKVDEDNEPSSSRKKWKYRNEVECYYCHKKGHLKYQCRKFKEDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASVHIASDRSLFTSFTGGHH
GLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLTDDGFMCEFGSRQCKLKFGSQVVAVGHRKSTLYRCQLNVAKGSERQWMAVKAADG
SCRGTVEPAARIANFDQFDQDPSVQKQLGSLGEKFMAIVNPQYFEPWSLPRRSEITVVVGVDPWSLPRCLEFCCSGRRDRCFVSKWEIVGVWRQNPQNSLTCRA