; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017653 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017653
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr5:6413602..6417492
RNA-Seq ExpressionLag0017653
SyntenyLag0017653
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TKR74765.1 hypothetical protein D5086_0000292320 [Populus alba]7.9e-6940.77Show/hide
Query:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN
        G+ KFDG +F YWKMQ++DYL  KK+H   L  +P+ M++E+W+ LD + +  IR+ LS  VA  V  E    KLME+L+  YEKPSANNK +L+KK FN
Subjt:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSAL-VMTKGKD
        ++M E+ASV  ++N   T+ NQL SV IEF +E+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSAL-VMTKGKD

Query:  KVDEDNEPSSSRKKWKYRN--------EVECYYIHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLF
            D   S  R K +YR+        +VEC+   K GH    C K K  +     A    V++ ++  V S         +WILDS AS H      + 
Subjt:  KVDEDNEPSSSRKKWKYRN--------EVECYYIHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLF

Query:  TSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTL
         ++ GG  G+V + +G        GDV +KT  G    L++VR+VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL
Subjt:  TSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTL

TKR89927.1 hypothetical protein D5086_0000238200 [Populus alba]6.1e-6940.51Show/hide
Query:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN
        G+ KFDG +F YWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E    KLME+L+  YEKPSANNK +L+KK FN
Subjt:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSAL-VMTKGKD
        ++M E+ SV  ++N   T+ NQL SV+IEF +E+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSAL-VMTKGKD

Query:  KVDEDNEPSSSRKKWKYRN--------EVECYYIHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLF
            D   S  R K +YR+        +VEC+   K GH    C K K+ +     A    V++ ++  V+S         +WILDS AS H      + 
Subjt:  KVDEDNEPSSSRKKWKYRN--------EVECYYIHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLF

Query:  TSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTL
         ++ GG  G+V + +G       IGDV +KT  G    L+++R+VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL
Subjt:  TSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTL

TKS02608.1 hypothetical protein D5086_0000161380 [Populus alba]3.2e-7040.15Show/hide
Query:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN
        G+ KFDG +F YWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E    KLME+L+  YEKPSANNK +L+KK FN
Subjt:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSAL-VMTKGKD
        ++M E+ SV  ++N   T+ NQL SV+IEF +E+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSAL-VMTKGKD

Query:  KVDEDNEPSSSRKKWKYRN--------EVECYYIHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLF
            D   S  R K +YR+        +VEC+   K GH    C K K+ +     A    V++ ++  V+S         +WILDS AS H      + 
Subjt:  KVDEDNEPSSSRKKWKYRN--------EVECYYIHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLF

Query:  TSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRC--QLNVA
         ++ GG  G+V + +G   K   IGDV +KT  G    L++VR+VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL     +L V+
Subjt:  TSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRC--QLNVA

Query:  KGSERQWMPVK
        + +E +W  +K
Subjt:  KGSERQWMPVK

TKS09800.1 hypothetical protein D5086_0000089010 [Populus alba]1.2e-6941.03Show/hide
Query:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN
        G+ KFDG +F YWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E    KLME+L+  YEKPSANNK +L+KK FN
Subjt:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSAL-VMTKGKD
        ++M E+ SV  ++N   T+ NQL SV+IEF +E+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSAL-VMTKGKD

Query:  KVDEDNEPSSSRKKWKYRN--------EVECYYIHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLF
            D   S  R K +YR+        +VEC+   K GH    C K K+ +     A    V++ ++  V+S         +WILDS AS H      + 
Subjt:  KVDEDNEPSSSRKKWKYRN--------EVECYYIHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLF

Query:  TSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTL
         ++ GG  G+V + +G   K   IGDV +KT  G    L++VR+VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL
Subjt:  TSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTL

TKS13843.1 hypothetical protein D5086_0000049350 [Populus alba]7.9e-6940.51Show/hide
Query:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN
        G+ KFDG +F YWKM+++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E    KLME+L+  YEKPSANNK +L+KK FN
Subjt:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSAL-VMTKGKD
        ++M E+ SV  ++N   T+ NQL SV+IEF +E+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSAL-VMTKGKD

Query:  KVDEDNEPSSSRKKWKYRN--------EVECYYIHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLF
            D   S  R K +YR+        +VEC+   K GH    C K K+ +     A    V++ ++  V+S         +WILDS AS H      + 
Subjt:  KVDEDNEPSSSRKKWKYRN--------EVECYYIHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLF

Query:  TSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTL
         ++ GG  G++ + +G       IGDV +KT  G    L++VR+VP +K  LIS+G+L D GY   F     K+  G+ V+A G +  TL
Subjt:  TSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTL

TrEMBL top hitse value%identityAlignment
A0A2N9FQY2 CCHC-type domain-containing protein7.7e-7039.46Show/hide
Query:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN
        G+ KFDG +F YWKMQ++DYL  KK+H   L ++P  M+D +W  LD + +  IR+ LS  VA  V  ET  V LM +L+  YEKPSANNK +L+KK FN
Subjt:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALV-------
        ++M+E A+V  ++NE  T+ NQL SV+IEF +E+  + +L SLP+SWE M+ AVSNS G   LK+ ++ DL+++EE+RR+ +  ++  G A V       
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALV-------

Query:  MTKGKDKVDEDNEPSSSRKKWKYRNEVECYYIHKKGHLKYQCRKFK------EDQKRKPEANIV-----EEVVLACVESD--TKYSNHSSDWILDSAASI
          K ++     N+ S SR K ++    EC++  KKGH++  C+ ++      ED K   E         EEVV+  V+        N   +W++DSAA+ 
Subjt:  MTKGKDKVDEDNEPSSSRKKWKYRNEVECYYIHKKGHLKYQCRKFK------EDQKRKPEANIV-----EEVVLACVESD--TKYSNHSSDWILDSAASI

Query:  HIASDRSLFTSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLY
        H+   + LFT++  G  G V+MGN   SK   IGDV +KT  G  ++L++VR+VP++  NLIS   +   GY    G+ + KL  G  VVA G     LY
Subjt:  HIASDRSLFTSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLY

Query:  RCQLNVAK
        + ++   K
Subjt:  RCQLNVAK

A0A2N9G318 CCHC-type domain-containing protein2.7e-7039.71Show/hide
Query:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN
        G+ KFDG +F YWKMQ++DYL  KK+H   L ++P  M+D +W  LD + +  IR+ LS  VA  V  ET  V LM +L+  YEKPSANNK +L+KK FN
Subjt:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALV-------
        ++M+E A+V  Y+NE  T+ NQL SV+IEF +E+  + +L SLP+SWE M+ AVSNS G   LK+ ++ DL+++EE+RR+ +  ++  G A V       
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALV-------

Query:  MTKGKDKVDEDNEPSSSRKKWKYRNEVECYYIHKKGHLKYQCRKFKEDQ---------KRKPEANIV--EEVVLACVESD--TKYSNHSSDWILDSAASI
          K ++     N+ S SR K ++    EC++  KKGH++  C+ ++++Q           K    IV  EEVV+  V+        N   +W++DSAA+ 
Subjt:  MTKGKDKVDEDNEPSSSRKKWKYRNEVECYYIHKKGHLKYQCRKFKEDQ---------KRKPEANIV--EEVVLACVESD--TKYSNHSSDWILDSAASI

Query:  HIASDRSLFTSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLY
        H+   + LFT++  G  G V+MGN   SK   IGDV +KT  G  ++L++VR+VP++  NLIS   +   GY    G+ + KL  G  VVA G     LY
Subjt:  HIASDRSLFTSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLY

Query:  RCQLNVAK
        + ++   K
Subjt:  RCQLNVAK

A0A2N9G6J0 CCHC-type domain-containing protein7.7e-7039.46Show/hide
Query:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN
        G+ KFDG +F YWKMQ++DYL  KK+H   L ++P  M+D +W  LD + +  IR+ LS  VA  V  ET  V LM +L+  YEKPSANNK +L+KK FN
Subjt:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALV-------
        ++M+E A+V  ++NE  T+ NQL SV+IEF +E+  + +L SLP+SWE M+ AVSNS G   LK+ ++ DL+++EE+RR+ +  ++  G A V       
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALV-------

Query:  MTKGKDKVDEDNEPSSSRKKWKYRNEVECYYIHKKGHLKYQCRKFK------EDQKRKPEANIV-----EEVVLACVESD--TKYSNHSSDWILDSAASI
          K ++     N+ S SR K ++    EC++  KKGH++  C+ ++      ED K   E         EEVV+  V+        N   +W++DSAA+ 
Subjt:  MTKGKDKVDEDNEPSSSRKKWKYRNEVECYYIHKKGHLKYQCRKFK------EDQKRKPEANIV-----EEVVLACVESD--TKYSNHSSDWILDSAASI

Query:  HIASDRSLFTSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLY
        H+   + LFT++  G  G V+MGN   SK   IGDV +KT  G  ++L++VR+VP++  NLIS   +   GY    G+ + KL  G  VVA G     LY
Subjt:  HIASDRSLFTSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLY

Query:  RCQLNVAK
        + ++   K
Subjt:  RCQLNVAK

A0A4U5PY83 CCHC-type domain-containing protein1.6e-7040.15Show/hide
Query:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN
        G+ KFDG +F YWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E    KLME+L+  YEKPSANNK +L+KK FN
Subjt:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSAL-VMTKGKD
        ++M E+ SV  ++N   T+ NQL SV+IEF +E+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSAL-VMTKGKD

Query:  KVDEDNEPSSSRKKWKYRN--------EVECYYIHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLF
            D   S  R K +YR+        +VEC+   K GH    C K K+ +     A    V++ ++  V+S         +WILDS AS H      + 
Subjt:  KVDEDNEPSSSRKKWKYRN--------EVECYYIHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLF

Query:  TSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRC--QLNVA
         ++ GG  G+V + +G   K   IGDV +KT  G    L++VR+VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL     +L V+
Subjt:  TSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRC--QLNVA

Query:  KGSERQWMPVK
        + +E +W  +K
Subjt:  KGSERQWMPVK

A0A4U5QGR0 Uncharacterized protein5.9e-7041.03Show/hide
Query:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN
        G+ KFDG +F YWKMQ++DYL  KK+H   L  +P+ M+ E+W+ LD + +  IR+ LS  VA  V  E    KLME+L+  YEKPSANNK +L+KK FN
Subjt:  GVMKFDGKNFRYWKMQVKDYLTCKKVH-KALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFN

Query:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSAL-VMTKGKD
        ++M E+ SV  ++N   T+ NQL SV+IEF +E+  + LL SLP SWE M+TAVSNS G + LK+ ++ DL++AEE+RR+ S + S+ GSAL + T+G+ 
Subjt:  MQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSAL-VMTKGKD

Query:  KVDEDNEPSSSRKKWKYRN--------EVECYYIHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLF
            D   S  R K +YR+        +VEC+   K GH    C K K+ +     A    V++ ++  V+S         +WILDS AS H      + 
Subjt:  KVDEDNEPSSSRKKWKYRN--------EVECYYIHKKGHLKYQCRKFKEDQKRKPEANI--VEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLF

Query:  TSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTL
         ++ GG  G+V + +G   K   IGDV +KT  G    L++VR+VP +K  LIS+G+L D G+   F     K+  G+ V+A G +  TL
Subjt:  TSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-4634.3Show/hide
Query:  VMKFDGKN-FRYWKMQVKDYLTCKKVHKAL---KERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKF
        V KF+G N F  W+ +++D L  + +HK L    ++P  MK EDW  LDE A + IR+ LS DV + +  E  A  +   L + Y   +  NK YL K+ 
Subjt:  VMKFDGKN-FRYWKMQVKDYLTCKKVHKAL---KERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKF

Query:  FNMQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCD-LVIAEEIRRQGSNKESTVGSALVMTKG
        + + MSE  +  S++N    LI QL ++ ++   E   I LL SLP S++ + T + +  G  T++  +V   L++ E++R++  N+    G AL+ T+G
Subjt:  FNMQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCD-LVIAEEIRRQGSNKESTVGSALVMTKG

Query:  KDKVDEDNE----------PSSSRKKWKYRNEVECYYIHKKGHLKYQC---RKFK-EDQKRKPEANIV------EEVVLACVESD--TKYSNHSSDWILD
        + +  + +            S +R K + RN   CY  ++ GH K  C   RK K E   +K + N        + VVL   E +     S   S+W++D
Subjt:  KDKVDEDNE----------PSSSRKKWKYRNEVECYYIHKKGHLKYQC---RKFK-EDQKRKPEANIV------EEVVLACVESD--TKYSNHSSDWILD

Query:  SAASIHIASDRSLFTSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHR
        +AAS H    R LF  +  G  G V+MGN   SK   IGD+ +KT  G  LVL+DVR+VP+++MNLIS   L  DGY   F +++ +L  GS V+A G  
Subjt:  SAASIHIASDRSLFTSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHR

Query:  KSTLYRCQLNVAKG
        + TLYR    + +G
Subjt:  KSTLYRCQLNVAKG

P25601 Putative transposon Ty5-1 protein YCL075W4.5e-0636.78Show/hide
Query:  CVESDTKYSNHSSDWILDSAASIHIASDRSLFTSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISI
        C+ S T  +  SS+WI D+  + H+  DRS+F+SFT   R     G G +      G V++     G + L DV YVP++ +NLIS+
Subjt:  CVESDTKYSNHSSDWILDSAASIHIASDRSLFTSFTGGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.4e-0622.48Show/hide
Query:  WEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFNMQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTS
        W+  D+   + +   +SM V   V+  T A ++ E+L   Y  PS  +   L +           +++ Y+  + T  +QL  +     ++  V ++L +
Subjt:  WEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFNMQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTS

Query:  LPDSWETM------------KTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALVMTKGKDKVDEDNEPSSSRKKWKYRNE-----------
        LP+ ++ +             T +     N+  K   V    +        S++ +T  +        ++ D  N  ++S K W+  +            
Subjt:  LPDSWETM------------KTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALVMTKGKDKVDEDNEPSSSRKKWKYRNE-----------

Query:  --VECYYIHKKGHLKYQCRKFK----EDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASIHIASD---RSLFTSFTGGHRGLVRMGNGRTSKT
           +C     +GH   +C + +        ++P +        A +   + YS  S++W+LDS A+ HI SD    SL   +TGG    V + +G T   
Subjt:  --VECYYIHKKGHLKYQCRKFK----EDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASIHIASD---RSLFTSFTGGHRGLVRMGNGRTSKT

Query:  REIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLAD-DGYMCEF
           G  SL T+    L L ++ YVPNI  NLIS+ +L + +G   EF
Subjt:  REIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLAD-DGYMCEF

Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein7.8e-0629.36Show/hide
Query:  NIVEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLFTSFTGGHRGLVRMGNGRTSKT-----REIGDVSLKTECGGKLVLRDVRYVPNIKMNLIS
        +++ EV     E  +KY+ H + W++ S  S H+      FT+     +  V+  +G  S+T       IGDV+  T  G K  +++V YVP I+ N +S
Subjt:  NIVEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLFTSFTGGHRGLVRMGNGRTSKT-----REIGDVSLKTECGGKLVLRDVRYVPNIKMNLIS

Query:  IGKLADDGY
        + +L  +G+
Subjt:  IGKLADDGY

AT3G21000.1 Gag-Pol-related retrotransposon family protein1.0e-0520.45Show/hide
Query:  LVKKFFNMQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALV
        L K+  +++M +  S +SY+++   ++ +L   K+E S+      + T+L  S++ + + +      + +    + +       R   S+ E  +   L 
Subjt:  LVKKFFNMQMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALV

Query:  MTKGKDKVDEDNEPSSSRKKWKYRNEVECYYIHKKGHLKYQCRKFKEDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLFTSFT
                 +D    S  +KW       C   +K  H +  C+      K + E  IV +  L  V +    +     WI+   A I++      FT+  
Subjt:  MTKGKDKVDEDNEPSSSRKKWKYRNEVECYYIHKKGHLKYQCRKFKEDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLFTSFT

Query:  GGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFG
           +  V   +G        GDV ++ + G K  +R+V +VP +  N++S GK+    Y    G
Subjt:  GGHRGLVRMGNGRTSKTREIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFG

AT3G25270.1 Ribonuclease H-like superfamily protein9.5e-0424.85Show/hide
Query:  FELVVIFWWSLWNLRNNL-------SWGG--QSDGRDL--WTYSSDYLSAFH--VGGGRFLAGDCLRTQTSDQEKRRVWRPPPVSELKLNIDASVRPDTR
        F L +   W LW  RN L       SW    Q    D+  W  ++ Y+ + +  V   R       RT+         W+ PP + +K N D +    TR
Subjt:  FELVVIFWWSLWNLRNNL-------SWGG--QSDGRDL--WTYSSDYLSAFH--VGGGRFLAGDCLRTQTSDQEKRRVWRPPPVSELKLNIDASVRPDTR

Query:  EAGGGCVLRGADGEVFMVVCLSLQRCWSVDL-AEAWAVYKGVKLARQLGFAEFVVETDSLRLVKILHGE
         A  G ++R  +G V+M    ++    S  L +E  A+   ++ A   G+ + + E DS ++ ++++ E
Subjt:  EAGGGCVLRGADGEVFMVVCLSLQRCWSVDL-AEAWAVYKGVKLARQLGFAEFVVETDSLRLVKILHGE

AT3G29785.1 unknown protein3.4e-0936.05Show/hide
Query:  KFDGKNFRYWKMQVKDYLTCKKVHKALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANN
        K DG ++ + +M+++DYL  KK+H+ L ++ + M  +DW  L  + +  IR+ +S ++A  VA E     LM+ L++ Y+KPS NN
Subjt:  KFDGKNFRYWKMQVKDYLTCKKVHKALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANN

AT4G29090.1 Ribonuclease H-like superfamily protein7.3e-1223.96Show/hide
Query:  SGY-RLAHILATQDCPGPSNSERMRVWWSALWKLNVPNKHRFFLWRLFHDRLPTKRTVSICSGTALWLRVCGWAPNLLSFTNPFLISCSRKSL-------
        SGY  L  I+  +  P   +   +   +  +WK     K + FLW+   + LP    ++    +      C   P+     N  L  C+   L       
Subjt:  SGY-RLAHILATQDCPGPSNSERMRVWWSALWKLNVPNKHRFFLWRLFHDRLPTKRTVSICSGTALWLRVCGWAPNLLSFTNPFLISCSRKSL-------

Query:  -----GVMKDKL------------AGPDFE----LVVIFWWSLWNLRNNLSWGGQS-DGRDLWTYSSDYLSAFHVGGGRFLAGDCLRTQTSDQEKRRVWR
             G   D +              P +E    LV    W LW  RN L + G+  + +++   + D L  + +   R  A  C      ++     WR
Subjt:  -----GVMKDKL------------AGPDFE----LVVIFWWSLWNLRNNLSWGGQS-DGRDLWTYSSDYLSAFHVGGGRFLAGDCLRTQTSDQEKRRVWR

Query:  PPPVSELKLNIDASVRPDTREAGGGCVLRGADGEVFMVVCLSLQRCWSVDLAEAWAVYKGVKLARQLGFAEFVVETDSLRLVKILHGE
        PPP   +K N DA+   D    G G VLR   GEV  +   +L +  SV  AE  A+   V    +  +   + E+DS  L++IL+ +
Subjt:  PPPVSELKLNIDASVRPDTREAGGGCVLRGADGEVFMVVCLSLQRCWSVDLAEAWAVYKGVKLARQLGFAEFVVETDSLRLVKILHGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTGTAGAGCCAAAAATTTTCGATGGAGTCATGAAGTTCGATGGGAAAAATTTTAGATATTGGAAGATGCAAGTCAAAGATTACTTAACTTGCAAGAAAGTGCA
TAAGGCATTGAAGGAGAGACCGAAAGATATGAAGGACGAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAACCATTAGGATGTGTTTGTCTATGGATGTGGCAAGTC
TAGTAGCCCATGAGACAATTGCAGTCAAGTTGATGGAATCACTTACAAACAGGTATGAAAAACCCTCTGCAAATAATAAGTTCTACCTAGTTAAGAAGTTTTTCAACATG
CAAATGTCTGAGGATGCTTCTGTGAATTCCTATATTAATGAGGTTACCACTTTGATTAATCAGTTAAAATCTGTTAAGATAGAATTTTCTAATGAGGTGAATGTTATTCA
GTTGTTAACGTCTTTACCTGATAGTTGGGAAACGATGAAGACAGCAGTGTCTAATTCGACTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATTTAGTCATAGCTG
AGGAAATTCGTAGGCAGGGTAGTAATAAAGAGTCTACTGTAGGGTCAGCTTTGGTTATGACTAAGGGTAAAGATAAAGTTGATGAAGATAATGAACCGAGTAGCAGTAGG
AAAAAGTGGAAATATAGGAATGAGGTAGAATGTTATTACATCCATAAGAAAGGTCACTTGAAGTATCAATGTAGGAAATTTAAAGAGGATCAGAAAAGAAAACCAGAGGC
AAATATAGTGGAAGAGGTTGTCTTAGCTTGTGTTGAGAGTGACACAAAGTATAGTAACCACTCATCAGATTGGATATTAGACAGTGCAGCTTCTATTCACATAGCTTCAG
ATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCGTGGCCTTGTGAGGATGGGGAATGGTAGAACCTCCAAGACTAGAGAGATTGGAGATGTTAGTCTGAAGACAGAA
TGTGGAGGTAAATTGGTACTGCGAGATGTCAGGTACGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAATTGGCAGATGATGGTTACATGTGTGAGTTTGGCAG
TCGCCAGTGTAAACTCAAGTTCGGATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAGATGTCAGTTGAATGTTGCCAAAGGTTCAGAGAGACAGTGGA
TGCCGGTTAAAGCTGCAGATGGTAGTTGTAGAGGTACAGTTGAGCCAGCAGCAAGGATAGCCAATTTCGATCAGTCCGATCAAGATCCTTCAGTTCAGAAACAATTGGGA
AGTCCAGGAGAGAAAGTTCATGGCTATCGTGAATCCCCAGTTGTCAGACGGTCGAATGAATTGAAGAAGTCACTTAGGCGAGTTGAGGCATCAAAGTGGAAGGCCAGGCA
GTTGCTAAGGTCAAAGGTCCACGTAGCTAGCCCATACCTCGGGGCGAGCACAGATGAGCTACAGTTATATTCTGATAGTTGCCTATTTTTGAGTGGGTATCGACTTGCTC
ATATACTGGCTACCCAGGATTGTCCTGGCCCCTCAAACTCCGAGAGAATGCGTGTGTGGTGGTCCGCCCTTTGGAAGCTGAATGTGCCCAATAAACATAGGTTCTTCCTC
TGGCGACTGTTCCATGACCGTCTGCCAACTAAGAGGACTGTCTCCATCTGTTCTGGAACTGCCCTGTGGTTAAGAGTATGTGGTTGGGCTCCAAATTTGCTCTCCTTCAC
CAATCCTTTTCTCATCTCATGTTCGAGGAAATCATTGGGGGTGATGAAGGACAAACTTGCAGGGCCAGATTTTGAGCTGGTGGTCATTTTTTGGTGGTCCCTGTGGAATC
TTCGAAACAACCTGAGTTGGGGTGGTCAGTCAGACGGTCGAGATCTCTGGACTTATTCGAGTGATTACCTCAGTGCCTTCCATGTTGGTGGGGGGCGTTTCCTAGCAGGG
GACTGCTTACGAACCCAAACGAGTGACCAGGAGAAACGTCGTGTATGGAGGCCGCCCCCTGTTAGTGAGCTGAAGCTTAATATTGATGCTTCGGTCAGGCCTGATACAAG
GGAAGCTGGGGGTGGCTGTGTGCTGCGTGGGGCTGATGGGGAGGTCTTTATGGTGGTCTGTTTGAGCTTACAGAGGTGTTGGAGTGTGGATTTGGCTGAGGCTTGGGCTG
TGTATAAAGGGGTCAAACTTGCTCGACAGCTGGGGTTTGCAGAATTTGTGGTGGAGACTGATTCTCTGAGGTTGGTCAAAATCCTGCATGGTGAACTGCACGATGTGTCG
GAAGTGGGGCTACTGATGGATGACATCCGAGGAATCCTCCGTCCTTGGGACAACGGTAAGTGCGATAAAGGTGTTCCATCTAAGCATTTTCAAGTTCTCCCATCCACTGT
GCCTAATCAGGGAAGAGGTCATGCGTCTGTGCACTGGATTTGTCCTTTGAGCTTGCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTTGTAGAGCCAAAAATTTTCGATGGAGTCATGAAGTTCGATGGGAAAAATTTTAGATATTGGAAGATGCAAGTCAAAGATTACTTAACTTGCAAGAAAGTGCA
TAAGGCATTGAAGGAGAGACCGAAAGATATGAAGGACGAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAACCATTAGGATGTGTTTGTCTATGGATGTGGCAAGTC
TAGTAGCCCATGAGACAATTGCAGTCAAGTTGATGGAATCACTTACAAACAGGTATGAAAAACCCTCTGCAAATAATAAGTTCTACCTAGTTAAGAAGTTTTTCAACATG
CAAATGTCTGAGGATGCTTCTGTGAATTCCTATATTAATGAGGTTACCACTTTGATTAATCAGTTAAAATCTGTTAAGATAGAATTTTCTAATGAGGTGAATGTTATTCA
GTTGTTAACGTCTTTACCTGATAGTTGGGAAACGATGAAGACAGCAGTGTCTAATTCGACTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATTTAGTCATAGCTG
AGGAAATTCGTAGGCAGGGTAGTAATAAAGAGTCTACTGTAGGGTCAGCTTTGGTTATGACTAAGGGTAAAGATAAAGTTGATGAAGATAATGAACCGAGTAGCAGTAGG
AAAAAGTGGAAATATAGGAATGAGGTAGAATGTTATTACATCCATAAGAAAGGTCACTTGAAGTATCAATGTAGGAAATTTAAAGAGGATCAGAAAAGAAAACCAGAGGC
AAATATAGTGGAAGAGGTTGTCTTAGCTTGTGTTGAGAGTGACACAAAGTATAGTAACCACTCATCAGATTGGATATTAGACAGTGCAGCTTCTATTCACATAGCTTCAG
ATAGGAGTTTGTTCACATCATTCACAGGAGGGCATCGTGGCCTTGTGAGGATGGGGAATGGTAGAACCTCCAAGACTAGAGAGATTGGAGATGTTAGTCTGAAGACAGAA
TGTGGAGGTAAATTGGTACTGCGAGATGTCAGGTACGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAATTGGCAGATGATGGTTACATGTGTGAGTTTGGCAG
TCGCCAGTGTAAACTCAAGTTCGGATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAGATGTCAGTTGAATGTTGCCAAAGGTTCAGAGAGACAGTGGA
TGCCGGTTAAAGCTGCAGATGGTAGTTGTAGAGGTACAGTTGAGCCAGCAGCAAGGATAGCCAATTTCGATCAGTCCGATCAAGATCCTTCAGTTCAGAAACAATTGGGA
AGTCCAGGAGAGAAAGTTCATGGCTATCGTGAATCCCCAGTTGTCAGACGGTCGAATGAATTGAAGAAGTCACTTAGGCGAGTTGAGGCATCAAAGTGGAAGGCCAGGCA
GTTGCTAAGGTCAAAGGTCCACGTAGCTAGCCCATACCTCGGGGCGAGCACAGATGAGCTACAGTTATATTCTGATAGTTGCCTATTTTTGAGTGGGTATCGACTTGCTC
ATATACTGGCTACCCAGGATTGTCCTGGCCCCTCAAACTCCGAGAGAATGCGTGTGTGGTGGTCCGCCCTTTGGAAGCTGAATGTGCCCAATAAACATAGGTTCTTCCTC
TGGCGACTGTTCCATGACCGTCTGCCAACTAAGAGGACTGTCTCCATCTGTTCTGGAACTGCCCTGTGGTTAAGAGTATGTGGTTGGGCTCCAAATTTGCTCTCCTTCAC
CAATCCTTTTCTCATCTCATGTTCGAGGAAATCATTGGGGGTGATGAAGGACAAACTTGCAGGGCCAGATTTTGAGCTGGTGGTCATTTTTTGGTGGTCCCTGTGGAATC
TTCGAAACAACCTGAGTTGGGGTGGTCAGTCAGACGGTCGAGATCTCTGGACTTATTCGAGTGATTACCTCAGTGCCTTCCATGTTGGTGGGGGGCGTTTCCTAGCAGGG
GACTGCTTACGAACCCAAACGAGTGACCAGGAGAAACGTCGTGTATGGAGGCCGCCCCCTGTTAGTGAGCTGAAGCTTAATATTGATGCTTCGGTCAGGCCTGATACAAG
GGAAGCTGGGGGTGGCTGTGTGCTGCGTGGGGCTGATGGGGAGGTCTTTATGGTGGTCTGTTTGAGCTTACAGAGGTGTTGGAGTGTGGATTTGGCTGAGGCTTGGGCTG
TGTATAAAGGGGTCAAACTTGCTCGACAGCTGGGGTTTGCAGAATTTGTGGTGGAGACTGATTCTCTGAGGTTGGTCAAAATCCTGCATGGTGAACTGCACGATGTGTCG
GAAGTGGGGCTACTGATGGATGACATCCGAGGAATCCTCCGTCCTTGGGACAACGGTAAGTGCGATAAAGGTGTTCCATCTAAGCATTTTCAAGTTCTCCCATCCACTGT
GCCTAATCAGGGAAGAGGTCATGCGTCTGTGCACTGGATTTGTCCTTTGAGCTTGCCTTGA
Protein sequenceShow/hide protein sequence
MGFVEPKIFDGVMKFDGKNFRYWKMQVKDYLTCKKVHKALKERPKDMKDEDWEALDEEAVATIRMCLSMDVASLVAHETIAVKLMESLTNRYEKPSANNKFYLVKKFFNM
QMSEDASVNSYINEVTTLINQLKSVKIEFSNEVNVIQLLTSLPDSWETMKTAVSNSTGNNTLKFSEVCDLVIAEEIRRQGSNKESTVGSALVMTKGKDKVDEDNEPSSSR
KKWKYRNEVECYYIHKKGHLKYQCRKFKEDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSAASIHIASDRSLFTSFTGGHRGLVRMGNGRTSKTREIGDVSLKTE
CGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHRKSTLYRCQLNVAKGSERQWMPVKAADGSCRGTVEPAARIANFDQSDQDPSVQKQLG
SPGEKVHGYRESPVVRRSNELKKSLRRVEASKWKARQLLRSKVHVASPYLGASTDELQLYSDSCLFLSGYRLAHILATQDCPGPSNSERMRVWWSALWKLNVPNKHRFFL
WRLFHDRLPTKRTVSICSGTALWLRVCGWAPNLLSFTNPFLISCSRKSLGVMKDKLAGPDFELVVIFWWSLWNLRNNLSWGGQSDGRDLWTYSSDYLSAFHVGGGRFLAG
DCLRTQTSDQEKRRVWRPPPVSELKLNIDASVRPDTREAGGGCVLRGADGEVFMVVCLSLQRCWSVDLAEAWAVYKGVKLARQLGFAEFVVETDSLRLVKILHGELHDVS
EVGLLMDDIRGILRPWDNGKCDKGVPSKHFQVLPSTVPNQGRGHASVHWICPLSLP