; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005943 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005943
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:34145300..34146091
RNA-Seq ExpressionLag0005943
SyntenyLag0005943
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TKR74765.1 hypothetical protein D5086_0000292320 [Populus alba]3.2e-4848.39Show/hide
Query:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN
        G+ KFDG  F YWKMQ++DYL  KK+H   L  K + M +E+W+ LD + +  IR+ LS  VA  VT E +  KLMEAL+  YEK SANNKV+L+KK FN
Subjt:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN

Query:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-
        ++M E+ASV  + N   T+ NQL SV IEF DE+ A+ LL SL  SWE M+T VSNS+G + LK+ ++ DL +AEE+RR+ S + S+ GSAL I T+G+ 
Subjt:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-

Query:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ
         D+      S S   S++K+ +R +VEC+ C K GHF   C K K  +
Subjt:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ

TKR89927.1 hypothetical protein D5086_0000238200 [Populus alba]1.2e-4747.98Show/hide
Query:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN
        G+ KFDG  F YWKMQ++DYL  KK+H   L  K + M  E+W+ LD + +  IR+ LS  VA  VT E +  KLMEAL+  YEK SANNKV+L+KK FN
Subjt:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN

Query:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-
        ++M E+ SV  + N   T+ NQL SV+IEF DE+ A+ LL SL  SWE M+T VSNS+G + LK+ ++ DL +AEE+RR+ S + S+ GSAL I T+G+ 
Subjt:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-

Query:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ
         D+      S S   S++K+ +R +VEC+ C K GHF   C K K+ +
Subjt:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ

TKR90717.1 hypothetical protein D5086_0000230290 [Populus alba]1.9e-4848.39Show/hide
Query:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN
        G+ KFDG  F YWKMQ++DYL  KK+H   L  K + M +E+W+ LD + +  IR+ LS  VA  VT E +  KLMEAL+  YEK SANNKV+L+KK FN
Subjt:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN

Query:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-
        ++M E+ASV  + N   T+ NQL SV IEF DE+ A+ LL SL  SWE M+T VSNS+G + LK+ ++ DL +AEE+RR+ S + S+ GSAL I T+G+ 
Subjt:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-

Query:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ
         D+      S S   S++K+ +R +VEC+ C K GHF   C K K  +
Subjt:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ

TKS02608.1 hypothetical protein D5086_0000161380 [Populus alba]1.2e-4747.98Show/hide
Query:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN
        G+ KFDG  F YWKMQ++DYL  KK+H   L  K + M  E+W+ LD + +  IR+ LS  VA  VT E +  KLMEAL+  YEK SANNKV+L+KK FN
Subjt:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN

Query:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-
        ++M E+ SV  + N   T+ NQL SV+IEF DE+ A+ LL SL  SWE M+T VSNS+G + LK+ ++ DL +AEE+RR+ S + S+ GSAL I T+G+ 
Subjt:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-

Query:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ
         D+      S S   S++K+ +R +VEC+ C K GHF   C K K+ +
Subjt:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ

TKS09800.1 hypothetical protein D5086_0000089010 [Populus alba]1.2e-4747.98Show/hide
Query:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN
        G+ KFDG  F YWKMQ++DYL  KK+H   L  K + M  E+W+ LD + +  IR+ LS  VA  VT E +  KLMEAL+  YEK SANNKV+L+KK FN
Subjt:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN

Query:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-
        ++M E+ SV  + N   T+ NQL SV+IEF DE+ A+ LL SL  SWE M+T VSNS+G + LK+ ++ DL +AEE+RR+ S + S+ GSAL I T+G+ 
Subjt:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-

Query:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ
         D+      S S   S++K+ +R +VEC+ C K GHF   C K K+ +
Subjt:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ

TrEMBL top hitse value%identityAlignment
A0A4U5P1P0 CCHC-type domain-containing protein5.9e-4847.98Show/hide
Query:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN
        G+ KFDG  F YWKMQ++DYL  KK+H   L  K + M  E+W+ LD + +  IR+ LS  VA  VT E +  KLMEAL+  YEK SANNKV+L+KK FN
Subjt:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN

Query:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-
        ++M E+ SV  + N   T+ NQL SV+IEF DE+ A+ LL SL  SWE M+T VSNS+G + LK+ ++ DL +AEE+RR+ S + S+ GSAL I T+G+ 
Subjt:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-

Query:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ
         D+      S S   S++K+ +R +VEC+ C K GHF   C K K+ +
Subjt:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ

A0A4U5P397 CCHC-type domain-containing protein9.1e-4948.39Show/hide
Query:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN
        G+ KFDG  F YWKMQ++DYL  KK+H   L  K + M +E+W+ LD + +  IR+ LS  VA  VT E +  KLMEAL+  YEK SANNKV+L+KK FN
Subjt:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN

Query:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-
        ++M E+ASV  + N   T+ NQL SV IEF DE+ A+ LL SL  SWE M+T VSNS+G + LK+ ++ DL +AEE+RR+ S + S+ GSAL I T+G+ 
Subjt:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-

Query:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ
         D+      S S   S++K+ +R +VEC+ C K GHF   C K K  +
Subjt:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ

A0A4U5PY83 CCHC-type domain-containing protein5.9e-4847.98Show/hide
Query:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN
        G+ KFDG  F YWKMQ++DYL  KK+H   L  K + M  E+W+ LD + +  IR+ LS  VA  VT E +  KLMEAL+  YEK SANNKV+L+KK FN
Subjt:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN

Query:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-
        ++M E+ SV  + N   T+ NQL SV+IEF DE+ A+ LL SL  SWE M+T VSNS+G + LK+ ++ DL +AEE+RR+ S + S+ GSAL I T+G+ 
Subjt:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-

Query:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ
         D+      S S   S++K+ +R +VEC+ C K GHF   C K K+ +
Subjt:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ

A0A4U5QGR0 Uncharacterized protein5.9e-4847.98Show/hide
Query:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN
        G+ KFDG  F YWKMQ++DYL  KK+H   L  K + M  E+W+ LD + +  IR+ LS  VA  VT E +  KLMEAL+  YEK SANNKV+L+KK FN
Subjt:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN

Query:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-
        ++M E+ SV  + N   T+ NQL SV+IEF DE+ A+ LL SL  SWE M+T VSNS+G + LK+ ++ DL +AEE+RR+ S + S+ GSAL I T+G+ 
Subjt:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-

Query:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ
         D+      S S   S++K+ +R +VEC+ C K GHF   C K K+ +
Subjt:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ

A0A4V6XW18 CCHC-type domain-containing protein1.6e-4848.39Show/hide
Query:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN
        G+ KFDG  F YWKMQ++DYL  KK+H   L  K + M +E+W+ LD + +  IR+ LS  VA  VT E +  KLMEAL+  YEK SANNKV+L+KK FN
Subjt:  GVMKFDGKKFRYWKMQVKDYLTCKKVH-KALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFN

Query:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-
        ++M E+ASV  + N   T+ NQL SV IEF DE+ A+ LL SL  SWE M+T VSNS+G + LK+ ++ DL +AEE+RR+ S + S+ GSAL I T+G+ 
Subjt:  MQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSAL-IMTKGK-

Query:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ
         D+      S S   S++K+ +R +VEC+ C K GHF   C K K  +
Subjt:  -DKVDEENESSS---SRNKWKNRIEVECFYCHKKGHFKSQCRKFKEDQ

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-1731.56Show/hide
Query:  VMKFDGKK-FRYWKMQVKDYLTCKKVHKAL---KEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKF
        V KF+G   F  W+ +++D L  + +HK L    +K   M  EDW  LDE+  + IR+ LS +V + +  E TA  +   L + Y   +  NK+YL K+ 
Subjt:  VMKFDGKK-FRYWKMQVKDYLTCKKVHKAL---KEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKF

Query:  FNMQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCD-LAIAEEIRRQSSNKESTVGSALIMTKG
        + + MSE  +   + N    LI QL ++ ++  +E  AI LL SL  S++ + TT+ +  G  T++  +V   L + E++R++  N+    G ALI T+G
Subjt:  FNMQMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCD-LAIAEEIRRQSSNKESTVGSALIMTKG

Query:  KDKVDEENES----SSSRNKWKNRIEV---ECFYCHKKGHFKSQC---RKFK-EDQKRRQEEN
        + +  + + +    S +R K KNR +     C+ C++ GHFK  C   RK K E   ++ ++N
Subjt:  KDKVDEENES----SSSRNKWKNRIEV---ECFYCHKKGHFKSQC---RKFK-EDQKRRQEEN

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein6.3e-1036.36Show/hide
Query:  KFDGKKFRYWKMQVKDYLTCKKVHKALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKV
        K DG  + + +M+++DYL  KK+H+ L +K++ M+ +DW  L  + +  IR+ +S N+A  V  E +   LM+ L++ Y+K S NN V
Subjt:  KFDGKKFRYWKMQVKDYLTCKKVHKALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATTTGTAGAGCCAAAAAGTTTTGATGGAGTCATGAAGTTCGATGGGAAAAAATTTAGATATTGGAAGATGCAAGTCAAAGATTATTTAACTTGCAAGAAAGTGCA
TAAAGCATTGAAGGAGAAACTGAAAGGGATGACTGACGAAGATTGGGAAGCTCTGGATGAAAAGACAGTTGCAACCATAAGGATGTGTTTGTCAATGAATGTGGCAAGTC
TAGTGACCCATGAGACAACTGCTGTTAAATTGATGGAAGCGCTTACAAACAGGTATGAAAAATCCTCTGCAAATAATAAGGTTTACCTAGTCAAGAAGTTTTTCAACATG
CAAATGTCTGAGGATGCTTCTGTGAATTTCTATAATAATGAGGTTACCACTTTGATTAATCAGTTAAAATCTGTTAAGATAGAGTTTACTGATGAGGTGAATGCTATTCA
GTTGTTAACGTCTTTATCTGATAGTTGGGAAACGATGAAGACAACAGTGTCTAATTCGTCTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGCTG
AGGAAATTCGTAGACAGAGTAGTAATAAAGAGTCTACAGTAGGGTCAGCTTTAATTATGACTAAGGGTAAAGATAAGGTTGATGAAGAAAATGAATCAAGTAGCAGTAGG
AACAAGTGGAAAAATAGGATTGAGGTAGAATGTTTTTACTGCCATAAGAAAGGTCACTTCAAGAGTCAGTGTAGGAAATTCAAAGAGGATCAGAAAAGAAGACAAGAGGA
AAATATAGTGCAGAAGTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGATTTGTAGAGCCAAAAAGTTTTGATGGAGTCATGAAGTTCGATGGGAAAAAATTTAGATATTGGAAGATGCAAGTCAAAGATTATTTAACTTGCAAGAAAGTGCA
TAAAGCATTGAAGGAGAAACTGAAAGGGATGACTGACGAAGATTGGGAAGCTCTGGATGAAAAGACAGTTGCAACCATAAGGATGTGTTTGTCAATGAATGTGGCAAGTC
TAGTGACCCATGAGACAACTGCTGTTAAATTGATGGAAGCGCTTACAAACAGGTATGAAAAATCCTCTGCAAATAATAAGGTTTACCTAGTCAAGAAGTTTTTCAACATG
CAAATGTCTGAGGATGCTTCTGTGAATTTCTATAATAATGAGGTTACCACTTTGATTAATCAGTTAAAATCTGTTAAGATAGAGTTTACTGATGAGGTGAATGCTATTCA
GTTGTTAACGTCTTTATCTGATAGTTGGGAAACGATGAAGACAACAGTGTCTAATTCGTCTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGCTG
AGGAAATTCGTAGACAGAGTAGTAATAAAGAGTCTACAGTAGGGTCAGCTTTAATTATGACTAAGGGTAAAGATAAGGTTGATGAAGAAAATGAATCAAGTAGCAGTAGG
AACAAGTGGAAAAATAGGATTGAGGTAGAATGTTTTTACTGCCATAAGAAAGGTCACTTCAAGAGTCAGTGTAGGAAATTCAAAGAGGATCAGAAAAGAAGACAAGAGGA
AAATATAGTGCAGAAGTCTTAG
Protein sequenceShow/hide protein sequence
MGFVEPKSFDGVMKFDGKKFRYWKMQVKDYLTCKKVHKALKEKLKGMTDEDWEALDEKTVATIRMCLSMNVASLVTHETTAVKLMEALTNRYEKSSANNKVYLVKKFFNM
QMSEDASVNFYNNEVTTLINQLKSVKIEFTDEVNAIQLLTSLSDSWETMKTTVSNSSGNNTLKFSEVCDLAIAEEIRRQSSNKESTVGSALIMTKGKDKVDEENESSSSR
NKWKNRIEVECFYCHKKGHFKSQCRKFKEDQKRRQEENIVQKS