; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013022 (gene) of Snake gourd v1 genome

Gene IDTan0013022
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG10:10988005..10993541
RNA-Seq ExpressionTan0013022
SyntenyTan0013022
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2663507.1 hypothetical protein I3760_16G033000 [Carya illinoinensis]2.8e-3634.27Show/hide
Query:  VEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELEF
        + I ++   +  R  E  L+ K+ ++R V   V+ES  AKIW +   +         FI   +T ADK +V  G PW FD  L +     G   +SE++F
Subjt:  VEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELEF

Query:  SYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQE
         +ASFW+ FHNLP +G++++    LG  +G  E V+++++    G+SLRV++ LD++KPL R     L  +  + W+P+ YEK+P  CF CG+I HG   
Subjt:  SYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQE

Query:  CKESHLAQGGDLPFKSWL-------------PEPSTWKGSSQNRGLEE
        C+   + +   + F SWL              E ST KGSS   G+E+
Subjt:  CKESHLAQGGDLPFKSWL-------------PEPSTWKGSSQNRGLEE

TXG57113.1 hypothetical protein EZV62_018426 [Acer yangbiense]4.0e-3534.35Show/hide
Query:  RGIVEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISE
        R I+E+ ++   D     +  L+ KVL+ + V     + ++ +IWN    +  +  G  +F++      D+ +V   GPW F  +L++ E+  G   IS+
Subjt:  RGIVEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISE

Query:  LEFSYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHG
        L F+ A FW+  H++P + ++R+ A  L  ++G    + LE   C  GK +RV+VR+DI KPL+R +R KLG   E     + YE+LP  C+ CGKIGHG
Subjt:  LEFSYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHG

Query:  IQECKESHLAQ----GGDLPFKSWLPEPST
        I+EC +    Q    G    F SWL  P T
Subjt:  IQECKESHLAQ----GGDLPFKSWLPEPST

XP_042964753.1 uncharacterized protein LOC122298977 [Carya illinoinensis]2.5e-3733.05Show/hide
Query:  VEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELEF
        + I ++A+ +  +  +  L+ K+L++R +  +V+ES + KIW +  P +    G+  ++   +T ADK ++++G PW FD  L +     GI  I +L+F
Subjt:  VEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELEF

Query:  SYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQE
          A FW+ FHNLP  G++R+V + LG  +G  + VD++D+    G SLRV++ +D++KPL R     L  +  + W+P+ YEKLP  CF CG+I H   +
Subjt:  SYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQE

Query:  CKESHLAQGGDLPFKSWLPEPST----WKGSSQ
        C+ +         + +WL   S+    W  SS+
Subjt:  CKESHLAQGGDLPFKSWLPEPST----WKGSSQ

XP_042979872.1 uncharacterized protein LOC122310049 [Carya illinoinensis]4.8e-3634.41Show/hide
Query:  VEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELEF
        + I ++   +  R  E  L+ K+ ++R V   V+ES  AKIW +   +         FI   +T ADK +V  G PW FD  L +     G   +SE++F
Subjt:  VEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELEF

Query:  SYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQE
         +ASFW+ FHNLP +G++++    LG  +G  E V+++++    G+SLRV++ LD++KPL R     L  +  + W+P+ YEK+P  CF CG+I HG   
Subjt:  SYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQE

Query:  CKESHLAQGGDLPFKSWL-------------PEPSTWKGSSQNRGLE
        C+   + +   + F SWL              E ST KGSS   G+E
Subjt:  CKESHLAQGGDLPFKSWL-------------PEPSTWKGSSQNRGLE

XP_042979975.1 uncharacterized protein LOC122310162 [Carya illinoinensis]6.2e-3630.46Show/hide
Query:  RRKKRGIVEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIH
        R +++ ++EI D+   D     +  L+ K+ +NR +   V+ES +AKIW I     F       F       ADK KV  G PW+FD  LV+ +E  G  
Subjt:  RRKKRGIVEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIH

Query:  KISELEFSYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGK
         + ++ F+  SFW+ FHNLP   ++      +G  VG  E VD++ +    GK LRV++ +D+ +PL R     +     E W P +YEK+P +CF CG 
Subjt:  KISELEFSYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGK

Query:  IGHGIQECKESHLAQGGDLPFKSWLPEPSTWKGS--SQNRGLEEVKGHARRVSTRVIEAGRGRGRGWSGWEVGNWSEGDAGKSRDVGRKKGLLSTKKVEE
        I HG+ ECKE     G +  +  WL     ++ +   + +  EE  G  +         G G      G  V   S  +  +  + G+++GL   +K +E
Subjt:  IGHGIQECKESHLAQGGDLPFKSWLPEPSTWKGS--SQNRGLEEVKGHARRVSTRVIEAGRGRGRGWSGWEVGNWSEGDAGKSRDVGRKKGLLSTKKVEE

Query:  GK
        G+
Subjt:  GK

TrEMBL top hitse value%identityAlignment
A0A2I4G596 uncharacterized protein LOC1090048227.0e-3330.42Show/hide
Query:  IVEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELE
        ++E+ +D +          L+ K+L++R V   V+ S M KIW +     F       F+ +   + D+ +VL+G PW+FD  L + +   G+    ++ 
Subjt:  IVEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELE

Query:  FSYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQ
        F +  FW+H +NLP   +SR+V M +G  VG  +GVD+ ++    G  LRV++ +D++K + R     +  M  + W P++YEKLP +CF CGKI HG +
Subjt:  FSYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQ

Query:  ECKESHLAQGGDLPFKSWLPEPSTWKGSSQNRGLEEVKGH
         C E+       L +  WL    + +G S      ++ G+
Subjt:  ECKESHLAQGGDLPFKSWLPEPSTWKGSSQNRGLEEVKGH

A0A2I4GBZ4 uncharacterized protein LOC1090065605.3e-3330.38Show/hide
Query:  IVEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELE
        ++E+ +D +    +     L+ K+L++R V   V+ S M K W +     F       F+ +   + D+ +VL+G PW+FD  L +    +G+    ++ 
Subjt:  IVEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELE

Query:  FSYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQ
        F +  FW+H +NLP   +SR+V M +G  VG+ +GVDL ++    G  LRV++ +D++K + R     +  M  + W P++YEKLP +CF CGKI HG +
Subjt:  FSYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQ

Query:  ECKESHLAQGGDLPFKSWLPEPSTWKGSSQNRGLEEV
         C E+         +  WL    + +G S      +V
Subjt:  ECKESHLAQGGDLPFKSWLPEPSTWKGSSQNRGLEEV

A0A5C7GU64 CCHC-type domain-containing protein2.4e-3333.03Show/hide
Query:  EIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELEFS
        EI +D I+D +   +  L+ KVLT + V     + ++ +IWN    +  +  G+  F++  + K  + KV   GPW+F K+L++ E+ KG   I++L+F+
Subjt:  EIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELEFS

Query:  YASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQEC
         A FW+  H++P + ++++    L   +G    +  E   C  GK +RV+V++DI KPL+R +R KLG   E     + YE+LP+ CF CG+IGH ++EC
Subjt:  YASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQEC

Query:  KESHLAQ----GGDLPFKSWL
         +    +    G    F SW+
Subjt:  KESHLAQ----GGDLPFKSWL

A0A5C7HJ97 CCHC-type domain-containing protein2.0e-3534.35Show/hide
Query:  RGIVEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISE
        R I+E+ ++   D     +  L+ KVL+ + V     + ++ +IWN    +  +  G  +F++      D+ +V   GPW F  +L++ E+  G   IS+
Subjt:  RGIVEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISE

Query:  LEFSYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHG
        L F+ A FW+  H++P + ++R+ A  L  ++G    + LE   C  GK +RV+VR+DI KPL+R +R KLG   E     + YE+LP  C+ CGKIGHG
Subjt:  LEFSYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHG

Query:  IQECKESHLAQ----GGDLPFKSWLPEPST
        I+EC +    Q    G    F SWL  P T
Subjt:  IQECKESHLAQ----GGDLPFKSWLPEPST

A0A6J1DU55 uncharacterized protein LOC1110231353.7e-3429.66Show/hide
Query:  VEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELEF
        +++  DA++   +     L+ K+L  R +   VL  V+   W +E  ++ +  GK  F++    + D  +V+K GPW FDKAL++ ++      ISELEF
Subjt:  VEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELEF

Query:  SYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQE
        +  +FW+H  +LP   +++ +A+ LG  +G+F  VD  ++    G SLR+RV +DI KPLRR ++  +       W PI YE+LP+ C+ CG IGH   +
Subjt:  SYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQE

Query:  CKESHLAQGGDLPFKSWLPEPSTWKGSSQNRGLEEVKGHARRVSTRVIEAGRGRGRGWSGWEVGNWSEGDAGKSRDVGRKKGLLSTK-KVEEGKTQAAMI
        C   +LA   D                  +R   E     R V ++   AG  +GR           E   G S    +++G+  TK ++ E   Q    
Subjt:  CKESHLAQGGDLPFKSWLPEPSTWKGSSQNRGLEEVKGHARRVSTRVIEAGRGRGRGWSGWEVGNWSEGDAGKSRDVGRKKGLLSTK-KVEEGKTQAAMI

Query:  GVTPEEMVGPNGNIKVGPTLCGVNMES
             E  G  G  +   T C + MES
Subjt:  GVTPEEMVGPNGNIKVGPTLCGVNMES

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding5.0e-0721.46Show/hide
Query:  IVEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELE
        ++E+ D+A+   N +   +  C    +  VV R+LE     I  IE              +   ++     +L+ GPW F+  + + +    +H  S+ E
Subjt:  IVEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKALVIFEELKGIHKISELE

Query:  FSYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQ
        F    FW+    +P   ++ ++  ++G  +G F   +L       G+ + V             ++F+             YEKL N C  CG + H   
Subjt:  FSYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCGKIGHGIQ

Query:  ECKES
        EC  S
Subjt:  ECKES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAGGGGCAACTCTTGGGAGTCGGATATCTTCAAGAAATTAGAGGGGTTGAACCTAGCAGAAGAAAGAAAAGGGGCATTGTAGAAATTGGAGATGATGCTATTGA
AGATTACAACCGCTCTTGTGAAACAAAACTGATGTGCAAAGTTTTAACGAATCGAAACGTTGTCCCGAGAGTGCTTGAGTCGGTCATGGCGAAAATCTGGAACATTGAGT
CGCCCATCTCCTTCGACCGAGCGGGTAAAGGTAAGTTCATCTACACCCTCTCTACAAAAGCGGACAAAATGAAGGTGCTCAAAGGAGGACCTTGGATTTTCGACAAGGCA
TTGGTGATTTTCGAAGAGCTGAAAGGTATTCATAAAATTTCTGAGCTGGAGTTCAGCTACGCTTCTTTTTGGCTTCACTTTCATAATCTCCCCTTTGTAGGAATTTCTAG
GAAGGTTGCGATGGCTTTAGGGGTTGAAGTAGGGCATTTTGAGGGAGTTGATCTAGAGGATGAACGGTGCAAAAGGGGTAAATCTCTTAGAGTGAGAGTTAGATTGGATA
TTCAAAAACCTCTTAGGCGTGCTGTGCGGTTTAAATTAGGATCCATGACAGAGGAAGCTTGGGCTCCAATCAATTATGAGAAGTTACCCAATTTGTGCTTTGGTTGTGGG
AAAATAGGACATGGAATCCAAGAGTGCAAGGAAAGTCATTTGGCCCAAGGAGGAGATCTTCCTTTCAAGAGTTGGCTACCCGAACCTTCGACATGGAAAGGAAGTTCTCA
GAATAGGGGTTTAGAAGAAGTTAAAGGCCATGCGAGAAGGGTTTCGACTCGAGTCATTGAAGCGGGAAGGGGCCGAGGACGAGGATGGTCAGGGTGGGAAGTAGGTAATT
GGAGTGAGGGGGATGCCGGAAAATCGAGGGATGTTGGTCGGAAAAAGGGCCTTCTTTCGACGAAAAAAGTCGAGGAAGGGAAAACCCAGGCGGCAATGATAGGGGTCACT
CCCGAGGAAATGGTTGGCCCCAACGGTAATATCAAGGTGGGCCCCACCCTCTGTGGTGTCAATATGGAATCAACAAGAAAAGCTTATGGGAATATGACGCTCCAAATGGG
TTTTCCGTTTCTGGGCCTGGACAAAGGTGAACAGAGTTCAAAGCCTACTAAATTGGATGAAATCGAGGCCCCCCGATTATTTAATCCCATCGGGCTTCCTGACAAACTTA
CTCTCTTGGGCCTGAATCCTAGTGGGCAGAGCCCAAATTCTGAAAAGGACCTCACTAAAGTCAGTAGAGGTGAAGTTTCATCTACTTTTCCAAATATGGCCTTAAAAGCA
AAAAGAAGGTACCAAATCCCAAGCGGATATTGCATCTCCGCTACGAGGTTTGTAGCAATTCCCGGATATCGTAATGATAATTCAAATGTGGTTATCTCCTCCTCTCGACG
CGATTTTCTCTTGGTTAAACAAAATTTGTGGGTTTCTCACGAAATCAAAAAGCGTTTGAGTCTGGATGCTTCTAGGTCAACTACAATTCCTTCGGTGGTTGGAGGAAAAA
GGTGGAGTCTGCCACCCATGCTTTCTGGACGTGCAAATGGGTCTCGAAAGGACTATTGGACTCCTTTTGAATTTTGGGAATGGATGACAAAGACTCTTCGTGAAGAGGAT
CTTTCCAAGGCTATCACTATAATGCAAACCAGAAAAAGAGACAGGGGCAAGGACGTGAGAACGATTGAAATTCAGAGAAGCGCACTGCAATCGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAGGGGCAACTCTTGGGAGTCGGATATCTTCAAGAAATTAGAGGGGTTGAACCTAGCAGAAGAAAGAAAAGGGGCATTGTAGAAATTGGAGATGATGCTATTGA
AGATTACAACCGCTCTTGTGAAACAAAACTGATGTGCAAAGTTTTAACGAATCGAAACGTTGTCCCGAGAGTGCTTGAGTCGGTCATGGCGAAAATCTGGAACATTGAGT
CGCCCATCTCCTTCGACCGAGCGGGTAAAGGTAAGTTCATCTACACCCTCTCTACAAAAGCGGACAAAATGAAGGTGCTCAAAGGAGGACCTTGGATTTTCGACAAGGCA
TTGGTGATTTTCGAAGAGCTGAAAGGTATTCATAAAATTTCTGAGCTGGAGTTCAGCTACGCTTCTTTTTGGCTTCACTTTCATAATCTCCCCTTTGTAGGAATTTCTAG
GAAGGTTGCGATGGCTTTAGGGGTTGAAGTAGGGCATTTTGAGGGAGTTGATCTAGAGGATGAACGGTGCAAAAGGGGTAAATCTCTTAGAGTGAGAGTTAGATTGGATA
TTCAAAAACCTCTTAGGCGTGCTGTGCGGTTTAAATTAGGATCCATGACAGAGGAAGCTTGGGCTCCAATCAATTATGAGAAGTTACCCAATTTGTGCTTTGGTTGTGGG
AAAATAGGACATGGAATCCAAGAGTGCAAGGAAAGTCATTTGGCCCAAGGAGGAGATCTTCCTTTCAAGAGTTGGCTACCCGAACCTTCGACATGGAAAGGAAGTTCTCA
GAATAGGGGTTTAGAAGAAGTTAAAGGCCATGCGAGAAGGGTTTCGACTCGAGTCATTGAAGCGGGAAGGGGCCGAGGACGAGGATGGTCAGGGTGGGAAGTAGGTAATT
GGAGTGAGGGGGATGCCGGAAAATCGAGGGATGTTGGTCGGAAAAAGGGCCTTCTTTCGACGAAAAAAGTCGAGGAAGGGAAAACCCAGGCGGCAATGATAGGGGTCACT
CCCGAGGAAATGGTTGGCCCCAACGGTAATATCAAGGTGGGCCCCACCCTCTGTGGTGTCAATATGGAATCAACAAGAAAAGCTTATGGGAATATGACGCTCCAAATGGG
TTTTCCGTTTCTGGGCCTGGACAAAGGTGAACAGAGTTCAAAGCCTACTAAATTGGATGAAATCGAGGCCCCCCGATTATTTAATCCCATCGGGCTTCCTGACAAACTTA
CTCTCTTGGGCCTGAATCCTAGTGGGCAGAGCCCAAATTCTGAAAAGGACCTCACTAAAGTCAGTAGAGGTGAAGTTTCATCTACTTTTCCAAATATGGCCTTAAAAGCA
AAAAGAAGGTACCAAATCCCAAGCGGATATTGCATCTCCGCTACGAGGTTTGTAGCAATTCCCGGATATCGTAATGATAATTCAAATGTGGTTATCTCCTCCTCTCGACG
CGATTTTCTCTTGGTTAAACAAAATTTGTGGGTTTCTCACGAAATCAAAAAGCGTTTGAGTCTGGATGCTTCTAGGTCAACTACAATTCCTTCGGTGGTTGGAGGAAAAA
GGTGGAGTCTGCCACCCATGCTTTCTGGACGTGCAAATGGGTCTCGAAAGGACTATTGGACTCCTTTTGAATTTTGGGAATGGATGACAAAGACTCTTCGTGAAGAGGAT
CTTTCCAAGGCTATCACTATAATGCAAACCAGAAAAAGAGACAGGGGCAAGGACGTGAGAACGATTGAAATTCAGAGAAGCGCACTGCAATCGTGCTGA
Protein sequenceShow/hide protein sequence
MKKGQLLGVGYLQEIRGVEPSRRKKRGIVEIGDDAIEDYNRSCETKLMCKVLTNRNVVPRVLESVMAKIWNIESPISFDRAGKGKFIYTLSTKADKMKVLKGGPWIFDKA
LVIFEELKGIHKISELEFSYASFWLHFHNLPFVGISRKVAMALGVEVGHFEGVDLEDERCKRGKSLRVRVRLDIQKPLRRAVRFKLGSMTEEAWAPINYEKLPNLCFGCG
KIGHGIQECKESHLAQGGDLPFKSWLPEPSTWKGSSQNRGLEEVKGHARRVSTRVIEAGRGRGRGWSGWEVGNWSEGDAGKSRDVGRKKGLLSTKKVEEGKTQAAMIGVT
PEEMVGPNGNIKVGPTLCGVNMESTRKAYGNMTLQMGFPFLGLDKGEQSSKPTKLDEIEAPRLFNPIGLPDKLTLLGLNPSGQSPNSEKDLTKVSRGEVSSTFPNMALKA
KRRYQIPSGYCISATRFVAIPGYRNDNSNVVISSSRRDFLLVKQNLWVSHEIKKRLSLDASRSTTIPSVVGGKRWSLPPMLSGRANGSRKDYWTPFEFWEWMTKTLREED
LSKAITIMQTRKRDRGKDVRTIEIQRSALQSC