; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031175 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031175
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr11:5536125..5537930
RNA-Seq ExpressionLag0031175
SyntenyLag0031175
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MCH80348.1 hypothetical protein [Trifolium medium]5.3e-3835.83Show/hide
Query:  ADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIEEPKGES
        AD+    + +ED+E+    E    +L  +I T    N   FK  + + W     I  + + +N +L KF   RE D V + GPW+FD+ L+I+    G  
Subjt:  ADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIEEPKGES

Query:  RISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDFCYNCGL
        + S+    +V+FWV ++DLP+   +   A  LGN +G+FE++D+ E +R G+  LR+ V +D+ KPLKRG   ++  +G+E WV   YER+P+FC+ CG 
Subjt:  RISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDFCYNCGL

Query:  IGHVKSECESQISPEDEHQY----------GDWMRDAPIP
        IGH   +CE  I   DE QY          G W+R +P+P
Subjt:  IGHVKSECESQISPEDEHQY----------GDWMRDAPIP

TXG48193.1 hypothetical protein EZV62_027487 [Acer yangbiense]1.5e-4035.98Show/hide
Query:  ENLKIRADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIE
        ENL +  DE A +  I +D + +  ED+   L  ++ T K +N E FK  I +IWN+ G++  + +G NTF+  F N   ++ V   GPW F K+LI++E
Subjt:  ENLKIRADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIE

Query:  EPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDF
        +PKG   I+K +F   +FWV IHD+P+  MN++    L   +G   ++  +  +  G+  +R+ V++DITKPLKR L I++G   E + V + YER+PDF
Subjt:  EPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDF

Query:  CYNCGLIGHVKSECESQISPE-----DEHQYGDWMRDAPIPPNFNSFRSPDTRLNYGWNKGRGR
        C+ CG IGH   EC  + +        + ++G WMR  PI    +  ++    L  G +  RGR
Subjt:  CYNCGLIGHVKSECESQISPE-----DEHQYGDWMRDAPIPPNFNSFRSPDTRLNYGWNKGRGR

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]1.8e-3835.24Show/hide
Query:  DERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIEEPKGESR
        D+   I  I+    +   + L  SL  +  T+K+IN E FK+ I  IW  +  +  + MG N F  +F+N  ++  + +GGPW FDK L+++ E  G  +
Subjt:  DERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIEEPKGESR

Query:  ISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDFCYNCGLI
        ++  +FRYV FW+ +H+LP+A +NR+    LG L+G  +++D  E        +RI V +D+  PLKRGL + +G   + + V + YER+P+FCY CG I
Subjt:  ISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDFCYNCGLI

Query:  GHVKSEC---ESQISPEDEHQYGDWMR
        GH+  +C     +I+     ++G WMR
Subjt:  GHVKSEC---ESQISPEDEHQYGDWMR

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]5.3e-3829.58Show/hide
Query:  MLKQMENLKIRADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKA
        +L   +  K+ ++E    + ++ D +  A + L  SL  ++   ++I+ +     +   W  E ++  +++G+N FL  F    + + V K GPW FDKA
Subjt:  MLKQMENLKIRADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKA

Query:  LIIIEEPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYE
        LI++++P     IS+ EF  V FW+H+ DLPM+++N+  A  LGN +G+F DVD +E       +LRI V +DITKPL+RG+ I +       W+ + YE
Subjt:  LIIIEEPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYE

Query:  RMPDFCYNCGLIGHVKSECESQ-ISPEDE----HQYGDWMRDAPIPPNFNSFRSPDTRLNYGWNKGRGRGRGPGELRNFEWRKSVPSNRNQVRE------
        R+PDFCY CG+IGH   +C+++ ++ +D+     +YG W+R       F   ++   +   G +  R    G   + + E  + V   + Q+ E      
Subjt:  RMPDFCYNCGLIGHVKSECESQ-ISPEDE----HQYGDWMRDAPIPPNFNSFRSPDTRLNYGWNKGRGRGRGPGELRNFEWRKSVPSNRNQVRE------

Query:  FQKTFAGETAGNRAGTSRDGQRENQKEGTGKFHVSQDKPTSTASK----LPDEKTSEKEKDDMTGDQKQVRKNDKGKEIQGI
        FQ   A E  G+  G +         E     H   D P S+       + D     K KD M  +    R  D  KE+  I
Subjt:  FQKTFAGETAGNRAGTSRDGQRENQKEGTGKFHVSQDKPTSTASK----LPDEKTSEKEKDDMTGDQKQVRKNDKGKEIQGI

XP_035546596.1 uncharacterized protein LOC118348645 [Juglans regia]3.1e-3835.34Show/hide
Query:  ENLKIRADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIE
        ++LK+  +E+     ++++E+  + +     L   +   + +N   FK  + ++WN EG I FK +GRN FL +F NS  +  V  G PW+FD+ LI I+
Subjt:  ENLKIRADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIE

Query:  EPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDF
        E KG   +   +F    FW+ + DLP A MN++ +  LG   G    VD DE  R     LR+ V +D++KPL RG VI VG     SW+   YE++P F
Subjt:  EPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDF

Query:  CYNCGLIGHVKSECESQISPEDEH-----QYGDWMRDAPIPPNFNSFRSPDTRLNYGWNKGRGRGR
        CY CG++ H    C    S    H     QYG W+R  P P N N ++   T  N    K  G GR
Subjt:  CYNCGLIGHVKSECESQISPEDEH-----QYGDWMRDAPIPPNFNSFRSPDTRLNYGWNKGRGRGR

TrEMBL top hitse value%identityAlignment
A0A392M033 CCHC-type domain-containing protein (Fragment)2.6e-3835.83Show/hide
Query:  ADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIEEPKGES
        AD+    + +ED+E+    E    +L  +I T    N   FK  + + W     I  + + +N +L KF   RE D V + GPW+FD+ L+I+    G  
Subjt:  ADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIEEPKGES

Query:  RISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDFCYNCGL
        + S+    +V+FWV ++DLP+   +   A  LGN +G+FE++D+ E +R G+  LR+ V +D+ KPLKRG   ++  +G+E WV   YER+P+FC+ CG 
Subjt:  RISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDFCYNCGL

Query:  IGHVKSECESQISPEDEHQY----------GDWMRDAPIP
        IGH   +CE  I   DE QY          G W+R +P+P
Subjt:  IGHVKSECESQISPEDEHQY----------GDWMRDAPIP

A0A5C7GU64 CCHC-type domain-containing protein7.2e-4135.98Show/hide
Query:  ENLKIRADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIE
        ENL +  DE A +  I +D + +  ED+   L  ++ T K +N E FK  I +IWN+ G++  + +G NTF+  F N   ++ V   GPW F K+LI++E
Subjt:  ENLKIRADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIE

Query:  EPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDF
        +PKG   I+K +F   +FWV IHD+P+  MN++    L   +G   ++  +  +  G+  +R+ V++DITKPLKR L I++G   E + V + YER+PDF
Subjt:  EPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDF

Query:  CYNCGLIGHVKSECESQISPE-----DEHQYGDWMRDAPIPPNFNSFRSPDTRLNYGWNKGRGR
        C+ CG IGH   EC  + +        + ++G WMR  PI    +  ++    L  G +  RGR
Subjt:  CYNCGLIGHVKSECESQISPE-----DEHQYGDWMRDAPIPPNFNSFRSPDTRLNYGWNKGRGR

A0A5C7H9Y2 CCHC-type domain-containing protein8.8e-3935.24Show/hide
Query:  DERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIEEPKGESR
        D+   I  I+    +   + L  SL  +  T+K+IN E FK+ I  IW  +  +  + MG N F  +F+N  ++  + +GGPW FDK L+++ E  G  +
Subjt:  DERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIEEPKGESR

Query:  ISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDFCYNCGLI
        ++  +FRYV FW+ +H+LP+A +NR+    LG L+G  +++D  E        +RI V +D+  PLKRGL + +G   + + V + YER+P+FCY CG I
Subjt:  ISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDFCYNCGLI

Query:  GHVKSEC---ESQISPEDEHQYGDWMR
        GH+  +C     +I+     ++G WMR
Subjt:  GHVKSEC---ESQISPEDEHQYGDWMR

A0A6J1DU55 uncharacterized protein LOC1110231352.6e-3829.58Show/hide
Query:  MLKQMENLKIRADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKA
        +L   +  K+ ++E    + ++ D +  A + L  SL  ++   ++I+ +     +   W  E ++  +++G+N FL  F    + + V K GPW FDKA
Subjt:  MLKQMENLKIRADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKA

Query:  LIIIEEPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYE
        LI++++P     IS+ EF  V FW+H+ DLPM+++N+  A  LGN +G+F DVD +E       +LRI V +DITKPL+RG+ I +       W+ + YE
Subjt:  LIIIEEPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYE

Query:  RMPDFCYNCGLIGHVKSECESQ-ISPEDE----HQYGDWMRDAPIPPNFNSFRSPDTRLNYGWNKGRGRGRGPGELRNFEWRKSVPSNRNQVRE------
        R+PDFCY CG+IGH   +C+++ ++ +D+     +YG W+R       F   ++   +   G +  R    G   + + E  + V   + Q+ E      
Subjt:  RMPDFCYNCGLIGHVKSECESQ-ISPEDE----HQYGDWMRDAPIPPNFNSFRSPDTRLNYGWNKGRGRGRGPGELRNFEWRKSVPSNRNQVRE------

Query:  FQKTFAGETAGNRAGTSRDGQRENQKEGTGKFHVSQDKPTSTASK----LPDEKTSEKEKDDMTGDQKQVRKNDKGKEIQGI
        FQ   A E  G+  G +         E     H   D P S+       + D     K KD M  +    R  D  KE+  I
Subjt:  FQKTFAGETAGNRAGTSRDGQRENQKEGTGKFHVSQDKPTSTASK----LPDEKTSEKEKDDMTGDQKQVRKNDKGKEIQGI

A0A6P9EGW2 uncharacterized protein LOC1183486451.5e-3835.34Show/hide
Query:  ENLKIRADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIE
        ++LK+  +E+     ++++E+  + +     L   +   + +N   FK  + ++WN EG I FK +GRN FL +F NS  +  V  G PW+FD+ LI I+
Subjt:  ENLKIRADERANILVIEDDELDEATEDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIE

Query:  EPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDF
        E KG   +   +F    FW+ + DLP A MN++ +  LG   G    VD DE  R     LR+ V +D++KPL RG VI VG     SW+   YE++P F
Subjt:  EPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDF

Query:  CYNCGLIGHVKSECESQISPEDEH-----QYGDWMRDAPIPPNFNSFRSPDTRLNYGWNKGRGRGR
        CY CG++ H    C    S    H     QYG W+R  P P N N ++   T  N    K  G GR
Subjt:  CYNCGLIGHVKSECESQISPEDEH-----QYGDWMRDAPIPPNFNSFRSPDTRLNYGWNKGRGRGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding3.8e-1022.09Show/hide
Query:  FRNSREKDWVAKGGPWNFDKALIIIEEPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLK
        F++      + + GPW+F+  + +I+  +     S AEF+ + FW+ I  +P+ ++  +  T++G  MG F + +   D                     
Subjt:  FRNSREKDWVAKGGPWNFDKALIIIEEPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWATALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLK

Query:  RGLVIRVGSKGEESWVKVSYERMPDFCYNCGLIGHVKSECES------QISPEDEHQYGDWMRDAPIPPNFN
                     S +K  YE++ +FC  CG++ H  SEC +          +D+    D   D P  P  N
Subjt:  RGLVIRVGSKGEESWVKVSYERMPDFCYNCGLIGHVKSECES------QISPEDEHQYGDWMRDAPIPPNFN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAGGGAACCGACAACGGAAATCCAATCCACCATGGACAAAGAAGCAGAAGAAATCTGTAAACCAATGGATTACCAACTCGAGTCCATTAAAACCAACATCGCAAT
AGGAGAAAGAACTTCAGATACTTCAAAGAACGAAAACACCCAGAAACTCTCAATGAGTAAAGAGAACCCAAGCGAAGGACAGAAGGAAAACAACAGCGACCAGCATGAAC
ACTCGACTGAGGCAAGAAGAATGCTCAAACAAATGGAAAACTTAAAAATCAGAGCAGACGAACGCGCAAACATTCTAGTCATTGAAGACGACGAACTGGATGAAGCAACA
GAGGACCTAAAAGATTCTTTGTTCTGCAGAATTCAAACGTCGAAATTGATAAATCCGGAGACATTCAAGACATTTATCCCAAAAATCTGGAACAAAGAAGGAAGAATCAA
CTTCAAAACGATGGGAAGGAACACCTTCCTCTGCAAGTTCAGAAACAGTAGAGAAAAAGACTGGGTCGCTAAGGGAGGACCCTGGAATTTTGATAAGGCCCTAATCATAA
TCGAGGAACCAAAAGGCGAAAGCAGAATTTCTAAAGCAGAATTCAGGTATGTAAACTTTTGGGTCCACATTCATGACTTACCTATGGCTTATATGAACAGGAAATGGGCA
ACAGCCCTAGGGAATCTGATGGGGAGCTTTGAAGACGTTGACTGGGACGAAGACGACAGAAGAGGGGAAAACACTTTAAGAATTCTTGTCAAAATGGACATAACTAAACC
TTTGAAAAGGGGCTTGGTGATCAGAGTGGGGTCGAAAGGAGAGGAATCTTGGGTCAAAGTCTCTTATGAAAGAATGCCAGACTTTTGTTACAACTGTGGTCTCATCGGTC
ATGTTAAATCTGAATGTGAAAGTCAGATTTCCCCTGAGGACGAGCATCAATACGGAGACTGGATGAGAGATGCGCCGATTCCACCAAACTTCAATAGTTTCAGAAGCCCA
GATACAAGACTGAACTACGGATGGAACAAAGGAAGGGGGAGAGGTAGAGGTCCCGGCGAACTTCGCAACTTTGAATGGAGAAAATCTGTGCCATCCAATAGAAACCAAGT
CCGGGAGTTCCAAAAAACGTTCGCCGGAGAAACAGCTGGAAACCGAGCCGGAACCTCCAGGGACGGCCAAAGAGAAAATCAGAAGGAGGGAACAGGAAAGTTCCACGTTT
CCCAAGACAAACCAACCTCAACGGCTAGTAAATTACCAGACGAAAAGACATCAGAGAAAGAAAAAGATGATATGACAGGTGACCAGAAACAAGTGAGGAAAAACGACAAA
GGAAAGGAAATTCAAGGAATACAGTCGAGAGAAAATACCAAAAATGACCGGAACTTTTGTCACTCAAATTTTAAAGGAGAACCTGAAAGAGGAAAAAACAACAAACAGGA
GAATGAAAGTCAGAACAACAGAAAGGAAAAAAAAGCAACCAGGACCCACGATGACTTTTGGACCAACGGCCCTGAAAGAGAAATGGAAAAAGCCATGGAAGTCGATCTTG
AACTAAAGGAAAAAAATATTAAAATGGAGCTTGCCACAAAAGAAGTATCAGGAAAAGATCAACAGAAGGTGGGAAAATGGAAAAGAAGGGCTAGGTCGGGCAAATTGGAA
ACAAGCTACGAGGAACCAGTGAGACTAGAAAAAAGAAAGAATCAGAGCGAGGCAGGGGAGAAAGTTAGAGAAAGCAAGAAACCAAACAACCTTTACTTCGAAGCGGTCGA
AGGGATATCGGCGGAGGCTGACCTACAGCCCCGCCGGAAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGAGGGAACCGACAACGGAAATCCAATCCACCATGGACAAAGAAGCAGAAGAAATCTGTAAACCAATGGATTACCAACTCGAGTCCATTAAAACCAACATCGCAAT
AGGAGAAAGAACTTCAGATACTTCAAAGAACGAAAACACCCAGAAACTCTCAATGAGTAAAGAGAACCCAAGCGAAGGACAGAAGGAAAACAACAGCGACCAGCATGAAC
ACTCGACTGAGGCAAGAAGAATGCTCAAACAAATGGAAAACTTAAAAATCAGAGCAGACGAACGCGCAAACATTCTAGTCATTGAAGACGACGAACTGGATGAAGCAACA
GAGGACCTAAAAGATTCTTTGTTCTGCAGAATTCAAACGTCGAAATTGATAAATCCGGAGACATTCAAGACATTTATCCCAAAAATCTGGAACAAAGAAGGAAGAATCAA
CTTCAAAACGATGGGAAGGAACACCTTCCTCTGCAAGTTCAGAAACAGTAGAGAAAAAGACTGGGTCGCTAAGGGAGGACCCTGGAATTTTGATAAGGCCCTAATCATAA
TCGAGGAACCAAAAGGCGAAAGCAGAATTTCTAAAGCAGAATTCAGGTATGTAAACTTTTGGGTCCACATTCATGACTTACCTATGGCTTATATGAACAGGAAATGGGCA
ACAGCCCTAGGGAATCTGATGGGGAGCTTTGAAGACGTTGACTGGGACGAAGACGACAGAAGAGGGGAAAACACTTTAAGAATTCTTGTCAAAATGGACATAACTAAACC
TTTGAAAAGGGGCTTGGTGATCAGAGTGGGGTCGAAAGGAGAGGAATCTTGGGTCAAAGTCTCTTATGAAAGAATGCCAGACTTTTGTTACAACTGTGGTCTCATCGGTC
ATGTTAAATCTGAATGTGAAAGTCAGATTTCCCCTGAGGACGAGCATCAATACGGAGACTGGATGAGAGATGCGCCGATTCCACCAAACTTCAATAGTTTCAGAAGCCCA
GATACAAGACTGAACTACGGATGGAACAAAGGAAGGGGGAGAGGTAGAGGTCCCGGCGAACTTCGCAACTTTGAATGGAGAAAATCTGTGCCATCCAATAGAAACCAAGT
CCGGGAGTTCCAAAAAACGTTCGCCGGAGAAACAGCTGGAAACCGAGCCGGAACCTCCAGGGACGGCCAAAGAGAAAATCAGAAGGAGGGAACAGGAAAGTTCCACGTTT
CCCAAGACAAACCAACCTCAACGGCTAGTAAATTACCAGACGAAAAGACATCAGAGAAAGAAAAAGATGATATGACAGGTGACCAGAAACAAGTGAGGAAAAACGACAAA
GGAAAGGAAATTCAAGGAATACAGTCGAGAGAAAATACCAAAAATGACCGGAACTTTTGTCACTCAAATTTTAAAGGAGAACCTGAAAGAGGAAAAAACAACAAACAGGA
GAATGAAAGTCAGAACAACAGAAAGGAAAAAAAAGCAACCAGGACCCACGATGACTTTTGGACCAACGGCCCTGAAAGAGAAATGGAAAAAGCCATGGAAGTCGATCTTG
AACTAAAGGAAAAAAATATTAAAATGGAGCTTGCCACAAAAGAAGTATCAGGAAAAGATCAACAGAAGGTGGGAAAATGGAAAAGAAGGGCTAGGTCGGGCAAATTGGAA
ACAAGCTACGAGGAACCAGTGAGACTAGAAAAAAGAAAGAATCAGAGCGAGGCAGGGGAGAAAGTTAGAGAAAGCAAGAAACCAAACAACCTTTACTTCGAAGCGGTCGA
AGGGATATCGGCGGAGGCTGACCTACAGCCCCGCCGGAAGCCATGA
Protein sequenceShow/hide protein sequence
MLREPTTEIQSTMDKEAEEICKPMDYQLESIKTNIAIGERTSDTSKNENTQKLSMSKENPSEGQKENNSDQHEHSTEARRMLKQMENLKIRADERANILVIEDDELDEAT
EDLKDSLFCRIQTSKLINPETFKTFIPKIWNKEGRINFKTMGRNTFLCKFRNSREKDWVAKGGPWNFDKALIIIEEPKGESRISKAEFRYVNFWVHIHDLPMAYMNRKWA
TALGNLMGSFEDVDWDEDDRRGENTLRILVKMDITKPLKRGLVIRVGSKGEESWVKVSYERMPDFCYNCGLIGHVKSECESQISPEDEHQYGDWMRDAPIPPNFNSFRSP
DTRLNYGWNKGRGRGRGPGELRNFEWRKSVPSNRNQVREFQKTFAGETAGNRAGTSRDGQRENQKEGTGKFHVSQDKPTSTASKLPDEKTSEKEKDDMTGDQKQVRKNDK
GKEIQGIQSRENTKNDRNFCHSNFKGEPERGKNNKQENESQNNRKEKKATRTHDDFWTNGPEREMEKAMEVDLELKEKNIKMELATKEVSGKDQQKVGKWKRRARSGKLE
TSYEEPVRLEKRKNQSEAGEKVRESKKPNNLYFEAVEGISAEADLQPRRKP