; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017348 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017348
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr5:2559548..2561284
RNA-Seq ExpressionLag0017348
SyntenyLag0017348
Gene Ontology termsGO:0008152 - metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO61345.1 reverse transcriptase [Corchorus capsularis]2.8e-4435.83Show/hide
Query:  LSQKESSGINTSELGLHENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTF
        L+++E+  +      + E  A+  NCL+GK++S+R +N++  R  M  VW    G+ +  IGENLF+   +S L ++R+ +++PW F+  L++LK+   +
Subjt:  LSQKESSGINTSELGLHENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTF

Query:  DQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIKYERLPDFCYYCGM
        D  ED+  D   FW Q H LPLG  N  + + +G   G V+ +DT      WG FLR R ++ + +PL RG+ L     GKIL + +YE+LPDFCY CG 
Subjt:  DQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIKYERLPDFCYYCGM

Query:  IDHNDGSCP----LLKTQAKPEKQFGSWMRAASPPRKYHSNASGRGFEGTRGGR
        + H +  C     + + + K  K++G W+R A  PR       G G   TR GR
Subjt:  IDHNDGSCP----LLKTQAKPEKQFGSWMRAASPPRKYHSNASGRGFEGTRGGR

OMP03234.1 hypothetical protein COLO4_10561 [Corchorus olitorius]9.9e-4233.86Show/hide
Query:  LSQKESSGINTSELGLHENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTF
        L+++E+  +  +   + E+ A+ + CL+GK++S+R +NID  R  +  VW    G+ +  IGE L++   +S + ++R+ ++ PW F+  L++LK    F
Subjt:  LSQKESSGINTSELGLHENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTF

Query:  DQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIKYERLPDFCYYCGM
           E++  +   FW+Q+H LPLG     V + +G+  G V  +DT      WG FLR+R  + +N+PL RG+ L     GKIL + +YE+LPDFCY CG 
Subjt:  DQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIKYERLPDFCYYCGM

Query:  IDHNDGSCP----LLKTQAKPEKQFGSWMRAASPPRKYHSNASGRGFEGTRGGR
        ++H +  C     + +   K +K++G W+R A  PR       G G   TR G+
Subjt:  IDHNDGSCP----LLKTQAKPEKQFGSWMRAASPPRKYHSNASGRGFEGTRGGR

TXG60685.1 hypothetical protein EZV62_015258 [Acer yangbiense]3.9e-3838.94Show/hide
Query:  ENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTFDQPEDMIFDDPQFWVQI
        E    + +CLVGKV++ + +N +AF+K + ++W     + I+ +G+N F+    S+  R  I    PW FD NLI+L+ PT       + F   +FWVQI
Subjt:  ENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTFDQPEDMIFDDPQFWVQI

Query:  HHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAI-KYERLPDFCYYCGMIDHNDGSCP----LLK
        H+LPL   N ++A+ L   IG V  + T+   ECWG F+RV+V++ I++PL R L LN   SG++  AI  YERLP+FCY CG+I H    CP     L+
Subjt:  HHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAI-KYERLPDFCYYCGMIDHNDGSCP----LLK

Query:  TQAKPEKQFGSWMRAASPPRKYHSNA
               ++GSW+RAAS  +  + N+
Subjt:  TQAKPEKQFGSWMRAASPPRKYHSNA

TXG60826.1 hypothetical protein EZV62_012189 [Acer yangbiense]1.0e-3836.55Show/hide
Query:  KLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTFDQPEDMIFDDPQFWVQIHHLP
        KL+ C+VGKV S + IN +AFR  M K+W   +G+ I+ + +N+FL   ++   R R +   PWCFD  L+ L+        + + F    FWVQIH+ P
Subjt:  KLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTFDQPEDMIFDDPQFWVQIHHLP

Query:  LGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLN-DGPSGKILAAIKYERLPDFCYYCGMIDHNDGSCPLLK-TQAKP--
        L     ++   LG IIG ++ +D     EC+G +LRVRV + +N+PL R L ++  G   + L  ++Y++LP++C++CG + H+   C L K   A+P  
Subjt:  LGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLN-DGPSGKILAAIKYERLPDFCYYCGMIDHNDGSCPLLK-TQAKP--

Query:  -EKQFGSWMRAASPPRKYHSNASGRGFEGTRGGRGRFG
         + +FG+W+R  +PPR  H ++SGR F G+ G   + G
Subjt:  -EKQFGSWMRAASPPRKYHSNASGRGFEGTRGGRGRFG

XP_042980067.1 uncharacterized protein LOC122310252 [Carya illinoinensis]1.0e-3834.87Show/hide
Query:  SELKLSQKESSGINTSELGLHENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKT
        S LKL++ E   +   E     ++ +   CL+  +++ RT N +AF+ TM KVW     IT    G N FL+  Q +  + +++   PW FD +LI LK 
Subjt:  SELKLSQKESSGINTSELGLHENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKT

Query:  PTTFDQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIKYERLPDFCY
          +   P ++ F    FWVQ H+LP    N ++ + +G  IG V  V+ +E    WG +LR++V + + +PL RG  +N G   ++ +  KYERLP+FC+
Subjt:  PTTFDQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIKYERLPDFCY

Query:  YCGMIDHNDGSCPLLKTQAKPEKQFGSWMRAAS--PPR
         CG+  H +G CP      +P+ Q+G WMRA +  PP+
Subjt:  YCGMIDHNDGSCPLLKTQAKPEKQFGSWMRAAS--PPR

TrEMBL top hitse value%identityAlignment
A0A1R3GTB5 Reverse transcriptase1.3e-4435.83Show/hide
Query:  LSQKESSGINTSELGLHENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTF
        L+++E+  +      + E  A+  NCL+GK++S+R +N++  R  M  VW    G+ +  IGENLF+   +S L ++R+ +++PW F+  L++LK+   +
Subjt:  LSQKESSGINTSELGLHENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTF

Query:  DQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIKYERLPDFCYYCGM
        D  ED+  D   FW Q H LPLG  N  + + +G   G V+ +DT      WG FLR R ++ + +PL RG+ L     GKIL + +YE+LPDFCY CG 
Subjt:  DQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIKYERLPDFCYYCGM

Query:  IDHNDGSCP----LLKTQAKPEKQFGSWMRAASPPRKYHSNASGRGFEGTRGGR
        + H +  C     + + + K  K++G W+R A  PR       G G   TR GR
Subjt:  IDHNDGSCP----LLKTQAKPEKQFGSWMRAASPPRKYHSNASGRGFEGTRGGR

A0A1R3K847 Uncharacterized protein4.8e-4233.86Show/hide
Query:  LSQKESSGINTSELGLHENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTF
        L+++E+  +  +   + E+ A+ + CL+GK++S+R +NID  R  +  VW    G+ +  IGE L++   +S + ++R+ ++ PW F+  L++LK    F
Subjt:  LSQKESSGINTSELGLHENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTF

Query:  DQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIKYERLPDFCYYCGM
           E++  +   FW+Q+H LPLG     V + +G+  G V  +DT      WG FLR+R  + +N+PL RG+ L     GKIL + +YE+LPDFCY CG 
Subjt:  DQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIKYERLPDFCYYCGM

Query:  IDHNDGSCP----LLKTQAKPEKQFGSWMRAASPPRKYHSNASGRGFEGTRGGR
        ++H +  C     + +   K +K++G W+R A  PR       G G   TR G+
Subjt:  IDHNDGSCP----LLKTQAKPEKQFGSWMRAASPPRKYHSNASGRGFEGTRGGR

A0A2N9F689 Uncharacterized protein7.6e-4034.96Show/hide
Query:  MDIINKLSELKLSQKESSGINTSELGLHENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVW--YFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCF
        +D+I  L +L L++ E+  I   E     +Q K + CL+ K+++QR  N    + TMT +W     RG+   +IG+NLFL        ++ ++  SPWCF
Subjt:  MDIINKLSELKLSQKESSGINTSELGLHENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVW--YFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCF

Query:  DGNLILLKTPTTFDQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIK
        D  L++LK      Q  ++  D+  FWVQI+ LP+     +V + +G  +G V+ V+       WG FLRVRV++ + +PL R   +  G    IL   +
Subjt:  DGNLILLKTPTTFDQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIK

Query:  YERLPDFCYYCGMIDHNDGSCPL-LKTQAK-----PEKQFGSWMRA
        YERLP+FC+YCG +DH +  C + L+ + K      E Q+G W+RA
Subjt:  YERLPDFCYYCGMIDHNDGSCPL-LKTQAK-----PEKQFGSWMRA

A0A5C7HUW5 CCHC-type domain-containing protein1.9e-3838.94Show/hide
Query:  ENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTFDQPEDMIFDDPQFWVQI
        E    + +CLVGKV++ + +N +AF+K + ++W     + I+ +G+N F+    S+  R  I    PW FD NLI+L+ PT       + F   +FWVQI
Subjt:  ENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTFDQPEDMIFDDPQFWVQI

Query:  HHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAI-KYERLPDFCYYCGMIDHNDGSCP----LLK
        H+LPL   N ++A+ L   IG V  + T+   ECWG F+RV+V++ I++PL R L LN   SG++  AI  YERLP+FCY CG+I H    CP     L+
Subjt:  HHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAI-KYERLPDFCYYCGMIDHNDGSCP----LLK

Query:  TQAKPEKQFGSWMRAASPPRKYHSNA
               ++GSW+RAAS  +  + N+
Subjt:  TQAKPEKQFGSWMRAASPPRKYHSNA

A0A5C7HWW6 Glucan endo-1,3-beta-D-glucosidase4.9e-3936.55Show/hide
Query:  KLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTFDQPEDMIFDDPQFWVQIHHLP
        KL+ C+VGKV S + IN +AFR  M K+W   +G+ I+ + +N+FL   ++   R R +   PWCFD  L+ L+        + + F    FWVQIH+ P
Subjt:  KLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTFDQPEDMIFDDPQFWVQIHHLP

Query:  LGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLN-DGPSGKILAAIKYERLPDFCYYCGMIDHNDGSCPLLK-TQAKP--
        L     ++   LG IIG ++ +D     EC+G +LRVRV + +N+PL R L ++  G   + L  ++Y++LP++C++CG + H+   C L K   A+P  
Subjt:  LGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLN-DGPSGKILAAIKYERLPDFCYYCGMIDHNDGSCPLLK-TQAKP--

Query:  -EKQFGSWMRAASPPRKYHSNASGRGFEGTRGGRGRFG
         + +FG+W+R  +PPR  H ++SGR F G+ G   + G
Subjt:  -EKQFGSWMRAASPPRKYHSNASGRGFEGTRGGRGRFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein7.1e-1427.35Show/hide
Query:  NQAKLQN--CLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENL---FLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTFDQPEDMIFDDPQF
        N A  +N   L G+ V  R  N+ +   +M ++W    G+    I E     F+   +  L  + ++   PW F+  +ILL+      +P+  +F    F
Subjt:  NQAKLQN--CLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENL---FLINPQSVLGRDRIVEESPWCFDGNLILLKTPTTFDQPEDMIFDDPQF

Query:  WVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIKYERLPDFCYYCGMIDHNDGSCPLLKT
        WVQI  +P    N  V + +G  +G V   D N        F RV +   I  PL              L   +YERL  FC  CGM+ H+ G+C +   
Subjt:  WVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIKYERLPDFCYYCGMIDHNDGSCPLLKT

Query:  QAKPEKQFGSWMRAASPPRKYHS
        Q   E+Q          P+ YH+
Subjt:  QAKPEKQFGSWMRAASPPRKYHS

AT3G42140.1 zinc ion binding;nucleic acid binding9.3e-0621.37Show/hide
Query:  IVEESPWCFDGNLILLKTPTTFDQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGP
        I+   PW F+  + +++  T      D  F    FW+QI  +PL     ++  ++G  +G+               FL   +   ++             
Subjt:  IVEESPWCFDGNLILLKTPTTFDQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGP

Query:  SGKILAAIKYERLPDFCYYCGMIDHNDGSCP
            +   +YE+L +FC  CGM+ H+   CP
Subjt:  SGKILAAIKYERLPDFCYYCGMIDHNDGSCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATAATCAACAAACTTTCAGAGCTCAAGCTTTCACAGAAAGAATCTAGCGGAATAAACACTTCGGAGTTGGGCTTACACGAAAATCAGGCTAAGCTCCAAAATTG
TCTGGTAGGAAAGGTTGTTTCTCAACGAACTATCAACATTGACGCGTTCAGGAAAACTATGACAAAAGTATGGTATTTTTGTCGTGGGATAACCATCGATAATATTGGGG
AAAATCTCTTTCTTATCAATCCACAATCCGTTCTAGGAAGGGACAGAATTGTGGAGGAGAGCCCATGGTGTTTCGACGGTAATTTGATTCTACTCAAAACCCCAACAACG
TTCGATCAGCCAGAAGATATGATTTTCGATGATCCACAGTTCTGGGTTCAAATTCACCATTTACCTCTTGGATTACGAAATGGCAAAGTGGCACAAACGCTTGGGAACAT
AATAGGGATAGTACAAAGAGTAGATACAAATGAAGTGGATGAATGTTGGGGTCCTTTTCTCCGAGTACGGGTCAAGATGAAGATCAACGAGCCTTTATGTCGGGGGTTAA
CTCTAAATGACGGACCATCTGGAAAAATCTTGGCAGCGATTAAGTACGAGAGATTGCCTGATTTTTGTTATTACTGTGGAATGATCGATCACAATGATGGCTCATGCCCA
CTTCTAAAAACCCAGGCGAAACCAGAAAAACAATTCGGATCATGGATGCGAGCGGCTAGTCCTCCAAGAAAATACCATTCTAACGCAAGCGGCCGAGGATTTGAAGGAAC
CCGAGGAGGAAGAGGTCGATTTGGAAACCGACAAGGGGGAAAGCATTCGTGGCGATCTGAATCAATGGAGGAGGGGGAACACGATAACGATGAAGAACCGATTACAACTA
ATTCTCAACTCTCGAACGACGGCAATGGCGAACAACAGACAGGAACTGACCGCCGGACGTTTTCTGCGATCGCCTCCGACGATCCTCCACCGCAATCAAAGCTCCCAGCG
GCAAAGACGGGTATTAAATGTTCGATTTCCTTAGATAACGCTAATATTATGGATCCGTTCCCTATAATTGCAGATGCCGATAATTTAGGAAAGGAGGGAGATGGGTTTAA
TGGCTGCAACGGCCAGAAATCCCTAAAAATCAGCAACATAACTAATGGAGAAGGGAAAGGAAACGACGCTGATGAAAGCTTAGGGCATATGGAAACTAATGGGCTTGAAA
ATTATGATGGGCCCATTCCTTCTGGCAAAATTATTACAACCCAAATCAACATAAAACCAAATCAAATAATGGTGTATCAGCAACCCAATACGGAAGGGGTTGAAACCAAT
CACAACCCACTTGAAGGTAACAATATTGTAAGCCCAGGCCTTTTCCATGAAGCAACTATGATGGAAGCAGATCATTACTATGTCAAGCCCAAATCGACGGTGCAGGTCAA
AGAGAATCAGAAGAATAAACAGCCCATATCTGCAGACTCTGAAAGTGGAAAAAAGGATCGTTGGGAGCAGGGAAATATTTCTCTTGATAAGCTAAGGCTGTTAAAAGGCT
GGAAGAGGATTAACAGACAGGGAGATCACACTAAGGTCGATCAATCCACAGAACTCATGCAATTTGCTTCAAGGAAGAGGGTTGCAGAAAATGATAAATTGGAAGGTAGC
TCAGACAAGAAACTCAAATCACTAGTTCCACAAAAGGATGGAATTATATCGGCGGAGCCTGATCAGCAGGCCCGCCGAGAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATATAATCAACAAACTTTCAGAGCTCAAGCTTTCACAGAAAGAATCTAGCGGAATAAACACTTCGGAGTTGGGCTTACACGAAAATCAGGCTAAGCTCCAAAATTG
TCTGGTAGGAAAGGTTGTTTCTCAACGAACTATCAACATTGACGCGTTCAGGAAAACTATGACAAAAGTATGGTATTTTTGTCGTGGGATAACCATCGATAATATTGGGG
AAAATCTCTTTCTTATCAATCCACAATCCGTTCTAGGAAGGGACAGAATTGTGGAGGAGAGCCCATGGTGTTTCGACGGTAATTTGATTCTACTCAAAACCCCAACAACG
TTCGATCAGCCAGAAGATATGATTTTCGATGATCCACAGTTCTGGGTTCAAATTCACCATTTACCTCTTGGATTACGAAATGGCAAAGTGGCACAAACGCTTGGGAACAT
AATAGGGATAGTACAAAGAGTAGATACAAATGAAGTGGATGAATGTTGGGGTCCTTTTCTCCGAGTACGGGTCAAGATGAAGATCAACGAGCCTTTATGTCGGGGGTTAA
CTCTAAATGACGGACCATCTGGAAAAATCTTGGCAGCGATTAAGTACGAGAGATTGCCTGATTTTTGTTATTACTGTGGAATGATCGATCACAATGATGGCTCATGCCCA
CTTCTAAAAACCCAGGCGAAACCAGAAAAACAATTCGGATCATGGATGCGAGCGGCTAGTCCTCCAAGAAAATACCATTCTAACGCAAGCGGCCGAGGATTTGAAGGAAC
CCGAGGAGGAAGAGGTCGATTTGGAAACCGACAAGGGGGAAAGCATTCGTGGCGATCTGAATCAATGGAGGAGGGGGAACACGATAACGATGAAGAACCGATTACAACTA
ATTCTCAACTCTCGAACGACGGCAATGGCGAACAACAGACAGGAACTGACCGCCGGACGTTTTCTGCGATCGCCTCCGACGATCCTCCACCGCAATCAAAGCTCCCAGCG
GCAAAGACGGGTATTAAATGTTCGATTTCCTTAGATAACGCTAATATTATGGATCCGTTCCCTATAATTGCAGATGCCGATAATTTAGGAAAGGAGGGAGATGGGTTTAA
TGGCTGCAACGGCCAGAAATCCCTAAAAATCAGCAACATAACTAATGGAGAAGGGAAAGGAAACGACGCTGATGAAAGCTTAGGGCATATGGAAACTAATGGGCTTGAAA
ATTATGATGGGCCCATTCCTTCTGGCAAAATTATTACAACCCAAATCAACATAAAACCAAATCAAATAATGGTGTATCAGCAACCCAATACGGAAGGGGTTGAAACCAAT
CACAACCCACTTGAAGGTAACAATATTGTAAGCCCAGGCCTTTTCCATGAAGCAACTATGATGGAAGCAGATCATTACTATGTCAAGCCCAAATCGACGGTGCAGGTCAA
AGAGAATCAGAAGAATAAACAGCCCATATCTGCAGACTCTGAAAGTGGAAAAAAGGATCGTTGGGAGCAGGGAAATATTTCTCTTGATAAGCTAAGGCTGTTAAAAGGCT
GGAAGAGGATTAACAGACAGGGAGATCACACTAAGGTCGATCAATCCACAGAACTCATGCAATTTGCTTCAAGGAAGAGGGTTGCAGAAAATGATAAATTGGAAGGTAGC
TCAGACAAGAAACTCAAATCACTAGTTCCACAAAAGGATGGAATTATATCGGCGGAGCCTGATCAGCAGGCCCGCCGAGAGCCATGA
Protein sequenceShow/hide protein sequence
MDIINKLSELKLSQKESSGINTSELGLHENQAKLQNCLVGKVVSQRTINIDAFRKTMTKVWYFCRGITIDNIGENLFLINPQSVLGRDRIVEESPWCFDGNLILLKTPTT
FDQPEDMIFDDPQFWVQIHHLPLGLRNGKVAQTLGNIIGIVQRVDTNEVDECWGPFLRVRVKMKINEPLCRGLTLNDGPSGKILAAIKYERLPDFCYYCGMIDHNDGSCP
LLKTQAKPEKQFGSWMRAASPPRKYHSNASGRGFEGTRGGRGRFGNRQGGKHSWRSESMEEGEHDNDEEPITTNSQLSNDGNGEQQTGTDRRTFSAIASDDPPPQSKLPA
AKTGIKCSISLDNANIMDPFPIIADADNLGKEGDGFNGCNGQKSLKISNITNGEGKGNDADESLGHMETNGLENYDGPIPSGKIITTQINIKPNQIMVYQQPNTEGVETN
HNPLEGNNIVSPGLFHEATMMEADHYYVKPKSTVQVKENQKNKQPISADSESGKKDRWEQGNISLDKLRLLKGWKRINRQGDHTKVDQSTELMQFASRKRVAENDKLEGS
SDKKLKSLVPQKDGIISAEPDQQARREP