; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025555 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025555
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr10:15158991..15160525
RNA-Seq ExpressionLag0025555
SyntenyLag0025555
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015380691.1 uncharacterized protein LOC107174364 [Citrus sinensis]2.0e-3536.23Show/hide
Query:  KKLENAVVCKIFTQKKISPEVFSSMMPKIWKQEQTI-IDQVGFNVFLCKFRNARIKGQILEMGPWFYDKALLLLEEPKGDIYSEDMEFRYVSFWIHFHKL
        K L   +V K+   + ++ E F S + ++W+  + + I+ +G N F+ KF     K ++L  GPW +D+ALL+L EPKG        F + +FWI    +
Subjt:  KKLENAVVCKIFTQKKISPEVFSSMMPKIWKQEQTI-IDQVGFNVFLCKFRNARIKGQILEMGPWFYDKALLLLEEPKGDIYSEDMEFRYVSFWIHFHKL

Query:  PHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKR-GDKDKWIDVTYEKLPDFCYGCGHLGHTLKECEKDSGTNEEDL
        P AC  ++    +G ++G VE ++ T+E     G   +I++ +++T+PLK+ +FLK+ G+ D  + V YE+LPDFCY CG +GH  KEC K  G  +E L
Subjt:  PHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKR-GDKDKWIDVTYEKLPDFCYGCGHLGHTLKECEKDSGTNEEDL

Query:  PYDPWLR
        PY  W++
Subjt:  PYDPWLR

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.2e-3533.33Show/hide
Query:  AESIVRKFGDLKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQE--QTIIDQVGFNVFLCKFRNARIKGQILEMGPWF
        A +++ ++ + K+T  E      +    ++ + K LE +++CK+ +++ IS  V  + +   WK +     +D +GFN+FL  F  +  + +IL MGPW 
Subjt:  AESIVRKFGDLKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQE--QTIIDQVGFNVFLCKFRNARIKGQILEMGPWF

Query:  YDKALLLLEEPKGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDK--DKWI
        +D+AL++++ P       DM+FR VS W+HF  L  AC ++   T +G+ +G  E V+ +      WG  L+++++ DV  PL RGI L         WI
Subjt:  YDKALLLLEEPKGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDK--DKWI

Query:  DVTYEKLPDFCYGCGHLGHTLKECEKDS-GTNEEDLPYDPWLR
         + YE+LPDF Y CG L H LK+C      +  ++L Y PWLR
Subjt:  DVTYEKLPDFCYGCGHLGHTLKECEKDS-GTNEEDLPYDPWLR

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]4.2e-3833.77Show/hide
Query:  KVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWK-QEQTIIDQVGFNVFLCKFRNARIKGQILEMGPWFYDKALLLLEEPK
        K T+ E  +V  +  G+  L+   ++  VV K+ T K+IS E   S+M  +W+    T  + +G N+++  F++   K ++L  GPW ++K+LL+L  P 
Subjt:  KVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWK-QEQTIIDQVGFNVFLCKFRNARIKGQILEMGPWFYDKALLLLEEPK

Query:  GDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRW-GCSLQIKIKVDVTIPLKRGIFLKRGD-KDKWIDVTYEKLPDFCY
              DM F + +FWI  H +P  C S +    +G+ LG VE  ++  +    W G  +++++K+DV+ PL+RGI LK  D KD W  + YEKLPDFCY
Subjt:  GDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRW-GCSLQIKIKVDVTIPLKRGIFLKRGD-KDKWIDVTYEKLPDFCY

Query:  GCGHLGHTLKECEKDSGTNEEDLP--YDPWLREP-VKLKVREFDSGRYTQFSYQGRG-RGRGREEGRGSWRNKGDQEVKDINTASQNLSGGGKEGVP-VE
         CG +GH+ +ECE+ S     + P  Y  WLR   +K  V   +   + +    GRG +  G   GRG WR + D+  +DI+    +     +EGV  V 
Subjt:  GCGHLGHTLKECEKDSGTNEEDLP--YDPWLREP-VKLKVREFDSGRYTQFSYQGRG-RGRGREEGRGSWRNKGDQEVKDINTASQNLSGGGKEGVP-VE

Query:  QPENMAAS
          E++ A+
Subjt:  QPENMAAS

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]4.1e-3331.71Show/hide
Query:  DAESIVRKFGDLKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQE-QTIIDQVGFNVFLCKFRNARIKGQILEMGPWF
        D E+++  +   K+T  E      +    + ++ + L  ++V K+  ++ IS +V S ++   WK E Q  ++ +G N+FL  F       ++++ GPWF
Subjt:  DAESIVRKFGDLKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQE-QTIIDQVGFNVFLCKFRNARIKGQILEMGPWF

Query:  YDKALLLLEEPKGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDK--DKWI
        +DKAL++L++P       ++EF  V+FWIH   LP +  ++     +G+ +G    VD   E+   WG SL+I++ +D+T PL+RGI +         WI
Subjt:  YDKALLLLEEPKGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDK--DKWI

Query:  DVTYEKLPDFCYGCGHLGHTLKECEKDSGTNEED----LPYDPWLR
         + YE+LPDFCY CG +GH+  +C+      ++D      Y PWLR
Subjt:  DVTYEKLPDFCYGCGHLGHTLKECEKDSGTNEED----LPYDPWLR

XP_035541689.1 uncharacterized protein LOC118344688 [Juglans regia]2.8e-3433.48Show/hide
Query:  LKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQEQTI-IDQVGFNVFLCKFRNARIKGQILEMGPWFYDKALLLLEEP
        LK+T+ E+   Y+L E EI  S+    + +V  +   ++++   F + M ++W  E  I    +G N FL KF N  ++ ++L   PW +D+ L+ ++E 
Subjt:  LKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQEQTI-IDQVGFNVFLCKFRNARIKGQILEMGPWFYDKALLLLEEP

Query:  KGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDKDKWIDVTYEKLPDFCYG
        KG +  +D++F    FW+  H LP A  ++ +   +G+  G V MVD+ +E+   WG  L++K+ ++++ PL RG  +  GD   WI   YE+LP FCY 
Subjt:  KGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDKDKWIDVTYEKLPDFCYG

Query:  CGHLGHTLKECEK---DSGTNEEDLP-YDPWLR
        CG + H+   C +   D  T+ E  P Y PWLR
Subjt:  CGHLGHTLKECEK---DSGTNEEDLP-YDPWLR

TrEMBL top hitse value%identityAlignment
A0A1S8AC25 CCHC-type domain-containing protein (Fragment)1.5e-3331.75Show/hide
Query:  DAESIVRKFGDLKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQEQTI-IDQVGFNVFLCKFRNARIKGQILEMGPWF
        D E ++++   ++++D E   V      +I+   K L   +V K+   +++S E     M ++W+  + + I+++G NVF+ KF +   K  I+  GPW 
Subjt:  DAESIVRKFGDLKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQEQTI-IDQVGFNVFLCKFRNARIKGQILEMGPWF

Query:  YDKALLLLEEPKGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDKDK---W
        +D+AL+ L EP G    +  +F +VSFW+  H +P  C S+D    +G ++G VE V+ T+     +G  L+++I VD+T PLK+ I L++ ++D     
Subjt:  YDKALLLLEEPKGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDKDK---W

Query:  IDVTYEKLPDFCYGCGHLGHTLKECEKDSGTNEEDLPYDPWLREPVKLKVREFDSGRYTQFSYQGRGRGRGREE
        + V YE+LPDFC+ CG +GH  +EC      ++++L Y PWL+           +    +   QGRGR R   E
Subjt:  IDVTYEKLPDFCYGCGHLGHTLKECEKDSGTNEEDLPYDPWLREPVKLKVREFDSGRYTQFSYQGRGRGRGREE

A0A6J1BSZ1 uncharacterized protein LOC1110054815.6e-3633.33Show/hide
Query:  AESIVRKFGDLKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQE--QTIIDQVGFNVFLCKFRNARIKGQILEMGPWF
        A +++ ++ + K+T  E      +    ++ + K LE +++CK+ +++ IS  V  + +   WK +     +D +GFN+FL  F  +  + +IL MGPW 
Subjt:  AESIVRKFGDLKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQE--QTIIDQVGFNVFLCKFRNARIKGQILEMGPWF

Query:  YDKALLLLEEPKGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDK--DKWI
        +D+AL++++ P       DM+FR VS W+HF  L  AC ++   T +G+ +G  E V+ +      WG  L+++++ DV  PL RGI L         WI
Subjt:  YDKALLLLEEPKGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDK--DKWI

Query:  DVTYEKLPDFCYGCGHLGHTLKECEKDS-GTNEEDLPYDPWLR
         + YE+LPDF Y CG L H LK+C      +  ++L Y PWLR
Subjt:  DVTYEKLPDFCYGCGHLGHTLKECEKDS-GTNEEDLPYDPWLR

A0A6J1D765 uncharacterized protein LOC1110179022.1e-3833.77Show/hide
Query:  KVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWK-QEQTIIDQVGFNVFLCKFRNARIKGQILEMGPWFYDKALLLLEEPK
        K T+ E  +V  +  G+  L+   ++  VV K+ T K+IS E   S+M  +W+    T  + +G N+++  F++   K ++L  GPW ++K+LL+L  P 
Subjt:  KVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWK-QEQTIIDQVGFNVFLCKFRNARIKGQILEMGPWFYDKALLLLEEPK

Query:  GDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRW-GCSLQIKIKVDVTIPLKRGIFLKRGD-KDKWIDVTYEKLPDFCY
              DM F + +FWI  H +P  C S +    +G+ LG VE  ++  +    W G  +++++K+DV+ PL+RGI LK  D KD W  + YEKLPDFCY
Subjt:  GDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRW-GCSLQIKIKVDVTIPLKRGIFLKRGD-KDKWIDVTYEKLPDFCY

Query:  GCGHLGHTLKECEKDSGTNEEDLP--YDPWLREP-VKLKVREFDSGRYTQFSYQGRG-RGRGREEGRGSWRNKGDQEVKDINTASQNLSGGGKEGVP-VE
         CG +GH+ +ECE+ S     + P  Y  WLR   +K  V   +   + +    GRG +  G   GRG WR + D+  +DI+    +     +EGV  V 
Subjt:  GCGHLGHTLKECEKDSGTNEEDLP--YDPWLREP-VKLKVREFDSGRYTQFSYQGRG-RGRGREEGRGSWRNKGDQEVKDINTASQNLSGGGKEGVP-VE

Query:  QPENMAAS
          E++ A+
Subjt:  QPENMAAS

A0A6J1DU55 uncharacterized protein LOC1110231352.0e-3331.71Show/hide
Query:  DAESIVRKFGDLKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQE-QTIIDQVGFNVFLCKFRNARIKGQILEMGPWF
        D E+++  +   K+T  E      +    + ++ + L  ++V K+  ++ IS +V S ++   WK E Q  ++ +G N+FL  F       ++++ GPWF
Subjt:  DAESIVRKFGDLKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQE-QTIIDQVGFNVFLCKFRNARIKGQILEMGPWF

Query:  YDKALLLLEEPKGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDK--DKWI
        +DKAL++L++P       ++EF  V+FWIH   LP +  ++     +G+ +G    VD   E+   WG SL+I++ +D+T PL+RGI +         WI
Subjt:  YDKALLLLEEPKGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDK--DKWI

Query:  DVTYEKLPDFCYGCGHLGHTLKECEKDSGTNEED----LPYDPWLR
         + YE+LPDFCY CG +GH+  +C+      ++D      Y PWLR
Subjt:  DVTYEKLPDFCYGCGHLGHTLKECEKDSGTNEED----LPYDPWLR

A0A6P9E2G0 uncharacterized protein LOC1183446881.4e-3433.48Show/hide
Query:  LKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQEQTI-IDQVGFNVFLCKFRNARIKGQILEMGPWFYDKALLLLEEP
        LK+T+ E+   Y+L E EI  S+    + +V  +   ++++   F + M ++W  E  I    +G N FL KF N  ++ ++L   PW +D+ L+ ++E 
Subjt:  LKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQEQTI-IDQVGFNVFLCKFRNARIKGQILEMGPWFYDKALLLLEEP

Query:  KGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDKDKWIDVTYEKLPDFCYG
        KG +  +D++F    FW+  H LP A  ++ +   +G+  G V MVD+ +E+   WG  L++K+ ++++ PL RG  +  GD   WI   YE+LP FCY 
Subjt:  KGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDKDKWIDVTYEKLPDFCYG

Query:  CGHLGHTLKECEK---DSGTNEEDLP-YDPWLR
        CG + H+   C +   D  T+ E  P Y PWLR
Subjt:  CGHLGHTLKECEK---DSGTNEEDLP-YDPWLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding3.5e-0623.74Show/hide
Query:  FRNARIKGQILEMGPWFYDKALLLLEEPKGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPL
        F++      IL  GPW ++  + +++  +      D EF+ + FWI    +P    +    T IG                 R G  L+  +  DV++  
Subjt:  FRNARIKGQILEMGPWFYDKALLLLEEPKGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPL

Query:  KRGIFLKRGDKDKWIDVTYEKLPDFCYGCGHLGHTLKEC
                      +   YEKL +FC  CG L H   EC
Subjt:  KRGIFLKRGDKDKWIDVTYEKLPDFCYGCGHLGHTLKEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAACAATGATGCAGAGTCGATCGTTCGGAAGTTTGGTGACCTGAAAGTGACAGATGCAGAGAAATCAAGTGTTTATCATCTACAAGAAGGAGAAATTGATTTATC
GAGGAAGAAATTGGAAAATGCTGTTGTATGCAAAATATTTACCCAAAAGAAAATCTCTCCAGAGGTGTTTTCTTCAATGATGCCAAAGATCTGGAAGCAGGAGCAAACTA
TTATTGATCAGGTGGGATTTAATGTTTTTCTGTGCAAGTTCCGGAATGCCCGAATCAAAGGCCAAATCTTGGAGATGGGTCCTTGGTTCTATGATAAAGCGCTTTTGTTA
TTGGAGGAACCAAAAGGCGACATATACAGTGAAGATATGGAATTTCGGTACGTATCATTTTGGATTCATTTTCATAAACTACCCCATGCATGTTTTTCCAGGGATTCGAC
CACGGGGATTGGAAGCCTACTGGGCACGGTAGAGATGGTGGATCTAACGGAGGAGGAGGAGCATCGTTGGGGTTGTTCATTGCAGATTAAGATCAAGGTGGATGTAACTA
TTCCATTGAAACGAGGAATATTTCTTAAAAGAGGTGATAAGGATAAATGGATTGATGTGACCTACGAAAAACTTCCGGATTTTTGTTATGGATGTGGCCACTTAGGACAC
ACGTTGAAAGAATGTGAGAAGGATAGTGGCACCAATGAGGAAGATCTTCCATACGACCCATGGTTGAGAGAACCAGTGAAATTAAAAGTTCGCGAGTTTGACTCAGGACG
CTATACTCAGTTCAGCTACCAAGGGAGAGGAAGAGGACGGGGCCGGGAGGAAGGGAGAGGGAGCTGGAGGAATAAGGGAGATCAAGAGGTAAAAGATATTAACACTGCAA
GTCAAAATCTGAGTGGAGGTGGAAAGGAAGGTGTTCCGGTGGAGCAACCGGAAAACATGGCAGCTTCCGGCAAGCCGGTGACCTCTACCTTGGACAACCTTGTAACGGCT
AAGTTGGCTACTAAGGAAAGTGAGAAAAACACGGAGCAAATTAATTCTAATGGCGAGAATCAAGGTGATGTTGGCATTATGGCGTCAAACGGTCAGAAAATCCATATTAA
TGAACCTGTTGTACAACAAGAAGATGCTGAAAAAGGAAAAAATGGAAAAGAAATAACTTCATCTGATGTAATACAGCCCTATGCATCTGCCGAAAAAGGATTAATTTCAC
CTGTTGTAGTACAGACCCATGCAACTATTGGTAAGAGTAGGACATGGAAAAGAGCTGCTCGGATGCTGGAAGGGGAGAGATCCTGTGTGAACATAAAGTCAAGTCAACAA
TCTAAAGTGGGGCTCAAACGTAGTCTCGAAAGTGATGAGGACGGAGGAAGCTCAAAGAAAGTTATGGTTTCTAGAGAAATTGATATTGAAAGATCGGTGGAGGCTGCTGG
ACAGCCCCGCCGGACACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAACAATGATGCAGAGTCGATCGTTCGGAAGTTTGGTGACCTGAAAGTGACAGATGCAGAGAAATCAAGTGTTTATCATCTACAAGAAGGAGAAATTGATTTATC
GAGGAAGAAATTGGAAAATGCTGTTGTATGCAAAATATTTACCCAAAAGAAAATCTCTCCAGAGGTGTTTTCTTCAATGATGCCAAAGATCTGGAAGCAGGAGCAAACTA
TTATTGATCAGGTGGGATTTAATGTTTTTCTGTGCAAGTTCCGGAATGCCCGAATCAAAGGCCAAATCTTGGAGATGGGTCCTTGGTTCTATGATAAAGCGCTTTTGTTA
TTGGAGGAACCAAAAGGCGACATATACAGTGAAGATATGGAATTTCGGTACGTATCATTTTGGATTCATTTTCATAAACTACCCCATGCATGTTTTTCCAGGGATTCGAC
CACGGGGATTGGAAGCCTACTGGGCACGGTAGAGATGGTGGATCTAACGGAGGAGGAGGAGCATCGTTGGGGTTGTTCATTGCAGATTAAGATCAAGGTGGATGTAACTA
TTCCATTGAAACGAGGAATATTTCTTAAAAGAGGTGATAAGGATAAATGGATTGATGTGACCTACGAAAAACTTCCGGATTTTTGTTATGGATGTGGCCACTTAGGACAC
ACGTTGAAAGAATGTGAGAAGGATAGTGGCACCAATGAGGAAGATCTTCCATACGACCCATGGTTGAGAGAACCAGTGAAATTAAAAGTTCGCGAGTTTGACTCAGGACG
CTATACTCAGTTCAGCTACCAAGGGAGAGGAAGAGGACGGGGCCGGGAGGAAGGGAGAGGGAGCTGGAGGAATAAGGGAGATCAAGAGGTAAAAGATATTAACACTGCAA
GTCAAAATCTGAGTGGAGGTGGAAAGGAAGGTGTTCCGGTGGAGCAACCGGAAAACATGGCAGCTTCCGGCAAGCCGGTGACCTCTACCTTGGACAACCTTGTAACGGCT
AAGTTGGCTACTAAGGAAAGTGAGAAAAACACGGAGCAAATTAATTCTAATGGCGAGAATCAAGGTGATGTTGGCATTATGGCGTCAAACGGTCAGAAAATCCATATTAA
TGAACCTGTTGTACAACAAGAAGATGCTGAAAAAGGAAAAAATGGAAAAGAAATAACTTCATCTGATGTAATACAGCCCTATGCATCTGCCGAAAAAGGATTAATTTCAC
CTGTTGTAGTACAGACCCATGCAACTATTGGTAAGAGTAGGACATGGAAAAGAGCTGCTCGGATGCTGGAAGGGGAGAGATCCTGTGTGAACATAAAGTCAAGTCAACAA
TCTAAAGTGGGGCTCAAACGTAGTCTCGAAAGTGATGAGGACGGAGGAAGCTCAAAGAAAGTTATGGTTTCTAGAGAAATTGATATTGAAAGATCGGTGGAGGCTGCTGG
ACAGCCCCGCCGGACACAATGA
Protein sequenceShow/hide protein sequence
MANNDAESIVRKFGDLKVTDAEKSSVYHLQEGEIDLSRKKLENAVVCKIFTQKKISPEVFSSMMPKIWKQEQTIIDQVGFNVFLCKFRNARIKGQILEMGPWFYDKALLL
LEEPKGDIYSEDMEFRYVSFWIHFHKLPHACFSRDSTTGIGSLLGTVEMVDLTEEEEHRWGCSLQIKIKVDVTIPLKRGIFLKRGDKDKWIDVTYEKLPDFCYGCGHLGH
TLKECEKDSGTNEEDLPYDPWLREPVKLKVREFDSGRYTQFSYQGRGRGRGREEGRGSWRNKGDQEVKDINTASQNLSGGGKEGVPVEQPENMAASGKPVTSTLDNLVTA
KLATKESEKNTEQINSNGENQGDVGIMASNGQKIHINEPVVQQEDAEKGKNGKEITSSDVIQPYASAEKGLISPVVVQTHATIGKSRTWKRAARMLEGERSCVNIKSSQQ
SKVGLKRSLESDEDGGSSKKVMVSREIDIERSVEAAGQPRRTQ