; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002544 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002544
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr4:43720054..43721241
RNA-Seq ExpressionLag0002544
SyntenyLag0002544
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036875 - Zinc finger, CCHC-type superfamily
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG53482.1 hypothetical protein EZV62_022651 [Acer yangbiense]1.2e-3334.73Show/hide
Query:  VACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYC
        +A KIL+  +++ D F  ++ RIW + G V IE    N+Y   F+   D+ ++   GPW+FDD++++ E+ TG+  I+ L+F  V FWV   NLP +C  
Subjt:  VACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYC

Query:  RKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQECREEGVE---EERKLSFG
        ++ AE L   IG   E ++   G   G+ +RVRV +D+ +PLRR  ++     E+   + + YE+LP FC+ CG LGH    C E         +   +G
Subjt:  RKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQECREEGVE---EERKLSFG

Query:  VEMRETQSSKGIFK---NPKPN----MRPQYSRGRGRGR
          MR T   K +     NP+ N     RP Y    GRGR
Subjt:  VEMRETQSSKGIFK---NPKPN----MRPQYSRGRGRGR

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]1.5e-3331.1Show/hide
Query:  EEKAILSQMERLQIEEQKRVVD-IEDDDIEDTNRDLKNTVACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSF
        E   I  + E+L +++    ++ I+    E   + L  ++  K +T K+I+ + F   +  IW  +  V +E  G NI++ +F+   D+ RI+ GGPW F
Subjt:  EEKAILSQMERLQIEEQKRVVD-IEDDDIEDTNRDLKNTVACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSF

Query:  DDAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISV
        D  +L+  + +GS  +  L+FRYV FW+   NLP  C  R+    L   +G  +E +  + G+  G+ +R+RV +DV  PL+RG  +  G  E    + +
Subjt:  DDAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISV

Query:  TYEKLPDFCYYCGKLGHVHQEC--REEGVEEERKLSFGVEMRETQSSKGIFKNPKPNMRPQYSRGRGRGRFLGGGREERSKSW
         YE+LP+FCYYCGK+GH+ ++C    + +       FG  MR    ++      K N  P+ SR  G    L   R + S  W
Subjt:  TYEKLPDFCYYCGKLGHVHQEC--REEGVEEERKLSFGVEMRETQSSKGIFKNPKPNMRPQYSRGRGRGRFLGGGREERSKSW

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]1.3e-3234.24Show/hide
Query:  TNRDLKNTVACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFEKFTGSCDIEALEFRYVSFWVHFK
        T  ++K  V  K+ T K IS +    +M  +W +    R E  G NIY   FK   +K+R++  GPW+F+ ++L+    T +     + F + +FW+   
Subjt:  TNRDLKNTVACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFEKFTGSCDIEALEFRYVSFWVHFK

Query:  NLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQEC--REEGVEE
        N+P  C   + A  L   +G  EE E D      G  +RVRV++DV +PLRRG  +K    +D  W  + YEKLPDFCY CGK+GH  +EC  R + V  
Subjt:  NLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQEC--REEGVEE

Query:  ERKLSFGVEMRETQSSKGIFKNPKPNMRPQYSRGR-GRGRFLGGGREERSKSWRQEE
             +G  +R T   K +     P     +  GR GRG  + GGR  R   WR+ +
Subjt:  ERKLSFGVEMRETQSSKGIFKNPKPNMRPQYSRGR-GRGRFLGGGREERSKSWRQEE

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]1.4e-3435.51Show/hide
Query:  QMERLQIEEQKRVVDIEDDDIEDTNRDLKNTVACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFE
        Q  +L  EE +  +D++ D ++   + L  ++  K+L  +IIS D  S ++   W +E  + +E  G+N++   F  + D NR+M+ GPW FD A+++ +
Subjt:  QMERLQIEEQKRVVDIEDDDIEDTNRDLKNTVACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFE

Query:  KFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDF
        K   S +I  LEF  V+FW+H  +LP     +  A  L N+IG F + + +++G   G +LR+RV +D+ +PLRRG  I         WI + YE+LPDF
Subjt:  KFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDF

Query:  CYYCGKLGHVHQEC
        CY+CG +GH   +C
Subjt:  CYYCGKLGHVHQEC

XP_024955847.1 uncharacterized protein LOC112498636 [Citrus sinensis]1.4e-3134.78Show/hide
Query:  MPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAET
        M +IW     VRIE  G NI+  KF  + DK R+M GGPW FD A+++  +  G  DI    F +  FWVH +N+P     R   + L   IG+ EE +T
Subjt:  MPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAET

Query:  DDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQECREEGVEEERKLSFGVEMRETQSSKGIFKNPKPNMRP
        D+EG   G+  R+R+ +D+ +PLR+   ++T   E +  I + YE+LPDFC+ CG +GH ++EC E   + ++ L +G  M+ T      F   K N R 
Subjt:  DDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQECREEGVEEERKLSFGVEMRETQSSKGIFKNPKPNMRP

Query:  QYSRGRGRGRFLGGGREERSKSWRQEETRE
        ++ R + + +   G  + +S+   Q   R+
Subjt:  QYSRGRGRGRFLGGGREERSKSWRQEETRE

TrEMBL top hitse value%identityAlignment
A0A5C7GU64 CCHC-type domain-containing protein1.2e-3130.94Show/hide
Query:  EKAILSQMERLQIEEQKRVV-DIEDDDIEDTNRDLKNTVACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFD
        E  I    E L +E++   V +I +D I+D + D+   +  K+LT K ++ + F  ++ +IW   G V +E  G N +   F  +  +N++   GPW F 
Subjt:  EKAILSQMERLQIEEQKRVV-DIEDDDIEDTNRDLKNTVACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFD

Query:  DAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVT
         ++++ EK  G  DI  L+F    FWV   ++P +C  ++  + LA  IG   E  T+   +  G+ +RV+V++D+ +PL+R   IK G  E+   +++ 
Subjt:  DAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVT

Query:  YEKLPDFCYYCGKLGHVHQECREEGVE----EERKLSFGVEMRETQSSKGIFKNPKPNMRPQYSRGRGRGRFLGGGRE
        YE+LPDFC+ CG++GH  +EC +E  +    + ++  FG  MR T     I K+   ++         RGR + G RE
Subjt:  YEKLPDFCYYCGKLGHVHQECREEGVE----EERKLSFGVEMRETQSSKGIFKNPKPNMRPQYSRGRGRGRFLGGGRE

A0A5C7H9Y2 CCHC-type domain-containing protein7.3e-3431.1Show/hide
Query:  EEKAILSQMERLQIEEQKRVVD-IEDDDIEDTNRDLKNTVACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSF
        E   I  + E+L +++    ++ I+    E   + L  ++  K +T K+I+ + F   +  IW  +  V +E  G NI++ +F+   D+ RI+ GGPW F
Subjt:  EEKAILSQMERLQIEEQKRVVD-IEDDDIEDTNRDLKNTVACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSF

Query:  DDAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISV
        D  +L+  + +GS  +  L+FRYV FW+   NLP  C  R+    L   +G  +E +  + G+  G+ +R+RV +DV  PL+RG  +  G  E    + +
Subjt:  DDAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISV

Query:  TYEKLPDFCYYCGKLGHVHQEC--REEGVEEERKLSFGVEMRETQSSKGIFKNPKPNMRPQYSRGRGRGRFLGGGREERSKSW
         YE+LP+FCYYCGK+GH+ ++C    + +       FG  MR    ++      K N  P+ SR  G    L   R + S  W
Subjt:  TYEKLPDFCYYCGKLGHVHQEC--REEGVEEERKLSFGVEMRETQSSKGIFKNPKPNMRPQYSRGRGRGRFLGGGREERSKSW

A0A5C7HA98 CCHC-type domain-containing protein5.6e-3434.73Show/hide
Query:  VACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYC
        +A KIL+  +++ D F  ++ RIW + G V IE    N+Y   F+   D+ ++   GPW+FDD++++ E+ TG+  I+ L+F  V FWV   NLP +C  
Subjt:  VACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYC

Query:  RKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQECREEGVE---EERKLSFG
        ++ AE L   IG   E ++   G   G+ +RVRV +D+ +PLRR  ++     E+   + + YE+LP FC+ CG LGH    C E         +   +G
Subjt:  RKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQECREEGVE---EERKLSFG

Query:  VEMRETQSSKGIFK---NPKPN----MRPQYSRGRGRGR
          MR T   K +     NP+ N     RP Y    GRGR
Subjt:  VEMRETQSSKGIFK---NPKPN----MRPQYSRGRGRGR

A0A6J1D765 uncharacterized protein LOC1110179026.2e-3334.24Show/hide
Query:  TNRDLKNTVACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFEKFTGSCDIEALEFRYVSFWVHFK
        T  ++K  V  K+ T K IS +    +M  +W +    R E  G NIY   FK   +K+R++  GPW+F+ ++L+    T +     + F + +FW+   
Subjt:  TNRDLKNTVACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFEKFTGSCDIEALEFRYVSFWVHFK

Query:  NLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQEC--REEGVEE
        N+P  C   + A  L   +G  EE E D      G  +RVRV++DV +PLRRG  +K    +D  W  + YEKLPDFCY CGK+GH  +EC  R + V  
Subjt:  NLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQEC--REEGVEE

Query:  ERKLSFGVEMRETQSSKGIFKNPKPNMRPQYSRGR-GRGRFLGGGREERSKSWRQEE
             +G  +R T   K +     P     +  GR GRG  + GGR  R   WR+ +
Subjt:  ERKLSFGVEMRETQSSKGIFKNPKPNMRPQYSRGR-GRGRFLGGGREERSKSWRQEE

A0A6J1DU55 uncharacterized protein LOC1110231356.6e-3535.51Show/hide
Query:  QMERLQIEEQKRVVDIEDDDIEDTNRDLKNTVACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFE
        Q  +L  EE +  +D++ D ++   + L  ++  K+L  +IIS D  S ++   W +E  + +E  G+N++   F  + D NR+M+ GPW FD A+++ +
Subjt:  QMERLQIEEQKRVVDIEDDDIEDTNRDLKNTVACKILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFE

Query:  KFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDF
        K   S +I  LEF  V+FW+H  +LP     +  A  L N+IG F + + +++G   G +LR+RV +D+ +PLRRG  I         WI + YE+LPDF
Subjt:  KFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDF

Query:  CYYCGKLGHVHQEC
        CY+CG +GH   +C
Subjt:  CYYCGKLGHVHQEC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding8.0e-0925.79Show/hide
Query:  EKTGR--NIYQCKFKYQRDKN--RIMRGGPWSFDDAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGE
        E  GR   I++ +F +Q +++   I+R GPWSF+D + + +++T     +A EF+ + FW+  + +P                                 
Subjt:  EKTGR--NIYQCKFKYQRDKN--RIMRGGPWSFDDAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAETDDEGKMRGE

Query:  TLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQECREEG
         L  R+   +G+  R G  ++T    D   +   YEKL +FC  CG L H   EC   G
Subjt:  TLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQECREEG

AT5G36228.1 nucleic acid binding;zinc ion binding9.2e-1324.31Show/hide
Query:  KILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFEKFTGSCDIEALEF-RYVSFWVHFKNLPRVCYCRK
        +IL  +  S +   + +P  WG+   V         +Q +F+ + D    +R  PW F++  +  +++    D    +F  ++  WVH + +P      +
Subjt:  KILTFKIISEDYFSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFEKFTGSCDIEALEF-RYVSFWVHFKNLPRVCYCRK

Query:  YAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQEC
          E +A+++G     + ++E   +   +RV+VRMD  +PLR    ++  S+E    I   YEKL   C  C ++ H    C
Subjt:  YAEALANSIGTFEEAETDDEGKMRGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGACAGGGGTCCAAGAGTGACCGTAAAGGAGCTTCGGAGAAGATGGAGATATCCATCCCAGCGGGAAACCAAAGAGAACCAGTAGCACTCGTATCTCACATAGA
AAACGAGACAGAGGAACAAACCAACCAAGCGAAGGAAACGATGGAGCAGGATAACGATGAGGAAGAAAAGGCAATCCTAAGCCAGATGGAAAGACTACAGATAGAAGAGC
AAAAAAGGGTAGTAGACATCGAAGATGATGACATTGAAGACACCAACAGAGATCTAAAAAACACAGTGGCCTGCAAGATACTGACCTTCAAAATAATCAGTGAAGATTAT
TTCAGCATAATGATGCCGAGGATTTGGGGTATAGAAGGATTGGTCAGAATAGAAAAAACGGGTAGAAACATTTATCAATGCAAGTTCAAATATCAAAGAGACAAAAACAG
AATCATGAGAGGAGGACCATGGAGCTTTGATGACGCTATACTAATCTTCGAAAAGTTCACAGGAAGCTGCGACATCGAAGCCTTGGAATTCAGGTATGTGTCCTTCTGGG
TACACTTCAAAAACTTACCCAGAGTTTGCTATTGCAGGAAATACGCGGAAGCTCTGGCCAACTCGATAGGGACATTCGAAGAAGCAGAAACAGATGATGAAGGGAAGATG
AGAGGAGAGACTTTACGAGTTAGAGTCCGAATGGATGTGGGTCAACCGCTAAGAAGAGGAACGAACATCAAGACCGGATCAAAAGAGGACACAAAATGGATATCAGTTAC
ATATGAGAAACTACCTGACTTCTGTTATTATTGTGGAAAACTGGGGCACGTTCACCAGGAATGCAGAGAAGAAGGAGTAGAAGAGGAAAGGAAACTGAGCTTTGGTGTTG
AAATGAGAGAAACTCAATCGAGTAAAGGAATATTCAAAAACCCAAAACCAAACATGAGACCCCAATACTCCAGGGGAAGAGGCAGAGGCAGATTCCTTGGCGGAGGAAGA
GAGGAAAGAAGCAAATCTTGGAGGCAAGAGGAAACAAGGGAAGAACCAGTTATACCAAAAAAAGCCAAAGCCAGCTATAGACCCCAGACTAGAACCGCCGACCGGAAAAA
TGAGGACGAAACAAGAACCAAAAAACCTGAAGGGGAAAACCAAAAGGGGGCAGAAGCCAATAGCACCGACGACCATCGGAAACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGGACAGGGGTCCAAGAGTGACCGTAAAGGAGCTTCGGAGAAGATGGAGATATCCATCCCAGCGGGAAACCAAAGAGAACCAGTAGCACTCGTATCTCACATAGA
AAACGAGACAGAGGAACAAACCAACCAAGCGAAGGAAACGATGGAGCAGGATAACGATGAGGAAGAAAAGGCAATCCTAAGCCAGATGGAAAGACTACAGATAGAAGAGC
AAAAAAGGGTAGTAGACATCGAAGATGATGACATTGAAGACACCAACAGAGATCTAAAAAACACAGTGGCCTGCAAGATACTGACCTTCAAAATAATCAGTGAAGATTAT
TTCAGCATAATGATGCCGAGGATTTGGGGTATAGAAGGATTGGTCAGAATAGAAAAAACGGGTAGAAACATTTATCAATGCAAGTTCAAATATCAAAGAGACAAAAACAG
AATCATGAGAGGAGGACCATGGAGCTTTGATGACGCTATACTAATCTTCGAAAAGTTCACAGGAAGCTGCGACATCGAAGCCTTGGAATTCAGGTATGTGTCCTTCTGGG
TACACTTCAAAAACTTACCCAGAGTTTGCTATTGCAGGAAATACGCGGAAGCTCTGGCCAACTCGATAGGGACATTCGAAGAAGCAGAAACAGATGATGAAGGGAAGATG
AGAGGAGAGACTTTACGAGTTAGAGTCCGAATGGATGTGGGTCAACCGCTAAGAAGAGGAACGAACATCAAGACCGGATCAAAAGAGGACACAAAATGGATATCAGTTAC
ATATGAGAAACTACCTGACTTCTGTTATTATTGTGGAAAACTGGGGCACGTTCACCAGGAATGCAGAGAAGAAGGAGTAGAAGAGGAAAGGAAACTGAGCTTTGGTGTTG
AAATGAGAGAAACTCAATCGAGTAAAGGAATATTCAAAAACCCAAAACCAAACATGAGACCCCAATACTCCAGGGGAAGAGGCAGAGGCAGATTCCTTGGCGGAGGAAGA
GAGGAAAGAAGCAAATCTTGGAGGCAAGAGGAAACAAGGGAAGAACCAGTTATACCAAAAAAAGCCAAAGCCAGCTATAGACCCCAGACTAGAACCGCCGACCGGAAAAA
TGAGGACGAAACAAGAACCAAAAAACCTGAAGGGGAAAACCAAAAGGGGGCAGAAGCCAATAGCACCGACGACCATCGGAAACCATGA
Protein sequenceShow/hide protein sequence
MDGQGSKSDRKGASEKMEISIPAGNQREPVALVSHIENETEEQTNQAKETMEQDNDEEEKAILSQMERLQIEEQKRVVDIEDDDIEDTNRDLKNTVACKILTFKIISEDY
FSIMMPRIWGIEGLVRIEKTGRNIYQCKFKYQRDKNRIMRGGPWSFDDAILIFEKFTGSCDIEALEFRYVSFWVHFKNLPRVCYCRKYAEALANSIGTFEEAETDDEGKM
RGETLRVRVRMDVGQPLRRGTNIKTGSKEDTKWISVTYEKLPDFCYYCGKLGHVHQECREEGVEEERKLSFGVEMRETQSSKGIFKNPKPNMRPQYSRGRGRGRFLGGGR
EERSKSWRQEETREEPVIPKKAKASYRPQTRTADRKNEDETRTKKPEGENQKGAEANSTDDHRKP