; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035787 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035787
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr3:30180143..30181975
RNA-Seq ExpressionLag0035787
SyntenyLag0035787
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PQQ10307.1 uncharacterized protein Pyn_17609 [Prunus yedoensis var. nudiflora]4.4e-3229.93Show/hide
Query:  KLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFED
        K  +  +++   + +E  DI++    L  S+  K+ T+     E F   M +IW     + V+  G N+FL    T+ D++K+++  P  FD A+VL E 
Subjt:  KLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFED

Query:  PTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFC
        P G     +M  +YA FWI  HN+P  C        +GN++G  +DV    +G   G  LRIR+ +D+++P+ RG  + + S  +  ++   YE+LP+FC
Subjt:  PTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFC

Query:  YSCDKIGHVKQECA--EEKEGSLDEGVYGTMLRETQGSKGFYRAKSNEFGRNRGFNPRGRGRGRNERG----GRSGYPDQWSRD
        + C ++GHV +ECA  ++      E  YG  L+ T+      RA +++     G N     +G++ RG    G S    +W  D
Subjt:  YSCDKIGHVKQECA--EEKEGSLDEGVYGTMLRETQGSKGFYRAKSNEFGRNRGFNPRGRGRGRNERG----GRSGYPDQWSRD

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]3.4e-3232.86Show/hide
Query:  KLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFED
        KLSL + ++  +  I+    E   + L  S+  K +T+K+I  E F   +  IW  +  +T+E  G NIF  + +   D+ +I++G P +FD  +++  +
Subjt:  KLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFED

Query:  PTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFC
         +G+  V +++FRY  FWI  HNLP  C  R+    LG  +G+  ++++  +G   G+ +RIR+ +D+  P+KRG  + +G       + I YE+LP+FC
Subjt:  PTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFC

Query:  YSCDKIGHVKQEC
        Y C KIGH+ ++C
Subjt:  YSCDKIGHVKQEC

VVA32948.1 PREDICTED: DUF4283 domain-containing [Prunus dulcis]5.0e-3630.18Show/hide
Query:  EMLNDQIGKLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFD
        E++N  + K  +  +++R  + +  +DI++    L  S+  K+LT+     E F   M +IW     + V+  G N+FL    T+ D++++++  P  FD
Subjt:  EMLNDQIGKLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFD

Query:  DAIVLFEDPTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPIT
         A+VL E P G+    +M  +YA FWI  HN+P  C        +GN+ G  +DV    +G+  G  LR+R+ VD+++P++RG  + + S  +  ++   
Subjt:  DAIVLFEDPTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPIT

Query:  YEKLPDFCYSCDKIGHVKQECA--EEKEGSLDEGVYGTMLRETQGSKGFYRAKSNEFGRNRGFNPRGRGRGRNER
        YE+LP+FC+ C ++GHV +EC+  ++     DE  YG+ L+ T   K F+  ++    R  G      G G  E+
Subjt:  YEKLPDFCYSCDKIGHVKQECA--EEKEGSLDEGVYGTMLRETQGSKGFYRAKSNEFGRNRGFNPRGRGRGRNER

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]2.6e-3233.05Show/hide
Query:  LAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGR-ITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFEDPT
        L  +ED+  V I+   +E T + L+ S+ CK+L+ + I   V  + +   W ++ +  +V+  G NIFL       D+ +I++  P  FD A+++ ++P 
Subjt:  LAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGR-ITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFEDPT

Query:  GNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFCYS
              +M+FR    W+HF +L   C  +  A  LGNAIG F DV S+ N    G  LR+R++ D+ +P+ RG  + +       WIPI YE+LPDF Y 
Subjt:  GNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFCYS

Query:  CDKIGHVKQECAEEKEGSLDEGV-YGTMLRETQGSK
        C ++ H+ ++C++    S+ + + YG  LR  QG K
Subjt:  CDKIGHVKQECAEEKEGSLDEGV-YGTMLRETQGSK

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]2.5e-3532.2Show/hide
Query:  KLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFED
        K  L  +ED   + ++ D ++   + L  S+  K+L  ++I  +V S ++   W VE ++TVE  G+N+FL     + D  +++K  P  FD A+++ + 
Subjt:  KLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFED

Query:  PTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFC
        P  +  + E+EF    FWIH  +LP     +  A  LGNAIG F+DV+ +  G   G SLRIR+ +DI +P++RG  I +       WIPI YE+LPDFC
Subjt:  PTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFC

Query:  YSCDKIGHVKQECAEEKEGSLDE----GVYGTMLR-------ETQGSKGFYRAKSNEFGRNRGFNPRGRG-----RGRNERGGRSGYPDQWSRDQ
        Y C  IGH   +C      + D+      YG  LR         +G KG   A+ +  G +   N + RG     +  +E+  + G+  Q + +Q
Subjt:  YSCDKIGHVKQECAEEKEGSLDE----GVYGTMLR-------ETQGSKGFYRAKSNEFGRNRGFNPRGRG-----RGRNERGGRSGYPDQWSRDQ

TrEMBL top hitse value%identityAlignment
A0A314YVX1 CCHC-type domain-containing protein2.1e-3229.93Show/hide
Query:  KLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFED
        K  +  +++   + +E  DI++    L  S+  K+ T+     E F   M +IW     + V+  G N+FL    T+ D++K+++  P  FD A+VL E 
Subjt:  KLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFED

Query:  PTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFC
        P G     +M  +YA FWI  HN+P  C        +GN++G  +DV    +G   G  LRIR+ +D+++P+ RG  + + S  +  ++   YE+LP+FC
Subjt:  PTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFC

Query:  YSCDKIGHVKQECA--EEKEGSLDEGVYGTMLRETQGSKGFYRAKSNEFGRNRGFNPRGRGRGRNERG----GRSGYPDQWSRD
        + C ++GHV +ECA  ++      E  YG  L+ T+      RA +++     G N     +G++ RG    G S    +W  D
Subjt:  YSCDKIGHVKQECA--EEKEGSLDEGVYGTMLRETQGSKGFYRAKSNEFGRNRGFNPRGRGRGRNERG----GRSGYPDQWSRD

A0A5C7H9Y2 CCHC-type domain-containing protein1.6e-3232.86Show/hide
Query:  KLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFED
        KLSL + ++  +  I+    E   + L  S+  K +T+K+I  E F   +  IW  +  +T+E  G NIF  + +   D+ +I++G P +FD  +++  +
Subjt:  KLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFED

Query:  PTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFC
         +G+  V +++FRY  FWI  HNLP  C  R+    LG  +G+  ++++  +G   G+ +RIR+ +D+  P+KRG  + +G       + I YE+LP+FC
Subjt:  PTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFC

Query:  YSCDKIGHVKQEC
        Y C KIGH+ ++C
Subjt:  YSCDKIGHVKQEC

A0A5E4G034 PREDICTED: DUF4283 domain-containing2.4e-3630.18Show/hide
Query:  EMLNDQIGKLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFD
        E++N  + K  +  +++R  + +  +DI++    L  S+  K+LT+     E F   M +IW     + V+  G N+FL    T+ D++++++  P  FD
Subjt:  EMLNDQIGKLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFD

Query:  DAIVLFEDPTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPIT
         A+VL E P G+    +M  +YA FWI  HN+P  C        +GN+ G  +DV    +G+  G  LR+R+ VD+++P++RG  + + S  +  ++   
Subjt:  DAIVLFEDPTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPIT

Query:  YEKLPDFCYSCDKIGHVKQECA--EEKEGSLDEGVYGTMLRETQGSKGFYRAKSNEFGRNRGFNPRGRGRGRNER
        YE+LP+FC+ C ++GHV +EC+  ++     DE  YG+ L+ T   K F+  ++    R  G      G G  E+
Subjt:  YEKLPDFCYSCDKIGHVKQECA--EEKEGSLDEGVYGTMLRETQGSKGFYRAKSNEFGRNRGFNPRGRGRGRNER

A0A6J1BSZ1 uncharacterized protein LOC1110054811.2e-3233.05Show/hide
Query:  LAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGR-ITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFEDPT
        L  +ED+  V I+   +E T + L+ S+ CK+L+ + I   V  + +   W ++ +  +V+  G NIFL       D+ +I++  P  FD A+++ ++P 
Subjt:  LAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGR-ITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFEDPT

Query:  GNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFCYS
              +M+FR    W+HF +L   C  +  A  LGNAIG F DV S+ N    G  LR+R++ D+ +P+ RG  + +       WIPI YE+LPDF Y 
Subjt:  GNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFCYS

Query:  CDKIGHVKQECAEEKEGSLDEGV-YGTMLRETQGSK
        C ++ H+ ++C++    S+ + + YG  LR  QG K
Subjt:  CDKIGHVKQECAEEKEGSLDEGV-YGTMLRETQGSK

A0A6J1DU55 uncharacterized protein LOC1110231351.2e-3532.2Show/hide
Query:  KLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFED
        K  L  +ED   + ++ D ++   + L  S+  K+L  ++I  +V S ++   W VE ++TVE  G+N+FL     + D  +++K  P  FD A+++ + 
Subjt:  KLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVFSDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFED

Query:  PTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFC
        P  +  + E+EF    FWIH  +LP     +  A  LGNAIG F+DV+ +  G   G SLRIR+ +DI +P++RG  I +       WIPI YE+LPDFC
Subjt:  PTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFC

Query:  YSCDKIGHVKQECAEEKEGSLDE----GVYGTMLR-------ETQGSKGFYRAKSNEFGRNRGFNPRGRG-----RGRNERGGRSGYPDQWSRDQ
        Y C  IGH   +C      + D+      YG  LR         +G KG   A+ +  G +   N + RG     +  +E+  + G+  Q + +Q
Subjt:  YSCDKIGHVKQECAEEKEGSLDE----GVYGTMLR-------ETQGSKGFYRAKSNEFGRNRGFNPRGRG-----RGRNERGGRSGYPDQWSRDQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein1.4e-0723.43Show/hide
Query:  MPRIWG----VEGRITVEKEGRNIFLCKLRTQKDKIK-IIKGAPLIFDDAIVLFEDPTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEF
        MPRIWG    V GRI   ++   IF     T ++ ++ +++  P  F+D ++L +       +    F +  FW+    +P     R   E +G A+G+ 
Subjt:  MPRIWG----VEGRITVEKEGRNIFLCKLRTQKDKIK-IIKGAPLIFDDAIVLFEDPTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEF

Query:  IDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFCYSCDKIGHVKQECAEEKEG
        +D + +       +  R+ +  DI  P++   + +  +    T +   YE+L  FC  C  + H    C  +  G
Subjt:  IDVNSDGNGRISGESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFCYSCDKIGHVKQECAEEKEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAAGAAGGGGCAACCTCAGGTTTGCCAAAAGGGATCCATGAGTGATCTAATAGGAGGCTTGGGTAAGGAGAGCGGTGGGGAAATGGCCGAGGAATGGGCCACAAT
CCAGATGGTTGAAGCAGGGGACGCAGAAGGCACCGCAACAGAAGTAGGGGAGTCAACAGAAATGCTCAACGATCAGATAGGAAAACTAAGTTTGGCGGAACAAGAGGACA
GGAGGGTGGTGGCTATTGAGGATGACGATATAGAAGAAACTAATAGAGATTTGGACAACTCGATAGCCTGCAAAATTCTAACCTCAAAGCTGATTCAATGGGAGGTTTTC
TCTGATATTATGCCACGAATATGGGGTGTTGAAGGAAGAATTACCGTTGAAAAAGAGGGGAGAAATATCTTCCTTTGCAAACTTCGAACCCAAAAGGATAAAATCAAAAT
AATCAAGGGAGCACCATTGATATTTGATGATGCGATTGTGTTGTTCGAGGATCCAACAGGGAACTGCTGCGTCAAAGAGATGGAATTCAGGTACGCTATGTTTTGGATTC
ATTTCCATAACTTACCTCGAGTATGTTTCTGCAGGAAATATGCGGAAGCCCTGGGCAATGCGATTGGAGAGTTCATAGACGTAAACTCAGATGGGAATGGAAGGATTAGC
GGCGAAAGCTTGAGGATCAGAATCAAGGTGGATATCAATGAGCCAATCAAAAGAGGAACGAATATTAGAGTTGGGTCAAAGGCAAGCAAGACATGGATTCCTATAACCTA
TGAAAAACTCCCTGACTTCTGCTACTCGTGCGACAAGATTGGCCACGTTAAGCAAGAATGTGCGGAAGAGAAGGAGGGATCACTAGATGAAGGGGTGTATGGAACAATGC
TAAGGGAAACACAGGGAAGTAAAGGCTTCTATCGAGCCAAAAGCAATGAATTTGGAAGGAATAGAGGTTTTAATCCACGAGGTCGGGGCAGGGGCAGAAATGAAAGAGGA
GGAAGGAGCGGCTACCCAGATCAGTGGAGTAGAGATCAGTTCAGAGGAGGCAACAGTAGGGACGAAACAATGGCTAATTTTCCGGCGAGAAGAGCTGAAACAGCTGACCG
GCAGGAGGATGATAAAACGACAAAATATGAGCTGCGAAAGGAACGAGGTGGAAAGAACAAGGTTGATGGGACAAAGCAGAGTAGGTTGGAACCTCAGGCTGTCAGGGAGT
ACAGTGAACAAAAAGGTTACAACCGTAACATGACAGACAGGAAAGAGAATGGAAAAAAAGGGGACAGGCCAGAAAGTAATTATGGATCTGGGGAATATGGGCCCGAGGCA
GCTGGAATTATGAATGGGTTGGACTTAGGGACCCCAAAAACCAGAGAGATCACTACCAGCCCAAAAGGAAAAGGAAAACAAAACACGATCCAATGCGGGTTAAGATTTGA
GAGCATACCTTCAGGGATTATAAAAGACCAAGTAACCGACAAAGGCAAGGGAATAACAGAGGTGGGAACAGAGGAGAAAACAGAAGAAAGGACAGAAAATGTCAAGAATA
GTGGAAAGAGTCGGATTCAACAATCTAAGGGTAAGAGTCCGATAAAGGAAAAAAGGAATTCTACAAAGAAATGGAAACGGCTTGCGAGAGGAGAACTTGGATCAAAAAAG
GTGGTGAATGATATGACAGGGACTAAGGGGATGGAAATAGACAGTAGAAAAAGGAAAATTTATGACGACAAGGACATTACGGGGGAAAACGGATCAAAGAGAGTTCACAG
GGATAATGCTTACTCCGAGATGCACGGAGGGATATCGGTGGAGGTTGGATGCCAGCCCCGCCGGACGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAAGAAGGGGCAACCTCAGGTTTGCCAAAAGGGATCCATGAGTGATCTAATAGGAGGCTTGGGTAAGGAGAGCGGTGGGGAAATGGCCGAGGAATGGGCCACAAT
CCAGATGGTTGAAGCAGGGGACGCAGAAGGCACCGCAACAGAAGTAGGGGAGTCAACAGAAATGCTCAACGATCAGATAGGAAAACTAAGTTTGGCGGAACAAGAGGACA
GGAGGGTGGTGGCTATTGAGGATGACGATATAGAAGAAACTAATAGAGATTTGGACAACTCGATAGCCTGCAAAATTCTAACCTCAAAGCTGATTCAATGGGAGGTTTTC
TCTGATATTATGCCACGAATATGGGGTGTTGAAGGAAGAATTACCGTTGAAAAAGAGGGGAGAAATATCTTCCTTTGCAAACTTCGAACCCAAAAGGATAAAATCAAAAT
AATCAAGGGAGCACCATTGATATTTGATGATGCGATTGTGTTGTTCGAGGATCCAACAGGGAACTGCTGCGTCAAAGAGATGGAATTCAGGTACGCTATGTTTTGGATTC
ATTTCCATAACTTACCTCGAGTATGTTTCTGCAGGAAATATGCGGAAGCCCTGGGCAATGCGATTGGAGAGTTCATAGACGTAAACTCAGATGGGAATGGAAGGATTAGC
GGCGAAAGCTTGAGGATCAGAATCAAGGTGGATATCAATGAGCCAATCAAAAGAGGAACGAATATTAGAGTTGGGTCAAAGGCAAGCAAGACATGGATTCCTATAACCTA
TGAAAAACTCCCTGACTTCTGCTACTCGTGCGACAAGATTGGCCACGTTAAGCAAGAATGTGCGGAAGAGAAGGAGGGATCACTAGATGAAGGGGTGTATGGAACAATGC
TAAGGGAAACACAGGGAAGTAAAGGCTTCTATCGAGCCAAAAGCAATGAATTTGGAAGGAATAGAGGTTTTAATCCACGAGGTCGGGGCAGGGGCAGAAATGAAAGAGGA
GGAAGGAGCGGCTACCCAGATCAGTGGAGTAGAGATCAGTTCAGAGGAGGCAACAGTAGGGACGAAACAATGGCTAATTTTCCGGCGAGAAGAGCTGAAACAGCTGACCG
GCAGGAGGATGATAAAACGACAAAATATGAGCTGCGAAAGGAACGAGGTGGAAAGAACAAGGTTGATGGGACAAAGCAGAGTAGGTTGGAACCTCAGGCTGTCAGGGAGT
ACAGTGAACAAAAAGGTTACAACCGTAACATGACAGACAGGAAAGAGAATGGAAAAAAAGGGGACAGGCCAGAAAGTAATTATGGATCTGGGGAATATGGGCCCGAGGCA
GCTGGAATTATGAATGGGTTGGACTTAGGGACCCCAAAAACCAGAGAGATCACTACCAGCCCAAAAGGAAAAGGAAAACAAAACACGATCCAATGCGGGTTAAGATTTGA
GAGCATACCTTCAGGGATTATAAAAGACCAAGTAACCGACAAAGGCAAGGGAATAACAGAGGTGGGAACAGAGGAGAAAACAGAAGAAAGGACAGAAAATGTCAAGAATA
GTGGAAAGAGTCGGATTCAACAATCTAAGGGTAAGAGTCCGATAAAGGAAAAAAGGAATTCTACAAAGAAATGGAAACGGCTTGCGAGAGGAGAACTTGGATCAAAAAAG
GTGGTGAATGATATGACAGGGACTAAGGGGATGGAAATAGACAGTAGAAAAAGGAAAATTTATGACGACAAGGACATTACGGGGGAAAACGGATCAAAGAGAGTTCACAG
GGATAATGCTTACTCCGAGATGCACGGAGGGATATCGGTGGAGGTTGGATGCCAGCCCCGCCGGACGCAATGA
Protein sequenceShow/hide protein sequence
MDKKGQPQVCQKGSMSDLIGGLGKESGGEMAEEWATIQMVEAGDAEGTATEVGESTEMLNDQIGKLSLAEQEDRRVVAIEDDDIEETNRDLDNSIACKILTSKLIQWEVF
SDIMPRIWGVEGRITVEKEGRNIFLCKLRTQKDKIKIIKGAPLIFDDAIVLFEDPTGNCCVKEMEFRYAMFWIHFHNLPRVCFCRKYAEALGNAIGEFIDVNSDGNGRIS
GESLRIRIKVDINEPIKRGTNIRVGSKASKTWIPITYEKLPDFCYSCDKIGHVKQECAEEKEGSLDEGVYGTMLRETQGSKGFYRAKSNEFGRNRGFNPRGRGRGRNERG
GRSGYPDQWSRDQFRGGNSRDETMANFPARRAETADRQEDDKTTKYELRKERGGKNKVDGTKQSRLEPQAVREYSEQKGYNRNMTDRKENGKKGDRPESNYGSGEYGPEA
AGIMNGLDLGTPKTREITTSPKGKGKQNTIQCGLRFESIPSGIIKDQVTDKGKGITEVGTEEKTEERTENVKNSGKSRIQQSKGKSPIKEKRNSTKKWKRLARGELGSKK
VVNDMTGTKGMEIDSRKRKIYDDKDITGENGSKRVHRDNAYSEMHGGISVEVGCQPRRTQ