; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037043 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037043
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr2:2968539..2969948
RNA-Seq ExpressionLag0037043
SyntenyLag0037043
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48193.1 hypothetical protein EZV62_027487 [Acer yangbiense]3.1e-3834.66Show/hide
Query:  EAVLTKMEELKITAAERAKVVAIEDEDLEEATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYD
        EA +T++ E      E A V  I ++ +++  ED+   ++ KVLT K +N E FK  + +IWN+ G+V ++  G N F   F N+  ++++   GPW + 
Subjt:  EAVLTKMEELKITAAERAKVVAIEDEDLEEATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYD

Query:  KALVVIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKIT
        K+L+V+E  KG   I+   F    FW+ IHD+P++CM ++    L   +G   E+   E        +R+ V +DI++PLKR L +K+G   E T V + 
Subjt:  KALVVIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKIT

Query:  YERLPEFCYDCGLVGHVQTECEETGQKE-----EEQLYGEWMRATPIIGGPSKQKFHENRTRNYWGRGRGRRGDFQI
        YERLP+FC+ CG +GH   EC +   K      ++  +G WMRATPI    SK K     T +   RGR   G  ++
Subjt:  YERLPEFCYDCGLVGHVQTECEETGQKE-----EEQLYGEWMRATPIIGGPSKQKFHENRTRNYWGRGRGRRGDFQI

TXG48811.1 hypothetical protein EZV62_024686 [Acer yangbiense]2.7e-3738.02Show/hide
Query:  AERAKVVAIEDEDLE----------EATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALV
        A+  + ++I D+D E          +  ED+   ++ KVL  K +N E FK+ + +IW+  G V I+    NIF   FNN  ++DRI   GPW++D+ L+
Subjt:  AERAKVVAIEDEDLE----------EATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALV

Query:  VIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTD-EDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYER
        V E L+G   IS   F   KFW+ IHD+P++CM R+ A  L   +G  E +D+  E+ D     LR+ V IDIS PLKR L +K+  +     V + YER
Subjt:  VIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTD-EDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYER

Query:  LPEFCYDCGLVGHVQTECEETGQKEEE-----QLYGEWMRAT
        L EFCY CG +GH  +EC +   K+E        +G W+RA+
Subjt:  LPEFCYDCGLVGHVQTECEETGQKEEE-----QLYGEWMRAT

TXG63523.1 hypothetical protein EZV62_010517 [Acer yangbiense]3.5e-3737.29Show/hide
Query:  VAIEDEDLE----------EATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALVVIEDLK
        ++I DED E          +  ED+   ++ KVL+ + +N E FK+ + ++W+  G V I+  G NIF   FNN  ++DRI   GPW++D++L+V+E  +
Subjt:  VAIEDEDLE----------EATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALVVIEDLK

Query:  GASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTD-EDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYERLPEFCY
        G   IS   F   +FW+ IHD+P++CM R+ A  L   +G  E +D+  E+ D     L++ V IDIS PLKR L +K+  +     V + YERLPEFCY
Subjt:  GASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTD-EDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYERLPEFCY

Query:  DCGLVGHVQTECEETGQKEE-----EQLYGEWMRAT
         CG +GH  ++  +   K+E        +G WMRA+
Subjt:  DCGLVGHVQTECEETGQKEE-----EQLYGEWMRAT

TXG68535.1 hypothetical protein EZV62_003470 [Acer yangbiense]2.4e-3838.14Show/hide
Query:  VAIEDEDLE----------EATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALVVIEDLK
        ++I+DED E          +  ED+   ++ KVL+ K +N E FKT + ++W+  G V I+  G N F   FNN  ++DRI + GPW++D++L+V+E  +
Subjt:  VAIEDEDLE----------EATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALVVIEDLK

Query:  GASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYERLPEFCYD
        G   IS   F   +FW+ IHD+P++CM ++ A  L   +G   E+ + E        LR+ V IDIS+PLKR L +K+  +     V + YERLPEFCY 
Subjt:  GASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYERLPEFCYD

Query:  CGLVGHVQTECEETGQKEE-----EQLYGEWMRATP
        CG VGH   +C +T  K+E     +  +G W+RA+P
Subjt:  CGLVGHVQTECEETGQKEE-----EQLYGEWMRATP

TXG72599.1 hypothetical protein EZV62_001178 [Acer yangbiense]5.9e-3738.22Show/hide
Query:  LTKMEELKITAAERAKVVAIEDEDLEEATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKAL
        + K+ +    A E   V+ + +E + +  ED+   ++ KVLT K IN E FK  + +IW+  G+V ++  G N+F   F NR ++DR+ + GPW++  +L
Subjt:  LTKMEELKITAAERAKVVAIEDEDLEEATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKAL

Query:  VVIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYER
        + +E   G   ++  +F    FWI IHD+P++CM R+ A  L   +G   E+ L E      N LR+ V IDIS+PLKR L +K+G + E   V + YER
Subjt:  VVIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYER

Query:  LPEFCYDCGLVGHVQTEC-EETGQK
        LPEFCY CG +GH   EC +E  +K
Subjt:  LPEFCYDCGLVGHVQTEC-EETGQK

TrEMBL top hitse value%identityAlignment
A0A5C7GU64 CCHC-type domain-containing protein1.5e-3834.66Show/hide
Query:  EAVLTKMEELKITAAERAKVVAIEDEDLEEATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYD
        EA +T++ E      E A V  I ++ +++  ED+   ++ KVLT K +N E FK  + +IWN+ G+V ++  G N F   F N+  ++++   GPW + 
Subjt:  EAVLTKMEELKITAAERAKVVAIEDEDLEEATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYD

Query:  KALVVIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKIT
        K+L+V+E  KG   I+   F    FW+ IHD+P++CM ++    L   +G   E+   E        +R+ V +DI++PLKR L +K+G   E T V + 
Subjt:  KALVVIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKIT

Query:  YERLPEFCYDCGLVGHVQTECEETGQKE-----EEQLYGEWMRATPIIGGPSKQKFHENRTRNYWGRGRGRRGDFQI
        YERLP+FC+ CG +GH   EC +   K      ++  +G WMRATPI    SK K     T +   RGR   G  ++
Subjt:  YERLPEFCYDCGLVGHVQTECEETGQKE-----EEQLYGEWMRATPIIGGPSKQKFHENRTRNYWGRGRGRRGDFQI

A0A5C7GW54 CCHC-type domain-containing protein1.3e-3738.02Show/hide
Query:  AERAKVVAIEDEDLE----------EATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALV
        A+  + ++I D+D E          +  ED+   ++ KVL  K +N E FK+ + +IW+  G V I+    NIF   FNN  ++DRI   GPW++D+ L+
Subjt:  AERAKVVAIEDEDLE----------EATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALV

Query:  VIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTD-EDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYER
        V E L+G   IS   F   KFW+ IHD+P++CM R+ A  L   +G  E +D+  E+ D     LR+ V IDIS PLKR L +K+  +     V + YER
Subjt:  VIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTD-EDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYER

Query:  LPEFCYDCGLVGHVQTECEETGQKEEE-----QLYGEWMRAT
        L EFCY CG +GH  +EC +   K+E        +G W+RA+
Subjt:  LPEFCYDCGLVGHVQTECEETGQKEEE-----QLYGEWMRAT

A0A5C7I1Z5 Uncharacterized protein1.7e-3737.29Show/hide
Query:  VAIEDEDLE----------EATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALVVIEDLK
        ++I DED E          +  ED+   ++ KVL+ + +N E FK+ + ++W+  G V I+  G NIF   FNN  ++DRI   GPW++D++L+V+E  +
Subjt:  VAIEDEDLE----------EATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALVVIEDLK

Query:  GASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTD-EDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYERLPEFCY
        G   IS   F   +FW+ IHD+P++CM R+ A  L   +G  E +D+  E+ D     L++ V IDIS PLKR L +K+  +     V + YERLPEFCY
Subjt:  GASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTD-EDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYERLPEFCY

Query:  DCGLVGHVQTECEETGQKEE-----EQLYGEWMRAT
         CG +GH  ++  +   K+E        +G WMRA+
Subjt:  DCGLVGHVQTECEETGQKEE-----EQLYGEWMRAT

A0A5C7IHI0 CCHC-type domain-containing protein1.2e-3838.14Show/hide
Query:  VAIEDEDLE----------EATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALVVIEDLK
        ++I+DED E          +  ED+   ++ KVL+ K +N E FKT + ++W+  G V I+  G N F   FNN  ++DRI + GPW++D++L+V+E  +
Subjt:  VAIEDEDLE----------EATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALVVIEDLK

Query:  GASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYERLPEFCYD
        G   IS   F   +FW+ IHD+P++CM ++ A  L   +G   E+ + E        LR+ V IDIS+PLKR L +K+  +     V + YERLPEFCY 
Subjt:  GASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYERLPEFCYD

Query:  CGLVGHVQTECEETGQKEE-----EQLYGEWMRATP
        CG VGH   +C +T  K+E     +  +G W+RA+P
Subjt:  CGLVGHVQTECEETGQKEE-----EQLYGEWMRATP

A0A5C7IU01 CCHC-type domain-containing protein2.9e-3738.22Show/hide
Query:  LTKMEELKITAAERAKVVAIEDEDLEEATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKAL
        + K+ +    A E   V+ + +E + +  ED+   ++ KVLT K IN E FK  + +IW+  G+V ++  G N+F   F NR ++DR+ + GPW++  +L
Subjt:  LTKMEELKITAAERAKVVAIEDEDLEEATEDLQSSILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKAL

Query:  VVIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYER
        + +E   G   ++  +F    FWI IHD+P++CM R+ A  L   +G   E+ L E      N LR+ V IDIS+PLKR L +K+G + E   V + YER
Subjt:  VVIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYER

Query:  LPEFCYDCGLVGHVQTEC-EETGQK
        LPEFCY CG +GH   EC +E  +K
Subjt:  LPEFCYDCGLVGHVQTEC-EETGQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding1.8e-0724.09Show/hide
Query:  IKKGGPWNYDKALVVIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGT
        I + GPW+++  + VI+  +     S  +F+ + FWI I  +P+  +  +  T++G  +G F E +L                 D+S             
Subjt:  IKKGGPWNYDKALVVIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGT

Query:  NAEETWVKITYERLPEFCYDCGLVGHVQTECEETGQK
              +K  YE+L  FC  CG++ H  +EC  +G +
Subjt:  NAEETWVKITYERLPEFCYDCGLVGHVQTECEETGQK

AT5G36228.1 nucleic acid binding;zinc ion binding1.4e-1222.28Show/hide
Query:  SILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALVVIEDLKGASRISLTDF-RYVKFWIHIHDLPMVC
        S+L ++L P+  + E     +P  W    +V  +      FQ  F +  +     +  PW +++  + ++  +        DF  ++  W+HI  +P+  
Subjt:  SILCKVLTPKFINPEVFKTFMPRIWNKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALVVIEDLKGASRISLTDF-RYVKFWIHIHDLPMVC

Query:  MCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYERLPEFCYDCGLVGHVQTECEETGQKEE
        +  +    + ++LG    +D +EE T +   +R+ V +D +EPL+    V+  +  E   +   YE+L   C +C  V H  + C     +EE
Subjt:  MCRKWATALGNSLGGFEEVDLHEENTDEDNILRILVNIDISEPLKRGLMVKIGTNAEETWVKITYERLPEFCYDCGLVGHVQTECEETGQKEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAGCGACATGCTGATAGAGGGAGCATCCAACTCTCAAGAGCGAGCTACAGAAGAAACAAAGGATAAGGAAAACATGGAAAACAAAGAAAACACTGAAGGATCTGG
CAAAGAAACGGTAACCAACCTGATCGAAGAAGAAGAAGCAGTCCTAACCAAGATGGAGGAGCTTAAAATAACAGCAGCAGAAAGGGCCAAAGTGGTGGCTATTGAAGACG
AGGACCTTGAAGAGGCAACAGAGGACCTCCAAAGTTCGATATTATGCAAAGTGCTAACACCCAAATTTATAAATCCCGAAGTCTTTAAAACTTTCATGCCAAGAATCTGG
AATAAAGAAGGGAAAGTGAGAATCAAATCAAAAGGAAGAAATATTTTCCAATGCACGTTCAATAACAGATGGGAAAAAGATAGAATAAAAAAAGGGGGTCCGTGGAACTA
CGACAAAGCCCTGGTGGTTATCGAAGATCTTAAAGGAGCTAGCAGAATATCGCTCACGGATTTTAGGTACGTAAAATTTTGGATTCATATCCACGACTTACCTATGGTAT
GTATGTGCAGGAAATGGGCCACAGCTCTGGGGAACTCTCTAGGGGGATTCGAGGAAGTGGACCTTCACGAAGAGAATACCGATGAGGATAACATCTTGAGAATCCTGGTG
AACATAGACATCTCTGAACCTCTAAAGCGAGGTTTAATGGTGAAAATAGGAACGAACGCTGAAGAAACATGGGTGAAGATTACCTACGAACGATTACCAGAATTCTGCTA
CGATTGCGGTCTTGTGGGGCATGTTCAGACAGAATGTGAAGAAACAGGACAAAAAGAAGAAGAACAATTGTATGGGGAGTGGATGCGGGCGACTCCGATAATAGGCGGCC
CGTCAAAACAGAAGTTCCATGAAAATAGAACGAGAAACTATTGGGGACGAGGAAGAGGACGAAGAGGGGATTTCCAAATATTCAGAAGAGATAGGAGAGAAGAATGGAAA
AATGAAAGAGGAAACACAGGAAGCTGGAGAAAGAAGGAGCAACCTAAAGGTGAAATTCAGACCAGAGACGGTGACAAACCGCCGGAAAAATCTTTAGGTGAAGGTCCTGT
TCCGGTGGACGACAACGGCTCAAACTCCAAGGAAAGTCCAACGGCTAGCAAGAAGTCAGATGACAAAAGAGTTTTAAAAGGAAAAGAAATCATAAAACCAACGGAGCATA
GTACAACCAGAGCCCAACTAAGTGAAAAAGTTCAAGATGACATGATGGAAATCAGCATGAGTTCTGGGCTTGTCATTAATGCACCGAATAAAGAAGGAAAAAGTGAGAAT
AAGAAGCTTCTGGGAAAGATAGAAAAAGACAAAAAAAGCTCACACGTTATGGGCCGACAGTGTGGGCCAGATAAAGAGATTAAACCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCAGCGACATGCTGATAGAGGGAGCATCCAACTCTCAAGAGCGAGCTACAGAAGAAACAAAGGATAAGGAAAACATGGAAAACAAAGAAAACACTGAAGGATCTGG
CAAAGAAACGGTAACCAACCTGATCGAAGAAGAAGAAGCAGTCCTAACCAAGATGGAGGAGCTTAAAATAACAGCAGCAGAAAGGGCCAAAGTGGTGGCTATTGAAGACG
AGGACCTTGAAGAGGCAACAGAGGACCTCCAAAGTTCGATATTATGCAAAGTGCTAACACCCAAATTTATAAATCCCGAAGTCTTTAAAACTTTCATGCCAAGAATCTGG
AATAAAGAAGGGAAAGTGAGAATCAAATCAAAAGGAAGAAATATTTTCCAATGCACGTTCAATAACAGATGGGAAAAAGATAGAATAAAAAAAGGGGGTCCGTGGAACTA
CGACAAAGCCCTGGTGGTTATCGAAGATCTTAAAGGAGCTAGCAGAATATCGCTCACGGATTTTAGGTACGTAAAATTTTGGATTCATATCCACGACTTACCTATGGTAT
GTATGTGCAGGAAATGGGCCACAGCTCTGGGGAACTCTCTAGGGGGATTCGAGGAAGTGGACCTTCACGAAGAGAATACCGATGAGGATAACATCTTGAGAATCCTGGTG
AACATAGACATCTCTGAACCTCTAAAGCGAGGTTTAATGGTGAAAATAGGAACGAACGCTGAAGAAACATGGGTGAAGATTACCTACGAACGATTACCAGAATTCTGCTA
CGATTGCGGTCTTGTGGGGCATGTTCAGACAGAATGTGAAGAAACAGGACAAAAAGAAGAAGAACAATTGTATGGGGAGTGGATGCGGGCGACTCCGATAATAGGCGGCC
CGTCAAAACAGAAGTTCCATGAAAATAGAACGAGAAACTATTGGGGACGAGGAAGAGGACGAAGAGGGGATTTCCAAATATTCAGAAGAGATAGGAGAGAAGAATGGAAA
AATGAAAGAGGAAACACAGGAAGCTGGAGAAAGAAGGAGCAACCTAAAGGTGAAATTCAGACCAGAGACGGTGACAAACCGCCGGAAAAATCTTTAGGTGAAGGTCCTGT
TCCGGTGGACGACAACGGCTCAAACTCCAAGGAAAGTCCAACGGCTAGCAAGAAGTCAGATGACAAAAGAGTTTTAAAAGGAAAAGAAATCATAAAACCAACGGAGCATA
GTACAACCAGAGCCCAACTAAGTGAAAAAGTTCAAGATGACATGATGGAAATCAGCATGAGTTCTGGGCTTGTCATTAATGCACCGAATAAAGAAGGAAAAAGTGAGAAT
AAGAAGCTTCTGGGAAAGATAGAAAAAGACAAAAAAAGCTCACACGTTATGGGCCGACAGTGTGGGCCAGATAAAGAGATTAAACCCTAA
Protein sequenceShow/hide protein sequence
MGSDMLIEGASNSQERATEETKDKENMENKENTEGSGKETVTNLIEEEEAVLTKMEELKITAAERAKVVAIEDEDLEEATEDLQSSILCKVLTPKFINPEVFKTFMPRIW
NKEGKVRIKSKGRNIFQCTFNNRWEKDRIKKGGPWNYDKALVVIEDLKGASRISLTDFRYVKFWIHIHDLPMVCMCRKWATALGNSLGGFEEVDLHEENTDEDNILRILV
NIDISEPLKRGLMVKIGTNAEETWVKITYERLPEFCYDCGLVGHVQTECEETGQKEEEQLYGEWMRATPIIGGPSKQKFHENRTRNYWGRGRGRRGDFQIFRRDRREEWK
NERGNTGSWRKKEQPKGEIQTRDGDKPPEKSLGEGPVPVDDNGSNSKESPTASKKSDDKRVLKGKEIIKPTEHSTTRAQLSEKVQDDMMEISMSSGLVINAPNKEGKSEN
KKLLGKIEKDKKSSHVMGRQCGPDKEIKP