; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009410 (gene) of Chayote v1 genome

Gene IDSed0009410
OrganismSechium edule (Chayote v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG14:4891661..4897848
RNA-Seq ExpressionSed0009410
SyntenySed0009410
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48504.1 hypothetical protein EZV62_024379 [Acer yangbiense]1.1e-1628.27Show/hide
Query:  MDPAQFAQMMENLDL--DDFEAQEIF-ELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWN-------------------------------
        M+  + A++ ENL L  +D E  ++F  +E E V     ++   L  KVLSS +++ + FK ++ +IW+                               
Subjt:  MDPAQFAQMMENLDL--DDFEAQEIF-ELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWN-------------------------------

Query:  ---------LENKAMAGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNR---GKFLRLKIAIDVRRPLRKGLKLKSEAM
                 L  + + G   +S++ F+  +FWV++HD+P  C     A+ L + +G  I    +   ++R   GKFLR+K+ ID+ RPL++ L+LK +  
Subjt:  ---------LENKAMAGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNR---GKFLRLKIAIDVRRPLRKGLKLKSEAM

Query:  AAEKWVRLSYEKLPFFCRGCGRIGHRASDCSFV----AIGEGISFPFGEDLR---------EPYGFSRGESGSPNRFAGRGRG
             V L YE+LP FC  CGR+GH   DC+ V    A  EG    FG  +R         +    S G S    R  G   G
Subjt:  AAEKWVRLSYEKLPFFCRGCGRIGHRASDCSFV----AIGEGISFPFGEDLR---------EPYGFSRGESGSPNRFAGRGRG

TXG70623.1 hypothetical protein EZV62_005558 [Acer yangbiense]1.1e-1632.12Show/hide
Query:  MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWNLENKAMA-----GDKRVSEMEFAFGDFWVRLHDV
        M+  + A++ E L L + E   + +L+E      E  L   LA K+LSS  ++ D F  V+PKIW+ +   +      G + + +M F    FWV++H V
Subjt:  MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWNLENKAMA-----GDKRVSEMEFAFGDFWVRLHDV

Query:  PPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWVRLSYEKLPFFCRGCGRIGHRASDCSFVA
        P  C T    R L +M+G+       G     GK++R+++ ID+ +P+R  L++       E  + L YE+L   C  CG IGH   DCS  A
Subjt:  PPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWVRLSYEKLPFFCRGCGRIGHRASDCSFVA

TXG73180.1 hypothetical protein EZV62_001759 [Acer yangbiense]1.4e-1627.63Show/hide
Query:  MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWN--------------------LENKAMAGDKRVSE
        M  A  A + ENL L D E   + E+ EE     E +++  L  +VLS  +++ + FK ++ ++WN                    +  +   G + VS+
Subjt:  MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWN--------------------LENKAMAGDKRVSE

Query:  MEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWVRLSYEKLPFFCRGCGRIGHR
        + F   +FWV++HD+P  C     A+ L   +G  +    D  +   G+F+R+K+ ID+ RPL++ L+LK         V L YE+LP FC  CGR+GH 
Subjt:  MEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWVRLSYEKLPFFCRGCGRIGHR

Query:  ASDCSFVAIGEGI----SFPFGEDLREPYGFSRGESGSPNRFAGRGRGCGKGRGRGE
          +C+     +G        +G+ L+ P     G     +RF  +  G    R + +
Subjt:  ASDCSFVAIGEGI----SFPFGEDLREPYGFSRGESGSPNRFAGRGRGCGKGRGRGE

XP_042958061.1 uncharacterized protein LOC122293581 [Carya illinoinensis]6.3e-1723.42Show/hide
Query:  MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWNLENKAM----------------------------
        M+    + + E L L D E QEI E+  E+++   S+ +  +   ++S  + +       +PKIWN E K +                            
Subjt:  MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWNLENKAM----------------------------

Query:  ------------AGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWV
                     G K + EM+F F  FW++ HD+P A  T +  +KL S LGK +    DG +   GKFLR+K+ +D+ +PL +G  +  +    + W+
Subjt:  ------------AGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWV

Query:  RLSYEKLPFFCRGCGRIGHRASDCSFVAIGEGISFPFGEDLREPYG-FSRGESGSPNRFAGRGRGCGKGRGRGEVSRSEDGGYFEKEVQPKSIEGGRREE
           YE++  FC  CG I H    C  +  G  +    G  L + YG + R  +    + AG+ R             ++DGG  E   +  S E  + ++
Subjt:  RLSYEKLPFFCRGCGRIGHRASDCSFVAIGEGISFPFGEDLREPYG-FSRGESGSPNRFAGRGRGCGKGRGRGEVSRSEDGGYFEKEVQPKSIEGGRREE

Query:  ARGITSFPAEGRHLFASNGSLETVLGEGEGKGEGQKLQEVVSRTGKS--MCTEVRDEGVVCLKKAMKGKEMGQVTTDSWCKTADKGKGKATDFGINELLF
          G        +    S  + E        +   + ++ V+   G+S  +  E    G V ++++ K +++   +  SW + A + +   T+ G+ E + 
Subjt:  ARGITSFPAEGRHLFASNGSLETVLGEGEGKGEGQKLQEVVSRTGKS--MCTEVRDEGVVCLKKAMKGKEMGQVTTDSWCKTADKGKGKATDFGINELLF

Query:  GPFEIKINHISHLKADGSGPGKGLSRGITIKENMECNDQEKDHL
             K NH S  KA     G     G+  +  +   D+E D L
Subjt:  GPFEIKINHISHLKADGSGPGKGLSRGITIKENMECNDQEKDHL

XP_042958241.1 uncharacterized protein LOC122293864 [Carya illinoinensis]6.3e-1723.42Show/hide
Query:  MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWNLENKAM----------------------------
        M+    + + E L L D E QEI E+  E+++   S+ +  +   ++S  + +       +PKIWN E K +                            
Subjt:  MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWNLENKAM----------------------------

Query:  ------------AGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWV
                     G K + EM+F F  FW++ HD+P A  T +  +KL S LGK +    DG +   GKFLR+K+ +D+ +PL +G  +  +    + W+
Subjt:  ------------AGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWV

Query:  RLSYEKLPFFCRGCGRIGHRASDCSFVAIGEGISFPFGEDLREPYG-FSRGESGSPNRFAGRGRGCGKGRGRGEVSRSEDGGYFEKEVQPKSIEGGRREE
           YE++  FC  CG I H    C  +  G  +    G  L + YG + R  +    + AG+ R             ++DGG  E   +  S E  + ++
Subjt:  RLSYEKLPFFCRGCGRIGHRASDCSFVAIGEGISFPFGEDLREPYG-FSRGESGSPNRFAGRGRGCGKGRGRGEVSRSEDGGYFEKEVQPKSIEGGRREE

Query:  ARGITSFPAEGRHLFASNGSLETVLGEGEGKGEGQKLQEVVSRTGKS--MCTEVRDEGVVCLKKAMKGKEMGQVTTDSWCKTADKGKGKATDFGINELLF
          G        +    S  + E        +   + ++ V+   G+S  +  E    G V ++++ K +++   +  SW + A + +   T+ G+ E + 
Subjt:  ARGITSFPAEGRHLFASNGSLETVLGEGEGKGEGQKLQEVVSRTGKS--MCTEVRDEGVVCLKKAMKGKEMGQVTTDSWCKTADKGKGKATDFGINELLF

Query:  GPFEIKINHISHLKADGSGPGKGLSRGITIKENMECNDQEKDHL
             K NH S  KA     G     G+  +  +   D+E D L
Subjt:  GPFEIKINHISHLKADGSGPGKGLSRGITIKENMECNDQEKDHL

TrEMBL top hitse value%identityAlignment
A0A5C7GU64 CCHC-type domain-containing protein8.8e-1726.34Show/hide
Query:  MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWNL----------ENKAM------------------
        M  A+  Q+ ENL L+D +A  + E+ E+ +   + +++  L  KVL+  +++ + FK ++ +IWN           EN  M                  
Subjt:  MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWNL----------ENKAM------------------

Query:  ------------AGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWV
                     G   +++++F   DFWV++HD+P  C      + L   +G+ +    +  +   GK++R+K+ +D+ +PL++ L++K         V
Subjt:  ------------AGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWV

Query:  RLSYEKLPFFCRGCGRIGHRASDC
         L YE+LP FC  CGRIGH   +C
Subjt:  RLSYEKLPFFCRGCGRIGHRASDC

A0A5C7GUN2 CCHC-type domain-containing protein5.2e-1728.27Show/hide
Query:  MDPAQFAQMMENLDL--DDFEAQEIF-ELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWN-------------------------------
        M+  + A++ ENL L  +D E  ++F  +E E V     ++   L  KVLSS +++ + FK ++ +IW+                               
Subjt:  MDPAQFAQMMENLDL--DDFEAQEIF-ELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWN-------------------------------

Query:  ---------LENKAMAGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNR---GKFLRLKIAIDVRRPLRKGLKLKSEAM
                 L  + + G   +S++ F+  +FWV++HD+P  C     A+ L + +G  I    +   ++R   GKFLR+K+ ID+ RPL++ L+LK +  
Subjt:  ---------LENKAMAGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNR---GKFLRLKIAIDVRRPLRKGLKLKSEAM

Query:  AAEKWVRLSYEKLPFFCRGCGRIGHRASDCSFV----AIGEGISFPFGEDLR---------EPYGFSRGESGSPNRFAGRGRG
             V L YE+LP FC  CGR+GH   DC+ V    A  EG    FG  +R         +    S G S    R  G   G
Subjt:  AAEKWVRLSYEKLPFFCRGCGRIGHRASDCSFV----AIGEGISFPFGEDLR---------EPYGFSRGESGSPNRFAGRGRG

A0A5C7HJF0 Uncharacterized protein1.2e-1630.13Show/hide
Query:  MDPAQFAQMMENLDL--DDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWNL----------ENKAM----------------
        M   + A++ ENL +  +D E  +I+E  E D      +    L  KVLSS +++ + FK V+ ++W+           EN  M                
Subjt:  MDPAQFAQMMENLDL--DDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWNL----------ENKAM----------------

Query:  --------------AGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEK
                       G   +S+  F+   FWV++HD+P  C     AR L   +G+ I    +  +   GKFL++K++ID+ RPL++ L+LK +      
Subjt:  --------------AGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEK

Query:  WVRLSYEKLPFFCRGCGRIGHRASDCSFV
         + L YE+LP FC  CGRIGH  SD S V
Subjt:  WVRLSYEKLPFFCRGCGRIGHRASDCSFV

A0A5C7HVP8 Uncharacterized protein8.8e-1728.57Show/hide
Query:  MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIW---------------------NLENK---------
        M   + A++ ENL + D E  +I ++ E+       ++E  L  KVLS  +++ + FK V+ ++W                     NLE++         
Subjt:  MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIW---------------------NLENK---------

Query:  ----------AMAGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWV
                     G   +S++ F   +FWV++HD+P  C     A+ L   +GKFI    +  +   GKFLR+K+ ID+ +PL++ L+LK +       V
Subjt:  ----------AMAGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWV

Query:  RLSYEKLPFFCRGCGRIGHRASDC
         L YE+LP FC  C R+GH  ++C
Subjt:  RLSYEKLPFFCRGCGRIGHRASDC

A0A5C7IV30 CCHC-type domain-containing protein6.8e-1727.63Show/hide
Query:  MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWN--------------------LENKAMAGDKRVSE
        M  A  A + ENL L D E   + E+ EE     E +++  L  +VLS  +++ + FK ++ ++WN                    +  +   G + VS+
Subjt:  MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWN--------------------LENKAMAGDKRVSE

Query:  MEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWVRLSYEKLPFFCRGCGRIGHR
        + F   +FWV++HD+P  C     A+ L   +G  +    D  +   G+F+R+K+ ID+ RPL++ L+LK         V L YE+LP FC  CGR+GH 
Subjt:  MEFAFGDFWVRLHDVPPACHTGNNARKLESMLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWVRLSYEKLPFFCRGCGRIGHR

Query:  ASDCSFVAIGEGI----SFPFGEDLREPYGFSRGESGSPNRFAGRGRGCGKGRGRGE
          +C+     +G        +G+ L+ P     G     +RF  +  G    R + +
Subjt:  ASDCSFVAIGEGI----SFPFGEDLREPYGFSRGESGSPNRFAGRGRGCGKGRGRGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCCAGCACAGTTTGCTCAAATGATGGAAAATCTTGATCTTGATGATTTTGAAGCACAAGAAATCTTTGAACTTGAAGAAGAAGATGTGACAGCTTATGAAAGTGA
ACTTGAAGGGGTGCTGGCGTGCAAGGTTCTCTCTAGCAATCAGATTAGTTCAGATGTGTTCAAGAAGGTTGTCCCAAAAATATGGAATCTGGAAAATAAAGCAATGGCAG
GGGACAAGCGAGTTTCAGAGATGGAGTTCGCATTTGGAGATTTTTGGGTCAGACTTCACGATGTGCCTCCAGCGTGTCATACAGGAAACAATGCAAGGAAGCTAGAGAGT
ATGTTAGGGAAGTTCATAGGATGGCATTGTGATGGAGATAAAAGGAACAGAGGAAAATTTTTAAGGCTCAAAATTGCAATCGATGTACGTAGACCTCTGAGAAAGGGATT
GAAGTTAAAAAGCGAAGCGATGGCGGCTGAGAAATGGGTAAGATTATCATATGAAAAACTTCCGTTCTTCTGTCGTGGATGTGGAAGAATAGGCCACAGGGCCTCTGATT
GTAGCTTCGTTGCGATTGGAGAGGGTATTTCGTTCCCTTTTGGGGAAGATTTGAGAGAACCATATGGTTTTTCTAGGGGAGAATCAGGGTCGCCTAATAGATTTGCTGGG
AGAGGAAGGGGTTGTGGAAAAGGAAGAGGAAGAGGGGAGGTATCTCGCAGCGAGGATGGCGGTTATTTTGAGAAAGAAGTCCAGCCGAAATCGATCGAAGGAGGGAGAAG
AGAGGAGGCTAGGGGGATCACGAGTTTTCCGGCGGAGGGGCGACACCTTTTTGCAAGTAACGGGTCGCTGGAGACGGTCTTGGGAGAAGGTGAGGGAAAGGGGGAAGGTC
AGAAGTTGCAGGAGGTTGTCTCAAGAACAGGTAAAAGCATGTGTACGGAGGTTAGAGATGAAGGGGTTGTGTGTTTGAAAAAAGCAATGAAAGGAAAAGAGATGGGACAA
GTGACAACTGATAGCTGGTGCAAGACGGCTGACAAAGGAAAAGGGAAGGCAACAGATTTCGGGATAAACGAACTCCTTTTTGGGCCTTTTGAAATTAAAATAAACCATAT
TAGTCATTTAAAGGCTGATGGAAGTGGGCCTGGGAAGGGGTTAAGTAGGGGAATTACTATTAAAGAAAATATGGAGTGTAATGATCAGGAAAAAGATCATTTATTTCATG
GGCCATTTGAAACTGATTCTAAAATCAACCCGGTGGCGTTCTCGATCAACAATGAAGTTGCTCATGTGAAGGCTGATCGTAAAAAAGGCGGCTATAATGGAAAATCGGCT
GATTTAAAATCAATTACGGTCCAAAAGGATGGGAATCCTGTGAGGTCTCAAGAGGAGCCGTCAGAAATTGCTTCCGATAAGGAGATTGAATCAAGTTTGGATAAAAAGAC
TGCAAAATCTATGTTGGAGGAAGGGGTGACCAAGAGGGAGGCTGCTTTGGGAAAAATACGAGAATTAGATAAGGAACCCGAAGTGTTTGGTGGAGGCAAAATTAAAGGGA
ATATGTCAGCTGCAAAATGGAGACGAATTGCGAGATTAGGAGGAAGGGGAGCATCGTCACTATCAGTGGAAATGGAGTCTTGCAATGAAAGATCAACGGGTAAAAAACAT
GATATCGTTGGAGAGGAAAGGTTGGAGGAAGAAAATTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACCCAGCACAGTTTGCTCAAATGATGGAAAATCTTGATCTTGATGATTTTGAAGCACAAGAAATCTTTGAACTTGAAGAAGAAGATGTGACAGCTTATGAAAGTGA
ACTTGAAGGGGTGCTGGCGTGCAAGGTTCTCTCTAGCAATCAGATTAGTTCAGATGTGTTCAAGAAGGTTGTCCCAAAAATATGGAATCTGGAAAATAAAGCAATGGCAG
GGGACAAGCGAGTTTCAGAGATGGAGTTCGCATTTGGAGATTTTTGGGTCAGACTTCACGATGTGCCTCCAGCGTGTCATACAGGAAACAATGCAAGGAAGCTAGAGAGT
ATGTTAGGGAAGTTCATAGGATGGCATTGTGATGGAGATAAAAGGAACAGAGGAAAATTTTTAAGGCTCAAAATTGCAATCGATGTACGTAGACCTCTGAGAAAGGGATT
GAAGTTAAAAAGCGAAGCGATGGCGGCTGAGAAATGGGTAAGATTATCATATGAAAAACTTCCGTTCTTCTGTCGTGGATGTGGAAGAATAGGCCACAGGGCCTCTGATT
GTAGCTTCGTTGCGATTGGAGAGGGTATTTCGTTCCCTTTTGGGGAAGATTTGAGAGAACCATATGGTTTTTCTAGGGGAGAATCAGGGTCGCCTAATAGATTTGCTGGG
AGAGGAAGGGGTTGTGGAAAAGGAAGAGGAAGAGGGGAGGTATCTCGCAGCGAGGATGGCGGTTATTTTGAGAAAGAAGTCCAGCCGAAATCGATCGAAGGAGGGAGAAG
AGAGGAGGCTAGGGGGATCACGAGTTTTCCGGCGGAGGGGCGACACCTTTTTGCAAGTAACGGGTCGCTGGAGACGGTCTTGGGAGAAGGTGAGGGAAAGGGGGAAGGTC
AGAAGTTGCAGGAGGTTGTCTCAAGAACAGGTAAAAGCATGTGTACGGAGGTTAGAGATGAAGGGGTTGTGTGTTTGAAAAAAGCAATGAAAGGAAAAGAGATGGGACAA
GTGACAACTGATAGCTGGTGCAAGACGGCTGACAAAGGAAAAGGGAAGGCAACAGATTTCGGGATAAACGAACTCCTTTTTGGGCCTTTTGAAATTAAAATAAACCATAT
TAGTCATTTAAAGGCTGATGGAAGTGGGCCTGGGAAGGGGTTAAGTAGGGGAATTACTATTAAAGAAAATATGGAGTGTAATGATCAGGAAAAAGATCATTTATTTCATG
GGCCATTTGAAACTGATTCTAAAATCAACCCGGTGGCGTTCTCGATCAACAATGAAGTTGCTCATGTGAAGGCTGATCGTAAAAAAGGCGGCTATAATGGAAAATCGGCT
GATTTAAAATCAATTACGGTCCAAAAGGATGGGAATCCTGTGAGGTCTCAAGAGGAGCCGTCAGAAATTGCTTCCGATAAGGAGATTGAATCAAGTTTGGATAAAAAGAC
TGCAAAATCTATGTTGGAGGAAGGGGTGACCAAGAGGGAGGCTGCTTTGGGAAAAATACGAGAATTAGATAAGGAACCCGAAGTGTTTGGTGGAGGCAAAATTAAAGGGA
ATATGTCAGCTGCAAAATGGAGACGAATTGCGAGATTAGGAGGAAGGGGAGCATCGTCACTATCAGTGGAAATGGAGTCTTGCAATGAAAGATCAACGGGTAAAAAACAT
GATATCGTTGGAGAGGAAAGGTTGGAGGAAGAAAATTTCTGA
Protein sequenceShow/hide protein sequence
MDPAQFAQMMENLDLDDFEAQEIFELEEEDVTAYESELEGVLACKVLSSNQISSDVFKKVVPKIWNLENKAMAGDKRVSEMEFAFGDFWVRLHDVPPACHTGNNARKLES
MLGKFIGWHCDGDKRNRGKFLRLKIAIDVRRPLRKGLKLKSEAMAAEKWVRLSYEKLPFFCRGCGRIGHRASDCSFVAIGEGISFPFGEDLREPYGFSRGESGSPNRFAG
RGRGCGKGRGRGEVSRSEDGGYFEKEVQPKSIEGGRREEARGITSFPAEGRHLFASNGSLETVLGEGEGKGEGQKLQEVVSRTGKSMCTEVRDEGVVCLKKAMKGKEMGQ
VTTDSWCKTADKGKGKATDFGINELLFGPFEIKINHISHLKADGSGPGKGLSRGITIKENMECNDQEKDHLFHGPFETDSKINPVAFSINNEVAHVKADRKKGGYNGKSA
DLKSITVQKDGNPVRSQEEPSEIASDKEIESSLDKKTAKSMLEEGVTKREAALGKIRELDKEPEVFGGGKIKGNMSAAKWRRIARLGGRGASSLSVEMESCNERSTGKKH
DIVGEERLEEENF