; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh05G003080 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh05G003080
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionWD_REPEATS_REGION domain-containing protein
Genome locationCmo_Chr05:1360664..1369337
RNA-Seq ExpressionCmoCh05G003080
SyntenyCmoCh05G003080
Gene Ontology termsGO:0006384 - transcription initiation from RNA polymerase III promoter (biological process)
GO:0016573 - histone acetylation (biological process)
GO:0000127 - transcription factor TFIIIC complex (cellular component)
GO:0004402 - histone acetyltransferase activity (molecular function)
InterPro domainsIPR024761 - Transcription factor IIIC, 90kDa subunit, N-terminal
IPR044230 - General transcription factor 3C polypeptide 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598500.1 hypothetical protein SDJN03_08278, partial [Cucurbita argyrosperma subsp. sororia]3.4e-10497.41Show/hide
Query:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
        MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVI RKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
Subjt:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP

Query:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKN
        NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKL+DYFESLKFGELDVPSS+CSDNPVK+GGSALDVQEHFTKEDRKRRRKVAPNL N
Subjt:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKN

KAG7029436.1 hypothetical protein SDJN02_07775 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-10598.96Show/hide
Query:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
        MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVI RKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
Subjt:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP

Query:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKN
        NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNL N
Subjt:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKN

XP_022961658.1 uncharacterized protein LOC111462361 [Cucurbita moschata]3.7e-10699.48Show/hide
Query:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
        MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
Subjt:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP

Query:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKN
        NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNL N
Subjt:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKN

XP_022996975.1 uncharacterized protein LOC111492045 [Cucurbita maxima]3.8e-10396.37Show/hide
Query:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
        MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVI RKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
Subjt:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP

Query:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKN
        NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSS+CSDN VK+GGSA+DVQEHFTKEDRKRR+KVAPNL N
Subjt:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKN

XP_023547196.1 uncharacterized protein LOC111806078 [Cucurbita pepo subsp. pepo]1.0e-10396.37Show/hide
Query:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
        MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVI RKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
Subjt:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP

Query:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKN
        NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSS+CSDNPVK+GGSA+DVQ+HFT+EDRKRRRKVAPNL N
Subjt:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKN

TrEMBL top hitse value%identityAlignment
A0A0A0LPG2 WD_REPEATS_REGION domain-containing protein8.5e-8582.8Show/hide
Query:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
        MVE++FQAV+L AAPNYPNAIAWSDENLIA+ASGPLVTILNPASPFGARGTITIPA+DPL IGVI RKDLF+ CLL TCLSRDD+PRAQS+AWSP+GMAP
Subjt:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP

Query:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRK
        NAGCLLAVCTSEGCVKLYRPPFCDF+AEWIEI+DISNKLYDY ES+K+GELDV SS+CSD PVK+ GSA DV EH TK+   +RRK
Subjt:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRK

A0A1S3BB76 uncharacterized protein LOC103488044 isoform X11.4e-8779.8Show/hide
Query:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
        MVE++FQAV+LVAAPNYPNAIAWSDENLIA+ASGPLVTI+NPASPFGARGTITIPA+DPL IG++ RKDLF+ CLL TCLSRDD+PRAQS+AWSP+GMAP
Subjt:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP

Query:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKNLMFNS
        NAGCLLAVCTSEGCVKLYRPPFCDF+AEWIEI+DISNKLYDY ES+K+GELDV SS+ SD P K+ GSA+DVQE+FTK++ KRR+K     +NLMFNS
Subjt:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKNLMFNS

A0A1S4DVH0 uncharacterized protein LOC103488044 isoform X21.1e-8780.81Show/hide
Query:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
        MVE++FQAV+LVAAPNYPNAIAWSDENLIA+ASGPLVTI+NPASPFGARGTITIPA+DPL IG++ RKDLF+ CLL TCLSRDD+PRAQS+AWSP+GMAP
Subjt:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP

Query:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKNLMFNS
        NAGCLLAVCTSEGCVKLYRPPFCDF+AEWIEI+DISNKLYDY ES+K+GELDV SS+ SD P K+ GSA+DVQE+FTK++ KRR+K    LKNLMFNS
Subjt:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKNLMFNS

A0A6J1HCF6 uncharacterized protein LOC1114623611.8e-10699.48Show/hide
Query:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
        MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
Subjt:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP

Query:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKN
        NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNL N
Subjt:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKN

A0A6J1K896 uncharacterized protein LOC1114920451.8e-10396.37Show/hide
Query:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
        MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVI RKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP
Subjt:  MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAP

Query:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKN
        NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSS+CSDN VK+GGSA+DVQEHFTKEDRKRR+KVAPNL N
Subjt:  NAGCLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G49400.1 Transducin/WD40 repeat-like superfamily protein1.1e-4751.31Show/hide
Query:  SYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAPNAG
        S FQ  +LV +P+YPNA+AWS ENLIAVA+G LV I+NPA P G RG ITI  ++   IG +  +DL  G LLP+ L R+  P  +S++WS +GM+PN G
Subjt:  SYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAPNAG

Query:  CLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKNL
        CLLAVCT+EG VKLYRPP+ DF AEWIEI+DIS  LY+   S+ FGE   PS+  S + V +     D  E  +    ++RRK + N  NL
Subjt:  CLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKNL

AT3G49400.2 Transducin/WD40 repeat-like superfamily protein1.1e-4751.31Show/hide
Query:  SYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAPNAG
        S FQ  +LV +P+YPNA+AWS ENLIAVA+G LV I+NPA P G RG ITI  ++   IG +  +DL  G LLP+ L R+  P  +S++WS +GM+PN G
Subjt:  SYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAPNAG

Query:  CLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKNL
        CLLAVCT+EG VKLYRPP+ DF AEWIEI+DIS  LY+   S+ FGE   PS+  S + V +     D  E  +    ++RRK + N  NL
Subjt:  CLLAVCTSEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAATCATATTTCCAGGCCGTCACGTTGGTCGCTGCCCCAAACTACCCAAATGCTATCGCTTGGTCTGATGAGAATTTAATCGCCGTTGCCTCAGGCCCCCTTGT
CACTATACTGAATCCGGCATCGCCATTCGGAGCACGAGGCACTATTACGATCCCTGCAAGTGATCCACTTCCAATAGGGGTGATAGCGAGAAAAGATTTATTTGCTGGTT
GCTTGTTGCCAACTTGCTTATCTCGGGATGACCGACCTCGTGCTCAGTCCATAGCGTGGTCTCCTCTTGGAATGGCTCCTAATGCAGGGTGCTTGTTGGCTGTTTGCACG
TCCGAAGGATGTGTGAAGCTTTACCGTCCACCGTTCTGTGACTTTACTGCTGAATGGATTGAGATTATGGACATATCAAATAAACTTTATGATTATTTTGAAAGTCTTAA
ATTTGGGGAGTTGGATGTTCCTTCATCAGAGTGTTCTGATAATCCAGTGAAGCAAGGTGGCAGTGCTCTCGATGTCCAAGAGCATTTCACAAAGGAGGACCGTAAGCGAA
GAAGGAAAGTTGCGCCCAATTTAAAGAATTTGATGTTCAATTCTGGCCAGCAACGGAAGCGGTTTAAATCAATCATTGGAGAAATCAAAGCGTCCGAAGAGAACTGA
mRNA sequenceShow/hide mRNA sequence
TCAATCCCTAGCACCGCCGCACTAGCTTGCGTCTTCCTCCCCTCCGGCGACACTGCAGTGCGGTCCCACTCTTCTCAGTTCAGCCGATAGGCATAGTGCACCGACTCCGC
CTCCTCCGGCTTTGCTTTCGCCAGCCTCTTCGGCCTCCGCCGGTCGTCACCGGCACGTAATCGCAAACCTTCAAACGGAGCAGACCGAACCGTCAATCCGCGGTTCTTAT
TTCTGAACAAGCTCCTTTTACCCATTCGACATTGTGCTGAACTTCTATTGTCATAGTCAACTCGTCTCACAACTCTGGAAGAGCGAGCAATGGTGGAATCATATTTCCAG
GCCGTCACGTTGGTCGCTGCCCCAAACTACCCAAATGCTATCGCTTGGTCTGATGAGAATTTAATCGCCGTTGCCTCAGGCCCCCTTGTCACTATACTGAATCCGGCATC
GCCATTCGGAGCACGAGGCACTATTACGATCCCTGCAAGTGATCCACTTCCAATAGGGGTGATAGCGAGAAAAGATTTATTTGCTGGTTGCTTGTTGCCAACTTGCTTAT
CTCGGGATGACCGACCTCGTGCTCAGTCCATAGCGTGGTCTCCTCTTGGAATGGCTCCTAATGCAGGGTGCTTGTTGGCTGTTTGCACGTCCGAAGGATGTGTGAAGCTT
TACCGTCCACCGTTCTGTGACTTTACTGCTGAATGGATTGAGATTATGGACATATCAAATAAACTTTATGATTATTTTGAAAGTCTTAAATTTGGGGAGTTGGATGTTCC
TTCATCAGAGTGTTCTGATAATCCAGTGAAGCAAGGTGGCAGTGCTCTCGATGTCCAAGAGCATTTCACAAAGGAGGACCGTAAGCGAAGAAGGAAAGTTGCGCCCAATT
TAAAGAATTTGATGTTCAATTCTGGCCAGCAACGGAAGCGGTTTAAATCAATCATTGGAGAAATCAAAGCGTCCGAAGAGAACTGAAGATAGCTCCGTGCCTTCATTAAT
TAATGCCCAACAATATGCTTCTCGCAGTGCAATGTTGTTGTCTGTTGTCGTTGCCTGGTCCCCAGTAATGAAGCCGTCTCATAAGGTTCATTCGCACTGGAATTCATCTG
TCAGTGTTCTTGCAGTAGGAGGGAAGTCTGGTAAAGTTTCATTTTGGAAAGTTAACGTACCAGAATGCTACTCCCTTGCTGAGTGCACGGTCCCAACAAGAGCTCTGCTT
GTTGGGCTTCTTCAGGCACATAATTCATGGGTCAACTGTATCAATTGGATGATGTTCGATTCTGATTCATCAAATCCAAAGGTTTTATTGGCAACTGGGAGCACAGATGG
GAGTGTGAAGATCTGGCAATCTTCCTGTGAAGAGTTATTAGCATCTTCAGACACTAATTTTGCTTCGTTTTCCCTATTGAAGGAGGTCATCAGTGGTGGAGGAGTGCCAA
CTCTACTTTCACTCAATTTGCCCAATTCAGCCGTGCACAAGCTGTTTTTGGCCATTGGCAGAGGATCTGGATCACTTGAAATAAGGATATTTAACCTATCTAGCAGTGAA
TTTGATAGCGTTAGGTCGTATGAAGCACATGATCACGTCGTTACCGGTGCAGCTTGGGCATTTGATGGACGTTATTTGTTCACCTGCAGTGAGGATAATATTCTCCGAGG
TTGGAGTTTAGATGAGAGTTCTCTCCGCGAAGTACCCATTTCATCACATATCCCTGATCTTGGAAACTCCATTGATCTTCCAGATTCATTTCGGTCGTGCTTTGGCCTTG
CAGTGTCCCCGGGAAATCTTGTGGCTGCCGTGGTTCGCAACTTTGATCTCGAATCACTTGATCGAATGTACCAAGCAAGGTCTCAGAAAGCTGCTATTCAATTCTTCTGG
ATTGGAGAAGAAATACAAGCCGTGCCAAACAGTTCTTCTTACTTTTATACTGAAACAATTACAGACATTTCTAAGAAGGAATTGGTTCAATGGGAATCCAGTATGTTGTG
GTCGTTAAATCAGTTTAAAAATTTGAACAAGCCTATGGTTCTTTGGGATGTTGTAGCTGCTTTGCTGGCATTCAGGCAGTCCATACCGGAATTCGTTGACCACATTCTAC
TTAAGTGGTTTTCAACGTCATATCTCGAATGGAACGAGGAGCTCTCTGCTACAAAGATTTTGTCACACGTCTCGAGAAATGTGTCGACATTTTCTACTCGCCAACTTCAC
CTCCTTAACGTTATTTGTAGACGTGTAGTTCTGTCAGAGCTAGTACAGGATCAAGTGAACAATGACCTGCAGGATTTGGAGAGACTCAACGATGCTGAAAATGATAAACA
TATTTTGTGGAAGGAGTTGCTTTTAAGCAGTGAAAGAGAACTCCGTCAGAGGCTAATCGGTCTGTGTTTTTCTTCTTGTGCAAAGCTTCGTGCCCTGTCGAGTTCCGAAT
ATCGACCTGGGTTCTGGTATCCCATTGGATTAGTGGAAATGCAGCAGTGGATTAGATATAATCACGAACATTTACAGGAATCGCTAAATGTCATTGCATCGAAAGAGGAA
AAAGACCATTCGAGTGAACATTCAGCGACGGAGCAGTGCACCTACTGTTCAGCATCGGTTCCATTCGAGTCTCCAGAACTCGGGTTTTGCCAGGGCGATAAGCACAATCC
AAACGTCGGTCAGAGTCACAAGCTAGTAAGGTGTTCTGTATCAATGCAGGTCTGCCCTGTTACTGCTCCCTTATGGTTCTGCATGTGCTGTAGTAGAAGTGCCTTCAGAT
TAGCTCCTGATATACTTTTCCAGATGTCTGAGACTCCCGACTTTAGTTCTTTAACACTCTCCGATTCGGACATACCCTCGAAACCGCTATGCCCCTTTTGTGGTATACTG
CTGCAACGTCGACAGCCCGACTTCTTACTGTCAGCATGCCTGGTTTAAGTAAGCATGCCGTTTTGTCAGTGCCTTGAAGCTTGTTGTATATAGTTGGTGATAATAGCTAA
GAATCATACAAGGGCAAGCAGCATAAGTTAACCCAAATGATTTAGATTGTTGTTGTGATGGTAGTTTAGCCATAAGCTTGGAATAGTCAAATTATTTACAAATTATGTAG
GCTGAAACTTTAATCAGATATTTAGTGTACATACAGTACCGTTGAAATGATTAAATGTAAGTATAACTCTACTGATCGACTCATACTTTTTGGTTTAGTGGTGATTTAAG
TTGATATTTTCGTGA
Protein sequenceShow/hide protein sequence
MVESYFQAVTLVAAPNYPNAIAWSDENLIAVASGPLVTILNPASPFGARGTITIPASDPLPIGVIARKDLFAGCLLPTCLSRDDRPRAQSIAWSPLGMAPNAGCLLAVCT
SEGCVKLYRPPFCDFTAEWIEIMDISNKLYDYFESLKFGELDVPSSECSDNPVKQGGSALDVQEHFTKEDRKRRRKVAPNLKNLMFNSGQQRKRFKSIIGEIKASEEN