; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021067 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021067
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionWD repeat-containing protein 43
Genome locationscaffold9:542022..548553
RNA-Seq ExpressionSpg021067
SyntenySpg021067
Gene Ontology termsGO:0006364 - rRNA processing (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR015943 - WD40/YVTN repeat-like-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593430.1 WD repeat-containing protein 43, partial [Cucurbita argyrosperma subsp. sororia]5.3e-10481.25Show/hide
Query:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK
        +SFA+KGRR+HTVGSNG+AF+M+ ETGSII EFKASKKSISSSAFSLDEKY+AVAGKKL ILST+NG ELM HPDKLGPVK+ SISDDAK IITSE GAK
Subjt:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK

Query:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ
        H+QVWWCDMSAGKLSRGPVLSMKHPPFVSECRNI N ED+I VLSVSVSGVAYLWKLKFLSED+  PTK+TVK N+ +SAEENHGSAKKNRISV++S IQ
Subjt:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ

Query:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKKIQILRMRK
        GL DNEVS+LVTHGSMDLPQH+VLNIGY AK+   + + K
Subjt:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKKIQILRMRK

XP_022150139.1 WD repeat-containing protein 43 [Momordica charantia]4.5e-10383.62Show/hide
Query:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK
        +SFANKGRR+ TVGSNGMA +MDTETG+IIKEFKASKKSISSSAFS DEKY+AVAGKKL ILST+NGDELM H DKLGPVK+VSISDDAKAIITSE GAK
Subjt:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK

Query:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ
        HLQVWWCDMSA KLSRGPVLSMKHPPFVSEC+NI NEED+I VLSVSVSGVAY+W+LK LSEDE  P K+TVK ND QSAEENHGSAKKNRISVIASRI 
Subjt:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ

Query:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKK
        G  DNEVS+LVTHGSMD PQ S+ NIGYS K+
Subjt:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKK

XP_022964606.1 WD repeat-containing protein 43 isoform X1 [Cucurbita moschata]2.0e-10380.83Show/hide
Query:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK
        +SFA+KGRR+HTVGSNG+AF+M+ ETGSII EFKASKKSISSSAFSLDEKY+AVAGKKL ILST++G ELM HPDKLGPVK+ SISDDAK IITSE GAK
Subjt:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK

Query:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ
        H+QVWWCDMSAGKLSRGPVLSMKHPPFVSECRNI N ED+I VLSVSVSGVAYLWKLKFLSED+  PTK+TVK N+ +SAEENHGSAKKNRISV++S IQ
Subjt:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ

Query:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKKIQILRMRK
        GL DNEVS+LVTHGSMDLPQH+VLNIGY AK+   + + K
Subjt:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKKIQILRMRK

XP_023513462.1 WD repeat-containing protein 43 [Cucurbita pepo subsp. pepo]1.7e-10282.33Show/hide
Query:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK
        +SFA+KGRR+HTVGSNGMAF+M+ ETGSII EFKASKKSISSSAFS DEKY+AVAGKKL ILST+NG ELM HPDKLGPVK+ SIS+DAK IITSE GAK
Subjt:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK

Query:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ
        H+QVWWCD+SAGKLSRGPVLSMKHPPFVSECRNI N ED+I VLSVSVSGVAYLWKLKFLSE +  PTK+TVK N+ +SAEE+HGSAKKNRISVI+S IQ
Subjt:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ

Query:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKK
        GL DNEVS+LVTHGSMDLPQH+VLNIGY AK+
Subjt:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKK

XP_038899613.1 WD repeat-containing protein 43 [Benincasa hispida]9.7e-10682.76Show/hide
Query:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK
        +SFANKGRR+H VGSNG   +MDTETG+IIKEFKASKKSISSS+FSLDEKY+AVAGKKL ILS ++GDELM HPDKLGPVK+VS+SDDAK IITSELGAK
Subjt:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK

Query:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ
        HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRN+ N+EDN+ VLSVSVSGVAYLWKLK LSEDE  PTK++VK ND QSAEENHGSAKKNR+SVIASRI 
Subjt:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ

Query:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKK
        G+ DNEVS+LVTHGSMDLPQHS+ +IGYS K+
Subjt:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKK

TrEMBL top hitse value%identityAlignment
A0A1S3CDX9 WD repeat-containing protein 434.1e-10279.74Show/hide
Query:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK
        +SFAN+GRR+HTVGSNGMA +MDTETG+IIKEFKASKKSISSSAFSLDEKY+AVAGKKL ILS ++GDEL+ HPDKL PVK+VSISDDAK I+TSELGAK
Subjt:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK

Query:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ
        HLQVWWCDMSAGK SRGPVLSM HPPFVSECRN+ N+ED++ VLSVSVSG AYLWKLK LSEDE  PTK++VK ND QSAEENHGSAKKNR+SV+AS+I 
Subjt:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ

Query:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKK
         + DNEVS+LVTHGS+DLPQHS+L+IGY+ K+
Subjt:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKK

A0A5A7UYX1 WD repeat-containing protein 431.1e-10280.6Show/hide
Query:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK
        +SFAN+GRR+HTVGSNGMA +MDTETG+IIKEFKASKKSISSSAFSLDEKY+AVAGKKL ILS ++GDEL+ HPDKL PVK+VSISDDAK IITSELGAK
Subjt:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK

Query:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ
        HLQVWWCDMSAGK SRGPVLSM HPPFVSECRN+ N+ED++ VLSVSVSG AYLWKLK LSEDE  PTK++VK ND QSAEENHGSAKKNR+SV+ASRI 
Subjt:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ

Query:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKK
         + DNEVS+LVTHGS+DLPQHS+L+IGY+ K+
Subjt:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKK

A0A6J1D938 WD repeat-containing protein 432.2e-10383.62Show/hide
Query:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK
        +SFANKGRR+ TVGSNGMA +MDTETG+IIKEFKASKKSISSSAFS DEKY+AVAGKKL ILST+NGDELM H DKLGPVK+VSISDDAKAIITSE GAK
Subjt:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK

Query:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ
        HLQVWWCDMSA KLSRGPVLSMKHPPFVSEC+NI NEED+I VLSVSVSGVAY+W+LK LSEDE  P K+TVK ND QSAEENHGSAKKNRISVIASRI 
Subjt:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ

Query:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKK
        G  DNEVS+LVTHGSMD PQ S+ NIGYS K+
Subjt:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKK

A0A6J1HJF3 WD repeat-containing protein 43 isoform X19.8e-10480.83Show/hide
Query:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK
        +SFA+KGRR+HTVGSNG+AF+M+ ETGSII EFKASKKSISSSAFSLDEKY+AVAGKKL ILST++G ELM HPDKLGPVK+ SISDDAK IITSE GAK
Subjt:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK

Query:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ
        H+QVWWCDMSAGKLSRGPVLSMKHPPFVSECRNI N ED+I VLSVSVSGVAYLWKLKFLSED+  PTK+TVK N+ +SAEENHGSAKKNRISV++S IQ
Subjt:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ

Query:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKKIQILRMRK
        GL DNEVS+LVTHGSMDLPQH+VLNIGY AK+   + + K
Subjt:  GLRDNEVSILVTHGSMDLPQHSVLNIGYSAKKIQILRMRK

A0A6J1KMD0 WD repeat-containing protein 431.2e-10183.19Show/hide
Query:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK
        +SFA+KGRR+HT+GSNGMAF+M+ ETGSII EFKASKKSISSSAFSLDEKY+AVAGKKL ILST+NG ELM HPDKLGPVK+ SISDDAK IITSE GAK
Subjt:  ISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAK

Query:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ
        H+QVWWCDMSAGKLSRGPVLSMKHPPF+SECRN+ N ED+I VLSVSVSGVAYLWKLKFLSED+  PTK+TVK N+ +SAEENHGSAKKNRISV++S IQ
Subjt:  HLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQ

Query:  GLRDNEVSILVTHGSMDLPQHSVLNI
        GL DNEVS+LVTHGSMDLPQH+VLNI
Subjt:  GLRDNEVSILVTHGSMDLPQHSVLNI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G11240.1 transducin family protein / WD-40 repeat family protein9.6e-1928.28Show/hide
Query:  VADCPTVEGTNPNISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISD
        ++DC    G N  +S + K   +++ G++GM  Q+D  +G++I++FKAS K++SS   S D K +  A  +L   + ++  ++       G V+ V+ ++
Subjt:  VADCPTVEGTNPNISFANKGRRVHTVGSNGMAFQMDTETGSIIKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISD

Query:  DAKAIITSELGAKHLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSE-DEAGPTKLTVKGNDFQSAEENHGS
        D K +++S +G +++ VW  D  A K S   VL+++HPP   +     NE+  + VL++S  GV Y W    + E   A PTK+ +      +A+ +   
Subjt:  DAKAIITSELGAKHLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEEDNIAVLSVSVSGVAYLWKLKFLSE-DEAGPTKLTVKGNDFQSAEENHGS

Query:  AKKNRISVIASRIQG-LRDNEVSILVTHGSMDLP--QHSVLNIG
         K +   + A+++QG L+       +  G +  P  Q  VL  G
Subjt:  AKKNRISVIASRIQG-LRDNEVSILVTHGSMDLP--QHSVLNIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATTCCAGCAGCAGTTTTTTTTCTTCTTCTATTCCAGAATTTTGGTTCGGTTCAGCTGCTGCCTTTTGGTCCGGTTCGGTCTGTTGCAGCCTCATTTCACTCATT
GCATGGCCGGTTCATTGCTTGGATGAGCGGTTCGGGATGGTTCGTGAAGCTCGGGCGTGCTGCAAATCCATGCTTGGCCTCGAGGCGCCCTGGGAGCGTCCCCCTACGGA
TGGTGTTTGCATGGGTTAATATCAAGGTGAATGGGGAAAGTGTTCATAGTAAGTGGGAGAAGGAAACGTGTCAAACGTATCCTGCGGTCTCCATCATTAGGTTGCACCGT
GAGATTCCTATGCGCTGCCTGCGAGTCGCCCTGGGAGCGATCACCCTACGGAGGGCTTGCGCATGGGATTCGGAACAACGCAAACTCCAGAAATGGATAGGGTTTCCTAG
GGCCATTCCCAACATTAGCTCTTCCCTACGATGGCATTGTTGGGGCCGTCCTCTGCGATCTGAAAATGATGGATTGTCTCGATTCAGAGGAGGAGTAGCTGACTGCCCTA
CGGTAGAGGGCACCAACCCCAACATTTCTTTTGCGAACAAAGGCCGTAGAGTGCATACGGTTGGAAGTAATGGAATGGCGTTTCAGATGGACACTGAAACAGGAAGCATT
ATCAAGGAGTTCAAAGCTTCTAAAAAATCAATCTCCTCTTCAGCCTTTTCACTTGATGAGAAGTACGTAGCTGTAGCTGGCAAAAAGTTGACGATTTTAAGCACAAATAA
TGGGGATGAGCTTATGGCACATCCTGATAAATTGGGTCCTGTGAAGATTGTTTCTATATCCGATGATGCTAAAGCAATAATTACATCAGAACTTGGAGCCAAACATCTTC
AAGTGTGGTGGTGTGATATGAGTGCTGGAAAACTTAGTAGAGGTCCGGTTCTGTCGATGAAGCATCCTCCATTTGTTTCGGAATGCAGAAATATTTGCAATGAAGAAGAT
AACATAGCTGTCTTGTCAGTATCAGTATCAGGTGTAGCTTATTTATGGAAATTAAAGTTCCTATCAGAAGACGAGGCTGGTCCAACTAAACTCACTGTTAAAGGTAATGA
CTTCCAATCAGCCGAGGAAAACCATGGAAGTGCTAAGAAAAATCGAATTTCTGTCATTGCTTCCAGAATACAAGGTTTAAGAGACAATGAAGTGTCAATTCTTGTTACTC
ATGGCTCCATGGACCTACCACAGCATAGTGTTCTTAATATTGGTTATTCCGCAAAGAAGATACAGATACTGCGCATGAGAAAAAGACTTTCCAACAAAATGATGATTCTT
CCGAGCAAGGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATTCCAGCAGCAGTTTTTTTTCTTCTTCTATTCCAGAATTTTGGTTCGGTTCAGCTGCTGCCTTTTGGTCCGGTTCGGTCTGTTGCAGCCTCATTTCACTCATT
GCATGGCCGGTTCATTGCTTGGATGAGCGGTTCGGGATGGTTCGTGAAGCTCGGGCGTGCTGCAAATCCATGCTTGGCCTCGAGGCGCCCTGGGAGCGTCCCCCTACGGA
TGGTGTTTGCATGGGTTAATATCAAGGTGAATGGGGAAAGTGTTCATAGTAAGTGGGAGAAGGAAACGTGTCAAACGTATCCTGCGGTCTCCATCATTAGGTTGCACCGT
GAGATTCCTATGCGCTGCCTGCGAGTCGCCCTGGGAGCGATCACCCTACGGAGGGCTTGCGCATGGGATTCGGAACAACGCAAACTCCAGAAATGGATAGGGTTTCCTAG
GGCCATTCCCAACATTAGCTCTTCCCTACGATGGCATTGTTGGGGCCGTCCTCTGCGATCTGAAAATGATGGATTGTCTCGATTCAGAGGAGGAGTAGCTGACTGCCCTA
CGGTAGAGGGCACCAACCCCAACATTTCTTTTGCGAACAAAGGCCGTAGAGTGCATACGGTTGGAAGTAATGGAATGGCGTTTCAGATGGACACTGAAACAGGAAGCATT
ATCAAGGAGTTCAAAGCTTCTAAAAAATCAATCTCCTCTTCAGCCTTTTCACTTGATGAGAAGTACGTAGCTGTAGCTGGCAAAAAGTTGACGATTTTAAGCACAAATAA
TGGGGATGAGCTTATGGCACATCCTGATAAATTGGGTCCTGTGAAGATTGTTTCTATATCCGATGATGCTAAAGCAATAATTACATCAGAACTTGGAGCCAAACATCTTC
AAGTGTGGTGGTGTGATATGAGTGCTGGAAAACTTAGTAGAGGTCCGGTTCTGTCGATGAAGCATCCTCCATTTGTTTCGGAATGCAGAAATATTTGCAATGAAGAAGAT
AACATAGCTGTCTTGTCAGTATCAGTATCAGGTGTAGCTTATTTATGGAAATTAAAGTTCCTATCAGAAGACGAGGCTGGTCCAACTAAACTCACTGTTAAAGGTAATGA
CTTCCAATCAGCCGAGGAAAACCATGGAAGTGCTAAGAAAAATCGAATTTCTGTCATTGCTTCCAGAATACAAGGTTTAAGAGACAATGAAGTGTCAATTCTTGTTACTC
ATGGCTCCATGGACCTACCACAGCATAGTGTTCTTAATATTGGTTATTCCGCAAAGAAGATACAGATACTGCGCATGAGAAAAAGACTTTCCAACAAAATGATGATTCTT
CCGAGCAAGGTTTGA
Protein sequenceShow/hide protein sequence
MKIPAAVFFLLLFQNFGSVQLLPFGPVRSVAASFHSLHGRFIAWMSGSGWFVKLGRAANPCLASRRPGSVPLRMVFAWVNIKVNGESVHSKWEKETCQTYPAVSIIRLHR
EIPMRCLRVALGAITLRRACAWDSEQRKLQKWIGFPRAIPNISSSLRWHCWGRPLRSENDGLSRFRGGVADCPTVEGTNPNISFANKGRRVHTVGSNGMAFQMDTETGSI
IKEFKASKKSISSSAFSLDEKYVAVAGKKLTILSTNNGDELMAHPDKLGPVKIVSISDDAKAIITSELGAKHLQVWWCDMSAGKLSRGPVLSMKHPPFVSECRNICNEED
NIAVLSVSVSGVAYLWKLKFLSEDEAGPTKLTVKGNDFQSAEENHGSAKKNRISVIASRIQGLRDNEVSILVTHGSMDLPQHSVLNIGYSAKKIQILRMRKRLSNKMMIL
PSKV