; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021633 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021633
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUTP--glucose-1-phosphate uridylyltransferase
Genome locationscaffold2:16628702..16638682
RNA-Seq ExpressionSpg021633
SyntenySpg021633
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443650.1 PREDICTED: uncharacterized protein LOC103487197 [Cucumis melo]3.9e-12186.27Show/hide
Query:  IIFWGFSLLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGK
        ++F    LLAARPFASSSGNRKS KSSVFSLFNLKDKS+FWSETVIRGDFDDLESS+T+KMSVVNYTKAGN+ANYLKLLEVDSLYLPVPVNFIF+GFEGK
Subjt:  IIFWGFSLLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGK

Query:  SNHEFKLHPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALW
         NHEFKLHPEELERWF+KLDH+FEHTRIPQVREV TPFYK+S+DKV RHQLPL+SH NYNFS+HVIQTGEKVTSIFELARNVLSRKE +SNNGDGNDALW
Subjt:  SNHEFKLHPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALW

Query:  QVDVDLMDVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSK
        QVDVDLMDVLFTSFVEYLQLENAYNIFILNLKRD++RA+YGYR G  ES ++  K
Subjt:  QVDVDLMDVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSK

XP_022157070.1 uncharacterized protein LOC111023880 [Momordica charantia]6.6e-12189.11Show/hide
Query:  LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKL
        LLAARP ASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSST+KMS VNYTKAGNIAN+LKLLEVDSLYLPVPVNFIF+GFEGK NHEFKL
Subjt:  LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKL

Query:  HPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLM
        HPEELERWF+KLDH+FEHTRIPQVREV TPFYKISVDKV RHQLPLVSHINYNFS+H IQTGEKVTSIFELARNVL+RKE++S+NGDG+DALWQVDVDLM
Subjt:  HPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLM

Query:  DVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSK
        DVLFTSFVEYLQLENAYNIFILNLKRDA+RA+YGYR G  ES ++  K
Subjt:  DVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSK

XP_038880656.1 uncharacterized protein LOC120072284 isoform X1 [Benincasa hispida]5.0e-12189.2Show/hide
Query:  LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKL
        LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESS+T+KMSVVNYTKAGN+ANYLKLLEVDSLYLPVPVNFIF+GFEGK NHEFKL
Subjt:  LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKL

Query:  HPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLM
        HPEELERWFMKLDH+FEHTRIPQVREV TPFYKISVDKV RHQLPLVSHINYNFS+HVIQTGEKVTSIFELARNVLSRK+++SNNGD N ALWQVDVDLM
Subjt:  HPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLM

Query:  DVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSKRE
        DVLFTSFVEYLQLENAYNIFILNLKRD +RA+YGYR G  ES ++  K +
Subjt:  DVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSKRE

XP_038880657.1 uncharacterized protein LOC120072284 isoform X2 [Benincasa hispida]6.6e-12189.92Show/hide
Query:  LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKL
        LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESS+T+KMSVVNYTKAGN+ANYLKLLEVDSLYLPVPVNFIF+GFEGK NHEFKL
Subjt:  LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKL

Query:  HPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLM
        HPEELERWFMKLDH+FEHTRIPQVREV TPFYKISVDKV RHQLPLVSHINYNFS+HVIQTGEKVTSIFELARNVLSRK+++SNNGD N ALWQVDVDLM
Subjt:  HPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLM

Query:  DVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSK
        DVLFTSFVEYLQLENAYNIFILNLKRD +RA+YGYR G  ES ++  K
Subjt:  DVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSK

XP_038880658.1 uncharacterized protein LOC120072284 isoform X3 [Benincasa hispida]5.0e-12189.2Show/hide
Query:  LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKL
        LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESS+T+KMSVVNYTKAGN+ANYLKLLEVDSLYLPVPVNFIF+GFEGK NHEFKL
Subjt:  LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKL

Query:  HPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLM
        HPEELERWFMKLDH+FEHTRIPQVREV TPFYKISVDKV RHQLPLVSHINYNFS+HVIQTGEKVTSIFELARNVLSRK+++SNNGD N ALWQVDVDLM
Subjt:  HPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLM

Query:  DVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSKRE
        DVLFTSFVEYLQLENAYNIFILNLKRD +RA+YGYR G  ES ++  K +
Subjt:  DVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSKRE

TrEMBL top hitse value%identityAlignment
A0A0A0M0H0 Uncharacterized protein3.5e-12087.5Show/hide
Query:  LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKL
        LLAARP ASSSGNRKS KSSVFSLFNLKDKS+FWSETVIRGDFDDLESS+T+KMSVVNYTKAGN+ANYLKLLEVDSLYLPVPVNFIF+GFEGK NHEFKL
Subjt:  LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKL

Query:  HPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLM
        HPEELERWF+KLDH+FEHTRIPQ REV TPFYK+S+DKV RHQLPL+SH NYNFS+HVIQTGEKVTSIFELARNVLSRKE++SNNGDGNDALWQVDVDLM
Subjt:  HPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLM

Query:  DVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSK
        DVLFTSFVEYLQLENAYNIFILNLKRD +RA+YGYR G  ES ++  K
Subjt:  DVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSK

A0A1S3B823 uncharacterized protein LOC1034871971.9e-12186.27Show/hide
Query:  IIFWGFSLLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGK
        ++F    LLAARPFASSSGNRKS KSSVFSLFNLKDKS+FWSETVIRGDFDDLESS+T+KMSVVNYTKAGN+ANYLKLLEVDSLYLPVPVNFIF+GFEGK
Subjt:  IIFWGFSLLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGK

Query:  SNHEFKLHPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALW
         NHEFKLHPEELERWF+KLDH+FEHTRIPQVREV TPFYK+S+DKV RHQLPL+SH NYNFS+HVIQTGEKVTSIFELARNVLSRKE +SNNGDGNDALW
Subjt:  SNHEFKLHPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALW

Query:  QVDVDLMDVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSK
        QVDVDLMDVLFTSFVEYLQLENAYNIFILNLKRD++RA+YGYR G  ES ++  K
Subjt:  QVDVDLMDVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSK

A0A5A7T3Y1 Uncharacterized protein7.1e-12188.89Show/hide
Query:  IIFWGFSLLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGK
        ++F    LLAARPFASSSGNRKS KSSVFSLFNLKDKS+FWSETVIRGDFDDLESS+T+KMSVVNYTKAGN+ANYLKLLEVDSLYLPVPVNFIF+GFEGK
Subjt:  IIFWGFSLLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGK

Query:  SNHEFKLHPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALW
         NHEFKLHPEELERWF+KLDH+FEHTRIPQVREV TPFYK+S+DKV RHQLPL+SH NYNFS+HVIQTGEKVTSIFELARNVLSRKE +SNNGDGNDALW
Subjt:  SNHEFKLHPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALW

Query:  QVDVDLMDVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYR
        QVDVDLMDVLFTSFVEYLQLENAYNIFILNLKRD++RA+YGYR
Subjt:  QVDVDLMDVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYR

A0A6J1DS42 uncharacterized protein LOC1110238803.2e-12189.11Show/hide
Query:  LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKL
        LLAARP ASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSST+KMS VNYTKAGNIAN+LKLLEVDSLYLPVPVNFIF+GFEGK NHEFKL
Subjt:  LLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKL

Query:  HPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLM
        HPEELERWF+KLDH+FEHTRIPQVREV TPFYKISVDKV RHQLPLVSHINYNFS+H IQTGEKVTSIFELARNVL+RKE++S+NGDG+DALWQVDVDLM
Subjt:  HPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLM

Query:  DVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSK
        DVLFTSFVEYLQLENAYNIFILNLKRDA+RA+YGYR G  ES ++  K
Subjt:  DVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSK

A0A6J1H937 uncharacterized protein LOC1114616186.0e-12085.21Show/hide
Query:  FSLLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEF
        F LLAAR FASSSGNRKS KSSVFSLFNLKDKSRFWSETVIRGDFDDLESSS +KMSVVNYTKAGNIANYLKLLEV+SLYLPVPVNFIF+GFEGK NHEF
Subjt:  FSLLAARPFASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEF

Query:  KLHPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVD
        KLHPEELERWF KLDH+FEHTRIPQVREV TPFYKISVDKV +HQLPLVSHINYNFS+H IQTGEKVTSIFELARNVLSRKE++SNNGDGND LWQVDVD
Subjt:  KLHPEELERWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVD

Query:  LMDVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCGKESTVSVSKREEMDFES
        LMDVLFTSFVEYLQLENAYNIFILNLKRD +R +YGYR G   +     +E+++ +S
Subjt:  LMDVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCGKESTVSVSKREEMDFES

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G58100.1 unknown protein1.1e-8962.06Show/hide
Query:  FASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKLHPEELE
        + +S GNRK+ KSSVFSLFNL+DKSRFWSE+V R DFDDLESS      V+NYTK+GNIA+YL+L+EVDS+YLPVPVNFIF+GFEGK N +FKL PEELE
Subjt:  FASSSGNRKSGKSSVFSLFNLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKLHPEELE

Query:  RWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLMDVLFTS
        RWF KLDH+FEHTR+PQ++EV  PFYKI+++K  +H LP++S +NYNFS+H IQ GEKVTS+ E A  VL+RK++++ N D   AL QVD ++M+ +FTS
Subjt:  RWFMKLDHVFEHTRIPQVREVFTPFYKISVDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLMDVLFTS

Query:  FVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSKREEMDFESLLK
         VEY  LE+AYN+FILN K D ++AKYGYR G  ES +S  K  +   ++LL+
Subjt:  FVEYLQLENAYNIFILNLKRDAQRAKYGYRCG-KESTVSVSKREEMDFESLLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCATTCCATCCACGCGGCCAGCTACCGCATCGATCTTATCTGTCCTCGTCGCAATCTCTTCAAGACGAGATTCTAGGAAGCGAACCGAATCGGGGACTTCCCTCAG
TGGCAAAGGTACTTGTAGAGATTTTGTGAAAAGTGTGTGGAGTAAGAGAAAAGTGGAGTGGATGGGAATCGAGTCCTTGGGGGCTTCTGGTGGTATTCTCATTTTGTGGG
ATGGGAATTTGATTTCCGTCAAGGAGGTCATCCTGGGTCGAGTTAATACTTTTGAGAGGCTTTCTAGAATGATGCCCCCGATGGTTGACCCTTTTTGTTGCATTCTTTGT
CGAAAGGCGGAGGAAGATCTTGATCATTTGTTGTGTGATTGTTGTTTTACTCGCCCTGTTTGGAGCCTCTTCTTTGAGGTTTTCAGGTTTCAGACTGTTGGCCAGTGCAG
GTGTAGGAAGATGATCGAGGAGTTCCTCCTCCCTCCGCCGTTTCGGGAGAAAGGAAGATTTTGTGGCTTGTCGAAACACAAGGACACATTTTCAGCACTTGTCCCTTGGC
TTGATCCATTTGGAATTATTTTCTGGGGGTTTTCGCTATTAGCAGCAAGACCGTTTGCCTCTTCCTCTGGGAATCGTAAAAGTGGAAAGTCATCCGTATTCTCTTTGTTT
AACCTAAAAGATAAGAGTAGGTTTTGGAGTGAGACAGTCATACGTGGTGATTTTGATGATCTGGAATCATCCAGCACTGATAAAATGAGTGTTGTCAACTACACGAAGGC
AGGTAATATAGCAAATTACTTGAAGCTTCTTGAAGTTGATTCCCTGTACCTCCCAGTCCCTGTGAATTTTATTTTTGTAGGTTTTGAAGGGAAAAGTAACCATGAATTCA
AGCTGCATCCAGAAGAGCTTGAACGTTGGTTCATGAAACTTGATCATGTCTTTGAACATACACGGATTCCGCAAGTCAGGGAGGTGTTTACCCCTTTTTATAAGATCAGT
GTGGACAAAGTTTCGAGGCATCAACTACCTCTTGTCAGTCACATAAATTACAATTTTTCTATTCATGTAATACAAACGGGTGAGAAGGTTACTTCAATCTTTGAGCTTGC
AAGAAATGTCTTATCTCGCAAGGAAAATATATCCAATAATGGGGATGGGAATGATGCTCTTTGGCAAGTAGATGTGGACCTGATGGATGTACTTTTCACTAGCTTTGTGG
AGTACCTTCAACTTGAAAATGCTTATAACATTTTTATTCTAAATCTCAAGCGTGATGCACAAAGGGCTAAATATGGATACCGGTGTGGGAAGGAGAGCACTGTGTCGGTT
TCAAAGAGGGAAGAGATGGATTTTGAGTCTCTGTTGAAGAAGACAAGAAAAAGATTGAGTGACAAAAACAAGGGATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCATTCCATCCACGCGGCCAGCTACCGCATCGATCTTATCTGTCCTCGTCGCAATCTCTTCAAGACGAGATTCTAGGAAGCGAACCGAATCGGGGACTTCCCTCAG
TGGCAAAGGTACTTGTAGAGATTTTGTGAAAAGTGTGTGGAGTAAGAGAAAAGTGGAGTGGATGGGAATCGAGTCCTTGGGGGCTTCTGGTGGTATTCTCATTTTGTGGG
ATGGGAATTTGATTTCCGTCAAGGAGGTCATCCTGGGTCGAGTTAATACTTTTGAGAGGCTTTCTAGAATGATGCCCCCGATGGTTGACCCTTTTTGTTGCATTCTTTGT
CGAAAGGCGGAGGAAGATCTTGATCATTTGTTGTGTGATTGTTGTTTTACTCGCCCTGTTTGGAGCCTCTTCTTTGAGGTTTTCAGGTTTCAGACTGTTGGCCAGTGCAG
GTGTAGGAAGATGATCGAGGAGTTCCTCCTCCCTCCGCCGTTTCGGGAGAAAGGAAGATTTTGTGGCTTGTCGAAACACAAGGACACATTTTCAGCACTTGTCCCTTGGC
TTGATCCATTTGGAATTATTTTCTGGGGGTTTTCGCTATTAGCAGCAAGACCGTTTGCCTCTTCCTCTGGGAATCGTAAAAGTGGAAAGTCATCCGTATTCTCTTTGTTT
AACCTAAAAGATAAGAGTAGGTTTTGGAGTGAGACAGTCATACGTGGTGATTTTGATGATCTGGAATCATCCAGCACTGATAAAATGAGTGTTGTCAACTACACGAAGGC
AGGTAATATAGCAAATTACTTGAAGCTTCTTGAAGTTGATTCCCTGTACCTCCCAGTCCCTGTGAATTTTATTTTTGTAGGTTTTGAAGGGAAAAGTAACCATGAATTCA
AGCTGCATCCAGAAGAGCTTGAACGTTGGTTCATGAAACTTGATCATGTCTTTGAACATACACGGATTCCGCAAGTCAGGGAGGTGTTTACCCCTTTTTATAAGATCAGT
GTGGACAAAGTTTCGAGGCATCAACTACCTCTTGTCAGTCACATAAATTACAATTTTTCTATTCATGTAATACAAACGGGTGAGAAGGTTACTTCAATCTTTGAGCTTGC
AAGAAATGTCTTATCTCGCAAGGAAAATATATCCAATAATGGGGATGGGAATGATGCTCTTTGGCAAGTAGATGTGGACCTGATGGATGTACTTTTCACTAGCTTTGTGG
AGTACCTTCAACTTGAAAATGCTTATAACATTTTTATTCTAAATCTCAAGCGTGATGCACAAAGGGCTAAATATGGATACCGGTGTGGGAAGGAGAGCACTGTGTCGGTT
TCAAAGAGGGAAGAGATGGATTTTGAGTCTCTGTTGAAGAAGACAAGAAAAAGATTGAGTGACAAAAACAAGGGATTTTGA
Protein sequenceShow/hide protein sequence
MGIPSTRPATASILSVLVAISSRRDSRKRTESGTSLSGKGTCRDFVKSVWSKRKVEWMGIESLGASGGILILWDGNLISVKEVILGRVNTFERLSRMMPPMVDPFCCILC
RKAEEDLDHLLCDCCFTRPVWSLFFEVFRFQTVGQCRCRKMIEEFLLPPPFREKGRFCGLSKHKDTFSALVPWLDPFGIIFWGFSLLAARPFASSSGNRKSGKSSVFSLF
NLKDKSRFWSETVIRGDFDDLESSSTDKMSVVNYTKAGNIANYLKLLEVDSLYLPVPVNFIFVGFEGKSNHEFKLHPEELERWFMKLDHVFEHTRIPQVREVFTPFYKIS
VDKVSRHQLPLVSHINYNFSIHVIQTGEKVTSIFELARNVLSRKENISNNGDGNDALWQVDVDLMDVLFTSFVEYLQLENAYNIFILNLKRDAQRAKYGYRCGKESTVSV
SKREEMDFESLLKKTRKRLSDKNKGF