; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019552 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019552
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionglucose-induced degradation protein 8 homolog
Genome locationtig00153348:718914..722614
RNA-Seq ExpressionSgr019552
SyntenySgr019552
Gene Ontology termsGO:0043161 - proteasome-mediated ubiquitin-dependent protein catabolic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR006594 - LIS1 homology motif
IPR006595 - CTLH, C-terminal LisH motif
IPR024964 - CTLH/CRA C-terminal to LisH motif domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056956.1 LisH domain-containing protein/CLTH domain-containing protein [Cucumis melo var. makuwa]7.0e-7496.75Show/hide
Query:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
        MSLFWIVIRQFAE E MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
Subjt:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV

Query:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEEN  +L
Subjt:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

XP_004146589.1 protein GID8 homolog [Cucumis sativus]7.0e-7496.75Show/hide
Query:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
        MSLFWIVIRQFAE E MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
Subjt:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV

Query:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEEN  +L
Subjt:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

XP_022140081.1 glucose-induced degradation protein 8 homolog [Momordica charantia]2.0e-7395.45Show/hide
Query:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
        MSLFWIVIRQFAE E MATSKKVITREEWEKKLND+KIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVK+AVQCGNVEDAIEKV
Subjt:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV

Query:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEEN  +L
Subjt:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

XP_022954757.1 protein GID8 homolog [Cucurbita moschata]7.0e-7496.75Show/hide
Query:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
        MSLFWIVIRQFAE E MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
Subjt:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV

Query:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEEN  +L
Subjt:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

XP_038893683.1 protein GID8 homolog [Benincasa hispida]9.1e-7496.75Show/hide
Query:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
        MSLFWIVIRQFAE E MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
Subjt:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV

Query:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEEN  +L
Subjt:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

TrEMBL top hitse value%identityAlignment
A0A0A0LV19 Uncharacterized protein3.4e-7496.75Show/hide
Query:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
        MSLFWIVIRQFAE E MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
Subjt:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV

Query:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEEN  +L
Subjt:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

A0A1S3B548 glucose-induced degradation protein 8 homolog isoform X13.4e-7496.75Show/hide
Query:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
        MSLFWIVIRQFAE E MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
Subjt:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV

Query:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEEN  +L
Subjt:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

A0A5A7URR5 LisH domain-containing protein/CLTH domain-containing protein3.4e-7496.75Show/hide
Query:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
        MSLFWIVIRQFAE E MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
Subjt:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV

Query:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEEN  +L
Subjt:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

A0A6J1GS04 protein GID8 homolog3.4e-7496.75Show/hide
Query:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
        MSLFWIVIRQFAE E MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
Subjt:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV

Query:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEEN  +L
Subjt:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

A0A6J1JZ68 protein GID8 homolog3.4e-7496.75Show/hide
Query:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
        MSLFWIVIRQFAE E MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV
Subjt:  MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKV

Query:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEEN  +L
Subjt:  NDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

SwissProt top hitse value%identityAlignment
Q54X16 Glucose-induced degradation protein 8 homolog1.7e-4365.67Show/hide
Query:  KKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFFHLQQ
        KKVI+  EW+ KL +V I K D+NKLVMN+LV EGY +AA KF+ ES  +  +DLA+I DRMA++ A+QCG+VE  IE VNDLNPEILDTNPQL+FHLQQ
Subjt:  KKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFFHLQQ

Query:  QRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        Q+LIELIR G   EAL+FAQ+ELAP+GEEN  +L
Subjt:  QRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

Q5ZKQ7 Glucose-induced degradation protein 8 homolog3.5e-3650.67Show/hide
Query:  ITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFFHLQQQRL
        IT++EW +KLN++ I++ DMN+L+MN+LVTEG+ +AAEKFRMESG EP +DL T+ +R+ +++ +  G +++AI  +N L+PE+LDTN  L+FHLQQQ L
Subjt:  ITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFFHLQQQRL

Query:  IELIRNGKVEEALEFAQEELAPRGEENYFYLLTFGMSYKHLKLWIPSPSP
        IELIR  + E ALEFAQ +LA +GEE+   L     +   L    P  SP
Subjt:  IELIRNGKVEEALEFAQEELAPRGEENYFYLLTFGMSYKHLKLWIPSPSP

Q84WK5 Protein GID8 homolog5.2e-6484.52Show/hide
Query:  MSLFWIVIRQFAE-IEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEK
        MSLF I I Q  E  E+MATSKK+ITREEWEKKLN VK+RKEDMN LVMNFLVTEGYV+AAEKF+ ESG +PEIDLATITDRMAVKKAVQ GNVEDAIEK
Subjt:  MSLFWIVIRQFAE-IEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEK

Query:  VNDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        VNDLNPEILDTNP+LFFHLQQQRLIELIR GK EEALEFAQEELAPRGEEN  +L
Subjt:  VNDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

Q9D7M1 Glucose-induced degradation protein 8 homolog4.6e-3650Show/hide
Query:  ITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFFHLQQQRL
        IT++EW +KLN++ +++ DMN+L+MN+LVTEG+ +AAEKFRMESG EP +DL T+ +R+ +++ +  G +++AI  +N L+PE+LDTN  L+FHLQQQ L
Subjt:  ITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFFHLQQQRL

Query:  IELIRNGKVEEALEFAQEELAPRGEENYFYLLTFGMSYKHLKLWIPSPSP
        IELIR  + E ALEFAQ +LA +GEE+   L     +   L    P  SP
Subjt:  IELIRNGKVEEALEFAQEELAPRGEENYFYLLTFGMSYKHLKLWIPSPSP

Q9NWU2 Glucose-induced degradation protein 8 homolog4.6e-3650Show/hide
Query:  ITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFFHLQQQRL
        IT++EW +KLN++ +++ DMN+L+MN+LVTEG+ +AAEKFRMESG EP +DL T+ +R+ +++ +  G +++AI  +N L+PE+LDTN  L+FHLQQQ L
Subjt:  ITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFFHLQQQRL

Query:  IELIRNGKVEEALEFAQEELAPRGEENYFYLLTFGMSYKHLKLWIPSPSP
        IELIR  + E ALEFAQ +LA +GEE+   L     +   L    P  SP
Subjt:  IELIRNGKVEEALEFAQEELAPRGEENYFYLLTFGMSYKHLKLWIPSPSP

Arabidopsis top hitse value%identityAlignment
AT1G61150.1 LisH and RanBPM domains containing protein3.7e-6584.52Show/hide
Query:  MSLFWIVIRQFAE-IEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEK
        MSLF I I Q  E  E+MATSKK+ITREEWEKKLN VK+RKEDMN LVMNFLVTEGYV+AAEKF+ ESG +PEIDLATITDRMAVKKAVQ GNVEDAIEK
Subjt:  MSLFWIVIRQFAE-IEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEK

Query:  VNDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        VNDLNPEILDTNP+LFFHLQQQRLIELIR GK EEALEFAQEELAPRGEEN  +L
Subjt:  VNDLNPEILDTNPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

AT1G61150.2 LisH and RanBPM domains containing protein1.0e-6288.41Show/hide
Query:  MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFF
        MATSKK+ITREEWEKKLN VK+RKEDMN LVMNFLVTEGYV+AAEKF+ ESG +PEIDLATITDRMAVKKAVQ GNVEDAIEKVNDLNPEILDTNP+LFF
Subjt:  MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFF

Query:  HLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        HLQQQRLIELIR GK EEALEFAQEELAPRGEEN  +L
Subjt:  HLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

AT1G61150.4 LisH and RanBPM domains containing protein1.0e-6288.41Show/hide
Query:  MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFF
        MATSKK+ITREEWEKKLN VK+RKEDMN LVMNFLVTEGYV+AAEKF+ ESG +PEIDLATITDRMAVKKAVQ GNVEDAIEKVNDLNPEILDTNP+LFF
Subjt:  MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFF

Query:  HLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        HLQQQRLIELIR GK EEALEFAQEELAPRGEEN  +L
Subjt:  HLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL

AT1G61150.5 LisH and RanBPM domains containing protein1.4e-5987.88Show/hide
Query:  VITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFFHLQQQR
        +ITREEWEKKLN VK+RKEDMN LVMNFLVTEGYV+AAEKF+ ESG +PEIDLATITDRMAVKKAVQ GNVEDAIEKVNDLNPEILDTNP+LFFHLQQQR
Subjt:  VITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFFHLQQQR

Query:  LIELIRNGKVEEALEFAQEELAPRGEENYFYL
        LIELIR GK EEALEFAQEELAPRGEEN  +L
Subjt:  LIELIRNGKVEEALEFAQEELAPRGEENYFYL

AT1G61150.6 LisH and RanBPM domains containing protein1.0e-6288.41Show/hide
Query:  MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFF
        MATSKK+ITREEWEKKLN VK+RKEDMN LVMNFLVTEGYV+AAEKF+ ESG +PEIDLATITDRMAVKKAVQ GNVEDAIEKVNDLNPEILDTNP+LFF
Subjt:  MATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDTNPQLFF

Query:  HLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL
        HLQQQRLIELIR GK EEALEFAQEELAPRGEEN  +L
Subjt:  HLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTGTTCTGGATTGTGATTCGTCAATTTGCAGAGATCGAAGAAATGGCCACATCAAAGAAAGTTATTACAAGGGAAGAGTGGGAGAAGAAGCTGAATGACGTAAA
GATTAGGAAAGAAGACATGAATAAATTGGTAATGAATTTCCTTGTCACTGAAGGTTATGTTGATGCAGCTGAGAAATTCCGGATGGAGTCTGGGGCTGAACCAGAAATAG
ATCTTGCAACCATAACAGATAGAATGGCTGTTAAGAAGGCAGTACAATGCGGTAATGTTGAGGATGCAATTGAGAAAGTGAATGATTTAAATCCTGAGATATTGGATACG
AATCCCCAATTGTTTTTTCATCTCCAACAGCAAAGGTTGATAGAACTAATTCGTAATGGAAAAGTAGAAGAAGCTCTTGAATTTGCTCAGGAGGAGCTTGCACCGAGGGG
AGAAGAAAATTACTTCTATTTGTTGACATTTGGAATGTCTTATAAACACTTAAAACTATGGATTCCATCCCCTTCACCCGCTCTGCAAAGTAACCCTCGTACTCTTTTCA
GCAAAGCTTCTTGGAAGAGTTGGAGAGAACAGTTGCTTTACTTGCTTTTGAAGATGTTTCCAACTGTCCTGTGCGAGACCTTTGGACATCTCTCAGCGCCTGAAGACAGC
AAATCCGAAACTGCCAAGTTTGTTGAAGATGTTAATGGTTGGAGTTTTGAAGAGAGTGGCAAGTGTAGAGCGCAATCTTGTGAGGTGTTCGAAAGAAGCTACAGGCTGAG
GGAGGCGATTCTCCAACAGAATGGACAATTTAGGTTGAAAGGGTCCACTTTGGAAGGCTCAAATTCTGTCATTAAGAGGGGAGAGGTGGGAGAGAAGAGGGCATGGTTGG
CGCAGGCGCAGTGGGCAGAGAGCCTCTCAAATGTCGTATCACTATCAGAATTTGTTGCTGCTGGTGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATTGTTCTGGATTGTGATTCGTCAATTTGCAGAGATCGAAGAAATGGCCACATCAAAGAAAGTTATTACAAGGGAAGAGTGGGAGAAGAAGCTGAATGACGTAAA
GATTAGGAAAGAAGACATGAATAAATTGGTAATGAATTTCCTTGTCACTGAAGGTTATGTTGATGCAGCTGAGAAATTCCGGATGGAGTCTGGGGCTGAACCAGAAATAG
ATCTTGCAACCATAACAGATAGAATGGCTGTTAAGAAGGCAGTACAATGCGGTAATGTTGAGGATGCAATTGAGAAAGTGAATGATTTAAATCCTGAGATATTGGATACG
AATCCCCAATTGTTTTTTCATCTCCAACAGCAAAGGTTGATAGAACTAATTCGTAATGGAAAAGTAGAAGAAGCTCTTGAATTTGCTCAGGAGGAGCTTGCACCGAGGGG
AGAAGAAAATTACTTCTATTTGTTGACATTTGGAATGTCTTATAAACACTTAAAACTATGGATTCCATCCCCTTCACCCGCTCTGCAAAGTAACCCTCGTACTCTTTTCA
GCAAAGCTTCTTGGAAGAGTTGGAGAGAACAGTTGCTTTACTTGCTTTTGAAGATGTTTCCAACTGTCCTGTGCGAGACCTTTGGACATCTCTCAGCGCCTGAAGACAGC
AAATCCGAAACTGCCAAGTTTGTTGAAGATGTTAATGGTTGGAGTTTTGAAGAGAGTGGCAAGTGTAGAGCGCAATCTTGTGAGGTGTTCGAAAGAAGCTACAGGCTGAG
GGAGGCGATTCTCCAACAGAATGGACAATTTAGGTTGAAAGGGTCCACTTTGGAAGGCTCAAATTCTGTCATTAAGAGGGGAGAGGTGGGAGAGAAGAGGGCATGGTTGG
CGCAGGCGCAGTGGGCAGAGAGCCTCTCAAATGTCGTATCACTATCAGAATTTGTTGCTGCTGGTGCCTGA
Protein sequenceShow/hide protein sequence
MSLFWIVIRQFAEIEEMATSKKVITREEWEKKLNDVKIRKEDMNKLVMNFLVTEGYVDAAEKFRMESGAEPEIDLATITDRMAVKKAVQCGNVEDAIEKVNDLNPEILDT
NPQLFFHLQQQRLIELIRNGKVEEALEFAQEELAPRGEENYFYLLTFGMSYKHLKLWIPSPSPALQSNPRTLFSKASWKSWREQLLYLLLKMFPTVLCETFGHLSAPEDS
KSETAKFVEDVNGWSFEESGKCRAQSCEVFERSYRLREAILQQNGQFRLKGSTLEGSNSVIKRGEVGEKRAWLAQAQWAESLSNVVSLSEFVAAGA