; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g21070 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g21070
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionglucose-induced degradation protein 4 homolog
Genome locationchr1:14693526..14697020
RNA-Seq ExpressionMoc01g21070
SyntenyMoc01g21070
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0006623 - protein targeting to vacuole (biological process)
GO:0007039 - protein catabolic process in the vacuole (biological process)
GO:0043161 - proteasome-mediated ubiquitin-dependent protein catabolic process (biological process)
GO:0045721 - negative regulation of gluconeogenesis (biological process)
GO:0005773 - vacuole (cellular component)
GO:0005886 - plasma membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0019898 - extrinsic component of membrane (cellular component)
GO:0034657 - GID complex (cellular component)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0030246 - carbohydrate binding (molecular function)
InterPro domainsIPR018618 - Vacuolar import/degradation protein Vid24


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142059.1 glucose-induced degradation protein 4 homolog [Cucumis sativus]6.4e-11389.86Show/hide
Query:  LPHKLEAKSISLFFFGFPGADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNY
        +P ++E+ + SL      GAD +QT+PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNY
Subjt:  LPHKLEAKSISLFFFGFPGADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNY

Query:  NFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELK
        NFFTGKWQAAPE+DIRHWTKFPSF+PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSI+GFYYDPNSSPFQKLELK
Subjt:  NFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELK

Query:  STNEGRSGFSFSSYELR
        STNEGRSGFSFSSYEL+
Subjt:  STNEGRSGFSFSSYELR

XP_008448124.1 PREDICTED: glucose-induced degradation protein 4 homolog isoform X1 [Cucumis melo]4.1e-11295.98Show/hide
Query:  GADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW
        GAD +QT+PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPE+DIRHW
Subjt:  GADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW

Query:  TKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR
        TKFPSF+PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSI+GFYYDPNSSPFQKLELKSTNEGRSGFSFSSYEL+
Subjt:  TKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR

XP_022143888.1 glucose-induced degradation protein 4 homolog isoform X2 [Momordica charantia]2.3e-11580.99Show/hide
Query:  MPVRVESSAPSPISSNSSSYRSFSFISSLKIGFFLLRMHSWVNVYQLPHKLEAKSISLFFFGFPGADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAW
        MPVRVESSAPSPISS                                                  ADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAW
Subjt:  MPVRVESSAPSPISSNSSSYRSFSFISSLKIGFFLLRMHSWVNVYQLPHKLEAKSISLFFFGFPGADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAW

Query:  RVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQ
        RVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQ
Subjt:  RVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQ

Query:  YFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR
        YFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR
Subjt:  YFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR

XP_023517177.1 glucose-induced degradation protein 4 homolog [Cucurbita pepo subsp. pepo]1.8e-11296.48Show/hide
Query:  GADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW
        GAD+RQ++PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDL+HGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW
Subjt:  GADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW

Query:  TKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR
        TKFPSF PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYEL+
Subjt:  TKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR

XP_038880933.1 glucose-induced degradation protein 4 homolog [Benincasa hispida]4.1e-11296.48Show/hide
Query:  GADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW
        GAD RQT+PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW
Subjt:  GADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW

Query:  TKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR
        TKFPSF+PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSI+GFYYDPNSSPFQKLELKSTNEGRSGFSFSSY L+
Subjt:  TKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR

TrEMBL top hitse value%identityAlignment
A0A0A0KXJ4 Uncharacterized protein3.1e-11389.86Show/hide
Query:  LPHKLEAKSISLFFFGFPGADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNY
        +P ++E+ + SL      GAD +QT+PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNY
Subjt:  LPHKLEAKSISLFFFGFPGADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNY

Query:  NFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELK
        NFFTGKWQAAPE+DIRHWTKFPSF+PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSI+GFYYDPNSSPFQKLELK
Subjt:  NFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELK

Query:  STNEGRSGFSFSSYELR
        STNEGRSGFSFSSYEL+
Subjt:  STNEGRSGFSFSSYELR

A0A1S3BJV3 glucose-induced degradation protein 4 homolog isoform X12.0e-11295.98Show/hide
Query:  GADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW
        GAD +QT+PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPE+DIRHW
Subjt:  GADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW

Query:  TKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR
        TKFPSF+PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSI+GFYYDPNSSPFQKLELKSTNEGRSGFSFSSYEL+
Subjt:  TKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR

A0A5A7SQ56 Glucose-induced degradation protein 4-like protein isoform X12.0e-11295.98Show/hide
Query:  GADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW
        GAD +QT+PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPE+DIRHW
Subjt:  GADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW

Query:  TKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR
        TKFPSF+PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSI+GFYYDPNSSPFQKLELKSTNEGRSGFSFSSYEL+
Subjt:  TKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR

A0A6J1CRU3 glucose-induced degradation protein 4 homolog isoform X21.1e-11580.99Show/hide
Query:  MPVRVESSAPSPISSNSSSYRSFSFISSLKIGFFLLRMHSWVNVYQLPHKLEAKSISLFFFGFPGADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAW
        MPVRVESSAPSPISS                                                  ADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAW
Subjt:  MPVRVESSAPSPISSNSSSYRSFSFISSLKIGFFLLRMHSWVNVYQLPHKLEAKSISLFFFGFPGADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAW

Query:  RVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQ
        RVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQ
Subjt:  RVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQ

Query:  YFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR
        YFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR
Subjt:  YFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR

A0A6J1HUE2 glucose-induced degradation protein 4 homolog7.6e-11295.98Show/hide
Query:  GADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW
        GAD+ Q++PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDL+HGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW
Subjt:  GADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHW

Query:  TKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR
        TKFPSF PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYEL+
Subjt:  TKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR

SwissProt top hitse value%identityAlignment
P53242 Uncharacterized protein YGR066C3.0e-1229.95Show/hide
Query:  LSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLE--------HGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFF------------TGKWQAAPE
        L  G  F G+Q    + K   + V V+I  ++L           ++ GT    N+      VVT +EG ++   NYN F               + A  E
Subjt:  LSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLE--------HGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFF------------TGKWQAAPE

Query:  DDIRHWTKFPSF-SPLMSQVEVDGG----KSLDLSNYPCIFMRWKEQYFV------NVGTDC-----GLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQK
         D  HW +F  F S   S  E + G    +S +  N   I+++WKE++ +      N+  D      G +  GFYYVC     GS+ G+YY P    FQK
Subjt:  DDIRHWTKFPSF-SPLMSQVEVDGG----KSLDLSNYPCIFMRWKEQYFV------NVGTDC-----GLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQK

Query:  LELKSTN
        LEL  TN
Subjt:  LELKSTN

Q10079 Uncharacterized protein C3H1.144.6e-1328.8Show/hide
Query:  QACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT--GKWQAAPEDDIRHWTKFPSFS
        + C+ L  G  F G Q   ++   E   V+V I  ++L    LCG +        +T + T++E EI+ G  + F T   +W A+ E D RHW +  +  
Subjt:  QACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT--GKWQAAPEDDIRHWTKFPSFS

Query:  PLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTD---------CGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELK
             + +   +  D  +   ++MRWKE   ++   D          G++  GFYY+ FS S G I G+YY  +S P + L L+
Subjt:  PLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTD---------CGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELK

Q8IVV7 Glucose-induced degradation protein 4 homolog7.5e-2436.31Show/hide
Query:  ACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLM
        A +LL  G  F G Q    N  D    V V +Q +D  + YLCG ++   +      + TF+EGEI+  K + F T KW A  + D +HW KF +F    
Subjt:  ACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLM

Query:  SQVEVDGGKSLDLSNYPCIFMRWKEQYFV---NVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
             D     +L N   +FMRWKEQ+ V    +    G + AGFYY+CF  S  SI G+YY  +S  +Q L L    E
Subjt:  SQVEVDGGKSLDLSNYPCIFMRWKEQYFV---NVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE

Q9CPY6 Glucose-induced degradation protein 4 homolog7.5e-2436.31Show/hide
Query:  ACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLM
        A +LL  G  F G Q    N  D    V V +Q +D  + YLCG ++   +      + TF+EGEI+  K + F T KW A  + D +HW KF +F    
Subjt:  ACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLM

Query:  SQVEVDGGKSLDLSNYPCIFMRWKEQYFV---NVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
             D     +L N   +FMRWKEQ+ V    +    G + AGFYY+CF  S  SI G+YY  +S  +Q L L    E
Subjt:  SQVEVDGGKSLDLSNYPCIFMRWKEQYFV---NVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE

Arabidopsis top hitse value%identityAlignment
AT2G37680.1 CONTAINS InterPro DOMAIN/s: Vacuolar import/degradation protein Vid24 (InterPro:IPR018618); Has 318 Blast hits to 317 proteins in 131 species: Archae - 0; Bacteria - 0; Metazoa - 80; Fungi - 184; Plants - 51; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).8.6e-10083Show/hide
Query:  GADARQTSP-QACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRH
        G D    SP    +LL  GQAFSGTQNVSN QK+EAWRVNV+IQG+DLEHGYLCGTMEALNVPMADTPV+TFWEGEIVDGKNY F+TGKW+A  EDD+RH
Subjt:  GADARQTSP-QACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRH

Query:  WTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR
        W+KFPSFSPL  QVE DGG+ LDL+NYP IFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELK+ NEGRSGFSFSSYEL+
Subjt:  WTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGTCCGAGTAGAGAGCTCAGCGCCTTCTCCGATTTCAAGTAATTCCTCTTCTTATCGATCCTTCTCCTTTATTTCCAGTCTGAAAATAGGGTTTTTTCTTCTGCG
AATGCATTCATGGGTCAATGTGTATCAATTACCCCACAAACTTGAGGCAAAATCTATTTCTCTCTTCTTCTTCGGGTTCCCCGGTGCTGATGCTAGGCAGACCTCTCCTC
AAGCTTGTACGCTGTTGAGTGTGGGGCAGGCGTTTTCCGGTACTCAGAACGTGTCTAATAATCAAAAGGATGAGGCATGGAGGGTGAATGTACGAATACAGGGGTTGGAC
CTTGAACATGGGTATCTCTGTGGCACGATGGAGGCTCTTAATGTTCCCATGGCGGATACACCGGTAGTAACCTTTTGGGAAGGAGAGATTGTGGATGGCAAGAATTATAA
TTTCTTCACTGGAAAATGGCAAGCAGCACCAGAAGATGATATAAGGCACTGGACCAAATTTCCGTCATTTTCGCCCCTGATGAGCCAGGTGGAAGTTGATGGTGGAAAAT
CTTTGGATCTTAGTAATTATCCATGCATATTTATGAGATGGAAAGAGCAATATTTCGTGAATGTTGGAACCGATTGTGGGTTAACCATAGCTGGCTTCTATTATGTTTGC
TTCTCTTGTAGTGATGGTTCCATCAGCGGCTTCTACTATGACCCTAATAGTAGCCCATTTCAGAAGCTTGAGCTCAAATCCACAAATGAGGGAAGATCGGGTTTCAGCTT
CTCATCGTACGAGTTGCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGGTCCGAGTAGAGAGCTCAGCGCCTTCTCCGATTTCAAGTAATTCCTCTTCTTATCGATCCTTCTCCTTTATTTCCAGTCTGAAAATAGGGTTTTTTCTTCTGCG
AATGCATTCATGGGTCAATGTGTATCAATTACCCCACAAACTTGAGGCAAAATCTATTTCTCTCTTCTTCTTCGGGTTCCCCGGTGCTGATGCTAGGCAGACCTCTCCTC
AAGCTTGTACGCTGTTGAGTGTGGGGCAGGCGTTTTCCGGTACTCAGAACGTGTCTAATAATCAAAAGGATGAGGCATGGAGGGTGAATGTACGAATACAGGGGTTGGAC
CTTGAACATGGGTATCTCTGTGGCACGATGGAGGCTCTTAATGTTCCCATGGCGGATACACCGGTAGTAACCTTTTGGGAAGGAGAGATTGTGGATGGCAAGAATTATAA
TTTCTTCACTGGAAAATGGCAAGCAGCACCAGAAGATGATATAAGGCACTGGACCAAATTTCCGTCATTTTCGCCCCTGATGAGCCAGGTGGAAGTTGATGGTGGAAAAT
CTTTGGATCTTAGTAATTATCCATGCATATTTATGAGATGGAAAGAGCAATATTTCGTGAATGTTGGAACCGATTGTGGGTTAACCATAGCTGGCTTCTATTATGTTTGC
TTCTCTTGTAGTGATGGTTCCATCAGCGGCTTCTACTATGACCCTAATAGTAGCCCATTTCAGAAGCTTGAGCTCAAATCCACAAATGAGGGAAGATCGGGTTTCAGCTT
CTCATCGTACGAGTTGCGATGA
Protein sequenceShow/hide protein sequence
MPVRVESSAPSPISSNSSSYRSFSFISSLKIGFFLLRMHSWVNVYQLPHKLEAKSISLFFFGFPGADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLD
LEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVC
FSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR