; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027084 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027084
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein OS-9 homolog
Genome locationtig00153048:903613..908391
RNA-Seq ExpressionSgr027084
SyntenySgr027084
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0030433 - ubiquitin-dependent ERAD pathway (biological process)
GO:0030968 - endoplasmic reticulum unfolded protein response (biological process)
GO:0030970 - retrograde protein transport, ER to cytosol (biological process)
GO:0005788 - endoplasmic reticulum lumen (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR009011 - Mannose-6-phosphate receptor binding domain superfamily
IPR012913 - Protein OS9-like domain
IPR044865 - MRH domain
IPR045149 - Protein OS-9-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587864.1 Protein OS-9-like protein, partial [Cucurbita argyrosperma subsp. sororia]6.0e-12886.14Show/hide
Query:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
        FSR  R P       N+ F   D  +  DDDQESVFMPNKNGKNY+CFLPK+EKSKSGKPAIQ+N+SSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
Subjt:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW

Query:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS
        WTYEFCYQK LRQFHLEDDKVVQEFVLGV+DAEATANL+ENLSD+STLKDP SKDASQRYHAHHYTNGTMCDLTN PRETEVRFVCSEPRAMI+SITELS
Subjt:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS

Query:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE
        TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLP+DYKETEQ+EEL DD+IVMVTD+KYPKNES+E
Subjt:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE

KAG7021752.1 Protein OS-9-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]6.0e-12886.14Show/hide
Query:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
        FSR  R P       N+ F   D  +  DDDQESVFMPNKNGKNY+CFLPK+EKSKSGKPAIQ+N+SSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
Subjt:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW

Query:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS
        WTYEFCYQK LRQFHLEDDKVVQEFVLGV+DAEATANL+ENLSD+STLKDP SKDASQRYHAHHYTNGTMCDLTN PRETEVRFVCSEPRAMI+SITELS
Subjt:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS

Query:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE
        TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLP+DYKETEQ+EEL DD+IVMVTD+KYPKNES+E
Subjt:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE

XP_022932897.1 protein OS-9 homolog [Cucurbita moschata]7.8e-12886.14Show/hide
Query:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
        FSR  R P       N+ F   D  +  DDDQESVFMPNKNGKNY+CFLPK+EKSKSGKPAIQ+N+SSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
Subjt:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW

Query:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS
        WTYEFCYQK LRQFHLEDDKVVQEFVLGV+DAEATANL+ENLSD+STLKDP SKDASQRYHAHHYTNGTMCDLTN PRETEVRFVCSEPRAMI+SITELS
Subjt:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS

Query:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE
        TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLP+DYKETEQ+EEL DD+IVMVTD+KYPKNES+E
Subjt:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE

XP_023530346.1 protein OS-9 homolog [Cucurbita pepo subsp. pepo]6.0e-12886.14Show/hide
Query:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
        FSR  R P       N+ F   D  +  DDDQESVFMPNKNGKNY+CFLPK+EKSKSGKPAIQ+N+SSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
Subjt:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW

Query:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS
        WTYEFCYQK LRQFHLEDDKVVQEFVLGV+DAEATANL+ENLSD+STLKDP SKDASQRYHAHHYTNGTMCDLTN PRETEVRFVCSEPRAMI+SITELS
Subjt:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS

Query:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE
        TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLP+DYKETEQ+EEL DD+IVMVTD+KYPKNES+E
Subjt:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE

XP_038878473.1 protein OS-9 homolog isoform X1 [Benincasa hispida]1.7e-12786.52Show/hide
Query:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
        FSR  R P       N+ F   D  +  DDDQESVFMPNKNGKNYLC+LPKVEKSKSGKP +QLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
Subjt:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW

Query:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS
        WTYEFCYQK LRQFHLED+KVVQEFVLGV+DAEATANLNENLSDISTLKDP SKDASQRYHAHHYTNGTMCDLTN PRETEVRFVCSEPRAMI+SITELS
Subjt:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS

Query:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE
        TCKYALTVRCPTLCKHPLFQEERPVWY INCNVLPDDYKETE+SEEL  D+IVMVTD+KYPKNESE+
Subjt:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE

TrEMBL top hitse value%identityAlignment
A0A1S3B9X7 protein OS-9 homolog isoform X21.1e-12284.33Show/hide
Query:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
        FSR  R P       N+ F   D  +  D+DQESVFMPNKNGKNYLC+LPKVEKSKSGKP+IQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
Subjt:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW

Query:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSE-PRAMIHSITEL
        WTYEFCYQK LRQFHLED+KVVQEF+LGV+DAEATA LNENLSDISTLKDP SKDASQRYHAHHYTNGTMCDLTN PRETEVRFVCSE PRAMI+SITEL
Subjt:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSE-PRAMIHSITEL

Query:  STCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE
        STCKYALTVRCPTLCKH LF+EERPVWY INCN LPDDYKETE+SEE   D+IVMVTDIKYPKNESE+
Subjt:  STCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE

A0A6J1C180 protein OS-9 homolog1.4e-12785.39Show/hide
Query:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
        FSR  R P       N+ F   D  +  DDDQESVFMPNKNGKNYLCFLPKVEKSK+GKPA+Q+NMSSMIVESEK+VKLKTPDELLEALKEQCFVRQEGW
Subjt:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW

Query:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS
        WTYEFCYQK LRQFHLEDD++VQ+FVLGV+DA+ATANLN+NLSDISTLKDP SKDASQRYHAHHYTNGTMCDLTN PRETEVRFVCSEPRAMI+S+TELS
Subjt:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS

Query:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE
        TCKYALTVRCPTLCKHPLFQEERPVWYTINCNV+PDDYKETEQSEE+E DQIVMVTDIKYPKN+SEE
Subjt:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE

A0A6J1EY24 protein OS-9 homolog3.8e-12886.14Show/hide
Query:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
        FSR  R P       N+ F   D  +  DDDQESVFMPNKNGKNY+CFLPK+EKSKSGKPAIQ+N+SSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
Subjt:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW

Query:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS
        WTYEFCYQK LRQFHLEDDKVVQEFVLGV+DAEATANL+ENLSD+STLKDP SKDASQRYHAHHYTNGTMCDLTN PRETEVRFVCSEPRAMI+SITELS
Subjt:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS

Query:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE
        TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLP+DYKETEQ+EEL DD+IVMVTD+KYPKNES+E
Subjt:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE

A0A6J1HCF9 protein OS-9 homolog8.7e-12585.77Show/hide
Query:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW
        FSR  R P       N+ F   D  +  DDDQESVFMP KNGKNYLC+LPKVEKSKS KPAIQ NM+SMI+ESEKRVKLKTPDELLEALKEQCFVRQEGW
Subjt:  FSRRRRRPDLPLPAENLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGW

Query:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS
        WTYEFCYQK LRQFHLEDDKVVQEFVLGV+DAE TANLNENLSDISTLKDP SKDASQRYHAHHYTNGTMCDLTN PRETEVRFVCSEPRAMI SITELS
Subjt:  WTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELS

Query:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE
        TCKYALTVRCPTLCKH LFQEERPVWYTINCNVLPDDYKETE SE+LE D+IVMVTD+KYPKNESEE
Subjt:  TCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE

A0A6J1HVT6 protein OS-9 homolog2.7e-12688.49Show/hide
Query:  NLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGWWTYEFCYQKKLRQFH
        N+ F   D  +  DDDQESVFMPNKNGKNY+CFLPK+EKSK+GKPAIQ+NMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGWWTYEFCYQK LRQFH
Subjt:  NLHFAAPD--FTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGWWTYEFCYQKKLRQFH

Query:  LEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELSTCKYALTVRCPTLCK
        LEDDKVVQEFVLGV+DA+ATANL+ENLSD+STLKDP SKDASQRYHAHHYTNGTMCDLTN PRETEVRFVCSEPRAMI+SITELSTCKY+LTVRCPTLCK
Subjt:  LEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELSTCKYALTVRCPTLCK

Query:  HPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE
        HPLFQEERPVWYTINCNVLP+ YKETEQ+EEL DD+IVMVTDIKYPKNES+E
Subjt:  HPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE

SwissProt top hitse value%identityAlignment
Q13438 Protein OS-94.3e-2037.76Show/hide
Query:  ELLEALKE-QCFVRQEGWWTYEFCYQKKLRQFHLEDDKVVQEFV-LGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETE
        ELL  +++  C ++ + WWTYEFCY + ++Q+H+ED ++  E + LG + +           D  T K  S +   +RYH+  Y NG+ CDL   PRE E
Subjt:  ELLEALKE-QCFVRQEGWWTYEFCYQKKLRQFHLEDDKVVQEFV-LGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETE

Query:  VRFVCSEPRAM----IHSITELSTCKYALTVRCPTLCKHPLFQ
        VRF+C E   +    I  + E  +C Y LT+R P LC HPL +
Subjt:  VRFVCSEPRAM----IHSITELSTCKYALTVRCPTLCKHPLFQ

Q3MHX6 Protein OS-91.9e-2038.46Show/hide
Query:  ELLEALKE-QCFVRQEGWWTYEFCYQKKLRQFHLEDDKVVQEFV-LGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETE
        ELL  +K+  C ++ + WWTYEFCY + ++Q+H+ED ++  E + LG + +           D  T K  S +   +RYH+  Y NG+ CDL   PRE E
Subjt:  ELLEALKE-QCFVRQEGWWTYEFCYQKKLRQFHLEDDKVVQEFV-LGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETE

Query:  VRFVCSEPRAM----IHSITELSTCKYALTVRCPTLCKHPLFQ
        VRF+C E   +    I  + E  +C Y LT+R P LC HPL +
Subjt:  VRFVCSEPRAM----IHSITELSTCKYALTVRCPTLCKHPLFQ

Q67WM9 Protein OS-9 homolog2.8e-6450.2Show/hide
Query:  FSRRRRRPDLPLPAENLHFAAPDFTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGWWT
        F R  R P   +     H     F  ++ QESV M +  GK+Y CFLP VE++K+ K  I  N +++I+ESE+RVK K PDELLE LK+QCF R EGWW+
Subjt:  FSRRRRRPDLPLPAENLHFAAPDFTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDELLEALKEQCFVRQEGWWT

Query:  YEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELSTC
        YEFCY  K+RQ H+E +KV+QE+VLG +DA+AT    EN +  S  +D +  D S+RYH H YTNGT+CDLT+ PRETEVRFVCSEP  +I SI E+S+C
Subjt:  YEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIHSITELSTC

Query:  KYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTD
        KY LTV+ P LCK+PLFQ+E+    +I+CN L  + + T   + L  +  +++ D
Subjt:  KYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTD

Q8GWH3 Protein OS-9 homolog2.3e-7455.02Show/hide
Query:  SAHCQYDAVTRVHEDATFFFSRRRRRPDLPLPAENLHFAAPDFTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPD
        S+H   D +   H   T  FSR  R P   +  E L   +P F   D+ ES+ M +K+G  +LC+LPK EK+ SG  + Q N+S++++E+++ VKLKTPD
Subjt:  SAHCQYDAVTRVHEDATFFFSRRRRRPDLPLPAENLHFAAPDFTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPD

Query:  ELLEALKEQCFVRQEGWWTYEFCYQKKLRQFHLEDD-KVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEV
        ELL+ L E+C  RQEGWW+YEFC+QK +RQ H+ED+ K+VQEF LG FD EATA  N+ +SD ST       DASQRYH+H YTNGT CDLT  PRE EV
Subjt:  ELLEALKEQCFVRQEGWWTYEFCYQKKLRQFHLEDD-KVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEV

Query:  RFVCSEPRAMIHSITELSTCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQ
        RFVC+E RAM+ SITELSTCKYALTV+CPTLCKHPLFQ E+PV +TI+CN +P +   T   EE   D+
Subjt:  RFVCSEPRAMIHSITELSTCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQ

Q8K2C7 Protein OS-92.8e-1931.52Show/hide
Query:  ELLEALKE-QCFVRQEGWWTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEV
        ELL  +++  C ++ + WWTYEFCY + ++Q+H+ED ++  + VL +   +++ N ++  +        S +   +RYH+  Y NG+ CDL   PRE EV
Subjt:  ELLEALKE-QCFVRQEGWWTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEV

Query:  RFVCSEPRAM----IHSITELSTCKYALTVRCPTLCKHPLFQ---EERPVWYTINCNVLPDDY-------KETEQSEELEDDQI
        RF+C E   +    I  + E  +C Y LT+R   LC HPL +      P     +  + PD+Y        E++Q EE   +++
Subjt:  RFVCSEPRAM----IHSITELSTCKYALTVRCPTLCKHPLFQ---EERPVWYTINCNVLPDDY-------KETEQSEELEDDQI

Arabidopsis top hitse value%identityAlignment
AT5G35080.1 INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Mannose-6-phosphate receptor, binding (InterPro:IPR009011), Glucosidase II beta subunit-like (InterPro:IPR012913); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).1.6e-7555.02Show/hide
Query:  SAHCQYDAVTRVHEDATFFFSRRRRRPDLPLPAENLHFAAPDFTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPD
        S+H   D +   H   T  FSR  R P   +  E L   +P F   D+ ES+ M +K+G  +LC+LPK EK+ SG  + Q N+S++++E+++ VKLKTPD
Subjt:  SAHCQYDAVTRVHEDATFFFSRRRRRPDLPLPAENLHFAAPDFTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPD

Query:  ELLEALKEQCFVRQEGWWTYEFCYQKKLRQFHLEDD-KVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEV
        ELL+ L E+C  RQEGWW+YEFC+QK +RQ H+ED+ K+VQEF LG FD EATA  N+ +SD ST       DASQRYH+H YTNGT CDLT  PRE EV
Subjt:  ELLEALKEQCFVRQEGWWTYEFCYQKKLRQFHLEDD-KVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEV

Query:  RFVCSEPRAMIHSITELSTCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQ
        RFVC+E RAM+ SITELSTCKYALTV+CPTLCKHPLFQ E+PV +TI+CN +P +   T   EE   D+
Subjt:  RFVCSEPRAMIHSITELSTCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACCAAACAAGAGCTTCCGTGGAAGTGCACACTGTCAATACGACGCCGTAACGCGAGTTCACGAAGACGCCACTTTCTTCTTCTCTCGTCGTCGCCGTCGTCCCGA
CCTCCCGCTCCCAGCTGAGAACCTCCACTTTGCTGCTCCCGATTTCACCGTGGATGATGACCAGGAATCTGTGTTTATGCCCAATAAAAATGGAAAGAATTACTTATGTT
TCTTGCCTAAGGTGGAGAAGTCCAAGAGTGGAAAGCCAGCTATTCAGCTGAACATGAGTAGCATGATTGTGGAATCTGAGAAGCGAGTCAAATTGAAGACTCCAGATGAG
CTGCTTGAAGCACTAAAAGAGCAATGCTTTGTTAGGCAAGAGGGTTGGTGGACATATGAATTTTGTTACCAGAAGAAGCTACGACAATTTCATTTGGAGGATGATAAGGT
AGTTCAGGAGTTTGTATTGGGCGTCTTTGATGCAGAGGCAACTGCTAATCTCAATGAGAATCTCTCTGATATATCAACTTTGAAGGATCCTAGCTCAAAAGATGCGTCCC
AAAGGTATCATGCTCATCATTACACAAATGGAACCATGTGTGATCTCACGAATTGGCCACGAGAAACTGAGGTTAGATTTGTTTGCTCGGAGCCCAGAGCCATGATTCAT
TCTATCACAGAACTTTCAACATGCAAGTATGCACTTACAGTGCGATGCCCGACCCTCTGCAAGCATCCATTATTCCAGGAAGAGAGACCAGTGTGGTACACCATTAACTG
CAACGTGCTCCCCGATGATTACAAGGAAACAGAGCAGAGTGAAGAATTAGAAGATGATCAGATTGTCATGGTTACAGACATCAAATATCCAAAAAATGAATCTGAAGAGT
AA
mRNA sequenceShow/hide mRNA sequence
ATGTTACCAAACAAGAGCTTCCGTGGAAGTGCACACTGTCAATACGACGCCGTAACGCGAGTTCACGAAGACGCCACTTTCTTCTTCTCTCGTCGTCGCCGTCGTCCCGA
CCTCCCGCTCCCAGCTGAGAACCTCCACTTTGCTGCTCCCGATTTCACCGTGGATGATGACCAGGAATCTGTGTTTATGCCCAATAAAAATGGAAAGAATTACTTATGTT
TCTTGCCTAAGGTGGAGAAGTCCAAGAGTGGAAAGCCAGCTATTCAGCTGAACATGAGTAGCATGATTGTGGAATCTGAGAAGCGAGTCAAATTGAAGACTCCAGATGAG
CTGCTTGAAGCACTAAAAGAGCAATGCTTTGTTAGGCAAGAGGGTTGGTGGACATATGAATTTTGTTACCAGAAGAAGCTACGACAATTTCATTTGGAGGATGATAAGGT
AGTTCAGGAGTTTGTATTGGGCGTCTTTGATGCAGAGGCAACTGCTAATCTCAATGAGAATCTCTCTGATATATCAACTTTGAAGGATCCTAGCTCAAAAGATGCGTCCC
AAAGGTATCATGCTCATCATTACACAAATGGAACCATGTGTGATCTCACGAATTGGCCACGAGAAACTGAGGTTAGATTTGTTTGCTCGGAGCCCAGAGCCATGATTCAT
TCTATCACAGAACTTTCAACATGCAAGTATGCACTTACAGTGCGATGCCCGACCCTCTGCAAGCATCCATTATTCCAGGAAGAGAGACCAGTGTGGTACACCATTAACTG
CAACGTGCTCCCCGATGATTACAAGGAAACAGAGCAGAGTGAAGAATTAGAAGATGATCAGATTGTCATGGTTACAGACATCAAATATCCAAAAAATGAATCTGAAGAGT
AA
Protein sequenceShow/hide protein sequence
MLPNKSFRGSAHCQYDAVTRVHEDATFFFSRRRRRPDLPLPAENLHFAAPDFTVDDDQESVFMPNKNGKNYLCFLPKVEKSKSGKPAIQLNMSSMIVESEKRVKLKTPDE
LLEALKEQCFVRQEGWWTYEFCYQKKLRQFHLEDDKVVQEFVLGVFDAEATANLNENLSDISTLKDPSSKDASQRYHAHHYTNGTMCDLTNWPRETEVRFVCSEPRAMIH
SITELSTCKYALTVRCPTLCKHPLFQEERPVWYTINCNVLPDDYKETEQSEELEDDQIVMVTDIKYPKNESEE