; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028541 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028541
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSAP30_Sin3_bdg domain-containing protein
Genome locationtig00153204:2152171..2159098
RNA-Seq ExpressionSgr028541
SyntenySgr028541
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009664 - plant-type cell wall organization (biological process)
GO:0000118 - histone deacetylase complex (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0005199 - structural constituent of cell wall (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR006706 - Extensin domain
IPR006769 - Calcium uniporter protein, C-terminal
IPR024145 - Histone deacetylase complex subunit SAP30/SAP30-like
IPR025718 - Histone deacetylase complex subunit SAP30, Sin3 binding domain
IPR038291 - SAP30, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152712.1 uncharacterized protein LOC101220556 isoform X1 [Cucumis sativus]3.8e-10296.02Show/hide
Query:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVESSINGGFS LQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
        NL+WN +DMASDDA KSHKSRHKLHK SGSSHKTMSRSLSCDSQSK SVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
Subjt:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ

Query:  K
        +
Subjt:  K

XP_008444685.1 PREDICTED: uncharacterized protein LOC103487947 isoform X2 [Cucumis melo]2.7e-10095.52Show/hide
Query:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVESSINGGFS LQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
        NL+WN +DMASDDA KSHKSRHKLHK SGSSHKTMSRSLSCDSQSK SVSAPQGST VDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
Subjt:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ

Query:  K
        +
Subjt:  K

XP_022144390.1 uncharacterized protein LOC111014080 isoform X1 [Momordica charantia]7.7e-10396.52Show/hide
Query:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVESSINGGFS LQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
        NL+WNGLDMASDDA KSHKSRHKLHK SGSSHKTMSRSLSCDSQSK SVSAPQGSTKVDL KLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
Subjt:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ

Query:  K
        +
Subjt:  K

XP_022144391.1 uncharacterized protein LOC111014080 isoform X2 [Momordica charantia]5.5e-10196.02Show/hide
Query:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVESSINGGFS LQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
        NL+WNGLDMASDDA KSHKSRHKLHK SGSSHKTMSRSLSCDSQSK SVSAPQGST VDL KLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
Subjt:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ

Query:  K
        +
Subjt:  K

XP_038883974.1 uncharacterized protein LOC120074938 isoform X1 [Benincasa hispida]4.2e-10195.52Show/hide
Query:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVESSINGGFS LQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
        NL+WNG+DMASDDA KSHKSRHKLHK SGSSHKTMSRSLSCDSQSK SVSA QGSTKVDLSKLEMAALWRYWRHFNLV AIPNPSKEQLVDLVQRHFMSQ
Subjt:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ

Query:  K
        +
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A0A0LNK2 SAP30_Sin3_bdg domain-containing protein1.8e-10296.02Show/hide
Query:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVESSINGGFS LQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
        NL+WN +DMASDDA KSHKSRHKLHK SGSSHKTMSRSLSCDSQSK SVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
Subjt:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ

Query:  K
        +
Subjt:  K

A0A1S3BAV7 uncharacterized protein LOC103487947 isoform X11.8e-10296.02Show/hide
Query:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVESSINGGFS LQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
        NL+WN +DMASDDA KSHKSRHKLHK SGSSHKTMSRSLSCDSQSK SVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
Subjt:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ

Query:  K
        +
Subjt:  K

A0A5A7VFZ9 Histone deacetylase complex subunit SAP30/SAP30-like protein1.8e-10296.02Show/hide
Query:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVESSINGGFS LQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
        NL+WN +DMASDDA KSHKSRHKLHK SGSSHKTMSRSLSCDSQSK SVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
Subjt:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ

Query:  K
        +
Subjt:  K

A0A6J1CRH4 uncharacterized protein LOC111014080 isoform X13.7e-10396.52Show/hide
Query:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVESSINGGFS LQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
        NL+WNGLDMASDDA KSHKSRHKLHK SGSSHKTMSRSLSCDSQSK SVSAPQGSTKVDL KLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
Subjt:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ

Query:  K
        +
Subjt:  K

A0A6J1CT98 uncharacterized protein LOC111014080 isoform X22.7e-10196.02Show/hide
Query:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
        MIEAVESSINGGFS LQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE
Subjt:  MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE

Query:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
        NL+WNGLDMASDDA KSHKSRHKLHK SGSSHKTMSRSLSCDSQSK SVSAPQGST VDL KLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ
Subjt:  NLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQ

Query:  K
        +
Subjt:  K

SwissProt top hitse value%identityAlignment
O48809 Leucine-rich repeat extensin-like protein 22.6e-0538.32Show/hide
Query:  ISPSFHGRQPSPELKHSHKSPPPPAAFLFTSPPP--PSPPAAFLFASPPP----------PSPP-------AAFLFNSPPPPARRFFKSPPPPAAFLFNS
        +SPS     P P L     SPPPP  ++++SPPP  PSPP  ++++SPPP          P PP           + SP PP  ++  SPPPP  +   S
Subjt:  ISPSFHGRQPSPELKHSHKSPPPPAAFLFTSPPP--PSPPAAFLFASPPP----------PSPP-------AAFLFNSPPPPARRFFKSPPPPAAFLFNS

Query:  PPPPARP
        PPPP  P
Subjt:  PPPPARP

O65375 Leucine-rich repeat extensin-like protein 11.7e-0441.05Show/hide
Query:  ISPSFHGRQPSPELKHSHKSPPPPAAFLFTSPPPP-----SPPAAFLFASPPPP-----SPPAAFLFNSPPPPARRFFKSPPPPAAFLFNSPPPP
        +SPS     P P       SP PP  ++++SPPPP      PP  ++++SPPPP     SPP  ++++SPPPP    + SPPPP      SPPPP
Subjt:  ISPSFHGRQPSPELKHSHKSPPPPAAFLFTSPPPP-----SPPAAFLFASPPPP-----SPPAAFLFNSPPPPARRFFKSPPPPAAFLFNSPPPP

P06599 Extensin3.4e-0540Show/hide
Query:  SPSFHGRQPSPELKHSHKSPPPPAAF---------LFTSPPPPSPPAAFLFASPPPPSPPAAFLFNSPPPPARR-------FFKSPPPPAAFLFNSPPPP
        SP      P+PE  + +KSPPPP  F          + SPPPP+P   + + SPPPP+P   + + SPPPP           +KSPPPP   ++ SPPPP
Subjt:  SPSFHGRQPSPELKHSHKSPPPPAAF---------LFTSPPPPSPPAAFLFASPPPPSPPAAFLFNSPPPPARR-------FFKSPPPPAAFLFNSPPPP

Q84ZL0 Formin-like protein 53.4e-0543.9Show/hide
Query:  PSPELKHSHKSPPPP-AAFLFTSPPPPSPPAAFLFASPPPPSPPAAFLFNSPPPPARRFFKSPPPPAAFLFNSPPPPARPEG
        P P + HS+  PPPP  A  F +PPPP PP    F +PPPP PP      +PP P       PPPP       PPPP  P G
Subjt:  PSPELKHSHKSPPPP-AAFLFTSPPPPSPPAAFLFASPPPPSPPAAFLFNSPPPPARRFFKSPPPPAAFLFNSPPPPARPEG

Q9M1G9 Extensin-23.1e-0640.2Show/hide
Query:  SPELKHSHKSPPPPAAFLFTSPPPP-----------SPPAAFLFASPPPP-----------SPPAAFLFNSPPPP-----ARRFFKSPPPPAAFLFNSPP
        SP  K  +KSPPPP  ++++SPPPP           SPP  ++++SPPPP           SPP  ++++SPPPP      +  +KSPPPP  ++++SPP
Subjt:  SPELKHSHKSPPPPAAFLFTSPPPP-----------SPPAAFLFASPPPP-----------SPPAAFLFNSPPPP-----ARRFFKSPPPPAAFLFNSPP

Query:  PP
        PP
Subjt:  PP

Arabidopsis top hitse value%identityAlignment
AT1G19330.1 unknown protein2.8e-7976.67Show/hide
Query:  MIEAVESS--INGGFSQLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDD
        M+EAV+SS  +NGGF Q+QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDD
Subjt:  MIEAVESS--INGGFSQLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDD

Query:  LEFENLRWNGLDM-----ASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVD
        L+FEN + NG DM     AS+D  K HKS+ +  + S SSHKTMSRSLS DSQSK S   P  + KVDLSKLEM AL  YWRHFNLVDAIPNPSKEQL+D
Subjt:  LEFENLRWNGLDM-----ASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVD

Query:  LVQRHFMSQK
        +VQRHFMSQ+
Subjt:  LVQRHFMSQK

AT1G19330.2 unknown protein1.1e-8078.05Show/hide
Query:  MIEAVESS--INGGFSQLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDD
        M+EAV+SS  +NGGF Q+QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDD
Subjt:  MIEAVESS--INGGFSQLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDD

Query:  LEFENLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRH
        L+FEN + NG DM S+D  K HKS+ +  + S SSHKTMSRSLS DSQSK S   P  + KVDLSKLEM AL  YWRHFNLVDAIPNPSKEQL+D+VQRH
Subjt:  LEFENLRWNGLDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRH

Query:  FMSQK
        FMSQ+
Subjt:  FMSQK

AT1G19330.3 unknown protein1.1e-7876.3Show/hide
Query:  MIEAVESS--INGGFSQLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDD
        M+EAV+SS  +NGGF Q+QS  GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDD
Subjt:  MIEAVESS--INGGFSQLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDD

Query:  LEFENLRWNGLDM-----ASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGS-VSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLV
        L+FEN + NG DM     AS+D  K HKS+ +  + S SSHKTMSRSLS DSQSK S  + P+   KVDLSKLEM AL  YWRHFNLVDAIPNPSKEQL+
Subjt:  LEFENLRWNGLDM-----ASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGS-VSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLV

Query:  DLVQRHFMSQK
        D+VQRHFMSQ+
Subjt:  DLVQRHFMSQK

AT1G75060.1 unknown protein2.0e-7273.33Show/hide
Query:  GGFSQLQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE-NLRWN-G
        GGFSQLQSC GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEED+DLE + + +WN  
Subjt:  GGFSQLQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE-NLRWN-G

Query:  LDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQK
         DM ++D  K HKS+ + H+ S  S K + R +SCDS SK S   P+ + KVDL+KL+MAAL RYWRHFNLVDA+PNP+KEQL+D++QRHFMSQ+
Subjt:  LDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQK

AT1G75060.2 unknown protein1.4e-7072.82Show/hide
Query:  GGFSQLQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE-NLRWN-G
        GGFSQLQSC GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEED+DLE + + +WN  
Subjt:  GGFSQLQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFE-NLRWN-G

Query:  LDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQK
         DM ++D  K HKS+ + H+ S  S K + R +SCDS SK S   P+    VDL+KL+MAAL RYWRHFNLVDA+PNP+KEQL+D++QRHFMSQ+
Subjt:  LDMASDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGAAGCTGTGGAGAGTTCTATCAATGGAGGTTTCTCGCAGTTGCAGAGCTGTGGGGACAGTAGCGAGGAGGAGCTCTCCGTGCTTCCTCGCCATACCAAGGTGGT
CGTTACCGGAAATAACCGTACCAAATCAGTGCTCGTCGGACTTCAAGGCGTCGTCAAGAAAGCCGTTGGCCTTGGCGGGTGGCATTGGCTGGTTCTAACAAATGGCATAG
AGGTGAAACTACAGCGGAATGCGCTTAGTGTGATCGAGGCTCCGACGGGTAATGAGGAAGACGACGACCTCGAATTTGAAAACTTGCGGTGGAATGGATTGGACATGGCA
TCCGATGACGCCCCAAAATCCCACAAATCAAGGCATAAATTACACAAGTTATCCGGGTCATCTCACAAGACTATGAGCAGATCCCTTTCCTGTGACTCACAGTCCAAGGG
CTCGGTTTCTGCACCGCAAGGATCCACGAAGGTTGACCTTAGTAAATTGGAGATGGCTGCATTGTGGAGATATTGGCGACACTTTAATCTTGTAGATGCTATTCCCAACC
CATCGAAAGAGCAATTGGTAGATCTAGTTCAGAGGCATTTCATGTCACAGAAACCTTACGATAGCTGTCTGTTTCTTTTTGGTTGCGAAGCAACTGGACGAGTTGCAGGT
CATTATGGGTTTTGTGAAGGCTGCAAAGAGACTGAAGACCGTGTGCAAATGAGAGGGAGAAACTGGGGATCTATAGAGTTGGTTTCTCATGCTAACACTGTTGCCATCTC
ACCCTCCTTCCATGGACGTCAACCGTCGCCGGAGCTGAAGCATTCACACAAATCTCCGCCGCCTCCTGCCGCATTCCTCTTCACCTCTCCGCCGCCTCCGAGTCCTCCTG
CTGCGTTCCTTTTCGCCTCTCCGCCACCTCCGAGTCCTCCTGCTGCATTCCTCTTCAACTCTCCTCCACCTCCTGCGCGACGCTTCTTCAAATCTCCGCCGCCTCCTGCT
GCGTTCCTCTTCAATTCACCACCGCCTCCTGCGCGGCCGGAAGGAATAATCGAAATTTTCATCCAATCTCGCCGGGCCGTGCGACGGAGATGCAGTGGACCAGCGGTGCC
GGCCGACGACGCCATATTTCCCTCAAATAAAATTAGGATGCTCGAGAAGAGTCAGAAGGCTCAGGCCCAAGAACAACTGCAACTTTACTGTGGCTTGGGCCTTATAATGG
CCCAGACTCTTGGGCTCACGAGGCTCACTTTCTGGGCCCTTAGTTGGGCCGTGATGGAGCCCATTTGTTTCTCTGTGTTTCTCTGTGACCTCCCTTCAACACGAAGCAAA
AGAAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTGAAGCTGTGGAGAGTTCTATCAATGGAGGTTTCTCGCAGTTGCAGAGCTGTGGGGACAGTAGCGAGGAGGAGCTCTCCGTGCTTCCTCGCCATACCAAGGTGGT
CGTTACCGGAAATAACCGTACCAAATCAGTGCTCGTCGGACTTCAAGGCGTCGTCAAGAAAGCCGTTGGCCTTGGCGGGTGGCATTGGCTGGTTCTAACAAATGGCATAG
AGGTGAAACTACAGCGGAATGCGCTTAGTGTGATCGAGGCTCCGACGGGTAATGAGGAAGACGACGACCTCGAATTTGAAAACTTGCGGTGGAATGGATTGGACATGGCA
TCCGATGACGCCCCAAAATCCCACAAATCAAGGCATAAATTACACAAGTTATCCGGGTCATCTCACAAGACTATGAGCAGATCCCTTTCCTGTGACTCACAGTCCAAGGG
CTCGGTTTCTGCACCGCAAGGATCCACGAAGGTTGACCTTAGTAAATTGGAGATGGCTGCATTGTGGAGATATTGGCGACACTTTAATCTTGTAGATGCTATTCCCAACC
CATCGAAAGAGCAATTGGTAGATCTAGTTCAGAGGCATTTCATGTCACAGAAACCTTACGATAGCTGTCTGTTTCTTTTTGGTTGCGAAGCAACTGGACGAGTTGCAGGT
CATTATGGGTTTTGTGAAGGCTGCAAAGAGACTGAAGACCGTGTGCAAATGAGAGGGAGAAACTGGGGATCTATAGAGTTGGTTTCTCATGCTAACACTGTTGCCATCTC
ACCCTCCTTCCATGGACGTCAACCGTCGCCGGAGCTGAAGCATTCACACAAATCTCCGCCGCCTCCTGCCGCATTCCTCTTCACCTCTCCGCCGCCTCCGAGTCCTCCTG
CTGCGTTCCTTTTCGCCTCTCCGCCACCTCCGAGTCCTCCTGCTGCATTCCTCTTCAACTCTCCTCCACCTCCTGCGCGACGCTTCTTCAAATCTCCGCCGCCTCCTGCT
GCGTTCCTCTTCAATTCACCACCGCCTCCTGCGCGGCCGGAAGGAATAATCGAAATTTTCATCCAATCTCGCCGGGCCGTGCGACGGAGATGCAGTGGACCAGCGGTGCC
GGCCGACGACGCCATATTTCCCTCAAATAAAATTAGGATGCTCGAGAAGAGTCAGAAGGCTCAGGCCCAAGAACAACTGCAACTTTACTGTGGCTTGGGCCTTATAATGG
CCCAGACTCTTGGGCTCACGAGGCTCACTTTCTGGGCCCTTAGTTGGGCCGTGATGGAGCCCATTTGTTTCTCTGTGTTTCTCTGTGACCTCCCTTCAACACGAAGCAAA
AGAAGCTGA
Protein sequenceShow/hide protein sequence
MIEAVESSINGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENLRWNGLDMA
SDDAPKSHKSRHKLHKLSGSSHKTMSRSLSCDSQSKGSVSAPQGSTKVDLSKLEMAALWRYWRHFNLVDAIPNPSKEQLVDLVQRHFMSQKPYDSCLFLFGCEATGRVAG
HYGFCEGCKETEDRVQMRGRNWGSIELVSHANTVAISPSFHGRQPSPELKHSHKSPPPPAAFLFTSPPPPSPPAAFLFASPPPPSPPAAFLFNSPPPPARRFFKSPPPPA
AFLFNSPPPPARPEGIIEIFIQSRRAVRRRCSGPAVPADDAIFPSNKIRMLEKSQKAQAQEQLQLYCGLGLIMAQTLGLTRLTFWALSWAVMEPICFSVFLCDLPSTRSK
RS