; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019644 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019644
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionroot hair specific 4
Genome locationtig00153349:653666..654763
RNA-Seq ExpressionSgr019644
SyntenySgr019644
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022924159.1 uncharacterized protein LOC111431688 isoform X1 [Cucurbita moschata]2.4e-12676.22Show/hide
Query:  VSGAPAAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSS
        +S   A A+P SDN NNERLMSGKL FVG TY +D+ ECVEKEKQI VDPISLR+SSAREDM+  P+ AP+  DLHLPPPLPPTQFKFLSYSLPNSVNSS
Subjt:  VSGAPAAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSS

Query:  PRFGAGLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEP
        PRFG+  MKKKGK ENQES+LK+SNSTKLKSSVQ+ Q ALQE+TQFRRSKSCGEGRASAPADDLDL LNK  F  T S G F RTESNK Y   A+ +EP
Subjt:  PRFGAGLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEP

Query:  MDDRFKCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVS
         DD FKCGALCLFLPGF K KAVRSIRK EE EI +   S+TEI +VISRTVS+EKFECGSWASSA+PN TGE+D S SLF+DLPMELIRNSVDANAP+S
Subjt:  MDDRFKCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVS

Query:  AAFVFDKDQKGVTKNSSTKLIRKSHESS
        AAFVFDKDQKGVTKNSS+   +KSHE S
Subjt:  AAFVFDKDQKGVTKNSSTKLIRKSHESS

XP_022924160.1 uncharacterized protein LOC111431688 isoform X2 [Cucurbita moschata]3.1e-12677.09Show/hide
Query:  AAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGA
        A A+P SDN NNERLMSGKL FVG TY +D+ ECVEKEKQI VDPISLR+SSAREDM+  P+ AP+  DLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG+
Subjt:  AAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGA

Query:  GLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEPMDDRF
          MKKKGK ENQES+LK+SNSTKLKSSVQ+ Q ALQE+TQFRRSKSCGEGRASAPADDLDL LNK  F  T S G F RTESNK Y   A+ +EP DD F
Subjt:  GLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEPMDDRF

Query:  KCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVSAAFVF
        KCGALCLFLPGF K KAVRSIRK EE EI +   S+TEI +VISRTVS+EKFECGSWASSA+PN TGE+D S SLF+DLPMELIRNSVDANAP+SAAFVF
Subjt:  KCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVSAAFVF

Query:  DKDQKGVTKNSSTKLIRKSHESS
        DKDQKGVTKNSS+   +KSHE S
Subjt:  DKDQKGVTKNSSTKLIRKSHESS

XP_023000753.1 uncharacterized protein LOC111495110 isoform X1 [Cucurbita maxima]7.4e-12876.83Show/hide
Query:  VSGAPAAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSS
        +S   A A+P SDN NNERLMSGKLEFVG TY +D+ ECVEKEKQI VDPISLR+SSAREDM+  P+ AP+V DLHLPPPLPPTQFKFLSYSLPNSVNSS
Subjt:  VSGAPAAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSS

Query:  PRFGAGLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEP
        PRFG+  MKKKGK ENQES+LK+SNSTKLKSS+Q+ Q ALQE+TQFRRSKSCGEGRASAPADDLDL LNK  F  T S G F RTESNK Y   A+ +EP
Subjt:  PRFGAGLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEP

Query:  MDDRFKCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVS
         DD FKCGALCLFLPGF K KAVRSIRK EE EI +   S+TEI +VISRTVS+EKFECGSWASSA+PN+TGE+D S SLFFDLPMELIRNSVDANAP+S
Subjt:  MDDRFKCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVS

Query:  AAFVFDKDQKGVTKNSSTKLIRKSHESS
        AAFVFDKDQKGVTKN+S+   +KSHESS
Subjt:  AAFVFDKDQKGVTKNSSTKLIRKSHESS

XP_023000754.1 uncharacterized protein LOC111495110 isoform X2 [Cucurbita maxima]9.7e-12877.71Show/hide
Query:  AAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGA
        A A+P SDN NNERLMSGKLEFVG TY +D+ ECVEKEKQI VDPISLR+SSAREDM+  P+ AP+V DLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG+
Subjt:  AAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGA

Query:  GLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEPMDDRF
          MKKKGK ENQES+LK+SNSTKLKSS+Q+ Q ALQE+TQFRRSKSCGEGRASAPADDLDL LNK  F  T S G F RTESNK Y   A+ +EP DD F
Subjt:  GLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEPMDDRF

Query:  KCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVSAAFVF
        KCGALCLFLPGF K KAVRSIRK EE EI +   S+TEI +VISRTVS+EKFECGSWASSA+PN+TGE+D S SLFFDLPMELIRNSVDANAP+SAAFVF
Subjt:  KCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVSAAFVF

Query:  DKDQKGVTKNSSTKLIRKSHESS
        DKDQKGVTKN+S+   +KSHESS
Subjt:  DKDQKGVTKNSSTKLIRKSHESS

XP_023520328.1 uncharacterized protein LOC111783644 [Cucurbita pepo subsp. pepo]1.1e-12677.4Show/hide
Query:  AAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGA
        A A+P SDN NNERLMSGKLEFVG TY +D+ ECVEKEKQI VDPISLR+SSAREDM+  P+ AP+V DLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG+
Subjt:  AAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGA

Query:  GLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEPMDDRF
          MKKKGK ENQES+LK+SNSTKLKSSVQ+ Q ALQE+TQFRRSKSCGEGRASAPADDLDL LNK  F  T S   F RTESNK Y   A+ +EP DD F
Subjt:  GLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEPMDDRF

Query:  KCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVSAAFVF
        KCGALCLFLPGF K KAVRSIRK EE EI +   S+TEI +VISRTVS+EKFECGSWASSA+PN TGE+D S SLF+DLPMELIRNSVDANAP+SAAFVF
Subjt:  KCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVSAAFVF

Query:  DKDQKGVTKNSSTKLIRKSHESS
        DKDQKGVTKNSS+   +KSHE S
Subjt:  DKDQKGVTKNSSTKLIRKSHESS

TrEMBL top hitse value%identityAlignment
A0A6J1C9T5 uncharacterized protein LOC1110087612.0e-12381.58Show/hide
Query:  GLTYFADNGECVEKEKQILVDPISLRESS--AREDMVVVP-VMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGAGLMKKKGKNENQESQLKVSN
        GLTYF DNGEC EKEKQI VDPISLRESS  AREDM+VVP + +P+  DLHLPPPLPPTQ KFLSYSLPNSVNSSP+FGAGLMKK+GK ENQES+L+VSN
Subjt:  GLTYFADNGECVEKEKQILVDPISLRESS--AREDMVVVP-VMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGAGLMKKKGKNENQESQLKVSN

Query:  STKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCG-GFSRTESNKAYHKSAKKIEPMDDRFKCGALCLFLPGFSKGKAVR
        STK+K SV +PQN L+E TQFRRSKSCGEGRASAPADDLDLWLNKP  V TK+ G GFSRTES K   KSAK IE  DD FKCGALCLFLPGFSKGKAVR
Subjt:  STKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCG-GFSRTESNKAYHKSAKKIEPMDDRFKCGALCLFLPGFSKGKAVR

Query:  SIRK-EETEIEN-ETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVSAAFVFDKDQKGVTKNSSTKLIRKS
        SIRK EE EI N   SKTEIE+VISRTVSLEKFECGSWASS LPN+TG +DDS +LFFDLPMELIR+SVDANAPVSAAFVFDKDQK VTKNSSTKL+RKS
Subjt:  SIRK-EETEIEN-ETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVSAAFVFDKDQKGVTKNSSTKLIRKS

Query:  HESS
        HESS
Subjt:  HESS

A0A6J1E8C7 uncharacterized protein LOC111431688 isoform X11.2e-12676.22Show/hide
Query:  VSGAPAAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSS
        +S   A A+P SDN NNERLMSGKL FVG TY +D+ ECVEKEKQI VDPISLR+SSAREDM+  P+ AP+  DLHLPPPLPPTQFKFLSYSLPNSVNSS
Subjt:  VSGAPAAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSS

Query:  PRFGAGLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEP
        PRFG+  MKKKGK ENQES+LK+SNSTKLKSSVQ+ Q ALQE+TQFRRSKSCGEGRASAPADDLDL LNK  F  T S G F RTESNK Y   A+ +EP
Subjt:  PRFGAGLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEP

Query:  MDDRFKCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVS
         DD FKCGALCLFLPGF K KAVRSIRK EE EI +   S+TEI +VISRTVS+EKFECGSWASSA+PN TGE+D S SLF+DLPMELIRNSVDANAP+S
Subjt:  MDDRFKCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVS

Query:  AAFVFDKDQKGVTKNSSTKLIRKSHESS
        AAFVFDKDQKGVTKNSS+   +KSHE S
Subjt:  AAFVFDKDQKGVTKNSSTKLIRKSHESS

A0A6J1EE09 uncharacterized protein LOC111431688 isoform X21.5e-12677.09Show/hide
Query:  AAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGA
        A A+P SDN NNERLMSGKL FVG TY +D+ ECVEKEKQI VDPISLR+SSAREDM+  P+ AP+  DLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG+
Subjt:  AAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGA

Query:  GLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEPMDDRF
          MKKKGK ENQES+LK+SNSTKLKSSVQ+ Q ALQE+TQFRRSKSCGEGRASAPADDLDL LNK  F  T S G F RTESNK Y   A+ +EP DD F
Subjt:  GLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEPMDDRF

Query:  KCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVSAAFVF
        KCGALCLFLPGF K KAVRSIRK EE EI +   S+TEI +VISRTVS+EKFECGSWASSA+PN TGE+D S SLF+DLPMELIRNSVDANAP+SAAFVF
Subjt:  KCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVSAAFVF

Query:  DKDQKGVTKNSSTKLIRKSHESS
        DKDQKGVTKNSS+   +KSHE S
Subjt:  DKDQKGVTKNSSTKLIRKSHESS

A0A6J1KKV0 uncharacterized protein LOC111495110 isoform X24.7e-12877.71Show/hide
Query:  AAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGA
        A A+P SDN NNERLMSGKLEFVG TY +D+ ECVEKEKQI VDPISLR+SSAREDM+  P+ AP+V DLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG+
Subjt:  AAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGA

Query:  GLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEPMDDRF
          MKKKGK ENQES+LK+SNSTKLKSS+Q+ Q ALQE+TQFRRSKSCGEGRASAPADDLDL LNK  F  T S G F RTESNK Y   A+ +EP DD F
Subjt:  GLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEPMDDRF

Query:  KCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVSAAFVF
        KCGALCLFLPGF K KAVRSIRK EE EI +   S+TEI +VISRTVS+EKFECGSWASSA+PN+TGE+D S SLFFDLPMELIRNSVDANAP+SAAFVF
Subjt:  KCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVSAAFVF

Query:  DKDQKGVTKNSSTKLIRKSHESS
        DKDQKGVTKN+S+   +KSHESS
Subjt:  DKDQKGVTKNSSTKLIRKSHESS

A0A6J1KNI3 uncharacterized protein LOC111495110 isoform X13.6e-12876.83Show/hide
Query:  VSGAPAAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSS
        +S   A A+P SDN NNERLMSGKLEFVG TY +D+ ECVEKEKQI VDPISLR+SSAREDM+  P+ AP+V DLHLPPPLPPTQFKFLSYSLPNSVNSS
Subjt:  VSGAPAAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSS

Query:  PRFGAGLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEP
        PRFG+  MKKKGK ENQES+LK+SNSTKLKSS+Q+ Q ALQE+TQFRRSKSCGEGRASAPADDLDL LNK  F  T S G F RTESNK Y   A+ +EP
Subjt:  PRFGAGLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEP

Query:  MDDRFKCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVS
         DD FKCGALCLFLPGF K KAVRSIRK EE EI +   S+TEI +VISRTVS+EKFECGSWASSA+PN+TGE+D S SLFFDLPMELIRNSVDANAP+S
Subjt:  MDDRFKCGALCLFLPGFSKGKAVRSIRK-EETEI-ENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVS

Query:  AAFVFDKDQKGVTKNSSTKLIRKSHESS
        AAFVFDKDQKGVTKN+S+   +KSHESS
Subjt:  AAFVFDKDQKGVTKNSSTKLIRKSHESS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 49.7e-1737.67Show/hide
Query:  AKKIEPMDDRFKCGALCLFLPGFSKGKAVRSIRKEETEIENETSKTE--IENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIR----
        A+KI   ++ FKC A CL LPGF K K +RS  K +  +E +  +      + +S   SLEKFECGSWAS+     T    D+  LFFD P+E+ +    
Subjt:  AKKIEPMDDRFKCGALCLFLPGFSKGKAVRSIRKEETEIENETSKTE--IENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIR----

Query:  ---NSVDANAPVSAAFVFDKDQ-----KGVTKNSSTKLIRKSHESS
              D   PV++ F+FD++      + V K  ST+  R+S ESS
Subjt:  ---NSVDANAPVSAAFVFDKDQ-----KGVTKNSSTKLIRKSHESS

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)3.4e-1438.3Show/hide
Query:  SRTESNKAYHKS---AKKIEPMDDRFKCGALCLFLPGFSKGKAVRSIRKEETEIENETSKTEIEN-VISRTVSLEKFECGSWAS-SALPNNTGEEDDSMS
        S  ES  +  KS    K     ++ FKC A CL LPGF K + VRS + E++  +     +   N  +S + SLEKFECGSWAS +AL    G       
Subjt:  SRTESNKAYHKS---AKKIEPMDDRFKCGALCLFLPGFSKGKAVRSIRKEETEIENETSKTEIEN-VISRTVSLEKFECGSWAS-SALPNNTGEEDDSMS

Query:  LFFDLPMELIR-NSVDANAPVSAAFVFDKDQKGVTKNSSTK
        L+ DLP+E+I+    D   PVS+ F FDK+   +   S  K
Subjt:  LFFDLPMELIR-NSVDANAPVSAAFVFDKDQKGVTKNSSTK

AT4G20190.1 unknown protein2.7e-3538.3Show/hide
Query:  EKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGAGLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQE
        E++I VDP SL   +   DM+V      ++ DL L   +   + KF+S SLPNS  +SPR  + +   K +   Q   L +         VQ+       
Subjt:  EKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGAGLMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQE

Query:  ETQFRRSKSCGEGRASAPADDLDLWLNK------------------PIFVGTKSCGG---FSRTESNKAYHK-----SAKKIEPMDDRFKCGALCLFLPG
         T FRRSKSCGEGRA  P+ D D+ L+K                     +  KS G    FS+TESNK+        ++K I   +D FKC ALCL+LPG
Subjt:  ETQFRRSKSCGEGRASAPADDLDLWLNK------------------PIFVGTKSCGG---FSRTESNKAYHK-----SAKKIEPMDDRFKCGALCLFLPG

Query:  FSKGKAVRSIRKEETEIENETSKTEIEN-----------VISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSV---DANAPVSAAFV
        FSKGK VRS RK ++     T+ T  ++           V+S   SLE+FECGSW SSA+  +  +  D    FFDLP ELI+      D + PVSAAFV
Subjt:  FSKGKAVRSIRKEETEIENETSKTEIEN-----------VISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSV---DANAPVSAAFV

Query:  FDKDQ------KGVTKNSSTKLIRKSHES
        FDK+       KGV K S +K  R+S ES
Subjt:  FDKDQ------KGVTKNSSTKLIRKSHES

AT5G44660.1 unknown protein7.4e-2532.67Show/hide
Query:  VEKEKQILVDPISLRESSARE--------DMVVVPVMAPEVADLHLPPPLP---------PTQFKFLSYSLPN----SVNSSPRFGAGLMKKKGKNENQE
        ++ E++I +DP S+R  S           DMV +P M+P   DL  P PLP         P Q   L  +L N    S+ +SP+  +GLM+     +   
Subjt:  VEKEKQILVDPISLRESSARE--------DMVVVPVMAPEVADLHLPPPLP---------PTQFKFLSYSLPN----SVNSSPRFGAGLMKKKGKNENQE

Query:  SQLKVSNSTKLKSSV-----QNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKA------------YHKSAKKIEPM
             + S K +S +        Q++L   T     +  G  RA    +      +   +  +KSCG  S+T S+K+             +KS      +
Subjt:  SQLKVSNSTKLKSSV-----QNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKA------------YHKSAKKIEPM

Query:  DDRFKCGALCLFLPGFSKGKAVRSIRKEETEIENETS--------------------KTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFD
        +DRFKC ALCLFLPGFSKGK +RS +K+++     T+                     T    VIS   S+EKF+CGS+ S     + GEE  +   FFD
Subjt:  DDRFKCGALCLFLPGFSKGKAVRSIRKEETEIENETS--------------------KTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFD

Query:  LPMELIRNSV---DANAPVSAAFVFDKDQ-----KGVTKNSSTKLIRKSHES
        LP ELI++     D + PVSAAFVFDK+      KGV K S +K  RK+ ES
Subjt:  LPMELIRNSV---DANAPVSAAFVFDKDQ-----KGVTKNSSTKLIRKSHES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCTTGCAGTTTCTGGTGCTCCTGCAGCTGCAATGCCTGATTCGGATAACTGTAACAATGAGAGGTTGATGTCTGGGAAGCTCGAGTTCGTTGGTTTAACATATTT
TGCCGATAACGGCGAATGTGTGGAAAAGGAAAAGCAGATATTGGTCGATCCGATATCACTGAGAGAGTCATCGGCGAGAGAGGACATGGTCGTTGTTCCTGTCATGGCTC
CTGAAGTCGCTGACCTCCACCTGCCGCCGCCGCTACCTCCCACCCAGTTCAAGTTCTTAAGCTACAGCCTACCTAACTCGGTCAATTCGTCCCCCCGATTCGGTGCAGGT
TTAATGAAAAAGAAGGGGAAAAATGAAAATCAAGAATCGCAACTTAAAGTCTCCAATTCGACGAAGCTCAAATCGTCGGTGCAGAATCCGCAGAATGCTCTGCAGGAGGA
AACTCAATTTCGAAGAAGCAAGTCGTGTGGCGAAGGCAGAGCCAGTGCTCCAGCAGATGATTTGGATCTCTGGTTGAACAAACCAATTTTTGTAGGAACCAAGAGTTGCG
GCGGTTTCTCTAGAACTGAATCTAACAAAGCTTATCATAAGAGTGCAAAGAAAATTGAGCCCATGGATGACAGATTTAAATGTGGAGCACTCTGTCTGTTCTTGCCGGGC
TTCAGCAAAGGGAAGGCGGTTAGATCAATCAGAAAGGAAGAAACAGAGATTGAAAATGAAACATCCAAGACAGAGATTGAAAATGTGATATCGAGGACTGTTTCTTTGGA
GAAATTCGAATGTGGATCCTGGGCTTCATCCGCCCTACCAAATAATACTGGCGAAGAAGACGACTCCATGAGCCTTTTCTTTGATCTGCCAATGGAGTTAATAAGAAATA
GCGTGGACGCAAATGCACCAGTCAGTGCAGCTTTCGTCTTTGATAAAGACCAGAAGGGAGTTACCAAGAACAGCTCGACGAAATTAATCCGAAAATCACACGAATCATCC
ATCATGTTCGCTTTTCTGCATCGTCTCCTTCTTCAGGGCCATCCTCACCAGCTGCTTGCATCACACCCAGATTGCGCAAGGCAAGAGAGGAGTTCAATGCCTTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGCTTGCAGTTTCTGGTGCTCCTGCAGCTGCAATGCCTGATTCGGATAACTGTAACAATGAGAGGTTGATGTCTGGGAAGCTCGAGTTCGTTGGTTTAACATATTT
TGCCGATAACGGCGAATGTGTGGAAAAGGAAAAGCAGATATTGGTCGATCCGATATCACTGAGAGAGTCATCGGCGAGAGAGGACATGGTCGTTGTTCCTGTCATGGCTC
CTGAAGTCGCTGACCTCCACCTGCCGCCGCCGCTACCTCCCACCCAGTTCAAGTTCTTAAGCTACAGCCTACCTAACTCGGTCAATTCGTCCCCCCGATTCGGTGCAGGT
TTAATGAAAAAGAAGGGGAAAAATGAAAATCAAGAATCGCAACTTAAAGTCTCCAATTCGACGAAGCTCAAATCGTCGGTGCAGAATCCGCAGAATGCTCTGCAGGAGGA
AACTCAATTTCGAAGAAGCAAGTCGTGTGGCGAAGGCAGAGCCAGTGCTCCAGCAGATGATTTGGATCTCTGGTTGAACAAACCAATTTTTGTAGGAACCAAGAGTTGCG
GCGGTTTCTCTAGAACTGAATCTAACAAAGCTTATCATAAGAGTGCAAAGAAAATTGAGCCCATGGATGACAGATTTAAATGTGGAGCACTCTGTCTGTTCTTGCCGGGC
TTCAGCAAAGGGAAGGCGGTTAGATCAATCAGAAAGGAAGAAACAGAGATTGAAAATGAAACATCCAAGACAGAGATTGAAAATGTGATATCGAGGACTGTTTCTTTGGA
GAAATTCGAATGTGGATCCTGGGCTTCATCCGCCCTACCAAATAATACTGGCGAAGAAGACGACTCCATGAGCCTTTTCTTTGATCTGCCAATGGAGTTAATAAGAAATA
GCGTGGACGCAAATGCACCAGTCAGTGCAGCTTTCGTCTTTGATAAAGACCAGAAGGGAGTTACCAAGAACAGCTCGACGAAATTAATCCGAAAATCACACGAATCATCC
ATCATGTTCGCTTTTCTGCATCGTCTCCTTCTTCAGGGCCATCCTCACCAGCTGCTTGCATCACACCCAGATTGCGCAAGGCAAGAGAGGAGTTCAATGCCTTTCTAG
Protein sequenceShow/hide protein sequence
MALAVSGAPAAAMPDSDNCNNERLMSGKLEFVGLTYFADNGECVEKEKQILVDPISLRESSAREDMVVVPVMAPEVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGAG
LMKKKGKNENQESQLKVSNSTKLKSSVQNPQNALQEETQFRRSKSCGEGRASAPADDLDLWLNKPIFVGTKSCGGFSRTESNKAYHKSAKKIEPMDDRFKCGALCLFLPG
FSKGKAVRSIRKEETEIENETSKTEIENVISRTVSLEKFECGSWASSALPNNTGEEDDSMSLFFDLPMELIRNSVDANAPVSAAFVFDKDQKGVTKNSSTKLIRKSHESS
IMFAFLHRLLLQGHPHQLLASHPDCARQERSSMPF