; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0930 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0930
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionroot hair specific 4
Genome locationMC03:15947196..15948221
RNA-Seq ExpressionMC03g0930
SyntenyMC03g0930
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584176.1 hypothetical protein SDJN03_20108, partial [Cucurbita argyrosperma subsp. sororia]1.17e-17077.84Show/hide
Query:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK
        Y  D+ EC EKEKQISVDPISLR+SSA  REDMI  P I++PDA DLHLPPPLPPTQ KFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNSTK
Subjt:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK

Query:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR
        +K SV D Q  L+E TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSIR
Subjt:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR

Query:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES
        KEEEPEIG V IS+TEI SVISRTVS+EKFECGSWASS +PN+TG+DDS++ LF+DLPMELIR+SVDANAP+SAAFVFDKDQK VTKNSS++   KSHE 
Subjt:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES

Query:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        SHHV FSASSPS GP+SPASCITPRLRKAREEFNAFLEAQS+A
Subjt:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

XP_022137258.1 uncharacterized protein LOC111008761 [Momordica charantia]4.08e-241100Show/hide
Query:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK
        YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK
Subjt:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK

Query:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR
        IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR
Subjt:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR

Query:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHESS
        KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHESS
Subjt:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHESS

Query:  HHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        HHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
Subjt:  HHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

XP_022924159.1 uncharacterized protein LOC111431688 isoform X1 [Cucurbita moschata]1.17e-17077.84Show/hide
Query:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK
        Y  D+ EC EKEKQISVDPISLR+SSA  REDMI  P I++PDA DLHLPPPLPPTQ KFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNSTK
Subjt:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK

Query:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR
        +K SV D Q  L+E TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSIR
Subjt:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR

Query:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES
        KEEEPEIG V IS+TEI SVISRTVS+EKFECGSWASS +PN+TG+DDS++ LF+DLPMELIR+SVDANAP+SAAFVFDKDQK VTKNSS++   KSHE 
Subjt:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES

Query:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        SHHV FSASSPS GP+SPASCITPRLRKAREEFNAFLEAQS+A
Subjt:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

XP_022924160.1 uncharacterized protein LOC111431688 isoform X2 [Cucurbita moschata]5.33e-17177.84Show/hide
Query:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK
        Y  D+ EC EKEKQISVDPISLR+SSA  REDMI  P I++PDA DLHLPPPLPPTQ KFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNSTK
Subjt:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK

Query:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR
        +K SV D Q  L+E TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSIR
Subjt:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR

Query:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES
        KEEEPEIG V IS+TEI SVISRTVS+EKFECGSWASS +PN+TG+DDS++ LF+DLPMELIR+SVDANAP+SAAFVFDKDQK VTKNSS++   KSHE 
Subjt:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES

Query:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        SHHV FSASSPS GP+SPASCITPRLRKAREEFNAFLEAQS+A
Subjt:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

XP_023000754.1 uncharacterized protein LOC111495110 isoform X2 [Cucurbita maxima]1.07e-17077.55Show/hide
Query:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK
        Y  D+ EC EKEKQISVDPISLR+SSA  REDMI  P I++PD  DLHLPPPLPPTQ KFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNSTK
Subjt:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK

Query:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR
        +K S+ D Q  L+E TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSIR
Subjt:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR

Query:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES
        KEEEPEIG + IS+TEI SVISRTVS+EKFECGSWASS +PNDTG+DDS++ LFFDLPMELIR+SVDANAP+SAAFVFDKDQK VTKN+S++   KSHES
Subjt:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES

Query:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        SHHV FSASSPS GP+SPASCITPRLRKAREEFNAF+EAQSSA
Subjt:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

TrEMBL top hitse value%identityAlignment
A0A6J1C9T5 uncharacterized protein LOC1110087611.97e-241100Show/hide
Query:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK
        YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK
Subjt:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK

Query:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR
        IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR
Subjt:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR

Query:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHESS
        KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHESS
Subjt:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHESS

Query:  HHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        HHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
Subjt:  HHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

A0A6J1E8C7 uncharacterized protein LOC111431688 isoform X15.65e-17177.84Show/hide
Query:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK
        Y  D+ EC EKEKQISVDPISLR+SSA  REDMI  P I++PDA DLHLPPPLPPTQ KFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNSTK
Subjt:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK

Query:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR
        +K SV D Q  L+E TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSIR
Subjt:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR

Query:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES
        KEEEPEIG V IS+TEI SVISRTVS+EKFECGSWASS +PN+TG+DDS++ LF+DLPMELIR+SVDANAP+SAAFVFDKDQK VTKNSS++   KSHE 
Subjt:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES

Query:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        SHHV FSASSPS GP+SPASCITPRLRKAREEFNAFLEAQS+A
Subjt:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

A0A6J1EE09 uncharacterized protein LOC111431688 isoform X22.58e-17177.84Show/hide
Query:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK
        Y  D+ EC EKEKQISVDPISLR+SSA  REDMI  P I++PDA DLHLPPPLPPTQ KFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNSTK
Subjt:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK

Query:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR
        +K SV D Q  L+E TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSIR
Subjt:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR

Query:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES
        KEEEPEIG V IS+TEI SVISRTVS+EKFECGSWASS +PN+TG+DDS++ LF+DLPMELIR+SVDANAP+SAAFVFDKDQK VTKNSS++   KSHE 
Subjt:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES

Query:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        SHHV FSASSPS GP+SPASCITPRLRKAREEFNAFLEAQS+A
Subjt:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

A0A6J1KKV0 uncharacterized protein LOC111495110 isoform X25.20e-17177.55Show/hide
Query:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK
        Y  D+ EC EKEKQISVDPISLR+SSA  REDMI  P I++PD  DLHLPPPLPPTQ KFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNSTK
Subjt:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK

Query:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR
        +K S+ D Q  L+E TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSIR
Subjt:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR

Query:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES
        KEEEPEIG + IS+TEI SVISRTVS+EKFECGSWASS +PNDTG+DDS++ LFFDLPMELIR+SVDANAP+SAAFVFDKDQK VTKN+S++   KSHES
Subjt:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES

Query:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        SHHV FSASSPS GP+SPASCITPRLRKAREEFNAF+EAQSSA
Subjt:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

A0A6J1KNI3 uncharacterized protein LOC111495110 isoform X11.14e-17077.55Show/hide
Query:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK
        Y  D+ EC EKEKQISVDPISLR+SSA  REDMI  P I++PD  DLHLPPPLPPTQ KFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNSTK
Subjt:  YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTK

Query:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR
        +K S+ D Q  L+E TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSIR
Subjt:  IKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIR

Query:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES
        KEEEPEIG + IS+TEI SVISRTVS+EKFECGSWASS +PNDTG+DDS++ LFFDLPMELIR+SVDANAP+SAAFVFDKDQK VTKN+S++   KSHES
Subjt:  KEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTN-LFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES

Query:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        SHHV FSASSPS GP+SPASCITPRLRKAREEFNAF+EAQSSA
Subjt:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 49.6e-2735.45Show/hide
Query:  FGAGLMKKRGKLENQESRLRVSNSTKIKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDL------------DLWLNKPNSVETKNYGDGFSRTESTK
        F  GL +K    E Q  R+    +T   P++     V K + Q  +S +    R+SA  D              D  +  P   E        S  ES  
Subjt:  FGAGLMKKRGKLENQESRLRVSNSTKIKPSVLDPQNVLKEVTQFRRSKSCGEGRASAPADDL------------DLWLNKPNSVETKNYGDGFSRTESTK

Query:  DDRKS--AKGIESTDDGFKCGALCLFLPGFSKGKAVRSIRKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWAS-SVLPNDTGDDDSTNLFFDLPME
           KS  A+ I   ++ FKC A CL LPGF K K +RS  K +      ++ + +   S +S   SLEKFECGSWAS + L  D G      LFFD P+E
Subjt:  DDRKS--AKGIESTDDGFKCGALCLFLPGFSKGKAVRSIRKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWAS-SVLPNDTGDDDSTNLFFDLPME

Query:  LIRSSV-------DANAPVSAAFVFDKDQKV-----VTKNSSTKLVRKSHESS--HHVHFSASSPS---SGPASPASCITPRLRKAREEFNAFLEAQSS
        + + +        D   PV++ F+FD++ +      V K  ST+  R+S ESS    V FS SS S   S P SP +CITPRLRKAR++FN FL AQ++
Subjt:  LIRSSV-------DANAPVSAAFVFDKDQKV-----VTKNSSTKLVRKSHESS--HHVHFSASSPS---SGPASPASCITPRLRKAREEFNAFLEAQSS

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)1.8e-2541.54Show/hide
Query:  STKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIRKEEEPEIGNVMISKTEI-ESVISRTVSLEKFECGSWAS-SVLPNDTGDDDSTNLFFDLP
        S +  R   K     ++ FKC A CL LPGF K + VRS + E+   I   MI  +    S +S + SLEKFECGSWAS + L  + G      L+ DLP
Subjt:  STKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIRKEEEPEIGNVMISKTEI-ESVISRTVSLEKFECGSWAS-SVLPNDTGDDDSTNLFFDLP

Query:  MELIR-SSVDANAPVSAAFVFDKD------QKVVTKNSST--KLVRKSHESS--HHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSS
        +E+I+    D   PVS+ F FDK+      + V+ K+SS   + +R   E+S    V FS ++  S PASP +CITPRL KAR++FN FL AQ++
Subjt:  MELIR-SSVDANAPVSAAFVFDKD------QKVVTKNSST--KLVRKSHESS--HHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSS

AT4G20190.1 unknown protein4.0e-4941.55Show/hide
Query:  EKQISVDPISLRESSATAREDMIVVPAISSP-DAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTKIKPSVLDPQN
        E++ISVDP SL   S     DMIV    S P D  DL L   +   ++KF+S SLPNS  +SP        +   + N + R            VLD   
Subjt:  EKQISVDPISLRESSATAREDMIVVPAISSP-DAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTKIKPSVLDPQN

Query:  VLKEVTQFRRSKSCGEGRASAPADDLDLWLNKP-NSVETKNYGDG-------------------FSRTESTKDDRK-----SAKGIESTDDGFKCGALCL
        V    T FRRSKSCGEGRA  P+ D D+ L+K  N+   +N+  G                   FS+TES K +R      ++K I S +DGFKC ALCL
Subjt:  VLKEVTQFRRSKSCGEGRASAPADDLDLWLNKP-NSVETKNYGDG-------------------FSRTESTKDDRK-----SAKGIESTDDGFKCGALCL

Query:  FLPGFSKGKAVRSIRKEEEPEIGNVMISKTEI---------ESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSV---DANAPVSA
        +LPGFSKGK VRS RK +        ++ ++           +V+S   SLE+FECGSW SS +  D   D   + FFDLP ELI+      D + PVSA
Subjt:  FLPGFSKGKAVRSIRKEEEPEIGNVMISKTEI---------ESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSV---DANAPVSA

Query:  AFVFDKDQ------KVVTKNSSTKLVRKSHESSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQS
        AFVFDK+       K V K S +K  R+S ES  HV FS SSP S P SP   ITPRL +A E+F++FLEAQ+
Subjt:  AFVFDKDQ------KVVTKNSSTKLVRKSHESSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQS

AT5G44660.1 unknown protein7.6e-3233.5Show/hide
Query:  EKEKQISVDPISLRESSATA------REDMIVVPAISSPDAGDLHLPPPLPPTQSKFL-------------------------------------SYSLP
        + E++IS+DP S+R  S +         DM+ +PA+S P   D  +P PL P Q+                                          SLP
Subjt:  EKEKQISVDPISLRESSATA------REDMIVVPAISSPDAGDLHLPPPLPPTQSKFL-------------------------------------SYSLP

Query:  NSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTKIKPS-------VLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTE
        NS   SP+  +GLM+    L N+E    + NST   P         L  +        ++RSKSCG               +K  S ++    + F    
Subjt:  NSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTKIKPS-------VLDPQNVLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTE

Query:  STKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIRKEEEPEI-----------GNVMISK-------TEIESVISRTVSLEKFECGSWASSVLP
           D  KS     + +D FKC ALCLFLPGFSKGK +RS +K++                 + +S+       T   +VIS   S+EKF+CGS+ S    
Subjt:  STKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIRKEEEPEI-----------GNVMISK-------TEIESVISRTVSLEKFECGSWASSVLP

Query:  NDTGDDDSTNLFFDLPMELIRSSV---DANAPVSAAFVFDKDQ-----KVVTKNSSTKLVRKSHESS--HHVHFSASSPSSGPASPASCITPRLRKAREE
         ++  ++  N FFDLP ELI+S     D + PVSAAFVFDK+      K V K S +K  RK+ ES     V FS SSP S P SPA  I+PRL +A + 
Subjt:  NDTGDDDSTNLFFDLPMELIRSSV---DANAPVSAAFVFDKDQ-----KVVTKNSSTKLVRKSHESS--HHVHFSASSPSSGPASPASCITPRLRKAREE

Query:  FNAFLEAQS
        FNAFLEAQ+
Subjt:  FNAFLEAQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TATTTTGACGATAACGGCGAATGTGCGGAAAAGGAAAAGCAGATATCGGTGGATCCGATATCATTGAGAGAGTCATCGGCGACGGCGAGAGAGGACATGATCGTCGTTCC
TGCCATTTCTTCTCCCGACGCCGGTGATCTTCATCTGCCGCCGCCGCTGCCTCCGACGCAGTCCAAGTTCTTAAGCTATAGCCTACCTAATTCGGTCAATTCATCCCCCC
AATTTGGCGCGGGATTAATGAAAAAGAGAGGGAAACTTGAAAATCAAGAATCGCGGCTTAGAGTCTCCAATTCGACGAAGATCAAACCGTCGGTGCTGGATCCACAGAAT
GTTTTGAAGGAGGTGACTCAATTTCGAAGAAGCAAGTCGTGTGGCGAAGGCAGAGCCAGTGCTCCAGCAGATGATTTGGATCTGTGGCTGAACAAACCAAATTCTGTAGA
AACCAAGAATTACGGTGATGGTTTCTCCAGAACTGAATCTACCAAAGATGATCGAAAGAGTGCAAAGGGAATAGAGTCCACGGATGACGGATTTAAATGTGGAGCTCTCT
GCCTGTTCTTGCCGGGCTTCAGCAAAGGGAAGGCGGTCAGATCAATCAGAAAGGAAGAAGAACCAGAGATTGGAAATGTGATGATATCCAAGACCGAAATTGAAAGTGTG
ATATCAAGAACCGTTTCTCTGGAGAAATTCGAATGCGGATCATGGGCTTCGTCCGTCCTGCCAAATGATACTGGCGACGACGACTCCACGAACCTTTTCTTCGATCTGCC
TATGGAGTTAATAAGAAGCAGCGTAGACGCAAATGCACCGGTCAGTGCAGCTTTCGTTTTCGATAAAGATCAGAAGGTAGTTACCAAGAACAGCTCGACGAAATTAGTCC
GAAAATCGCATGAATCATCTCATCATGTTCACTTTTCTGCATCGTCTCCTTCTTCAGGGCCAGCCTCACCAGCTTCTTGCATCACACCTAGATTGCGCAAGGCAAGAGAG
GAGTTCAATGCCTTTCTAGAAGCTCAGAGCAGTGCT
mRNA sequenceShow/hide mRNA sequence
TATTTTGACGATAACGGCGAATGTGCGGAAAAGGAAAAGCAGATATCGGTGGATCCGATATCATTGAGAGAGTCATCGGCGACGGCGAGAGAGGACATGATCGTCGTTCC
TGCCATTTCTTCTCCCGACGCCGGTGATCTTCATCTGCCGCCGCCGCTGCCTCCGACGCAGTCCAAGTTCTTAAGCTATAGCCTACCTAATTCGGTCAATTCATCCCCCC
AATTTGGCGCGGGATTAATGAAAAAGAGAGGGAAACTTGAAAATCAAGAATCGCGGCTTAGAGTCTCCAATTCGACGAAGATCAAACCGTCGGTGCTGGATCCACAGAAT
GTTTTGAAGGAGGTGACTCAATTTCGAAGAAGCAAGTCGTGTGGCGAAGGCAGAGCCAGTGCTCCAGCAGATGATTTGGATCTGTGGCTGAACAAACCAAATTCTGTAGA
AACCAAGAATTACGGTGATGGTTTCTCCAGAACTGAATCTACCAAAGATGATCGAAAGAGTGCAAAGGGAATAGAGTCCACGGATGACGGATTTAAATGTGGAGCTCTCT
GCCTGTTCTTGCCGGGCTTCAGCAAAGGGAAGGCGGTCAGATCAATCAGAAAGGAAGAAGAACCAGAGATTGGAAATGTGATGATATCCAAGACCGAAATTGAAAGTGTG
ATATCAAGAACCGTTTCTCTGGAGAAATTCGAATGCGGATCATGGGCTTCGTCCGTCCTGCCAAATGATACTGGCGACGACGACTCCACGAACCTTTTCTTCGATCTGCC
TATGGAGTTAATAAGAAGCAGCGTAGACGCAAATGCACCGGTCAGTGCAGCTTTCGTTTTCGATAAAGATCAGAAGGTAGTTACCAAGAACAGCTCGACGAAATTAGTCC
GAAAATCGCATGAATCATCTCATCATGTTCACTTTTCTGCATCGTCTCCTTCTTCAGGGCCAGCCTCACCAGCTTCTTGCATCACACCTAGATTGCGCAAGGCAAGAGAG
GAGTTCAATGCCTTTCTAGAAGCTCAGAGCAGTGCT
Protein sequenceShow/hide protein sequence
YFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQSKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTKIKPSVLDPQN
VLKEVTQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIRKEEEPEIGNVMISKTEIESV
ISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHESSHHVHFSASSPSSGPASPASCITPRLRKARE
EFNAFLEAQSSA