; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019902 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019902
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionroot hair specific 4
Genome locationscaffold22:399348..400376
RNA-Seq ExpressionMS019902
SyntenyMS019902
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584176.1 hypothetical protein SDJN03_20108, partial [Cucurbita argyrosperma subsp. sororia]4.1e-13678.2Show/hide
Query:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST
        TY  D+ EC EKEKQISVDPISLR+SS  AREDMI  P I++PDA DLHLPPPLPPTQFKFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNST
Subjt:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST

Query:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI
        K+K SV D Q  L+E+TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSI
Subjt:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI

Query:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE
        RKEEEPEIG V IS+TEI SVISRTVS+EKFECGSWASS +PN+TG+DD S++LF+DLPMELIR+SVDANAP+SAAFVFDKDQK VTKNSS+   +KSHE
Subjt:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE

Query:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
         SHHV FSASSP SGP+SPASCITPRLRKAREEFNAFLEAQS+A
Subjt:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

XP_022137258.1 uncharacterized protein LOC111008761 [Momordica charantia]1.9e-18699.42Show/hide
Query:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST
        TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQ KFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST
Subjt:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST

Query:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI
        KIKPSVLDPQNVLKE TQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI
Subjt:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI

Query:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES
        RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES
Subjt:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES

Query:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
Subjt:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

XP_022924159.1 uncharacterized protein LOC111431688 isoform X1 [Cucurbita moschata]4.1e-13678.2Show/hide
Query:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST
        TY  D+ EC EKEKQISVDPISLR+SS  AREDMI  P I++PDA DLHLPPPLPPTQFKFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNST
Subjt:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST

Query:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI
        K+K SV D Q  L+E+TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSI
Subjt:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI

Query:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE
        RKEEEPEIG V IS+TEI SVISRTVS+EKFECGSWASS +PN+TG+DD S++LF+DLPMELIR+SVDANAP+SAAFVFDKDQK VTKNSS+   +KSHE
Subjt:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE

Query:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
         SHHV FSASSP SGP+SPASCITPRLRKAREEFNAFLEAQS+A
Subjt:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

XP_022924160.1 uncharacterized protein LOC111431688 isoform X2 [Cucurbita moschata]4.1e-13678.2Show/hide
Query:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST
        TY  D+ EC EKEKQISVDPISLR+SS  AREDMI  P I++PDA DLHLPPPLPPTQFKFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNST
Subjt:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST

Query:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI
        K+K SV D Q  L+E+TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSI
Subjt:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI

Query:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE
        RKEEEPEIG V IS+TEI SVISRTVS+EKFECGSWASS +PN+TG+DD S++LF+DLPMELIR+SVDANAP+SAAFVFDKDQK VTKNSS+   +KSHE
Subjt:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE

Query:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
         SHHV FSASSP SGP+SPASCITPRLRKAREEFNAFLEAQS+A
Subjt:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

XP_023000754.1 uncharacterized protein LOC111495110 isoform X2 [Cucurbita maxima]7.0e-13677.91Show/hide
Query:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST
        TY  D+ EC EKEKQISVDPISLR+SS  AREDMI  P I++PD  DLHLPPPLPPTQFKFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNST
Subjt:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST

Query:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI
        K+K S+ D Q  L+E+TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSI
Subjt:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI

Query:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE
        RKEEEPEIG + IS+TEI SVISRTVS+EKFECGSWASS +PNDTG+DD S++LFFDLPMELIR+SVDANAP+SAAFVFDKDQK VTKN+S+   +KSHE
Subjt:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE

Query:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        SSHHV FSASSP SGP+SPASCITPRLRKAREEFNAF+EAQSSA
Subjt:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

TrEMBL top hitse value%identityAlignment
A0A6J1C9T5 uncharacterized protein LOC1110087619.3e-18799.42Show/hide
Query:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST
        TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQ KFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST
Subjt:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST

Query:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI
        KIKPSVLDPQNVLKE TQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI
Subjt:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI

Query:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES
        RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES
Subjt:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHES

Query:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
Subjt:  SHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

A0A6J1E8C7 uncharacterized protein LOC111431688 isoform X12.0e-13678.2Show/hide
Query:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST
        TY  D+ EC EKEKQISVDPISLR+SS  AREDMI  P I++PDA DLHLPPPLPPTQFKFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNST
Subjt:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST

Query:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI
        K+K SV D Q  L+E+TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSI
Subjt:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI

Query:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE
        RKEEEPEIG V IS+TEI SVISRTVS+EKFECGSWASS +PN+TG+DD S++LF+DLPMELIR+SVDANAP+SAAFVFDKDQK VTKNSS+   +KSHE
Subjt:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE

Query:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
         SHHV FSASSP SGP+SPASCITPRLRKAREEFNAFLEAQS+A
Subjt:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

A0A6J1EE09 uncharacterized protein LOC111431688 isoform X22.0e-13678.2Show/hide
Query:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST
        TY  D+ EC EKEKQISVDPISLR+SS  AREDMI  P I++PDA DLHLPPPLPPTQFKFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNST
Subjt:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST

Query:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI
        K+K SV D Q  L+E+TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSI
Subjt:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI

Query:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE
        RKEEEPEIG V IS+TEI SVISRTVS+EKFECGSWASS +PN+TG+DD S++LF+DLPMELIR+SVDANAP+SAAFVFDKDQK VTKNSS+   +KSHE
Subjt:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE

Query:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
         SHHV FSASSP SGP+SPASCITPRLRKAREEFNAFLEAQS+A
Subjt:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

A0A6J1KKV0 uncharacterized protein LOC111495110 isoform X23.4e-13677.91Show/hide
Query:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST
        TY  D+ EC EKEKQISVDPISLR+SS  AREDMI  P I++PD  DLHLPPPLPPTQFKFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNST
Subjt:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST

Query:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI
        K+K S+ D Q  L+E+TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSI
Subjt:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI

Query:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE
        RKEEEPEIG + IS+TEI SVISRTVS+EKFECGSWASS +PNDTG+DD S++LFFDLPMELIR+SVDANAP+SAAFVFDKDQK VTKN+S+   +KSHE
Subjt:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE

Query:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        SSHHV FSASSP SGP+SPASCITPRLRKAREEFNAF+EAQSSA
Subjt:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

A0A6J1KNI3 uncharacterized protein LOC111495110 isoform X13.4e-13677.91Show/hide
Query:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST
        TY  D+ EC EKEKQISVDPISLR+SS  AREDMI  P I++PD  DLHLPPPLPPTQFKFLSYSLPNSVNSSP+FG+  MKK+GKLENQES+L++SNST
Subjt:  TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNST

Query:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI
        K+K S+ D Q  L+E+TQFRRSKSCGEGRASAPADDLDL LNK    ET +YGD F RTES KD R  A+ +E TDDGFKCGALCLFLPGF K KAVRSI
Subjt:  KIKPSVLDPQNVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSI

Query:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE
        RKEEEPEIG + IS+TEI SVISRTVS+EKFECGSWASS +PNDTG+DD S++LFFDLPMELIR+SVDANAP+SAAFVFDKDQK VTKN+S+   +KSHE
Subjt:  RKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWASSVLPNDTGDDD-STNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHE

Query:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA
        SSHHV FSASSP SGP+SPASCITPRLRKAREEFNAF+EAQSSA
Subjt:  SSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSSA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 49.7e-2735.22Show/hide
Query:  FGAGLMKKRGKLENQESRLRVSNSTKIKPSVLDPQNVLKEETQFRRSKSCGE--GRASAPADDL------------DLWLNKPNSVETKNYGDGFSRTES
        F  GL +K    E Q  R+    +T   P++    N L ++   +  KS      R+SA  D              D  +  P   E        S  ES
Subjt:  FGAGLMKKRGKLENQESRLRVSNSTKIKPSVLDPQNVLKEETQFRRSKSCGE--GRASAPADDL------------DLWLNKPNSVETKNYGDGFSRTES

Query:  TKDDRKS--AKGIESTDDGFKCGALCLFLPGFSKGKAVRSIRKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWAS-SVLPNDTGDDDSTNLFFDLP
             KS  A+ I   ++ FKC A CL LPGF K K +RS  K +      ++ + +   S +S   SLEKFECGSWAS + L  D G      LFFD P
Subjt:  TKDDRKS--AKGIESTDDGFKCGALCLFLPGFSKGKAVRSIRKEEEPEIGNVMISKTEIESVISRTVSLEKFECGSWAS-SVLPNDTGDDDSTNLFFDLP

Query:  MELIRSSV-------DANAPVSAAFVFDKDQKV-----VTKNSSTKLVRKSHESS--HHVHFSASSPS---SGPASPASCITPRLRKAREEFNAFLEAQS
        +E+ + +        D   PV++ F+FD++ +      V K  ST+  R+S ESS    V FS SS S   S P SP +CITPRLRKAR++FN FL AQ+
Subjt:  MELIRSSV-------DANAPVSAAFVFDKDQKV-----VTKNSSTKLVRKSHESS--HHVHFSASSPS---SGPASPASCITPRLRKAREEFNAFLEAQS

Query:  S
        +
Subjt:  S

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)1.8e-2541.54Show/hide
Query:  STKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIRKEEEPEIGNVMISKTEI-ESVISRTVSLEKFECGSWAS-SVLPNDTGDDDSTNLFFDLP
        S +  R   K     ++ FKC A CL LPGF K + VRS + E+   I   MI  +    S +S + SLEKFECGSWAS + L  + G      L+ DLP
Subjt:  STKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIRKEEEPEIGNVMISKTEI-ESVISRTVSLEKFECGSWAS-SVLPNDTGDDDSTNLFFDLP

Query:  MELIR-SSVDANAPVSAAFVFDKD------QKVVTKNSST--KLVRKSHESS--HHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSS
        +E+I+    D   PVS+ F FDK+      + V+ K+SS   + +R   E+S    V FS ++  S PASP +CITPRL KAR++FN FL AQ++
Subjt:  MELIR-SSVDANAPVSAAFVFDKD------QKVVTKNSST--KLVRKSHESS--HHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQSS

AT4G20190.1 unknown protein1.2e-4841.55Show/hide
Query:  EKQISVDPISLRESSATAREDMIVVPAISSP-DAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTKIKPSVLDPQN
        E++ISVDP SL   S     DMIV    S P D  DL L   +   + KF+S SLPNS  +SP        +   + N + R            VLD   
Subjt:  EKQISVDPISLRESSATAREDMIVVPAISSP-DAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTKIKPSVLDPQN

Query:  VLKEETQFRRSKSCGEGRASAPADDLDLWLNKP-NSVETKNYGDG-------------------FSRTESTKDDRK-----SAKGIESTDDGFKCGALCL
        V    T FRRSKSCGEGRA  P+ D D+ L+K  N+   +N+  G                   FS+TES K +R      ++K I S +DGFKC ALCL
Subjt:  VLKEETQFRRSKSCGEGRASAPADDLDLWLNKP-NSVETKNYGDG-------------------FSRTESTKDDRK-----SAKGIESTDDGFKCGALCL

Query:  FLPGFSKGKAVRSIRKEEEPEIGNVMISKTEI---------ESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSV---DANAPVSA
        +LPGFSKGK VRS RK +        ++ ++           +V+S   SLE+FECGSW SS +  D   D   + FFDLP ELI+      D + PVSA
Subjt:  FLPGFSKGKAVRSIRKEEEPEIGNVMISKTEI---------ESVISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSV---DANAPVSA

Query:  AFVFDKDQ------KVVTKNSSTKLVRKSHESSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQS
        AFVFDK+       K V K S +K  R+S ES  HV FS SSP S P SP   ITPRL +A E+F++FLEAQ+
Subjt:  AFVFDKDQ------KVVTKNSSTKLVRKSHESSHHVHFSASSPSSGPASPASCITPRLRKAREEFNAFLEAQS

AT5G44660.1 unknown protein3.4e-3232.84Show/hide
Query:  EKEKQISVDPISLRESSATA------REDMIVVPAISSPDAGDLHLPPPLPPTQ------------------------------------FKFLSY-SLP
        + E++IS+DP S+R  S +         DM+ +PA+S P   D  +P PL P Q                                    F+     SLP
Subjt:  EKEKQISVDPISLRESSATA------REDMIVVPAISSPDAGDLHLPPPLPPTQ------------------------------------FKFLSY-SLP

Query:  NSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTKIKPSVLDPQNVLKEET---QFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKD
        NS   SP+  +GLM+     E        + S K +  ++      ++++    ++RSKSCG               +K  S ++    + F       D
Subjt:  NSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTKIKPSVLDPQNVLKEET---QFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKD

Query:  DRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIRKEEEPEI-----------GNVMISK-------TEIESVISRTVSLEKFECGSWASSVLPNDTG
          KS     + +D FKC ALCLFLPGFSKGK +RS +K++                 + +S+       T   +VIS   S+EKF+CGS+ S     ++ 
Subjt:  DRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIRKEEEPEI-----------GNVMISK-------TEIESVISRTVSLEKFECGSWASSVLPNDTG

Query:  DDDSTNLFFDLPMELIRSSV---DANAPVSAAFVFDKDQ-----KVVTKNSSTKLVRKSHESS--HHVHFSASSPSSGPASPASCITPRLRKAREEFNAF
         ++  N FFDLP ELI+S     D + PVSAAFVFDK+      K V K S +K  RK+ ES     V FS SSP S P SPA  I+PRL +A + FNAF
Subjt:  DDDSTNLFFDLPMELIRSSV---DANAPVSAAFVFDKDQ-----KVVTKNSSTKLVRKSHESS--HHVHFSASSPSSGPASPASCITPRLRKAREEFNAF

Query:  LEAQS
        LEAQ+
Subjt:  LEAQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACATATTTTGACGATAACGGCGAATGTGCGGAAAAGGAAAAGCAGATATCGGTGGATCCGATATCATTGAGAGAGTCATCGGCGACGGCGAGAGAGGACATGATCGTCGT
TCCTGCCATTTCTTCTCCCGACGCCGGTGATCTTCATCTGCCGCCGCCGCTGCCTCCGACGCAGTTCAAGTTCTTAAGCTATAGCCTACCTAATTCGGTCAATTCATCCC
CCCAATTTGGCGCGGGATTAATGAAAAAGAGAGGGAAACTTGAAAATCAAGAATCGCGGCTTAGAGTCTCCAATTCGACGAAGATCAAACCGTCGGTGCTGGATCCACAG
AATGTTTTGAAGGAGGAGACTCAATTTCGAAGAAGCAAGTCGTGTGGCGAAGGCAGAGCCAGTGCTCCAGCAGATGATTTGGATCTGTGGCTGAACAAACCAAATTCTGT
AGAAACCAAGAATTACGGTGATGGTTTCTCCAGAACTGAATCTACCAAAGATGATCGAAAGAGTGCAAAGGGAATAGAGTCCACGGATGACGGATTTAAATGTGGAGCTC
TCTGCCTGTTCTTGCCGGGCTTCAGCAAAGGGAAGGCGGTCAGATCAATCAGAAAGGAAGAAGAACCAGAGATTGGAAATGTGATGATATCCAAGACCGAAATTGAAAGT
GTGATATCGAGAACCGTTTCTCTGGAGAAATTCGAATGCGGATCATGGGCTTCGTCCGTCCTGCCAAATGATACTGGCGACGACGACTCCACGAACCTTTTCTTCGATCT
GCCTATGGAGTTAATAAGAAGCAGCGTAGACGCAAATGCACCGGTCAGTGCAGCTTTCGTTTTCGATAAAGATCAGAAGGTAGTTACCAAGAACAGCTCGACGAAATTAG
TCCGAAAATCGCATGAATCATCTCATCATGTTCACTTTTCTGCATCGTCTCCTTCTTCAGGGCCAGCCTCACCAGCTTCTTGCATCACACCTAGATTGCGCAAGGCAAGA
GAGGAGTTCAATGCCTTTCTAGAAGCCCAGAGCAGTGCT
mRNA sequenceShow/hide mRNA sequence
ACATATTTTGACGATAACGGCGAATGTGCGGAAAAGGAAAAGCAGATATCGGTGGATCCGATATCATTGAGAGAGTCATCGGCGACGGCGAGAGAGGACATGATCGTCGT
TCCTGCCATTTCTTCTCCCGACGCCGGTGATCTTCATCTGCCGCCGCCGCTGCCTCCGACGCAGTTCAAGTTCTTAAGCTATAGCCTACCTAATTCGGTCAATTCATCCC
CCCAATTTGGCGCGGGATTAATGAAAAAGAGAGGGAAACTTGAAAATCAAGAATCGCGGCTTAGAGTCTCCAATTCGACGAAGATCAAACCGTCGGTGCTGGATCCACAG
AATGTTTTGAAGGAGGAGACTCAATTTCGAAGAAGCAAGTCGTGTGGCGAAGGCAGAGCCAGTGCTCCAGCAGATGATTTGGATCTGTGGCTGAACAAACCAAATTCTGT
AGAAACCAAGAATTACGGTGATGGTTTCTCCAGAACTGAATCTACCAAAGATGATCGAAAGAGTGCAAAGGGAATAGAGTCCACGGATGACGGATTTAAATGTGGAGCTC
TCTGCCTGTTCTTGCCGGGCTTCAGCAAAGGGAAGGCGGTCAGATCAATCAGAAAGGAAGAAGAACCAGAGATTGGAAATGTGATGATATCCAAGACCGAAATTGAAAGT
GTGATATCGAGAACCGTTTCTCTGGAGAAATTCGAATGCGGATCATGGGCTTCGTCCGTCCTGCCAAATGATACTGGCGACGACGACTCCACGAACCTTTTCTTCGATCT
GCCTATGGAGTTAATAAGAAGCAGCGTAGACGCAAATGCACCGGTCAGTGCAGCTTTCGTTTTCGATAAAGATCAGAAGGTAGTTACCAAGAACAGCTCGACGAAATTAG
TCCGAAAATCGCATGAATCATCTCATCATGTTCACTTTTCTGCATCGTCTCCTTCTTCAGGGCCAGCCTCACCAGCTTCTTGCATCACACCTAGATTGCGCAAGGCAAGA
GAGGAGTTCAATGCCTTTCTAGAAGCCCAGAGCAGTGCT
Protein sequenceShow/hide protein sequence
TYFDDNGECAEKEKQISVDPISLRESSATAREDMIVVPAISSPDAGDLHLPPPLPPTQFKFLSYSLPNSVNSSPQFGAGLMKKRGKLENQESRLRVSNSTKIKPSVLDPQ
NVLKEETQFRRSKSCGEGRASAPADDLDLWLNKPNSVETKNYGDGFSRTESTKDDRKSAKGIESTDDGFKCGALCLFLPGFSKGKAVRSIRKEEEPEIGNVMISKTEIES
VISRTVSLEKFECGSWASSVLPNDTGDDDSTNLFFDLPMELIRSSVDANAPVSAAFVFDKDQKVVTKNSSTKLVRKSHESSHHVHFSASSPSSGPASPASCITPRLRKAR
EEFNAFLEAQSSA