; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018798 (gene) of Snake gourd v1 genome

Gene IDTan0018798
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBEST Arabidopsis thaliana protein match is: root hair specific 4 .
Genome locationLG05:82067452..82068577
RNA-Seq ExpressionTan0018798
SyntenyTan0018798
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022924159.1 uncharacterized protein LOC111431688 isoform X1 [Cucurbita moschata]4.4e-16584.38Show/hide
Query:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF
        MAPAL   DN NNERLMSGK  FVGSTY +D+ ECVEKEKQISVDPISLR+SSAREDMI  P I+APD  D+HLPPPLPPTQFKFLSYSLPNS NSSPRF
Subjt:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF

Query:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF
          MKKKGK+ENQES+LK+SNSTK+K SVQDIQ+AL E+ QFRRSKSCGEGRASAPADDLDL LNK KFPET SYGDF RTESNKDYRN A+N EPTDDGF
Subjt:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF

Query:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF
        KCGALCLFLPGF K K VRSIRKEEEPEIGKVR+SRTEIGSVISRTVSMEKFECGSWASSA+P +TGEDDSSS+LF+DLPMELIRNSVDANAP++AAFVF
Subjt:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF

Query:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA
        DKDQKGV KNSS+   QK HE SHHVRFSASSP SGPSSPASCITPRLRKAREEFNAFLEAQS+A
Subjt:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA

XP_022924160.1 uncharacterized protein LOC111431688 isoform X2 [Cucurbita moschata]4.4e-16584.38Show/hide
Query:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF
        MAPAL   DN NNERLMSGK  FVGSTY +D+ ECVEKEKQISVDPISLR+SSAREDMI  P I+APD  D+HLPPPLPPTQFKFLSYSLPNS NSSPRF
Subjt:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF

Query:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF
          MKKKGK+ENQES+LK+SNSTK+K SVQDIQ+AL E+ QFRRSKSCGEGRASAPADDLDL LNK KFPET SYGDF RTESNKDYRN A+N EPTDDGF
Subjt:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF

Query:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF
        KCGALCLFLPGF K K VRSIRKEEEPEIGKVR+SRTEIGSVISRTVSMEKFECGSWASSA+P +TGEDDSSS+LF+DLPMELIRNSVDANAP++AAFVF
Subjt:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF

Query:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA
        DKDQKGV KNSS+   QK HE SHHVRFSASSP SGPSSPASCITPRLRKAREEFNAFLEAQS+A
Subjt:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA

XP_023000753.1 uncharacterized protein LOC111495110 isoform X1 [Cucurbita maxima]1.4e-16684.93Show/hide
Query:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF
        MAPAL   DN NNERLMSGK EFVGSTY +D+ ECVEKEKQISVDPISLR+SSAREDMI  P I+APDV D+HLPPPLPPTQFKFLSYSLPNS NSSPRF
Subjt:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF

Query:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF
          MKKKGK+ENQES+LK+SNSTK+K S+QDIQ+AL E+ QFRRSKSCGEGRASAPADDLDL LNK KFPET SYGDF RTESNKDYRN A+N EPTDDGF
Subjt:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF

Query:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF
        KCGALCLFLPGF K K VRSIRKEEEPEIGK+R+SRTEIGSVISRTVSMEKFECGSWASSA+P DTGEDDSSS+LFFDLPMELIRNSVDANAP++AAFVF
Subjt:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF

Query:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA
        DKDQKGV KN+S+   QK HESSHHVRFSASSP SGPSSPASCITPRLRKAREEFNAF+EAQSSA
Subjt:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA

XP_023000754.1 uncharacterized protein LOC111495110 isoform X2 [Cucurbita maxima]1.4e-16684.93Show/hide
Query:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF
        MAPAL   DN NNERLMSGK EFVGSTY +D+ ECVEKEKQISVDPISLR+SSAREDMI  P I+APDV D+HLPPPLPPTQFKFLSYSLPNS NSSPRF
Subjt:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF

Query:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF
          MKKKGK+ENQES+LK+SNSTK+K S+QDIQ+AL E+ QFRRSKSCGEGRASAPADDLDL LNK KFPET SYGDF RTESNKDYRN A+N EPTDDGF
Subjt:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF

Query:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF
        KCGALCLFLPGF K K VRSIRKEEEPEIGK+R+SRTEIGSVISRTVSMEKFECGSWASSA+P DTGEDDSSS+LFFDLPMELIRNSVDANAP++AAFVF
Subjt:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF

Query:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA
        DKDQKGV KN+S+   QK HESSHHVRFSASSP SGPSSPASCITPRLRKAREEFNAF+EAQSSA
Subjt:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA

XP_023520328.1 uncharacterized protein LOC111783644 [Cucurbita pepo subsp. pepo]2.3e-16685.21Show/hide
Query:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF
        MAPAL   DN NNERLMSGK EFVGSTY +D+ ECVEKEKQISVDPISLR+SSAREDMI  P I+APDV D+HLPPPLPPTQFKFLSYSLPNS NSSPRF
Subjt:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF

Query:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF
          MKKKGK+ENQES+LK+SNSTK+K SVQDIQ+AL E+ QFRRSKSCGEGRASAPADDLDL LNK KFPET SY DF RTESNKDYRN A+N EPTDDGF
Subjt:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF

Query:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF
        KCGALCLFLPGF K K VRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSA+P +TGEDDSSS+LF+DLPMELIRNSVDANAP++AAFVF
Subjt:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF

Query:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA
        DKDQKGV KNSS+   QK HE SHHVRFSASSP SGPSSPASCITPRLRKAREEFNAFLEAQSSA
Subjt:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA

TrEMBL top hitse value%identityAlignment
A0A6J1C9T5 uncharacterized protein LOC1110087611.4e-14882.37Show/hide
Query:  GSTYLADNGECVEKEKQISVDPISLRESS--AREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF--SLMKKKGKIENQESQLKVSN
        G TY  DNGEC EKEKQISVDPISLRESS  AREDMI+VP IS+PD  D+HLPPPLPPTQ KFLSYSLPNS NSSP+F   LMKK+GK+ENQES+L+VSN
Subjt:  GSTYLADNGECVEKEKQISVDPISLRESS--AREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF--SLMKKKGKIENQESQLKVSN

Query:  STKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGD-FSRTESNKDYRNSAKNFEPTDDGFKCGALCLFLPGFSKGKVVR
        STKIKPSV D Q  L E  QFRRSKSCGEGRASAPADDLDLWLNK    ETK+YGD FSRTES KD R SAK  E TDDGFKCGALCLFLPGFSKGK VR
Subjt:  STKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGD-FSRTESNKDYRNSAKNFEPTDDGFKCGALCLFLPGFSKGKVVR

Query:  SIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVFDKDQKGVPKNSSTKLVQKP
        SIRKEEEPEIG V +S+TEI SVISRTVS+EKFECGSWASS LP DTG+DD S+NLFFDLPMELIR+SVDANAPV+AAFVFDKDQK V KNSSTKLV+K 
Subjt:  SIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVFDKDQKGVPKNSSTKLVQKP

Query:  HESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA
        HESSHHV FSASSPSSGP+SPASCITPRLRKAREEFNAFLEAQSSA
Subjt:  HESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA

A0A6J1E8C7 uncharacterized protein LOC111431688 isoform X12.1e-16584.38Show/hide
Query:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF
        MAPAL   DN NNERLMSGK  FVGSTY +D+ ECVEKEKQISVDPISLR+SSAREDMI  P I+APD  D+HLPPPLPPTQFKFLSYSLPNS NSSPRF
Subjt:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF

Query:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF
          MKKKGK+ENQES+LK+SNSTK+K SVQDIQ+AL E+ QFRRSKSCGEGRASAPADDLDL LNK KFPET SYGDF RTESNKDYRN A+N EPTDDGF
Subjt:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF

Query:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF
        KCGALCLFLPGF K K VRSIRKEEEPEIGKVR+SRTEIGSVISRTVSMEKFECGSWASSA+P +TGEDDSSS+LF+DLPMELIRNSVDANAP++AAFVF
Subjt:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF

Query:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA
        DKDQKGV KNSS+   QK HE SHHVRFSASSP SGPSSPASCITPRLRKAREEFNAFLEAQS+A
Subjt:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA

A0A6J1EE09 uncharacterized protein LOC111431688 isoform X22.1e-16584.38Show/hide
Query:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF
        MAPAL   DN NNERLMSGK  FVGSTY +D+ ECVEKEKQISVDPISLR+SSAREDMI  P I+APD  D+HLPPPLPPTQFKFLSYSLPNS NSSPRF
Subjt:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF

Query:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF
          MKKKGK+ENQES+LK+SNSTK+K SVQDIQ+AL E+ QFRRSKSCGEGRASAPADDLDL LNK KFPET SYGDF RTESNKDYRN A+N EPTDDGF
Subjt:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF

Query:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF
        KCGALCLFLPGF K K VRSIRKEEEPEIGKVR+SRTEIGSVISRTVSMEKFECGSWASSA+P +TGEDDSSS+LF+DLPMELIRNSVDANAP++AAFVF
Subjt:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF

Query:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA
        DKDQKGV KNSS+   QK HE SHHVRFSASSP SGPSSPASCITPRLRKAREEFNAFLEAQS+A
Subjt:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA

A0A6J1KKV0 uncharacterized protein LOC111495110 isoform X26.6e-16784.93Show/hide
Query:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF
        MAPAL   DN NNERLMSGK EFVGSTY +D+ ECVEKEKQISVDPISLR+SSAREDMI  P I+APDV D+HLPPPLPPTQFKFLSYSLPNS NSSPRF
Subjt:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF

Query:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF
          MKKKGK+ENQES+LK+SNSTK+K S+QDIQ+AL E+ QFRRSKSCGEGRASAPADDLDL LNK KFPET SYGDF RTESNKDYRN A+N EPTDDGF
Subjt:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF

Query:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF
        KCGALCLFLPGF K K VRSIRKEEEPEIGK+R+SRTEIGSVISRTVSMEKFECGSWASSA+P DTGEDDSSS+LFFDLPMELIRNSVDANAP++AAFVF
Subjt:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF

Query:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA
        DKDQKGV KN+S+   QK HESSHHVRFSASSP SGPSSPASCITPRLRKAREEFNAF+EAQSSA
Subjt:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA

A0A6J1KNI3 uncharacterized protein LOC111495110 isoform X16.6e-16784.93Show/hide
Query:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF
        MAPAL   DN NNERLMSGK EFVGSTY +D+ ECVEKEKQISVDPISLR+SSAREDMI  P I+APDV D+HLPPPLPPTQFKFLSYSLPNS NSSPRF
Subjt:  MAPAL---DNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRF

Query:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF
          MKKKGK+ENQES+LK+SNSTK+K S+QDIQ+AL E+ QFRRSKSCGEGRASAPADDLDL LNK KFPET SYGDF RTESNKDYRN A+N EPTDDGF
Subjt:  SLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGF

Query:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF
        KCGALCLFLPGF K K VRSIRKEEEPEIGK+R+SRTEIGSVISRTVSMEKFECGSWASSA+P DTGEDDSSS+LFFDLPMELIRNSVDANAP++AAFVF
Subjt:  KCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVF

Query:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA
        DKDQKGV KN+S+   QK HESSHHVRFSASSP SGPSSPASCITPRLRKAREEFNAF+EAQSSA
Subjt:  DKDQKGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSSA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 46.4e-2943.55Show/hide
Query:  DDGFKCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWAS-SALPTDTGEDDSSSNLFFDLPMELIR-------NSV
        ++ FKC A CL LPGF K K++RS  K +     K+  + +  GS +S   S+EKFECGSWAS +AL  D G       LFFD P+E+ +          
Subjt:  DDGFKCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWAS-SALPTDTGEDDSSSNLFFDLPMELIR-------NSV

Query:  DANAPVTAAFVFDKDQ-----KGVPKNSSTKLVQKPHESS--HHVRFSASSPS---SGPSSPASCITPRLRKAREEFNAFLEAQSS
        D   PVT+ F+FD++      + V K  ST+  ++  ESS    VRFS SS S   S P+SP +CITPRLRKAR++FN FL AQ++
Subjt:  DANAPVTAAFVFDKDQ-----KGVPKNSSTKLVQKPHESS--HHVRFSASSPS---SGPSSPASCITPRLRKAREEFNAFLEAQSS

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)1.5e-2534.25Show/hide
Query:  YSLPNSANSSPRFSLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYR
        +S+    N +P  + + +K  ++NQ         T  +PS+         E  FR         A +P  D  + L     P  K     S  ES    R
Subjt:  YSLPNSANSSPRFSLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYR

Query:  NS---AKNFEPTDDGFKCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWAS-SALPTDTGEDDSSSNLFFDLPMEL
         S    KNF   ++ FKC A CL LPGF K + VRS + E+  +   ++ S     S +S + S+EKFECGSWAS +AL  + G       L+ DLP+E+
Subjt:  NS---AKNFEPTDDGFKCGALCLFLPGFSKGKVVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWAS-SALPTDTGEDDSSSNLFFDLPMEL

Query:  IR-NSVDANAPVTAAFVFDKDQ-----KGVPKNSST---KLVQKPHESS--HHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSS
        I+    D   PV++ F FDK+      + V K SS+   + ++   E+S    VRFS ++  S P+SP +CITPRL KAR++FN FL AQ++
Subjt:  IR-NSVDANAPVTAAFVFDKDQ-----KGVPKNSST---KLVQKPHESS--HHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQSS

AT4G20190.1 unknown protein5.0e-5041.08Show/hide
Query:  EKQISVDPISLRESSAREDMIIVPVISAP-DVADIHLPPPLPPTQFKFLSYSLPNSANSSPRFSLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHE
        E++ISVDP SL   +   DMI    +S P D+ D+ L   +   + KF+S SLPNSA +SPR S       I N +         +    V D+ +    
Subjt:  EKQISVDPISLRESSAREDMIIVPVISAP-DVADIHLPPPLPPTQFKFLSYSLPNSANSSPRFSLMKKKGKIENQESQLKVSNSTKIKPSVQDIQIALHE

Query:  EAQFRRSKSCGEGRASAPADDLDLWLNK------------------TKFPETKSYGD---FSRTESNKDYRN-----SAKNFEPTDDGFKCGALCLFLPG
           FRRSKSCGEGRA  P+ D D+ L+K                  +K    KS G+   FS+TESNK  R+     ++K+    +DGFKC ALCL+LPG
Subjt:  EAQFRRSKSCGEGRASAPADDLDLWLNK------------------TKFPETKSYGD---FSRTESNKDYRN-----SAKNFEPTDDGFKCGALCLFLPG

Query:  FSKGKVVRSIRKEEEPEIGKVRMSRTEI---------GSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSV---DANAPVTAAFV
        FSKGK VRS RK +        M+ ++           +V+S   S+E+FECGSW SSA+  D   D      FFDLP ELI+      D + PV+AAFV
Subjt:  FSKGKVVRSIRKEEEPEIGKVRMSRTEI---------GSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSV---DANAPVTAAFV

Query:  FDKDQ------KGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQS
        FDK+       KGV K S +K  ++  ES  HVRFS SSP S P+SP   ITPRL +A E+F++FLEAQ+
Subjt:  FDKDQ------KGVPKNSSTKLVQKPHESSHHVRFSASSPSSGPSSPASCITPRLRKAREEFNAFLEAQS

AT5G44660.1 unknown protein4.3e-3334.94Show/hide
Query:  VEKEKQISVDPISLRESSARE--------DMIIVPVISAPDVADIHLPPPLPPTQ------------------------------------FKFLSY-SL
        ++ E++IS+DP S+R  S           DM+ +P +S P   D  +P PL P Q                                    F+     SL
Subjt:  VEKEKQISVDPISLRESSARE--------DMIIVPVISAPDVADIHLPPPLPPTQ------------------------------------FKFLSY-SL

Query:  PNSANSSP--RFSLMKKKGKIENQESQLKVSNSTKIKPSVQD-IQIALHEE------AQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYG----DF
        PNS   SP  R  LM+    + N+E Q  + NST   P  +  +  AL  +      A ++RSKSCG               + +K    KS G     F
Subjt:  PNSANSSP--RFSLMKKKGKIENQESQLKVSNSTKIKPSVQD-IQIALHEE------AQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYG----DF

Query:  SRTESNKDYRNSAKNFEPTDDGFKCGALCLFLPGFSKGKVVRSIRKEEEPEIGK-----------VRMSR-------TEIGSVISRTVSMEKFECGSWAS
         +T+SNK    S  N    +D FKC ALCLFLPGFSKGK +RS +K++     +           + +SR       T   +VIS   SMEKF+CGS+ S
Subjt:  SRTESNKDYRNSAKNFEPTDDGFKCGALCLFLPGFSKGKVVRSIRKEEEPEIGK-----------VRMSR-------TEIGSVISRTVSMEKFECGSWAS

Query:  SALPTDTGEDDSSSNLFFDLPMELIRNSV---DANAPVTAAFVFDKDQ-----KGVPKNSST---KLVQKPHESSHHVRFSASSPSSGPSSPASCITPRL
         +   + G      N FFDLP ELI++     D + PV+AAFVFDK+      KGV K S +   K ++ P  S   VRFS SSP S P+SPA  I+PRL
Subjt:  SALPTDTGEDDSSSNLFFDLPMELIRNSV---DANAPVTAAFVFDKDQ-----KGVPKNSST---KLVQKPHESSHHVRFSASSPSSGPSSPASCITPRL

Query:  RKAREEFNAFLEAQS
         +A + FNAFLEAQ+
Subjt:  RKAREEFNAFLEAQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCGGCACTGGATAACTGTAATAATGAGAGGTTGATGTCTGGGAAGTTCGAGTTCGTAGGTTCAACATATTTAGCTGATAATGGCGAATGTGTGGAAAAGGAAAA
GCAGATATCGGTGGATCCGATATCGTTGAGAGAATCATCAGCGAGAGAGGACATGATCATCGTTCCGGTCATTTCAGCTCCTGACGTCGCTGATATTCATCTGCCCCCAC
CGCTACCTCCGACGCAGTTCAAGTTCTTAAGCTACAGCCTACCTAACTCCGCCAATTCATCCCCCCGATTCAGTTTAATGAAAAAGAAAGGAAAAATTGAAAATCAAGAA
TCGCAGCTTAAAGTCTCCAATTCGACGAAGATCAAGCCGTCGGTGCAGGATATACAGATTGCTCTGCATGAGGAAGCTCAATTTCGAAGAAGCAAGTCGTGTGGCGAGGG
CAGAGCCAGTGCTCCGGCCGATGATTTGGACCTGTGGTTGAACAAAACAAAATTTCCAGAAACAAAGAGTTACGGCGATTTCTCCAGGACTGAATCGAACAAAGATTATC
GTAACAGTGCAAAGAATTTTGAGCCTACAGATGATGGATTTAAATGTGGAGCCCTCTGTTTGTTCTTACCAGGCTTCAGCAAAGGGAAGGTCGTTAGGTCAATCAGAAAG
GAAGAAGAACCAGAGATTGGAAAAGTGAGGATGTCGAGGACCGAGATTGGAAGTGTGATATCGAGGACAGTTTCCATGGAGAAATTCGAATGTGGATCATGGGCTTCATC
AGCCTTGCCAACTGATACTGGCGAAGATGATTCCAGTAGTAACCTGTTCTTTGATCTGCCAATGGAGTTAATAAGAAATAGTGTGGATGCAAATGCACCAGTCACTGCAG
CTTTTGTGTTCGATAAAGATCAGAAGGGAGTTCCAAAGAACAGCTCGACGAAATTAGTCCAAAAACCGCACGAATCATCTCATCATGTTCGATTTTCGGCATCATCTCCT
TCATCAGGGCCATCCTCGCCAGCTTCTTGCATCACACCTAGACTACGCAAGGCAAGAGAGGAGTTCAATGCCTTCCTAGAAGCCCAGAGCAGTGCTTAA
mRNA sequenceShow/hide mRNA sequence
TTCAAATTCTTCATTTCATTCACCGATTTCTTCGGCCATGGCGCCGGCACTGGATAACTGTAATAATGAGAGGTTGATGTCTGGGAAGTTCGAGTTCGTAGGTTCAACAT
ATTTAGCTGATAATGGCGAATGTGTGGAAAAGGAAAAGCAGATATCGGTGGATCCGATATCGTTGAGAGAATCATCAGCGAGAGAGGACATGATCATCGTTCCGGTCATT
TCAGCTCCTGACGTCGCTGATATTCATCTGCCCCCACCGCTACCTCCGACGCAGTTCAAGTTCTTAAGCTACAGCCTACCTAACTCCGCCAATTCATCCCCCCGATTCAG
TTTAATGAAAAAGAAAGGAAAAATTGAAAATCAAGAATCGCAGCTTAAAGTCTCCAATTCGACGAAGATCAAGCCGTCGGTGCAGGATATACAGATTGCTCTGCATGAGG
AAGCTCAATTTCGAAGAAGCAAGTCGTGTGGCGAGGGCAGAGCCAGTGCTCCGGCCGATGATTTGGACCTGTGGTTGAACAAAACAAAATTTCCAGAAACAAAGAGTTAC
GGCGATTTCTCCAGGACTGAATCGAACAAAGATTATCGTAACAGTGCAAAGAATTTTGAGCCTACAGATGATGGATTTAAATGTGGAGCCCTCTGTTTGTTCTTACCAGG
CTTCAGCAAAGGGAAGGTCGTTAGGTCAATCAGAAAGGAAGAAGAACCAGAGATTGGAAAAGTGAGGATGTCGAGGACCGAGATTGGAAGTGTGATATCGAGGACAGTTT
CCATGGAGAAATTCGAATGTGGATCATGGGCTTCATCAGCCTTGCCAACTGATACTGGCGAAGATGATTCCAGTAGTAACCTGTTCTTTGATCTGCCAATGGAGTTAATA
AGAAATAGTGTGGATGCAAATGCACCAGTCACTGCAGCTTTTGTGTTCGATAAAGATCAGAAGGGAGTTCCAAAGAACAGCTCGACGAAATTAGTCCAAAAACCGCACGA
ATCATCTCATCATGTTCGATTTTCGGCATCATCTCCTTCATCAGGGCCATCCTCGCCAGCTTCTTGCATCACACCTAGACTACGCAAGGCAAGAGAGGAGTTCAATGCCT
TCCTAGAAGCCCAGAGCAGTGCTTAA
Protein sequenceShow/hide protein sequence
MAPALDNCNNERLMSGKFEFVGSTYLADNGECVEKEKQISVDPISLRESSAREDMIIVPVISAPDVADIHLPPPLPPTQFKFLSYSLPNSANSSPRFSLMKKKGKIENQE
SQLKVSNSTKIKPSVQDIQIALHEEAQFRRSKSCGEGRASAPADDLDLWLNKTKFPETKSYGDFSRTESNKDYRNSAKNFEPTDDGFKCGALCLFLPGFSKGKVVRSIRK
EEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSALPTDTGEDDSSSNLFFDLPMELIRNSVDANAPVTAAFVFDKDQKGVPKNSSTKLVQKPHESSHHVRFSASSP
SSGPSSPASCITPRLRKAREEFNAFLEAQSSA