; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024663 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024663
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionBEST Arabidopsis thaliana protein match is: root hair specific 4 .
Genome locationchr10:4749942..4751030
RNA-Seq ExpressionLag0024663
SyntenyLag0024663
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022924159.1 uncharacterized protein LOC111431688 isoform X1 [Cucurbita moschata]1.3e-17286.5Show/hide
Query:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
        MAPALPTSDNFN ERLMSGKL FVGSTY +D+ ECVEKEKQISVDPISLR+SSARED+I D +TAPD  DLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
Subjt:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG

Query:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK
          KKKGKLENQES LK+SNSTK+K SVQDIQ+ALQE+TQFRRSKSCGEGRASAPADDLDL LNKAKFPET SYG+F RTESNKDYRN  +NLE TDDGFK
Subjt:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK

Query:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD
        CGALCLFLPGF K KA+RSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSA+PN+ GED+SSS+LF+DLPMELIRNSVDANAP+SAAF+FD
Subjt:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD

Query:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA
        KDQKGVTKN S+   QKSHE SHHVRFSASSP GPSSPA+CITPRLRKAREEFNAFLEAQS+A
Subjt:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA

XP_022924160.1 uncharacterized protein LOC111431688 isoform X2 [Cucurbita moschata]1.3e-17286.5Show/hide
Query:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
        MAPALPTSDNFN ERLMSGKL FVGSTY +D+ ECVEKEKQISVDPISLR+SSARED+I D +TAPD  DLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
Subjt:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG

Query:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK
          KKKGKLENQES LK+SNSTK+K SVQDIQ+ALQE+TQFRRSKSCGEGRASAPADDLDL LNKAKFPET SYG+F RTESNKDYRN  +NLE TDDGFK
Subjt:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK

Query:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD
        CGALCLFLPGF K KA+RSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSA+PN+ GED+SSS+LF+DLPMELIRNSVDANAP+SAAF+FD
Subjt:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD

Query:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA
        KDQKGVTKN S+   QKSHE SHHVRFSASSP GPSSPA+CITPRLRKAREEFNAFLEAQS+A
Subjt:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA

XP_023000753.1 uncharacterized protein LOC111495110 isoform X1 [Cucurbita maxima]1.8e-17487.33Show/hide
Query:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
        MAPALPTSDNFN ERLMSGKLEFVGSTY +D+ ECVEKEKQISVDPISLR+SSARED+I D +TAPDV DLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
Subjt:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG

Query:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK
          KKKGKLENQES LK+SNSTK+K S+QDIQ+ALQE+TQFRRSKSCGEGRASAPADDLDL LNKAKFPET SYG+F RTESNKDYRN  +NLE TDDGFK
Subjt:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK

Query:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD
        CGALCLFLPGF K KA+RSIRKEEEPEIGK+RISRTEIGSVISRTVSMEKFECGSWASSA+PND GED+SSS+LFFDLPMELIRNSVDANAP+SAAF+FD
Subjt:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD

Query:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA
        KDQKGVTKN S+   QKSHESSHHVRFSASSP GPSSPA+CITPRLRKAREEFNAF+EAQSSA
Subjt:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA

XP_023000754.1 uncharacterized protein LOC111495110 isoform X2 [Cucurbita maxima]1.8e-17487.33Show/hide
Query:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
        MAPALPTSDNFN ERLMSGKLEFVGSTY +D+ ECVEKEKQISVDPISLR+SSARED+I D +TAPDV DLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
Subjt:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG

Query:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK
          KKKGKLENQES LK+SNSTK+K S+QDIQ+ALQE+TQFRRSKSCGEGRASAPADDLDL LNKAKFPET SYG+F RTESNKDYRN  +NLE TDDGFK
Subjt:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK

Query:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD
        CGALCLFLPGF K KA+RSIRKEEEPEIGK+RISRTEIGSVISRTVSMEKFECGSWASSA+PND GED+SSS+LFFDLPMELIRNSVDANAP+SAAF+FD
Subjt:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD

Query:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA
        KDQKGVTKN S+   QKSHESSHHVRFSASSP GPSSPA+CITPRLRKAREEFNAF+EAQSSA
Subjt:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA

XP_023520328.1 uncharacterized protein LOC111783644 [Cucurbita pepo subsp. pepo]4.4e-17386.78Show/hide
Query:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
        MAPALPTSDNFN ERLMSGKLEFVGSTY +D+ ECVEKEKQISVDPISLR+SSARED+I D +TAPDV DLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
Subjt:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG

Query:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK
          KKKGKLENQES LK+SNSTK+K SVQDIQ+ALQE+TQFRRSKSCGEGRASAPADDLDL LNKAKFPET SY +F RTESNKDYRN  +NLE TDDGFK
Subjt:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK

Query:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD
        CGALCLFLPGF K KA+RSIRKEEEPEIGKVR+SRTEIGSVISRTVSMEKFECGSWASSA+PN+ GED+SSS+LF+DLPMELIRNSVDANAP+SAAF+FD
Subjt:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD

Query:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA
        KDQKGVTKN S+   QKSHE SHHVRFSASSP GPSSPA+CITPRLRKAREEFNAFLEAQSSA
Subjt:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA

TrEMBL top hitse value%identityAlignment
A0A0A0LWT5 Uncharacterized protein1.3e-15177.99Show/hide
Query:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
        MAPALPTS NFN  R +SGKLEF+ STY  DN EC EKEKQISVDPISLRESSARED++VD +TAPDVADLHLPPPLPPTQFKFLSYSLPNS NSSP+ G
Subjt:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG

Query:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGN-FSRTESNKDYRNSPKNLESTDDGF
        L KKKGK ENQ SLLKVSNSTK+  SV DIQ   QE+ QFRRSKSCGEGRASAPADDLDLWLNKAK PETKSY + FS+TESN       K LE+ DDGF
Subjt:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGN-FSRTESNKDYRNSPKNLESTDDGF

Query:  KCGALCLFLPGFSKGKAIRSIRKEEE-PEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFI
         CGALCLFLPGF KGK+++SIRKEEE  E+ KVRIS+TEIGSVISRTVS+EKFECGSWASS LPN+ GEDE+ ++LF+DLP+EL+R+SVDANAPV+AAF+
Subjt:  KCGALCLFLPGFSKGKAIRSIRKEEE-PEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFI

Query:  FDKDQKGVTK-NGSTKIVQKSHES-SHHVRFSASSP--GPSSPAACITPRLRKAREEFNAFLEAQSSA
        FDKD KGV K N STK+VQKSHES SH  RFSASSP  GPSSPA+CITP+LRKAREEFNAFLEAQSSA
Subjt:  FDKDQKGVTK-NGSTKIVQKSHES-SHHVRFSASSP--GPSSPAACITPRLRKAREEFNAFLEAQSSA

A0A6J1E8C7 uncharacterized protein LOC111431688 isoform X16.2e-17386.5Show/hide
Query:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
        MAPALPTSDNFN ERLMSGKL FVGSTY +D+ ECVEKEKQISVDPISLR+SSARED+I D +TAPD  DLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
Subjt:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG

Query:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK
          KKKGKLENQES LK+SNSTK+K SVQDIQ+ALQE+TQFRRSKSCGEGRASAPADDLDL LNKAKFPET SYG+F RTESNKDYRN  +NLE TDDGFK
Subjt:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK

Query:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD
        CGALCLFLPGF K KA+RSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSA+PN+ GED+SSS+LF+DLPMELIRNSVDANAP+SAAF+FD
Subjt:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD

Query:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA
        KDQKGVTKN S+   QKSHE SHHVRFSASSP GPSSPA+CITPRLRKAREEFNAFLEAQS+A
Subjt:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA

A0A6J1EE09 uncharacterized protein LOC111431688 isoform X26.2e-17386.5Show/hide
Query:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
        MAPALPTSDNFN ERLMSGKL FVGSTY +D+ ECVEKEKQISVDPISLR+SSARED+I D +TAPD  DLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
Subjt:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG

Query:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK
          KKKGKLENQES LK+SNSTK+K SVQDIQ+ALQE+TQFRRSKSCGEGRASAPADDLDL LNKAKFPET SYG+F RTESNKDYRN  +NLE TDDGFK
Subjt:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK

Query:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD
        CGALCLFLPGF K KA+RSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSA+PN+ GED+SSS+LF+DLPMELIRNSVDANAP+SAAF+FD
Subjt:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD

Query:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA
        KDQKGVTKN S+   QKSHE SHHVRFSASSP GPSSPA+CITPRLRKAREEFNAFLEAQS+A
Subjt:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA

A0A6J1KKV0 uncharacterized protein LOC111495110 isoform X28.7e-17587.33Show/hide
Query:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
        MAPALPTSDNFN ERLMSGKLEFVGSTY +D+ ECVEKEKQISVDPISLR+SSARED+I D +TAPDV DLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
Subjt:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG

Query:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK
          KKKGKLENQES LK+SNSTK+K S+QDIQ+ALQE+TQFRRSKSCGEGRASAPADDLDL LNKAKFPET SYG+F RTESNKDYRN  +NLE TDDGFK
Subjt:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK

Query:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD
        CGALCLFLPGF K KA+RSIRKEEEPEIGK+RISRTEIGSVISRTVSMEKFECGSWASSA+PND GED+SSS+LFFDLPMELIRNSVDANAP+SAAF+FD
Subjt:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD

Query:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA
        KDQKGVTKN S+   QKSHESSHHVRFSASSP GPSSPA+CITPRLRKAREEFNAF+EAQSSA
Subjt:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA

A0A6J1KNI3 uncharacterized protein LOC111495110 isoform X18.7e-17587.33Show/hide
Query:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
        MAPALPTSDNFN ERLMSGKLEFVGSTY +D+ ECVEKEKQISVDPISLR+SSARED+I D +TAPDV DLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG
Subjt:  MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFG

Query:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK
          KKKGKLENQES LK+SNSTK+K S+QDIQ+ALQE+TQFRRSKSCGEGRASAPADDLDL LNKAKFPET SYG+F RTESNKDYRN  +NLE TDDGFK
Subjt:  LTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFK

Query:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD
        CGALCLFLPGF K KA+RSIRKEEEPEIGK+RISRTEIGSVISRTVSMEKFECGSWASSA+PND GED+SSS+LFFDLPMELIRNSVDANAP+SAAF+FD
Subjt:  CGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFD

Query:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA
        KDQKGVTKN S+   QKSHESSHHVRFSASSP GPSSPA+CITPRLRKAREEFNAF+EAQSSA
Subjt:  KDQKGVTKNGSTKIVQKSHESSHHVRFSASSP-GPSSPAACITPRLRKAREEFNAFLEAQSSA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 41.9e-2843.01Show/hide
Query:  DDGFKCGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWAS-SALPNDAGEDESSSNLFFDLPMELIR-------NSV
        ++ FKC A CL LPGF K K IRS  K +     K+  + +  GS +S   S+EKFECGSWAS +AL  D G       LFFD P+E+ +          
Subjt:  DDGFKCGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWAS-SALPNDAGEDESSSNLFFDLPMELIR-------NSV

Query:  DANAPVSAAFIFDKDQ-----KGVTKNGSTKIVQKSHESS--HHVRFSASSPG-----PSSPAACITPRLRKAREEFNAFLEAQSS
        D   PV++ F+FD++      + V K  ST+  ++S ESS    VRFS SS       P+SP  CITPRLRKAR++FN FL AQ++
Subjt:  DANAPVSAAFIFDKDQ-----KGVTKNGSTKIVQKSHESS--HHVRFSASSPG-----PSSPAACITPRLRKAREEFNAFLEAQSS

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)1.1e-2333.9Show/hide
Query:  YSLPNSVNSSPRFGLTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYR
        +S+    N +P      +K  L+NQ         T  +PS+         E  FR         A +P  D  + L     P  K     S  ES    R
Subjt:  YSLPNSVNSSPRFGLTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYR

Query:  NS---PKNLESTDDGFKCGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWAS-SALPNDAGEDESSSNLFFDLPMEL
         S    KN    ++ FKC A CL LPGF K + +RS + E+  +   ++ S     S +S + S+EKFECGSWAS +AL  + G       L+ DLP+E+
Subjt:  NS---PKNLESTDDGFKCGALCLFLPGFSKGKAIRSIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWAS-SALPNDAGEDESSSNLFFDLPMEL

Query:  IR-NSVDANAPVSAAFIFDKDQ-----KGVTKNGST---KIVQKSHESS--HHVRFS--ASSPGPSSPAACITPRLRKAREEFNAFLEAQSS
        I+    D   PVS+ F FDK+      + V K  S+   + ++   E+S    VRFS   S   P+SP  CITPRL KAR++FN FL AQ++
Subjt:  IR-NSVDANAPVSAAFIFDKDQ-----KGVTKNGST---KIVQKSHESS--HHVRFS--ASSPGPSSPAACITPRLRKAREEFNAFLEAQSS

AT4G20190.1 unknown protein3.9e-5040.49Show/hide
Query:  EKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGLTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEET
        E++ISVDP SL   +   D+IV      D+ DL L   +   + KF+S SLPNS  +SPR         + N +         +    V D+ +     T
Subjt:  EKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGLTKKKGKLENQESLLKVSNSTKIKPSVQDIQIALQEET

Query:  QFRRSKSCGEGRASAPADDLDLWLNK------------------AKFPETKSYGN---FSRTESNKDYRN-----SPKNLESTDDGFKCGALCLFLPGFS
         FRRSKSCGEGRA  P+ D D+ L+K                  +K    KS GN   FS+TESNK  R+     + K++ S +DGFKC ALCL+LPGFS
Subjt:  QFRRSKSCGEGRASAPADDLDLWLNK------------------AKFPETKSYGN---FSRTESNKDYRN-----SPKNLESTDDGFKCGALCLFLPGFS

Query:  KGKAIRSIRKEEE---------PEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSV---DANAPVSAAFIFD
        KGK +RS RK +                R +     +V+S   S+E+FECGSW SSA+  D   D      FFDLP ELI+      D + PVSAAF+FD
Subjt:  KGKAIRSIRKEEE---------PEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSV---DANAPVSAAFIFD

Query:  KDQ------KGVTKNGSTKIVQKSHESSHHVRFSASSP--GPSSPAACITPRLRKAREEFNAFLEAQS
        K+       KGV K   +K  ++S ES  HVRFS SSP   P+SP   ITPRL +A E+F++FLEAQ+
Subjt:  KDQ------KGVTKNGSTKIVQKSHESSHHVRFSASSP--GPSSPAACITPRLRKAREEFNAFLEAQS

AT5G44660.1 unknown protein1.1e-3135.11Show/hide
Query:  VEKEKQISVDPISLR----ESSAREDLIVDAVTAPDVA---DLHLPPPLP-------------------------------PTQFKFL--------SYSL
        ++ E++IS+DP S+R      S R +   D V  P ++   DL  P PLP                               P Q   L          SL
Subjt:  VEKEKQISVDPISLR----ESSAREDLIVDAVTAPDVA---DLHLPPPLP-------------------------------PTQFKFL--------SYSL

Query:  PNSVNSSP--RFGLTKKKGKLENQESLLKVSNSTKIKPSVQD-IQIALQEETQ------FRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYG----NF
        PNS   SP  R GL +     E Q+SL    NST   P  +  +  AL+ + Q      ++RSKSCG               + +K    KS G     F
Subjt:  PNSVNSSP--RFGLTKKKGKLENQESLLKVSNSTKIKPSVQD-IQIALQEETQ------FRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYG----NF

Query:  SRTESNKDYRNSPKNLESTDDGFKCGALCLFLPGFSKGKAIRSIRKEEEPEIGK-----------VRISR-------TEIGSVISRTVSMEKFECGSWAS
         +T+SNK   N+     + +D FKC ALCLFLPGFSKGK IRS +K++     +           + +SR       T   +VIS   SMEKF+CGS+ S
Subjt:  SRTESNKDYRNSPKNLESTDDGFKCGALCLFLPGFSKGKAIRSIRKEEEPEIGK-----------VRISR-------TEIGSVISRTVSMEKFECGSWAS

Query:  SALPNDAGEDESSSNLFFDLPMELIRNSV---DANAPVSAAFIFDKDQ-----KGVTK-NGSTKIVQKSHESSHHVRFSASSP--GPSSPAACITPRLRK
         +   + G      N FFDLP ELI++     D + PVSAAF+FDK+      KGV K +GS         S   VRFS SSP   P+SPA  I+PRL +
Subjt:  SALPNDAGEDESSSNLFFDLPMELIRNSV---DANAPVSAAFIFDKDQ-----KGVTK-NGSTKIVQKSHESSHHVRFSASSP--GPSSPAACITPRLRK

Query:  AREEFNAFLEAQS
        A + FNAFLEAQ+
Subjt:  AREEFNAFLEAQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACCTGCACTGCCTACGTCAGATAACTTCAACCGTGAGAGGTTGATGTCCGGCAAGCTCGAGTTCGTTGGTTCAACATATTTTGCCGATAATGGCGAATGTGTGGA
AAAGGAAAAGCAGATTTCGGTGGATCCAATATCGTTGAGAGAGTCATCGGCGAGAGAGGACCTAATCGTCGATGCTGTCACTGCTCCTGACGTCGCCGATCTTCATCTGC
CGCCGCCGCTACCTCCGACGCAGTTCAAGTTCTTAAGCTACAGCCTACCTAACTCCGTCAATTCATCCCCCCGATTCGGTTTAACGAAAAAGAAAGGGAAACTTGAAAAT
CAAGAATCGCTGCTTAAAGTCTCCAATTCGACGAAGATCAAACCGTCGGTGCAGGATATACAGATTGCTCTGCAAGAGGAAACTCAATTTCGAAGAAGCAAGTCGTGTGG
CGAAGGTAGAGCCAGTGCTCCGGCGGACGATTTGGATCTGTGGTTGAACAAAGCAAAGTTTCCAGAAACGAAGAGTTACGGCAATTTCTCCAGGACGGAATCGAACAAAG
ATTATCGTAATAGTCCAAAGAATTTAGAGTCTACAGATGACGGATTTAAATGTGGAGCTCTCTGTTTGTTCTTACCAGGCTTCAGCAAAGGGAAGGCCATTAGGTCAATC
AGAAAGGAAGAAGAACCAGAGATTGGAAAAGTGAGGATATCGAGGACCGAGATTGGAAGTGTGATATCGAGGACAGTTTCTATGGAGAAATTCGAATGTGGATCGTGGGC
TTCATCTGCTCTGCCAAATGATGCCGGCGAAGACGAATCCAGCAGTAACCTTTTCTTTGATCTGCCAATGGAGTTAATAAGAAATAGTGTGGATGCAAATGCACCAGTCA
GTGCAGCTTTCATCTTTGACAAAGATCAGAAAGGAGTTACGAAGAACGGCTCGACGAAAATAGTCCAAAAATCGCACGAATCGTCTCATCATGTTCGATTTTCGGCATCG
TCTCCAGGGCCATCTTCACCAGCTGCTTGCATCACACCTAGATTGCGCAAGGCAAGAGAGGAGTTCAATGCCTTTCTAGAAGCCCAGAGCAGTGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCACCTGCACTGCCTACGTCAGATAACTTCAACCGTGAGAGGTTGATGTCCGGCAAGCTCGAGTTCGTTGGTTCAACATATTTTGCCGATAATGGCGAATGTGTGGA
AAAGGAAAAGCAGATTTCGGTGGATCCAATATCGTTGAGAGAGTCATCGGCGAGAGAGGACCTAATCGTCGATGCTGTCACTGCTCCTGACGTCGCCGATCTTCATCTGC
CGCCGCCGCTACCTCCGACGCAGTTCAAGTTCTTAAGCTACAGCCTACCTAACTCCGTCAATTCATCCCCCCGATTCGGTTTAACGAAAAAGAAAGGGAAACTTGAAAAT
CAAGAATCGCTGCTTAAAGTCTCCAATTCGACGAAGATCAAACCGTCGGTGCAGGATATACAGATTGCTCTGCAAGAGGAAACTCAATTTCGAAGAAGCAAGTCGTGTGG
CGAAGGTAGAGCCAGTGCTCCGGCGGACGATTTGGATCTGTGGTTGAACAAAGCAAAGTTTCCAGAAACGAAGAGTTACGGCAATTTCTCCAGGACGGAATCGAACAAAG
ATTATCGTAATAGTCCAAAGAATTTAGAGTCTACAGATGACGGATTTAAATGTGGAGCTCTCTGTTTGTTCTTACCAGGCTTCAGCAAAGGGAAGGCCATTAGGTCAATC
AGAAAGGAAGAAGAACCAGAGATTGGAAAAGTGAGGATATCGAGGACCGAGATTGGAAGTGTGATATCGAGGACAGTTTCTATGGAGAAATTCGAATGTGGATCGTGGGC
TTCATCTGCTCTGCCAAATGATGCCGGCGAAGACGAATCCAGCAGTAACCTTTTCTTTGATCTGCCAATGGAGTTAATAAGAAATAGTGTGGATGCAAATGCACCAGTCA
GTGCAGCTTTCATCTTTGACAAAGATCAGAAAGGAGTTACGAAGAACGGCTCGACGAAAATAGTCCAAAAATCGCACGAATCGTCTCATCATGTTCGATTTTCGGCATCG
TCTCCAGGGCCATCTTCACCAGCTGCTTGCATCACACCTAGATTGCGCAAGGCAAGAGAGGAGTTCAATGCCTTTCTAGAAGCCCAGAGCAGTGCTTAA
Protein sequenceShow/hide protein sequence
MAPALPTSDNFNRERLMSGKLEFVGSTYFADNGECVEKEKQISVDPISLRESSAREDLIVDAVTAPDVADLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGLTKKKGKLEN
QESLLKVSNSTKIKPSVQDIQIALQEETQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYGNFSRTESNKDYRNSPKNLESTDDGFKCGALCLFLPGFSKGKAIRSI
RKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSALPNDAGEDESSSNLFFDLPMELIRNSVDANAPVSAAFIFDKDQKGVTKNGSTKIVQKSHESSHHVRFSAS
SPGPSSPAACITPRLRKAREEFNAFLEAQSSA