; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006457 (gene) of Snake gourd v1 genome

Gene IDTan0006457
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein BUNDLE SHEATH DEFECTIVE 2, chloroplastic
Genome locationLG01:101428875..101429790
RNA-Seq ExpressionTan0006457
SyntenyTan0006457
Gene Ontology termsGO:0061077 - chaperone-mediated protein folding (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016020 - membrane (cellular component)
GO:0101031 - chaperone complex (cellular component)
GO:0044183 - protein folding chaperone (molecular function)
InterPro domainsIPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038807.1 chaperone protein DnaJ-like [Cucumis melo var. makuwa]1.2e-5383.21Show/hide
Query:  MASSFFPASASCFNS-TTVPAIGGCSNQKLNLISNGFR-YSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGA
        MASS F  SA+CF+S TT+ AI  CSNQKLNLI N F  YS  ARFPHL  KAA NDRNTKPNSVICGDCDGNGAVLCSQCKG+GVNAVDFFNGQFKAG 
Subjt:  MASSFFPASASCFNS-TTVPAIGGCSNQKLNLISNGFR-YSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGA

Query:  SCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ
        SCWLCGG+KEMLCGNCNGAGFVGGFLSTYDQ
Subjt:  SCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ

XP_004136477.1 protein BUNDLE SHEATH DEFECTIVE 2, chloroplastic isoform X2 [Cucumis sativus]6.2e-5381.06Show/hide
Query:  MASSFFPASASCFNSTTV--PAIGGCSNQKLNLISNGFR-YSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAG
        MASS F  SA+CF+STT     I  CSNQKLNLI NGF  YS  ARFPHL  KAA NDRNTKPNSVICGDCDGNGAV+CSQCKG GVNAVDFFNGQFKAG
Subjt:  MASSFFPASASCFNSTTV--PAIGGCSNQKLNLISNGFR-YSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAG

Query:  ASCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ
         SCWLCGG+KEMLCGNCNGAGF+GGFLSTYDQ
Subjt:  ASCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ

XP_008466412.1 PREDICTED: chaperone protein DnaJ-like [Cucumis melo]3.6e-5382.44Show/hide
Query:  MASSFFPASASCFNS-TTVPAIGGCSNQKLNLISNGFR-YSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGA
        MASS F  SA+CF+S TT+ A+  CSNQKLNLI NGF  YS  ARFPHL  KAA NDRNTKPNSVICGDCDGNGAVLCSQCKG+GVNAVDFFNGQFKAG 
Subjt:  MASSFFPASASCFNS-TTVPAIGGCSNQKLNLISNGFR-YSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGA

Query:  SCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ
        SCWLCGG+ EMLCGNCNGAGFVGGFLSTYDQ
Subjt:  SCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ

XP_022148277.1 uncharacterized protein LOC111016947 [Momordica charantia]3.4e-5986.05Show/hide
Query:  MASSFFPASASCFNSTTVPAIGGCSNQKLNLISNGFRYSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGASC
        MA S F ASA CFNSTTV AIGGCSN KLNLI NG  YSPAARFPHLN KAA NDRNTKPNS+ICGDCDGNGAVLCSQCKGSGVN  D FNGQFKAG SC
Subjt:  MASSFFPASASCFNSTTVPAIGGCSNQKLNLISNGFRYSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGASC

Query:  WLCGGKKEMLCGNCNGAGFVGGFLSTYDQ
        WLCGGKK+MLCGNCNGAGF+GGFLSTYDQ
Subjt:  WLCGGKKEMLCGNCNGAGFVGGFLSTYDQ

XP_038899028.1 protein BUNDLE SHEATH DEFECTIVE 2, chloroplastic-like [Benincasa hispida]3.5e-5684.73Show/hide
Query:  MASSFFPASASCF-NSTTVPAIGGCSNQKLNLISNGF-RYSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGA
        MASS F ASA+CF +STT+ AI G SNQKLNL++NGF  YS  ARFPHLN KAA NDRNTKPNSVICGDCDGNGAVLCSQCKG+GVNAVDFFNGQFKAGA
Subjt:  MASSFFPASASCF-NSTTVPAIGGCSNQKLNLISNGF-RYSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGA

Query:  SCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ
        SCWLCGG+KEMLCGNCNGAGF+GGFLSTYDQ
Subjt:  SCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ

TrEMBL top hitse value%identityAlignment
A0A0A0LDP8 Uncharacterized protein3.0e-5381.06Show/hide
Query:  MASSFFPASASCFNSTTV--PAIGGCSNQKLNLISNGFR-YSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAG
        MASS F  SA+CF+STT     I  CSNQKLNLI NGF  YS  ARFPHL  KAA NDRNTKPNSVICGDCDGNGAV+CSQCKG GVNAVDFFNGQFKAG
Subjt:  MASSFFPASASCFNSTTV--PAIGGCSNQKLNLISNGFR-YSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAG

Query:  ASCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ
         SCWLCGG+KEMLCGNCNGAGF+GGFLSTYDQ
Subjt:  ASCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ

A0A1S3CSH4 chaperone protein DnaJ-like1.8e-5382.44Show/hide
Query:  MASSFFPASASCFNS-TTVPAIGGCSNQKLNLISNGFR-YSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGA
        MASS F  SA+CF+S TT+ A+  CSNQKLNLI NGF  YS  ARFPHL  KAA NDRNTKPNSVICGDCDGNGAVLCSQCKG+GVNAVDFFNGQFKAG 
Subjt:  MASSFFPASASCFNS-TTVPAIGGCSNQKLNLISNGFR-YSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGA

Query:  SCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ
        SCWLCGG+ EMLCGNCNGAGFVGGFLSTYDQ
Subjt:  SCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ

A0A5A7TB44 Chaperone protein DnaJ-like6.0e-5483.21Show/hide
Query:  MASSFFPASASCFNS-TTVPAIGGCSNQKLNLISNGFR-YSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGA
        MASS F  SA+CF+S TT+ AI  CSNQKLNLI N F  YS  ARFPHL  KAA NDRNTKPNSVICGDCDGNGAVLCSQCKG+GVNAVDFFNGQFKAG 
Subjt:  MASSFFPASASCFNS-TTVPAIGGCSNQKLNLISNGFR-YSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGA

Query:  SCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ
        SCWLCGG+KEMLCGNCNGAGFVGGFLSTYDQ
Subjt:  SCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ

A0A5D3E725 Chaperone protein DnaJ-like1.8e-5382.44Show/hide
Query:  MASSFFPASASCFNS-TTVPAIGGCSNQKLNLISNGFR-YSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGA
        MASS F  SA+CF+S TT+ A+  CSNQKLNLI NGF  YS  ARFPHL  KAA NDRNTKPNSVICGDCDGNGAVLCSQCKG+GVNAVDFFNGQFKAG 
Subjt:  MASSFFPASASCFNS-TTVPAIGGCSNQKLNLISNGFR-YSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGA

Query:  SCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ
        SCWLCGG+ EMLCGNCNGAGFVGGFLSTYDQ
Subjt:  SCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ

A0A6J1D4M9 uncharacterized protein LOC1110169471.6e-5986.05Show/hide
Query:  MASSFFPASASCFNSTTVPAIGGCSNQKLNLISNGFRYSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGASC
        MA S F ASA CFNSTTV AIGGCSN KLNLI NG  YSPAARFPHLN KAA NDRNTKPNS+ICGDCDGNGAVLCSQCKGSGVN  D FNGQFKAG SC
Subjt:  MASSFFPASASCFNSTTVPAIGGCSNQKLNLISNGFRYSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGASC

Query:  WLCGGKKEMLCGNCNGAGFVGGFLSTYDQ
        WLCGGKK+MLCGNCNGAGF+GGFLSTYDQ
Subjt:  WLCGGKKEMLCGNCNGAGFVGGFLSTYDQ

SwissProt top hitse value%identityAlignment
Q9SN73 Protein BUNDLE SHEATH DEFECTIVE 2, chloroplastic6.9e-3171.6Show/hide
Query:  KAANND-RNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGASCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ
        KAANN+ + TKPNS++C +C+G G V CSQCKG GVN +D FNGQFKAGA CWLC GKKE+LCG+CNGAGF+GGFLST+D+
Subjt:  KAANND-RNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGASCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ

Q9XF14 Protein BUNDLE SHEATH DEFECTIVE 2, chloroplastic3.2e-2858.62Show/hide
Query:  LNSKAANNDRNTKPN----SVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGASCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ
        + +KA  ND++ K +    S++C DC+GNGA++C++C+G+GVN+VD+F G+FKAG+ CWLC GK+E+LCGNCNGAGF+GGFLST+D+
Subjt:  LNSKAANNDRNTKPN----SVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGASCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ

Arabidopsis top hitse value%identityAlignment
AT3G47650.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein4.9e-3271.6Show/hide
Query:  KAANND-RNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGASCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ
        KAANN+ + TKPNS++C +C+G G V CSQCKG GVN +D FNGQFKAGA CWLC GKKE+LCG+CNGAGF+GGFLST+D+
Subjt:  KAANND-RNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGASCWLCGGKKEMLCGNCNGAGFVGGFLSTYDQ

AT5G17840.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein3.3e-0436.21Show/hide
Query:  CGDCDGNGAVLCSQCKGSGVNAVDFFNGQ-FKAGASCWLCGGKKEMLCGNCNGAGFVG
        C  C+  GA+LCS C G+G+        Q       C  CGG   ++C  C G G VG
Subjt:  CGDCDGNGAVLCSQCKGSGVNAVDFFNGQ-FKAGASCWLCGGKKEMLCGNCNGAGFVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTTCATTTTTCCCGGCATCGGCAAGTTGCTTCAATTCCACTACAGTTCCAGCAATTGGAGGTTGTAGCAATCAGAAGCTCAACCTGATTAGCAATGGCTTCCG
TTATTCTCCAGCTGCTCGATTCCCTCATCTAAATAGCAAGGCTGCGAATAATGATCGGAACACAAAACCTAATAGCGTGATTTGTGGCGATTGTGATGGAAATGGTGCTG
TTCTTTGCTCGCAATGCAAAGGAAGTGGAGTTAATGCTGTTGATTTCTTCAATGGACAGTTCAAAGCCGGAGCATCTTGTTGGCTGTGCGGGGGGAAAAAGGAAATGCTG
TGTGGGAATTGCAATGGGGCTGGCTTCGTTGGAGGTTTTCTCAGCACTTATGATCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTTCATTTTTCCCGGCATCGGCAAGTTGCTTCAATTCCACTACAGTTCCAGCAATTGGAGGTTGTAGCAATCAGAAGCTCAACCTGATTAGCAATGGCTTCCG
TTATTCTCCAGCTGCTCGATTCCCTCATCTAAATAGCAAGGCTGCGAATAATGATCGGAACACAAAACCTAATAGCGTGATTTGTGGCGATTGTGATGGAAATGGTGCTG
TTCTTTGCTCGCAATGCAAAGGAAGTGGAGTTAATGCTGTTGATTTCTTCAATGGACAGTTCAAAGCCGGAGCATCTTGTTGGCTGTGCGGGGGGAAAAAGGAAATGCTG
TGTGGGAATTGCAATGGGGCTGGCTTCGTTGGAGGTTTTCTCAGCACTTATGATCAATAG
Protein sequenceShow/hide protein sequence
MASSFFPASASCFNSTTVPAIGGCSNQKLNLISNGFRYSPAARFPHLNSKAANNDRNTKPNSVICGDCDGNGAVLCSQCKGSGVNAVDFFNGQFKAGASCWLCGGKKEML
CGNCNGAGFVGGFLSTYDQ