; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014715 (gene) of Snake gourd v1 genome

Gene IDTan0014715
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionChlorophyll A-B binding protein
Genome locationLG08:10194232..10201875
RNA-Seq ExpressionTan0014715
SyntenyTan0014715
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
GO:0009579 - thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR023329 - Chlorophyll a/b binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451793.1 PREDICTED: uncharacterized protein LOC103492970 isoform X1 [Cucumis melo]3.1e-5776.36Show/hide
Query:  ASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEAK
        AS +LIL I GGNLP SQYLSFRHSHPSATF                  SR GWSRD+D GRSTHRTRGQAFRI    NVSP KDGLIK+VIMVDPLEAK
Subjt:  ASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEAK

Query:  RMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ
        RMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADY + ++NFF++
Subjt:  RMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ

XP_022937783.1 uncharacterized protein LOC111444076 isoform X1 [Cucurbita moschata]9.7e-5977.11Show/hide
Query:  MASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEA
        MASTALIL I GGN      LSFRH+HPSATF                  SRWGW+RD+D G+STHRTRGQAFRILANPNVSPGKD LIK+VIMVDPLEA
Subjt:  MASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEA

Query:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ
        KR+AAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGL++EGQTGKGILAQLA YLAAVVNFFV+
Subjt:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ

XP_022969695.1 uncharacterized protein LOC111468645 isoform X1 [Cucurbita maxima]3.6e-6179.52Show/hide
Query:  MASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEA
        MASTALIL I GGN  SSQ LSFRH+H SATF                  SRWGWSRDRD G STHRTRGQAFRILANPNVSPGKD LIK+VIMVDPLEA
Subjt:  MASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEA

Query:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ
        KR+AAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGL++EGQTGKGILAQLA YLAAVVNFFV+
Subjt:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ

XP_023538319.1 uncharacterized protein LOC111799137 isoform X1 [Cucurbita pepo subsp. pepo]5.7e-5977.11Show/hide
Query:  MASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEA
        MASTALIL I GGN      LSFRH+HPSATF                  SRWGW+RD+D G+STHRTRGQAFRILANPNVSPGKD LIK+VIMVDPLEA
Subjt:  MASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEA

Query:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ
        KR+AAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGL++EGQTGKGILAQLA YLAAVVNFFV+
Subjt:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ

XP_038889814.1 uncharacterized protein LOC120079625 [Benincasa hispida]4.5e-5675.15Show/hide
Query:  MASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEA
        MAST+LIL IKGGNLP SQYLSFRH+HPSATF                  SR GWSRD+D GRSTHRTRGQAF+I    NVSPGKD LIK+VIMVDPLEA
Subjt:  MASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEA

Query:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFV
        KR+AAKEMEKIKAKEKFKR+RQIEAINGAWAMIGLTAGLVIEGQTGKGILAQL  Y + V+NFF+
Subjt:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFV

TrEMBL top hitse value%identityAlignment
A0A0A0LH95 Uncharacterized protein7.8e-5475.15Show/hide
Query:  ASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEAK
        ASTALIL I GGNLP SQYLSFRH+ PSATF                  SR GWS  RDAGRST RTRGQAFRI    NVSPG+DGLIK+VIMVDPLEAK
Subjt:  ASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEAK

Query:  RMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ
        RMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEG+TGKGILAQLADY + ++NFF++
Subjt:  RMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ

A0A1S3BSE0 uncharacterized protein LOC103492970 isoform X11.5e-5776.36Show/hide
Query:  ASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEAK
        AS +LIL I GGNLP SQYLSFRHSHPSATF                  SR GWSRD+D GRSTHRTRGQAFRI    NVSP KDGLIK+VIMVDPLEAK
Subjt:  ASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEAK

Query:  RMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ
        RMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADY + ++NFF++
Subjt:  RMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ

A0A6J1FC77 uncharacterized protein LOC111444076 isoform X14.7e-5977.11Show/hide
Query:  MASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEA
        MASTALIL I GGN      LSFRH+HPSATF                  SRWGW+RD+D G+STHRTRGQAFRILANPNVSPGKD LIK+VIMVDPLEA
Subjt:  MASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEA

Query:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ
        KR+AAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGL++EGQTGKGILAQLA YLAAVVNFFV+
Subjt:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ

A0A6J1HYI7 uncharacterized protein LOC111468645 isoform X21.6e-5483.45Show/hide
Query:  PSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQIEAIN
        P+A  S+I +LL       V FPSRWGWSRDRD G STHRTRGQAFRILANPNVSPGKD LIK+VIMVDPLEAKR+AAKEMEKIKAKEKFKRRRQIEAIN
Subjt:  PSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQIEAIN

Query:  GAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ
        GAWAMIGLTAGL++EGQTGKGILAQLA YLAAVVNFFV+
Subjt:  GAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ

A0A6J1I0N1 uncharacterized protein LOC111468645 isoform X11.7e-6179.52Show/hide
Query:  MASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEA
        MASTALIL I GGN  SSQ LSFRH+H SATF                  SRWGWSRDRD G STHRTRGQAFRILANPNVSPGKD LIK+VIMVDPLEA
Subjt:  MASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEA

Query:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ
        KR+AAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGL++EGQTGKGILAQLA YLAAVVNFFV+
Subjt:  KRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G28025.1 unknown protein1.2e-3064.35Show/hide
Query:  GWSRDRDAGRSTHRTRGQAFRILANPNVS----PGKDGLIKEVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGI
        G  R +DA    +R R    R+LANPNVS    PGK  + KEVIMVDPLEAKR+A+K+ME+IK +EK +RRR+IEAINGAWA+IGL  GLVIE QTGKGI
Subjt:  GWSRDRDAGRSTHRTRGQAFRILANPNVS----PGKDGLIKEVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGI

Query:  LAQLADYLAAVVNFF
        LAQLA Y +AVV+ F
Subjt:  LAQLADYLAAVVNFF

AT4G28025.2 unknown protein.7.0e-3164.1Show/hide
Query:  RWGWSRDRDAGRSTHRTRGQAFRILANPNVS----PGKDGLIKEVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGK
        R G  R +DA    +R R    R+LANPNVS    PGK  + KEVIMVDPLEAKR+A+K+ME+IK +EK +RRR+IEAINGAWA+IGL  GLVIE QTGK
Subjt:  RWGWSRDRDAGRSTHRTRGQAFRILANPNVS----PGKDGLIKEVIMVDPLEAKRMAAKEMEKIKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGK

Query:  GILAQLADYLAAVVNFF
        GILAQLA Y +AVV+ F
Subjt:  GILAQLADYLAAVVNFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTGCGCTGATTCTCTCCATCAAAGGAGGAAATCTCCCGTCTTCGCAATACCTCTCTTTCCGCCATAGCCATCCTTCTGCAACTTTCTCCAGTATTTCGTC
CCTGCTAATGGCTGCATCATTCTTGATAGTTCCTTTCCCTTCCAGGTGGGGTTGGAGTAGGGATCGAGATGCAGGCAGAAGTACGCACAGAACGAGGGGTCAAGCGTTTC
GAATCTTGGCTAACCCTAATGTCTCTCCTGGAAAAGATGGCTTAATTAAGGAGGTGATTATGGTTGATCCTCTGGAAGCCAAACGTATGGCTGCGAAAGAAATGGAAAAA
ATCAAAGCAAAAGAGAAGTTCAAGAGACGACGTCAAATAGAAGCGATAAATGGAGCATGGGCAATGATTGGTCTCACGGCAGGGCTCGTCATCGAAGGTCAAACTGGAAA
AGGCATTCTAGCACAGTTGGCGGACTACTTAGCCGCGGTCGTGAACTTCTTTGTACAGTAG
mRNA sequenceShow/hide mRNA sequence
CTTCCGCCAAACAGCCCCTTTTGAGTCAAATATGTCCGAATGGATCAAAACAAGCTCTGTTTCAGCGATGAGAGCCATTACAAAAGTGTCATTCTCTGGAAGATAAGTGT
GGATGTGGGCAACTGAAATCTATAAATTAAAACCATTTATTTCCATATAAAATGATTTACTCAACAAAATGGATAAGGGTCGTTGTGACGAAGCAGTAGATTTTGTAGAT
GGCTTCCACTGCGCTGATTCTCTCCATCAAAGGAGGAAATCTCCCGTCTTCGCAATACCTCTCTTTCCGCCATAGCCATCCTTCTGCAACTTTCTCCAGTATTTCGTCCC
TGCTAATGGCTGCATCATTCTTGATAGTTCCTTTCCCTTCCAGGTGGGGTTGGAGTAGGGATCGAGATGCAGGCAGAAGTACGCACAGAACGAGGGGTCAAGCGTTTCGA
ATCTTGGCTAACCCTAATGTCTCTCCTGGAAAAGATGGCTTAATTAAGGAGGTGATTATGGTTGATCCTCTGGAAGCCAAACGTATGGCTGCGAAAGAAATGGAAAAAAT
CAAAGCAAAAGAGAAGTTCAAGAGACGACGTCAAATAGAAGCGATAAATGGAGCATGGGCAATGATTGGTCTCACGGCAGGGCTCGTCATCGAAGGTCAAACTGGAAAAG
GCATTCTAGCACAGTTGGCGGACTACTTAGCCGCGGTCGTGAACTTCTTTGTACAGTAGACATGTTGAATGGCAAAAGGAGAAGTCTTTGAATGAATGAATTATTCTCTT
CTCGACCATTGAAAGTTGAGTATGTTAAATATTTTCGAGCCAAGTCTTATTGCTTGTTCTTCGTTTTATATATTAAAAAAAAAAGCTGGTGTTGAAGATTCTGTACTCTG
ATATTCTTTTCAAGTCTGATTTCTTCACATTTCACAATACAAAAAATTCACATTTAATTGTGAAAAGCAGTGTATTCATATGGTTTTGTTTTGGAAGATTTTATTACCAA
TGCAATCAATTTCTTGCC
Protein sequenceShow/hide protein sequence
MASTALILSIKGGNLPSSQYLSFRHSHPSATFSSISSLLMAASFLIVPFPSRWGWSRDRDAGRSTHRTRGQAFRILANPNVSPGKDGLIKEVIMVDPLEAKRMAAKEMEK
IKAKEKFKRRRQIEAINGAWAMIGLTAGLVIEGQTGKGILAQLADYLAAVVNFFVQ