; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003049 (gene) of Snake gourd v1 genome

Gene IDTan0003049
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1118)
Genome locationLG05:74927425..74928579
RNA-Seq ExpressionTan0003049
SyntenyTan0003049
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010196 - nonphotochemical quenching (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0090391 - granum assembly (biological process)
GO:0009515 - granal stacked thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009500 - Protein of unknown function DUF1118


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451074.1 PREDICTED: uncharacterized protein LOC103492437 [Cucumis melo]9.0e-8388.12Show/hide
Query:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
        MEV+SFCSSS+R  S +YSYPS T S ++      PLH+H+MAAEK PPSA+KTVGSKKIN+TVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
Subjt:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE

Query:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR
        KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQ AVALVS+VGGSAAFAASNLVSNLQR
Subjt:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR

Query:  SN
        SN
Subjt:  SN

XP_022137623.1 uncharacterized protein LOC111009021 [Momordica charantia]2.8e-8493.07Show/hide
Query:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
        MEV+SFC SSSRGIS VYSYP KTKS   NPST SPL V +MAAEK  PSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSP IKLLTRVEQLKLLSKAE
Subjt:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE

Query:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR
        KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR
Subjt:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR

Query:  SN
        SN
Subjt:  SN

XP_022954525.1 uncharacterized protein LOC111456768 [Cucurbita moschata]3.1e-8389.11Show/hide
Query:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
        ME++S  +SS+R  S +YSYP KTKS  +NPSTFSPLHVHAMAAEK P SA KT+ SKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
Subjt:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE

Query:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR
        KAGLLSAAEKAGLSLSSIEKLGLLSKAEELG+LSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAA ALVSVVGGSAAFAASNLVS LQR
Subjt:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR

Query:  SN
        SN
Subjt:  SN

XP_022994700.1 uncharacterized protein LOC111490350 [Cucurbita maxima]4.8e-8489.6Show/hide
Query:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
        MEV+S  +SS+R  S +YSYP KTKS  +NPSTFSPLHVHAMAAEK P SA KT+GSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
Subjt:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE

Query:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR
        KAGLLSAAEKAGLSLSSIEKLGLLSKAEELG+LSAATDPGTPGALLSLSLGLLLLGPSCVYLVPED+VWEIVLQAA ALVSV+GGSAAFAASNLVSNLQR
Subjt:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR

Query:  SN
        SN
Subjt:  SN

XP_023541355.1 uncharacterized protein LOC111801557 [Cucurbita pepo subsp. pepo]2.8e-8489.6Show/hide
Query:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
        ME++S  +SS+R  S +YSYP KTKS  +NPSTFSPLHVHAMAAEK P SA KT+GSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQL+LLSKAE
Subjt:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE

Query:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR
        KAGLLSAAEKAGLSLSSIEKLGLLSKAEELG+LSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAA ALVSVVGGSAAFAASNLVSNLQR
Subjt:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR

Query:  SN
        SN
Subjt:  SN

TrEMBL top hitse value%identityAlignment
A0A1S3BQ40 uncharacterized protein LOC1034924374.3e-8388.12Show/hide
Query:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
        MEV+SFCSSS+R  S +YSYPS T S ++      PLH+H+MAAEK PPSA+KTVGSKKIN+TVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
Subjt:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE

Query:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR
        KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQ AVALVS+VGGSAAFAASNLVSNLQR
Subjt:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR

Query:  SN
        SN
Subjt:  SN

A0A5D3BGQ3 Uncharacterized protein4.3e-8388.12Show/hide
Query:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
        MEV+SFCSSS+R  S +YSYPS T S ++      PLH+H+MAAEK PPSA+KTVGSKKIN+TVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
Subjt:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE

Query:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR
        KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQ AVALVS+VGGSAAFAASNLVSNLQR
Subjt:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR

Query:  SN
        SN
Subjt:  SN

A0A6J1CAU4 uncharacterized protein LOC1110090211.4e-8493.07Show/hide
Query:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
        MEV+SFC SSSRGIS VYSYP KTKS   NPST SPL V +MAAEK  PSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSP IKLLTRVEQLKLLSKAE
Subjt:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE

Query:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR
        KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR
Subjt:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR

Query:  SN
        SN
Subjt:  SN

A0A6J1GSN1 uncharacterized protein LOC1114567681.5e-8389.11Show/hide
Query:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
        ME++S  +SS+R  S +YSYP KTKS  +NPSTFSPLHVHAMAAEK P SA KT+ SKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
Subjt:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE

Query:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR
        KAGLLSAAEKAGLSLSSIEKLGLLSKAEELG+LSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAA ALVSVVGGSAAFAASNLVS LQR
Subjt:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR

Query:  SN
        SN
Subjt:  SN

A0A6J1K230 uncharacterized protein LOC1114903502.3e-8489.6Show/hide
Query:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
        MEV+S  +SS+R  S +YSYP KTKS  +NPSTFSPLHVHAMAAEK P SA KT+GSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE
Subjt:  MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAE

Query:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR
        KAGLLSAAEKAGLSLSSIEKLGLLSKAEELG+LSAATDPGTPGALLSLSLGLLLLGPSCVYLVPED+VWEIVLQAA ALVSV+GGSAAFAASNLVSNLQR
Subjt:  KAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQR

Query:  SN
        SN
Subjt:  SN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G74730.1 Protein of unknown function (DUF1118)2.2e-1039.67Show/hide
Query:  IKLLTRVEQLKLLSKAEKAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATD-PGT-PGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVS
        + +  ++E+ K+LS  EK+GLLS AE  GL+LSS+EKL + SKAE+LG+LS   +  GT P  L S +L  L      V L+P+DS   +V QA +A   
Subjt:  IKLLTRVEQLKLLSKAEKAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVLSAATD-PGT-PGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVS

Query:  VVGGSAAFAASNLVSNLQRSN
         + G      S ++  LQ ++
Subjt:  VVGGSAAFAASNLVSNLQRSN

AT5G08050.1 Protein of unknown function (DUF1118)2.5e-4362.72Show/hide
Query:  SPLHVHAMAAEKAPPSAA-KTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVL
        S L VH+MA +K  PSAA +T+ SKK  +T                  P +KLLTRVEQLKLL+KAEKAGLLS AEK+G SLS+IE+LGLL+KAEE GVL
Subjt:  SPLHVHAMAAEKAPPSAA-KTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLLSAAEKAGLSLSSIEKLGLLSKAEELGVL

Query:  SAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQRSN
        SAAT+P TPG L +LSLGLLLLGP   Y+VPED  WE+V+Q  VAL+SV+GGSAAFAAS  VSNLQ+S+
Subjt:  SAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQRSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTCAGTTCTTTCTGCAGCAGCAGCAGCAGAGGAATTTCAATCGTCTACTCCTATCCATCGAAAACGAAATCAATAAAACGAAATCCTTCCACCTTTTCGCCTCT
CCATGTACACGCCATGGCCGCCGAGAAAGCTCCTCCTTCTGCCGCTAAAACCGTCGGCTCCAAGAAGATAAACTCGACGGTGTTCCCTCTCGGCGAGAAAGGACCGAGGA
GTAGCATCTCGCTGTCGACCTCGCCGCCGATTAAGCTACTGACGAGAGTGGAGCAATTGAAGCTGCTGAGCAAGGCGGAGAAGGCCGGGCTGCTGTCTGCGGCGGAGAAG
GCTGGATTGTCTCTGTCGTCGATTGAGAAATTAGGGCTTCTGTCGAAGGCTGAGGAGTTGGGGGTTCTATCGGCAGCGACGGATCCGGGAACTCCCGGGGCGCTGCTGAG
CCTGAGCTTAGGGCTGTTGCTTTTAGGGCCTTCGTGTGTGTATTTGGTACCGGAGGACAGTGTTTGGGAAATCGTGCTGCAGGCGGCGGTGGCTCTGGTCTCCGTCGTGG
GCGGCTCTGCGGCTTTTGCTGCGTCGAATTTGGTGTCCAATTTGCAGAGATCGAATTGA
mRNA sequenceShow/hide mRNA sequence
AATAAATAGAAAGTTAGAAACATGGATTTTTTTTTTTATATATAAAAAAATCAGATTATAAACCATTCCAACTGCGTGGCACCGATAAAATGATGACACGTGGATAATAC
AGTATCTCAATTCTCAAAACTAATCCAACCACCTTCCTTCTCCATCATTCATTTCCATGGTGCGGGCCTCTGCAATTCTCTCATTTTGCCCTAACCATCGCCATTATTCA
CAGTTCTCCTCTGCTAGAGATTCACAGGGATGGAGGTCAGTTCTTTCTGCAGCAGCAGCAGCAGAGGAATTTCAATCGTCTACTCCTATCCATCGAAAACGAAATCAATA
AAACGAAATCCTTCCACCTTTTCGCCTCTCCATGTACACGCCATGGCCGCCGAGAAAGCTCCTCCTTCTGCCGCTAAAACCGTCGGCTCCAAGAAGATAAACTCGACGGT
GTTCCCTCTCGGCGAGAAAGGACCGAGGAGTAGCATCTCGCTGTCGACCTCGCCGCCGATTAAGCTACTGACGAGAGTGGAGCAATTGAAGCTGCTGAGCAAGGCGGAGA
AGGCCGGGCTGCTGTCTGCGGCGGAGAAGGCTGGATTGTCTCTGTCGTCGATTGAGAAATTAGGGCTTCTGTCGAAGGCTGAGGAGTTGGGGGTTCTATCGGCAGCGACG
GATCCGGGAACTCCCGGGGCGCTGCTGAGCCTGAGCTTAGGGCTGTTGCTTTTAGGGCCTTCGTGTGTGTATTTGGTACCGGAGGACAGTGTTTGGGAAATCGTGCTGCA
GGCGGCGGTGGCTCTGGTCTCCGTCGTGGGCGGCTCTGCGGCTTTTGCTGCGTCGAATTTGGTGTCCAATTTGCAGAGATCGAATTGACTTCCGGAGTTGGAGTTTTCCG
GCGATGGGAGTTGTAGCATTGGCACTGTACATCAGTATCTTTTGTTATCTCACGTCGTCGTTTTGTATTCTGCAATTATTGAATTTATCATTTTCCATCGGATTAAAAAA
ATGGCGATATGTGAACGCAATTATTCTGATCTCAATTTTATACCAAAATCGATATAGGGAC
Protein sequenceShow/hide protein sequence
MEVSSFCSSSSRGISIVYSYPSKTKSIKRNPSTFSPLHVHAMAAEKAPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLLSAAEK
AGLSLSSIEKLGLLSKAEELGVLSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDSVWEIVLQAAVALVSVVGGSAAFAASNLVSNLQRSN