; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005836 (gene) of Snake gourd v1 genome

Gene IDTan0005836
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionycf20-like protein
Genome locationLG07:5524646..5526365
RNA-Seq ExpressionTan0005836
SyntenyTan0005836
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007572 - Uncharacterised protein family Ycf20


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579671.1 hypothetical protein SDJN03_24119, partial [Cucurbita argyrosperma subsp. sororia]1.1e-7292.9Show/hide
Query:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL
        MAQSASLVST  LNY FT ++NA YWRYDMIKSR SPRSL +KAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL
Subjt:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL

Query:  GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
        GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFP+ALLNNFKMGFTYGLFIDAFKLAS
Subjt:  GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS

XP_004143642.1 uncharacterized protein ycf20 [Cucumis sativus]9.3e-7292.95Show/hide
Query:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSR-SSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGA
        MAQSASLV TS LNYGFT K N  YWRYDM+KSR S PRS RVKAVQDTGGPRRLVDIIRLVPE+SRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGA
Subjt:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSR-SSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGA

Query:  LGVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
        LGVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
Subjt:  LGVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS

XP_022928979.1 uncharacterized protein ycf20 [Cucurbita moschata]5.5e-7291.61Show/hide
Query:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL
        MAQSASLVST  LNY FT ++NA YWRYD+IKSR SPRSL +KAVQDTGGPRRLVDIIRLVPE+SRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL
Subjt:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL

Query:  GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
        GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFP+ALLNNFKMGFTYGLFIDAFKLAS
Subjt:  GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS

XP_022969949.1 uncharacterized protein ycf20 [Cucurbita maxima]4.2e-7291.61Show/hide
Query:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL
        MAQSASLVST  LNY FT ++NA YWRYD+IKSR SPRSL +KAVQDTGGPRRLVDIIRLVPE+SRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL
Subjt:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL

Query:  GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
        GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFP+ALLNNFKMGFTYGLFIDAFKLAS
Subjt:  GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS

XP_023549885.1 uncharacterized protein LOC111808248 [Cucurbita pepo subsp. pepo]3.2e-7292.26Show/hide
Query:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL
        MAQSASLVST  LNY FT ++NA YWRYD+IKSR SPRSL +KAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL
Subjt:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL

Query:  GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
        GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFP+ALLNNFKMGFTYGLFIDAFKLAS
Subjt:  GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS

TrEMBL top hitse value%identityAlignment
A0A0A0KPS1 Uncharacterized protein4.5e-7292.95Show/hide
Query:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSR-SSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGA
        MAQSASLV TS LNYGFT K N  YWRYDM+KSR S PRS RVKAVQDTGGPRRLVDIIRLVPE+SRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGA
Subjt:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSR-SSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGA

Query:  LGVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
        LGVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
Subjt:  LGVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS

A0A1S3CT33 uncharacterized protein ycf20 isoform X18.5e-7191.03Show/hide
Query:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSR-SSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGA
        MAQS SLV T  LNYGF+ K N+ YWRYDM+KSR S PRS RVKAVQDTGGPRRLVDIIRLVPE+SRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGA
Subjt:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSR-SSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGA

Query:  LGVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
        LGVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
Subjt:  LGVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS

A0A5D3BMH1 Ycf20-like protein isoform X12.9e-6394.81Show/hide
Query:  NALYWRYDMIKSR-SSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTR
        N  YWRYDM+KSR S PRS RVKAVQDTGGPRRLVDIIRLVPE+SRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTR
Subjt:  NALYWRYDMIKSR-SSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTR

Query:  FYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
        FYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
Subjt:  FYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS

A0A6J1ESY7 uncharacterized protein ycf202.6e-7291.61Show/hide
Query:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL
        MAQSASLVST  LNY FT ++NA YWRYD+IKSR SPRSL +KAVQDTGGPRRLVDIIRLVPE+SRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL
Subjt:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL

Query:  GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
        GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFP+ALLNNFKMGFTYGLFIDAFKLAS
Subjt:  GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS

A0A6J1I440 uncharacterized protein ycf202.0e-7291.61Show/hide
Query:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL
        MAQSASLVST  LNY FT ++NA YWRYD+IKSR SPRSL +KAVQDTGGPRRLVDIIRLVPE+SRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL
Subjt:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGAL

Query:  GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
        GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFP+ALLNNFKMGFTYGLFIDAFKLAS
Subjt:  GVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS

SwissProt top hitse value%identityAlignment
P51214 Uncharacterized protein ycf205.8e-0847.44Show/hide
Query:  GGISLLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTRFYYS-RPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
        G ISLL GF+++  +S   G  G   +IAA + V   E V++  YS + K    I LLNN K+G TYGLF+DAFKL S
Subjt:  GGISLLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTRFYYS-RPKVTFPIALLNNFKMGFTYGLFIDAFKLAS

P72983 Ycf20-like protein3.4e-0835.24Show/hide
Query:  RLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTRFYYSR--PKVTFPIALLNNFKMGFTYGLFIDA
        RL  I+ +  +    +FR+P RR     +S L GF+V   ++ + G     DV+ A   +L  E V R++Y R          +LN FKMG +Y LF++A
Subjt:  RLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTRFYYSR--PKVTFPIALLNNFKMGFTYGLFIDA

Query:  FKLAS
        FKL S
Subjt:  FKLAS

Q1XDS2 Uncharacterized protein ycf201.2e-0847.44Show/hide
Query:  GGISLLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTRFYYSRPK-VTFPIALLNNFKMGFTYGLFIDAFKLAS
        G ISLL GF+++  +S   G  G   +IAA + V  TE  ++  YS  K +   I L NNFK+G TYGLF+DAFKL S
Subjt:  GGISLLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTRFYYSRPK-VTFPIALLNNFKMGFTYGLFIDAFKLAS

Q9MUL5 Uncharacterized protein ycf203.0e-0437.8Show/hide
Query:  LLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTRFYYSR----PKVTFPIALL-----NNFKMGFTYGLFIDAFKLAS
        LL GF++A  ++  FG  G  DV+ A + V + E +    YS+     K  F I+ L     N  K+G  +GLF+DAFKL S
Subjt:  LLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTRFYYSR----PKVTFPIALL-----NNFKMGFTYGLFIDAFKLAS

Arabidopsis top hitse value%identityAlignment
AT1G65420.1 Protein of unknown function (DUF565)1.8e-0435Show/hide
Query:  LLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTRFYYSRP-------KVTFPIALLNNFKMGFTYGLFIDAFKLAS
        LL GFY A  ++   G  G  DV+ A + V   E +    Y +P       K+   +  +N +K G   GLF+DAFKL S
Subjt:  LLGGFYVAQTISLSFGALGVNDVIAAVVCVLLTEYVTRFYYSRP-------KVTFPIALLNNFKMGFTYGLFIDAFKLAS

AT5G43050.1 Protein of unknown function (DUF565)4.3e-5168.99Show/hide
Query:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLR-VKAVQDTGG--PRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSF
        MA S S +ST+  +      +   +  +     +   R  R ++A+Q+T G  PRRL+DIIR VPEISRNYF+ PSRR LFGGISLLGGFYVAQTISLSF
Subjt:  MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLR-VKAVQDTGG--PRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSF

Query:  GALGVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS
        GALGVNDVIAAV+CVLLTEYVTRFYYSR  VTFPIALLNNFKMGFTYGLFIDAFKLAS
Subjt:  GALGVNDVIAAVVCVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACAGTCTGCTAGTTTAGTCTCTACTAGCTTCCTAAACTATGGCTTTACTTCCAAAAGGAATGCTTTATATTGGAGATATGATATGATAAAATCACGTTCATCCCC
TCGTAGTCTTCGTGTCAAAGCTGTGCAAGATACTGGAGGTCCTCGTAGGTTAGTTGATATAATTAGACTGGTGCCCGAGATCTCAAGAAATTACTTTCGAAGTCCTTCAA
GGAGGGCCCTTTTTGGAGGAATCTCATTGTTGGGTGGCTTTTATGTAGCACAGACTATATCATTGTCATTTGGAGCTTTAGGAGTAAATGATGTGATTGCTGCTGTGGTG
TGTGTTCTGCTTACAGAGTATGTTACTCGATTTTATTACAGTCGACCAAAAGTAACTTTCCCCATTGCTCTACTCAACAACTTCAAAATGGGTTTCACTTATGGTCTCTT
CATTGATGCTTTCAAACTTGCTAGTTAA
mRNA sequenceShow/hide mRNA sequence
TTCGGCTTTGGATATGTGCTCTACATTTTCCAATCCCAATTCCCAAACTCGCGATTCTCTTTGCTCCGTCGACGAATAGCAGTTCTCCGGAGCGGAACCTGGAATCGAAT
TCTAGTGATCTCAGGTTCTCAACATTTTATTCAGCCATATAGGAAGTGAAACTTCAATGGCACAGTCTGCTAGTTTAGTCTCTACTAGCTTCCTAAACTATGGCTTTACT
TCCAAAAGGAATGCTTTATATTGGAGATATGATATGATAAAATCACGTTCATCCCCTCGTAGTCTTCGTGTCAAAGCTGTGCAAGATACTGGAGGTCCTCGTAGGTTAGT
TGATATAATTAGACTGGTGCCCGAGATCTCAAGAAATTACTTTCGAAGTCCTTCAAGGAGGGCCCTTTTTGGAGGAATCTCATTGTTGGGTGGCTTTTATGTAGCACAGA
CTATATCATTGTCATTTGGAGCTTTAGGAGTAAATGATGTGATTGCTGCTGTGGTGTGTGTTCTGCTTACAGAGTATGTTACTCGATTTTATTACAGTCGACCAAAAGTA
ACTTTCCCCATTGCTCTACTCAACAACTTCAAAATGGGTTTCACTTATGGTCTCTTCATTGATGCTTTCAAACTTGCTAGTTAACTGCACTTGGTGAAGCCATTCTGGTG
GCTAGCATATTTACATTTGTAAAGTTCTACCCTTCTCTTTTCTTTACTAATTTTGTTCCATATCTTTAAAGCTTGTTTTTATTGTTTTTGATGAACCCTGCCCTCCCTTC
TTCATTGTTTTCTTCTGTATATTTTCATCTACAATTTACACTGAAGAGTTAATTCTGAACCTATTCACCCTCTTCAAATGCAATTGAGTAACAGATTTGAATGTAATTGA
TATCTTTTTTTAGTTTAGTTCTTGTTCATGTATTGAAATGCACCACTGTCAATGGAGAATTGTTTGATTCAACTGTTTCTTTCCAAAATACGTTCTGAG
Protein sequenceShow/hide protein sequence
MAQSASLVSTSFLNYGFTSKRNALYWRYDMIKSRSSPRSLRVKAVQDTGGPRRLVDIIRLVPEISRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVV
CVLLTEYVTRFYYSRPKVTFPIALLNNFKMGFTYGLFIDAFKLAS