; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021684 (gene) of Snake gourd v1 genome

Gene IDTan0021684
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG02:4935844..4936865
RNA-Seq ExpressionTan0021684
SyntenyTan0021684
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059605.1 hypothetical protein E6C27_scaffold54G00270 [Cucumis melo var. makuwa]2.9e-6583.44Show/hide
Query:  MSTIPRRSFLPTRPLNDALPVSQPD--SAQTLRRRLSSISFKIQPISSPVTSW-PFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMN--
        M++IP R FLPT PLNDALP+SQPD  S+QTLRRRLSSISFKI PISSP+TSW  FRRSKSVSSMRD+T +SLRKWWDWGWSWILSRK+AFARDLEMN  
Subjt:  MSTIPRRSFLPTRPLNDALPVSQPD--SAQTLRRRLSSISFKIQPISSPVTSW-PFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMN--

Query:  --DEETKALGSN--CRGS--LRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVK
          DEETKALGSN  CRGS   +HLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVK
Subjt:  --DEETKALGSN--CRGS--LRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVK

KAG7022775.1 hypothetical protein SDJN02_16511, partial [Cucurbita argyrosperma subsp. argyrosperma]8.0e-7692.81Show/hide
Query:  MSTIPRRSFLPTRPLNDALPVSQPDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETK
        M++IPRRSFLPTRPLNDALPVSQPDS QTLRRRLSSISFKIQPISSPVTSW F RSKSVSS+RD+TG SLRKWWDWGWSWILSRKAAFARDLEMNDEETK
Subjt:  MSTIPRRSFLPTRPLNDALPVSQPDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETK

Query:  ALGSNCRGSLRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVKS
        +L SNCRGS RHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVKS
Subjt:  ALGSNCRGSLRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVKS

XP_022135907.1 uncharacterized protein LOC111007745 [Momordica charantia]8.9e-7590.2Show/hide
Query:  MSTIPRRSFLPTRPLNDALPVSQPDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETK
        M+ +PRRSFLPTRPLNDALPVS PDS QTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMR+Y+GSSLRKWWDWGWSWILSRKA FA+DLE+ND+ETK
Subjt:  MSTIPRRSFLPTRPLNDALPVSQPDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETK

Query:  ALGSNCRGSLRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVKS
        +LGSNCRGS RHLFYKVRSEFRKL+RSDRVGLPQTFKYDSVNYSKNFDDGVK+
Subjt:  ALGSNCRGSLRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVKS

XP_022928235.1 uncharacterized protein LOC111435129 [Cucurbita moschata]6.8e-7590.85Show/hide
Query:  MSTIPRRSFLPTRPLNDALPVSQPDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETK
        M++IPRRSFLPTRPLNDALPVSQPDS QTLRRRLSSISFKIQPISSPVTSW F RSKSVSS+RD+TG SLRKWWDWGWSWILSRKAAFARDLEMNDEETK
Subjt:  MSTIPRRSFLPTRPLNDALPVSQPDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETK

Query:  ALGSNCRGSLRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVKS
        +L SNCRGS RHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDG+++
Subjt:  ALGSNCRGSLRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVKS

XP_038888269.1 uncharacterized protein LOC120078122 [Benincasa hispida]2.3e-7894.77Show/hide
Query:  MSTIPRRSFLPTRPLNDALPVSQPDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETK
        M++IPRRSFLPTRPLNDALPVSQPDSAQT+RRRLSSISFKIQPISSPV SWPFRRSKSVSSMRD+TGSSLRKWWDWGWSWILSRKAAF RDLEMNDEETK
Subjt:  MSTIPRRSFLPTRPLNDALPVSQPDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETK

Query:  ALGSNCRGSLRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVKS
        ALGSNCRGS RHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGV+S
Subjt:  ALGSNCRGSLRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVKS

TrEMBL top hitse value%identityAlignment
A0A0A0K1L8 Uncharacterized protein1.4e-6280.37Show/hide
Query:  MSTIPRRSFLPTRPLNDALPVSQPD--SAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMND--
        M++IP R FLPT PLNDALP+SQP   S QTLRRRLSSISFKI PISSP+TSW F RSKS+SSMRD+T +SLRKWWDWGWSWILSRKAAFARDLEMND  
Subjt:  MSTIPRRSFLPTRPLNDALPVSQPD--SAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMND--

Query:  ---EETKALGS--NCRGS--LRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVK
           E TKALGS  + RGS   +HLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVK
Subjt:  ---EETKALGS--NCRGS--LRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVK

A0A2I4FBL9 uncharacterized protein LOC1089972842.5e-5471.92Show/hide
Query:  RSFLPTRPLNDALPVSQPDS-AQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETKALGSN
        RSF+P  P   ALPVSQ DS  Q LRRRLSS+S KIQPI+SP TSW FRRSKSVSSM +Y GSS+R+WWDWGWSWILSRK  FA+DLE+N+EE K LGS+
Subjt:  RSFLPTRPLNDALPVSQPDS-AQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETKALGSN

Query:  CRGSLRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGV
         +GS RH+FYKVRSE RKL+ SD VGLPQT++YDS NYS+NFDDG+
Subjt:  CRGSLRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGV

A0A5A7V189 Uncharacterized protein1.4e-6583.44Show/hide
Query:  MSTIPRRSFLPTRPLNDALPVSQPD--SAQTLRRRLSSISFKIQPISSPVTSW-PFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMN--
        M++IP R FLPT PLNDALP+SQPD  S+QTLRRRLSSISFKI PISSP+TSW  FRRSKSVSSMRD+T +SLRKWWDWGWSWILSRK+AFARDLEMN  
Subjt:  MSTIPRRSFLPTRPLNDALPVSQPD--SAQTLRRRLSSISFKIQPISSPVTSW-PFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMN--

Query:  --DEETKALGSN--CRGS--LRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVK
          DEETKALGSN  CRGS   +HLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVK
Subjt:  --DEETKALGSN--CRGS--LRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVK

A0A6J1C433 uncharacterized protein LOC1110077454.3e-7590.2Show/hide
Query:  MSTIPRRSFLPTRPLNDALPVSQPDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETK
        M+ +PRRSFLPTRPLNDALPVS PDS QTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMR+Y+GSSLRKWWDWGWSWILSRKA FA+DLE+ND+ETK
Subjt:  MSTIPRRSFLPTRPLNDALPVSQPDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETK

Query:  ALGSNCRGSLRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVKS
        +LGSNCRGS RHLFYKVRSEFRKL+RSDRVGLPQTFKYDSVNYSKNFDDGVK+
Subjt:  ALGSNCRGSLRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVKS

A0A6J1ER56 uncharacterized protein LOC1114351293.3e-7590.85Show/hide
Query:  MSTIPRRSFLPTRPLNDALPVSQPDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETK
        M++IPRRSFLPTRPLNDALPVSQPDS QTLRRRLSSISFKIQPISSPVTSW F RSKSVSS+RD+TG SLRKWWDWGWSWILSRKAAFARDLEMNDEETK
Subjt:  MSTIPRRSFLPTRPLNDALPVSQPDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETK

Query:  ALGSNCRGSLRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVKS
        +L SNCRGS RHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDG+++
Subjt:  ALGSNCRGSLRHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G17300.1 unknown protein2.1e-2145.6Show/hide
Query:  ALPVSQ--PDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRK-AAFARDLEMNDEETK-ALGSNCRGSLRHL
        +LP+S    D    LRRRLSS+S  +        S    RSKSVS M +  GSS+++WW+W WSWIL +K   F  DLE+N  ETK +LG+  RGS  H+
Subjt:  ALPVSQ--PDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRK-AAFARDLEMNDEETK-ALGSNCRGSLRHL

Query:  FYKVRSEFRKLIRSDRVGLPQTFKY
        F+K+RSE R+L+R     LP + K+
Subjt:  FYKVRSEFRKLIRSDRVGLPQTFKY

AT4G35320.1 unknown protein2.1e-2647.69Show/hide
Query:  ALPVSQ-PDSAQT-------LRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETKALGSNCRGS
        +LPVSQ P++A T       +RRRLSS+S  +    + + +  F RSKSVS+M +  GSS+++WW+WGWSWILSRK  F RDLE+N +E K++GS  RGS
Subjt:  ALPVSQ-PDSAQT-------LRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETKALGSNCRGS

Query:  LRHLFYKVRSEFRKLI-RSDRVGLPQTFKY
        + H+F+K+RS+ R  +  S    LP + KY
Subjt:  LRHLFYKVRSEFRKLI-RSDRVGLPQTFKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTACCATTCCACGGCGGAGCTTTCTGCCGACGCGGCCTCTCAACGACGCGCTCCCTGTCTCGCAGCCGGACTCCGCTCAAACCCTACGCCGAAGGCTCTCTTCGAT
TTCCTTCAAGATCCAGCCGATTTCTTCGCCTGTTACTTCCTGGCCGTTTCGTAGATCCAAATCGGTGTCGTCTATGCGCGATTACACCGGCAGTTCGCTCCGGAAGTGGT
GGGATTGGGGATGGTCCTGGATCCTCTCTCGCAAGGCTGCCTTCGCTCGAGATCTGGAGATGAACGATGAAGAAACTAAAGCTCTTGGATCTAATTGCAGAGGTAGTTTG
AGGCATCTTTTCTATAAGGTTAGGTCTGAGTTCCGCAAACTGATTCGCTCCGATCGCGTCGGTCTTCCTCAAACTTTCAAATACGATTCGGTTAATTACTCGAAGAATTT
CGACGACGGAGTGAAATCTTAG
mRNA sequenceShow/hide mRNA sequence
CAAACACACCTCTCACTTCTCTCTCTCTCTCTTCACCGGCGCCGCTCCCTCCTCCGCCACCGCCGCCGCAGTTTCATCACCATGAGTACCATTCCACGGCGGAGCTTTCT
GCCGACGCGGCCTCTCAACGACGCGCTCCCTGTCTCGCAGCCGGACTCCGCTCAAACCCTACGCCGAAGGCTCTCTTCGATTTCCTTCAAGATCCAGCCGATTTCTTCGC
CTGTTACTTCCTGGCCGTTTCGTAGATCCAAATCGGTGTCGTCTATGCGCGATTACACCGGCAGTTCGCTCCGGAAGTGGTGGGATTGGGGATGGTCCTGGATCCTCTCT
CGCAAGGCTGCCTTCGCTCGAGATCTGGAGATGAACGATGAAGAAACTAAAGCTCTTGGATCTAATTGCAGAGGTAGTTTGAGGCATCTTTTCTATAAGGTTAGGTCTGA
GTTCCGCAAACTGATTCGCTCCGATCGCGTCGGTCTTCCTCAAACTTTCAAATACGATTCGGTTAATTACTCGAAGAATTTCGACGACGGAGTGAAATCTTAGGCTGCTT
CCACTGTAATCGATTTTATGTGTTCGAAGGAATTCAGACTTTCGTATTGAATAATATGATCTGTTTCAAGCTAGATTCAGGACAAGGATATATAGACAAAGAAGCTTCCG
ATTCCGGCCATGGAGGTCTCTGGTCTTCACTGCAATTGTTGCTGGTAATCGATTTGGCAAAATCAACCTGTTCATCGTGTTCATTGCTTCGTTCTTCGCTCGATCGGAAC
TCAGAACGCACATTTGATTGCTCGCCGGCCCTGAGGTGAAGATGAACAGAGAGATTGCAGAGATCGAATTCGTCGCAAATTATGAGATCAAAGAGTTTGAAGTACATAAG
AATTTTCTTCTTCTTTTTGCTGATTTTGTTTTTTTTTGTTTGTATAAGCAACGTAAAATTCCTTTTGCTCTTATTGTTTGCATAGTATTTCTTGCGTCATCCGATGTAAT
TCCAACACTACTTTTTCTTTACTGTTCTTTGA
Protein sequenceShow/hide protein sequence
MSTIPRRSFLPTRPLNDALPVSQPDSAQTLRRRLSSISFKIQPISSPVTSWPFRRSKSVSSMRDYTGSSLRKWWDWGWSWILSRKAAFARDLEMNDEETKALGSNCRGSL
RHLFYKVRSEFRKLIRSDRVGLPQTFKYDSVNYSKNFDDGVKS