; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003092 (gene) of Snake gourd v1 genome

Gene IDTan0003092
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG09:66031027..66031719
RNA-Seq ExpressionTan0003092
SyntenyTan0003092
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589876.1 hypothetical protein SDJN03_15299, partial [Cucurbita argyrosperma subsp. sororia]5.6e-4678.29Show/hide
Query:  MATNLLTFPSAGIRACAASADRRPDQNRRKT-SSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMT
        MAT+L TF    IRACA+SA    D NRRKT SSSSS+NWWAPVFGWSSE DYIDS NK++P+ LA G+S SDPE KSARNRFSPGCFTE+KARQLRLMT
Subjt:  MATNLLTFPSAGIRACAASADRRPDQNRRKT-SSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMT

Query:  TETESFHDVMYHSAIASRLASDFKNRADS
         ETESFHDVMYHSAIASRLA+DFK+RADS
Subjt:  TETESFHDVMYHSAIASRLASDFKNRADS

XP_011656185.1 uncharacterized protein LOC105435656 [Cucumis sativus]4.9e-5083.59Show/hide
Query:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMTT
        MATNLLTFP AGIRACA+SADRR D +RRKTSSSS  NWWAPVFGWSSE DYIDS NKA+PQ LAGG S  D E KS R RFSPGCFTEAKARQLR+MTT
Subjt:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMTT

Query:  ETESFHDVMYHSAIASRLASDFKNRADS
        ETESFHDVMYHSAIASRLASDFK+R DS
Subjt:  ETESFHDVMYHSAIASRLASDFKNRADS

XP_016903293.1 PREDICTED: uncharacterized protein LOC103502747 [Cucumis melo]5.0e-4781.4Show/hide
Query:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDS-DNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMT
        MATNLLTF  AGIRACAASADRR D NRRK  +SSS NWWAPVFGWSSE DYIDS  NKA+PQ LAG  S  D E KS R RFSPGCFTEAKARQLR+MT
Subjt:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDS-DNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMT

Query:  TETESFHDVMYHSAIASRLASDFKNRADS
        TETESFHDVMYHSAIASRLASDFK+R DS
Subjt:  TETESFHDVMYHSAIASRLASDFKNRADS

XP_022960569.1 uncharacterized protein LOC111461308 [Cucurbita moschata]3.6e-4576.15Show/hide
Query:  MATNLLTFPSAGIRACAASADRRPDQNRRK--TSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLM
        MATNL TF    IRACA+SA    D NRRK  +SSSSS+NWWAPVFGWSSE DYIDS NK++P+ LA G+S SDPE KS+RNRFSPGCFTE+KARQLRLM
Subjt:  MATNLLTFPSAGIRACAASADRRPDQNRRK--TSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLM

Query:  TTETESFHDVMYHSAIASRLASDFKNRADS
        T ETESFHDVMYHSAIASRLA+DFK+R DS
Subjt:  TTETESFHDVMYHSAIASRLASDFKNRADS

XP_038880363.1 uncharacterized protein LOC120072011 [Benincasa hispida]2.1e-5387.5Show/hide
Query:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMTT
        MATNLLTFP AGIRACAASADRR D NRRKTSSSS  NWWAPVFGWSSE DYIDS NKA+PQ LAGG+S SDPE KS R RFSPGCFTEAKARQLR+MTT
Subjt:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMTT

Query:  ETESFHDVMYHSAIASRLASDFKNRADS
        ETESFHDVMYHSAIASRLASDFK RADS
Subjt:  ETESFHDVMYHSAIASRLASDFKNRADS

TrEMBL top hitse value%identityAlignment
A0A0A0LTN7 Uncharacterized protein2.4e-5083.59Show/hide
Query:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMTT
        MATNLLTFP AGIRACA+SADRR D +RRKTSSSS  NWWAPVFGWSSE DYIDS NKA+PQ LAGG S  D E KS R RFSPGCFTEAKARQLR+MTT
Subjt:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMTT

Query:  ETESFHDVMYHSAIASRLASDFKNRADS
        ETESFHDVMYHSAIASRLASDFK+R DS
Subjt:  ETESFHDVMYHSAIASRLASDFKNRADS

A0A1S4E4Y4 uncharacterized protein LOC1035027472.4e-4781.4Show/hide
Query:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDS-DNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMT
        MATNLLTF  AGIRACAASADRR D NRRK  +SSS NWWAPVFGWSSE DYIDS  NKA+PQ LAG  S  D E KS R RFSPGCFTEAKARQLR+MT
Subjt:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDS-DNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMT

Query:  TETESFHDVMYHSAIASRLASDFKNRADS
        TETESFHDVMYHSAIASRLASDFK+R DS
Subjt:  TETESFHDVMYHSAIASRLASDFKNRADS

A0A5D3CSV6 Uncharacterized protein2.4e-4781.4Show/hide
Query:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDS-DNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMT
        MATNLLTF  AGIRACAASADRR D NRRK  +SSS NWWAPVFGWSSE DYIDS  NKA+PQ LAG  S  D E KS R RFSPGCFTEAKARQLR+MT
Subjt:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDS-DNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMT

Query:  TETESFHDVMYHSAIASRLASDFKNRADS
        TETESFHDVMYHSAIASRLASDFK+R DS
Subjt:  TETESFHDVMYHSAIASRLASDFKNRADS

A0A6J1HBD8 uncharacterized protein LOC1114613081.7e-4576.15Show/hide
Query:  MATNLLTFPSAGIRACAASADRRPDQNRRK--TSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLM
        MATNL TF    IRACA+SA    D NRRK  +SSSSS+NWWAPVFGWSSE DYIDS NK++P+ LA G+S SDPE KS+RNRFSPGCFTE+KARQLRLM
Subjt:  MATNLLTFPSAGIRACAASADRRPDQNRRK--TSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLM

Query:  TTETESFHDVMYHSAIASRLASDFKNRADS
        T ETESFHDVMYHSAIASRLA+DFK+R DS
Subjt:  TTETESFHDVMYHSAIASRLASDFKNRADS

A0A6J1JBM9 uncharacterized protein LOC1114853093.0e-4578.12Show/hide
Query:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMTT
        MATNL TF    IRACA+SA    D NRRKTSSSSS NWWAPVFGWSSE DYIDS NK++P+  A G+S SDPE KSARNRFSPGCFTE+KARQLRLMT 
Subjt:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMTT

Query:  ETESFHDVMYHSAIASRLASDFKNRADS
        ETESFHDVMYHSAIASRLA+DFK+R DS
Subjt:  ETESFHDVMYHSAIASRLASDFKNRADS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52720.1 unknown protein4.3e-2045.22Show/hide
Query:  IRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMTTETESFHDVMYHS
        IRA + S    PDQNR+K     SA WWAP+FG  S+ DY++ ++          ++    +   +  +F  GCFTE KA+QLR  T E  +FHDVMYHS
Subjt:  IRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMTTETESFHDVMYHS

Query:  AIASRLASDFKNRAD
        AIASRLASD   R +
Subjt:  AIASRLASDFKNRAD

AT3G15630.1 unknown protein9.2e-1544.72Show/hide
Query:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMTT
        MAT +     + IRA + S     D  R+K  SS S  WWAP+FG SSE DY++     +          SD +   A  R    C TE KA+QLR  T 
Subjt:  MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMTT

Query:  ETESFHDVMYHSAIASRLASDFK
        E  +FHDVMYHSAIASRLASD +
Subjt:  ETESFHDVMYHSAIASRLASDFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACCAATCTCCTCACTTTTCCCTCCGCCGGCATCCGTGCCTGCGCCGCCTCCGCCGACCGCCGCCCCGACCAGAATCGCCGGAAAACCTCCTCTTCTTCCTCCGC
CAACTGGTGGGCCCCGGTCTTCGGCTGGTCCTCGGAGGCGGACTACATCGACTCTGACAACAAGGCCGATCCTCAAAAACTCGCCGGCGGAATTTCATATTCCGACCCGG
AGCCAAAATCGGCGAGGAATCGATTTTCCCCCGGCTGCTTCACGGAGGCTAAGGCAAGGCAGCTCCGCCTGATGACGACGGAAACGGAGTCGTTTCACGACGTTATGTAT
CACTCGGCAATCGCATCTCGTCTCGCCTCCGACTTCAAAAATCGCGCCGATTCCTGA
mRNA sequenceShow/hide mRNA sequence
ACAACCCCATACCCCATTTCATCTCTCTAAAAAAAAAAGCACTTTCCGATCTGCTCTCTCTAAAAATGGCGACCAATCTCCTCACTTTTCCCTCCGCCGGCATCCGTGCC
TGCGCCGCCTCCGCCGACCGCCGCCCCGACCAGAATCGCCGGAAAACCTCCTCTTCTTCCTCCGCCAACTGGTGGGCCCCGGTCTTCGGCTGGTCCTCGGAGGCGGACTA
CATCGACTCTGACAACAAGGCCGATCCTCAAAAACTCGCCGGCGGAATTTCATATTCCGACCCGGAGCCAAAATCGGCGAGGAATCGATTTTCCCCCGGCTGCTTCACGG
AGGCTAAGGCAAGGCAGCTCCGCCTGATGACGACGGAAACGGAGTCGTTTCACGACGTTATGTATCACTCGGCAATCGCATCTCGTCTCGCCTCCGACTTCAAAAATCGC
GCCGATTCCTGATGTGATCTCCTCCGGTACCGGCCGACGGCCGCCGATCCCTCTCATCTACTTGTAATTATGTCGAATTGTATAAATTAACTGCCAATTCCTGGGATTAT
GAGTAGTAAACTGTTTTTTTTCTTTTTCCTTTTATTTTTGTTCGTTGTTTTGGCATTGTTCTTCCTTGAGCAATCGTCGGATTGAGAGGTGTTTGGTGAAGAAGAGAATG
ATGAAATAATTTTATTATTATTTTAGAACGAAA
Protein sequenceShow/hide protein sequence
MATNLLTFPSAGIRACAASADRRPDQNRRKTSSSSSANWWAPVFGWSSEADYIDSDNKADPQKLAGGISYSDPEPKSARNRFSPGCFTEAKARQLRLMTTETESFHDVMY
HSAIASRLASDFKNRADS