; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003068 (gene) of Snake gourd v1 genome

Gene IDTan0003068
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG04:88034972..88037601
RNA-Seq ExpressionTan0003068
SyntenyTan0003068
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018211.1 hypothetical protein SDJN02_20079, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-5984.47Show/hide
Query:  MSLSLSKS----LTLTLNLLLPRSSSRLQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHLPPI
        MSLSLSKS    LTLTLNLL PR S  LQSL  R FSSS D DLTDLSE  SPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP  SHLPPI
Subjt:  MSLSLSKS----LTLTLNLLLPRSSSRLQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHLPPI

Query:  ANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHEE
        ANLLH FANPLSPEQSLSTTTVRGWP+SH+FIQGAHLPSLE EVET SAEC+ASQ  DHEE
Subjt:  ANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHEE

XP_022956126.1 uncharacterized protein LOC111457911 isoform X1 [Cucurbita moschata]7.0e-5984.38Show/hide
Query:  MSLSLSKS----LTLTLNLLLPRSSSRLQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHLPPI
        MSLSLSKS    LTLTLNLL PR S  LQSL  R FSSS D DLTDLSE  SPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP  SHLPPI
Subjt:  MSLSLSKS----LTLTLNLLLPRSSSRLQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHLPPI

Query:  ANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHE
        ANLLH FANPLSPEQSLSTTTVRGWP+SH+FIQGAHLPSLE EVET SAEC+ASQ  DHE
Subjt:  ANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHE

XP_022956127.1 uncharacterized protein LOC111457911 isoform X2 [Cucurbita moschata]3.1e-5984.47Show/hide
Query:  MSLSLSKS----LTLTLNLLLPRSSSRLQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHLPPI
        MSLSLSKS    LTLTLNLL PR S  LQSL  R FSSS D DLTDLSE  SPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP  SHLPPI
Subjt:  MSLSLSKS----LTLTLNLLLPRSSSRLQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHLPPI

Query:  ANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHEE
        ANLLH FANPLSPEQSLSTTTVRGWP+SH+FIQGAHLPSLE EVET SAEC+ASQ  DHEE
Subjt:  ANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHEE

XP_022980733.1 uncharacterized protein LOC111480015 isoform X1 [Cucurbita maxima]1.2e-5882.21Show/hide
Query:  MSLSLSKS----LTLTLNLLLPRSSSR---LQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHL
        MSLSLSKS    LTLTLNLL PR   R   LQSL  R FSSS+D DLTDLSE  SPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP  SHL
Subjt:  MSLSLSKS----LTLTLNLLLPRSSSR---LQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHL

Query:  PPIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHE
        PPIANLLH FANPLSPEQSLSTTTVRGWP+SH+FIQG+HLPSLE EVET SAEC+ASQ  DHE
Subjt:  PPIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHE

XP_022980734.1 uncharacterized protein LOC111480015 isoform X2 [Cucurbita maxima]1.2e-5882.21Show/hide
Query:  MSLSLSKS----LTLTLNLLLPRSSSR---LQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHL
        MSLSLSKS    LTLTLNLL PR   R   LQSL  R FSSS+D DLTDLSE  SPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP  SHL
Subjt:  MSLSLSKS----LTLTLNLLLPRSSSR---LQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHL

Query:  PPIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHE
        PPIANLLH FANPLSPEQSLSTTTVRGWP+SH+FIQG+HLPSLE EVET SAEC+ASQ  DHE
Subjt:  PPIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHE

TrEMBL top hitse value%identityAlignment
A0A0A0K9L9 Uncharacterized protein5.6e-5478.34Show/hide
Query:  SLSLSKSLTLTLNLLLPRSSSRLQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHLPPIANLLH
        +LS  KSLTLTLNLL P        LF R FSS SD+DLTDL ES S SSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP  SHLPPIAN+L 
Subjt:  SLSLSKSLTLTLNLLLPRSSSRLQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHLPPIANLLH

Query:  TFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHEEG
          ANPLSPEQSLSTTTVRGWP+SHYFIQG HLPSL+ EV+TTS ECDAS   DHEEG
Subjt:  TFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHEEG

A0A6J1GVG9 uncharacterized protein LOC111457911 isoform X21.5e-5984.47Show/hide
Query:  MSLSLSKS----LTLTLNLLLPRSSSRLQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHLPPI
        MSLSLSKS    LTLTLNLL PR S  LQSL  R FSSS D DLTDLSE  SPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP  SHLPPI
Subjt:  MSLSLSKS----LTLTLNLLLPRSSSRLQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHLPPI

Query:  ANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHEE
        ANLLH FANPLSPEQSLSTTTVRGWP+SH+FIQGAHLPSLE EVET SAEC+ASQ  DHEE
Subjt:  ANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHEE

A0A6J1GVQ5 uncharacterized protein LOC111457911 isoform X13.4e-5984.38Show/hide
Query:  MSLSLSKS----LTLTLNLLLPRSSSRLQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHLPPI
        MSLSLSKS    LTLTLNLL PR S  LQSL  R FSSS D DLTDLSE  SPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP  SHLPPI
Subjt:  MSLSLSKS----LTLTLNLLLPRSSSRLQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHLPPI

Query:  ANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHE
        ANLLH FANPLSPEQSLSTTTVRGWP+SH+FIQGAHLPSLE EVET SAEC+ASQ  DHE
Subjt:  ANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHE

A0A6J1J031 uncharacterized protein LOC111480015 isoform X25.8e-5982.21Show/hide
Query:  MSLSLSKS----LTLTLNLLLPRSSSR---LQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHL
        MSLSLSKS    LTLTLNLL PR   R   LQSL  R FSSS+D DLTDLSE  SPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP  SHL
Subjt:  MSLSLSKS----LTLTLNLLLPRSSSR---LQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHL

Query:  PPIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHE
        PPIANLLH FANPLSPEQSLSTTTVRGWP+SH+FIQG+HLPSLE EVET SAEC+ASQ  DHE
Subjt:  PPIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHE

A0A6J1J062 uncharacterized protein LOC111480015 isoform X15.8e-5982.21Show/hide
Query:  MSLSLSKS----LTLTLNLLLPRSSSR---LQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHL
        MSLSLSKS    LTLTLNLL PR   R   LQSL  R FSSS+D DLTDLSE  SPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP  SHL
Subjt:  MSLSLSKS----LTLTLNLLLPRSSSR---LQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP--SHL

Query:  PPIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHE
        PPIANLLH FANPLSPEQSLSTTTVRGWP+SH+FIQG+HLPSLE EVET SAEC+ASQ  DHE
Subjt:  PPIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16840.1 unknown protein5.7e-2748.67Show/hide
Query:  MSLSLSKSLTLTLNLLLPRSSSRLQSLFLRNFSSSS-----DYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP---SHLP
        M+ S S+SL+L+       SSS L S  L +F S S     D    D S +S   SDPL++ LEDA+ RI VRR+ PDWLPFVPGASYWVP P   S   
Subjt:  MSLSLSKSLTLTLNLLLPRSSSRLQSLFLRNFSSSS-----DYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP---SHLP

Query:  PIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTS
         IA L+   ANPL+ E+SLST +  GWP+S YF++G     +E++ ETTS
Subjt:  PIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTS

AT1G16840.2 unknown protein9.2e-2549.28Show/hide
Query:  MSLSLSKSLTLTLNLLLPRSSSRLQSLFLRNFSSSS-----DYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP---SHLP
        M+ S S+SL+L+       SSS L S  L +F S S     D    D S +S   SDPL++ LEDA+ RI VRR+ PDWLPFVPGASYWVP P   S   
Subjt:  MSLSLSKSLTLTLNLLLPRSSSRLQSLFLRNFSSSS-----DYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP---SHLP

Query:  PIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAH
         IA L+   ANPL+ E+SLST +  GWP+S YF++G++
Subjt:  PIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAH

AT1G16840.3 unknown protein5.7e-2748.67Show/hide
Query:  MSLSLSKSLTLTLNLLLPRSSSRLQSLFLRNFSSSS-----DYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP---SHLP
        M+ S S+SL+L+       SSS L S  L +F S S     D    D S +S   SDPL++ LEDA+ RI VRR+ PDWLPFVPGASYWVP P   S   
Subjt:  MSLSLSKSLTLTLNLLLPRSSSRLQSLFLRNFSSSS-----DYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP---SHLP

Query:  PIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTS
         IA L+   ANPL+ E+SLST +  GWP+S YF++G     +E++ ETTS
Subjt:  PIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTS

AT1G16840.4 unknown protein5.7e-2748.67Show/hide
Query:  MSLSLSKSLTLTLNLLLPRSSSRLQSLFLRNFSSSS-----DYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP---SHLP
        M+ S S+SL+L+       SSS L S  L +F S S     D    D S +S   SDPL++ LEDA+ RI VRR+ PDWLPFVPGASYWVP P   S   
Subjt:  MSLSLSKSLTLTLNLLLPRSSSRLQSLFLRNFSSSS-----DYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP---SHLP

Query:  PIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTS
         IA L+   ANPL+ E+SLST +  GWP+S YF++G     +E++ ETTS
Subjt:  PIANLLHTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTS

AT1G78890.1 unknown protein1.3e-2648.28Show/hide
Query:  SLSLSKSLTLTLNLLLPRSSSRLQSLFLRNFSSS--SDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP-SHLPPIANLL
        SLSLS+      +LLLP S    Q++F+R+ SS+  S+ +   +      ++DPL+  LEDA+ RI+VRRSAPDWLPFVPGAS+WVP P S    IA L+
Subjt:  SLSLSKSLTLTLNLLLPRSSSRLQSLFLRNFSSS--SDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLP-SHLPPIANLL

Query:  HTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSA
           ANP+S E+S+S ++VRGWP S YFI+G    S+E+E+ + +A
Subjt:  HTFANPLSPEQSLSTTTVRGWPASHYFIQGAHLPSLESEVETTSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTCTCTCTATCCAAATCCCTAACCCTAACCCTAAATCTCCTTCTTCCTCGTTCTTCTTCTCGTCTTCAATCCCTCTTCCTCCGTAATTTCTCCTCTTCTTCCGA
TTACGATCTCACTGACCTCTCTGAATCTTCTTCTCCTTCTTCCGATCCCCTTCTTCGCAACCTCGAGGACGCTATTCAACGGATCCTCGTTCGCCGATCTGCCCCCGACT
GGCTCCCCTTCGTTCCCGGCGCTTCTTATTGGGTTCCACTGCCTTCTCATTTACCTCCCATTGCCAACCTTCTTCACACCTTCGCTAACCCTCTCTCTCCAGAACAATCC
TTGTCTACCACTACCGTCCGTGGCTGGCCCGCTTCTCATTACTTTATTCAAGGTGCCCATCTGCCTTCTCTTGAGTCGGAAGTCGAGACAACGTCCGCTGAATGTGATGC
TTCCCAACCTTTTGACCACGAGGAAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATTAAAATAAACTAAATTAATTCTCACTCCCCCTCTCTCTCTTTGTTTCTCCTCTTCTTCTCTGCAAAAGCGAAACCGACCCACTTAACCTTCCTTATGTCTCTCTCTCT
ATCCAAATCCCTAACCCTAACCCTAAATCTCCTTCTTCCTCGTTCTTCTTCTCGTCTTCAATCCCTCTTCCTCCGTAATTTCTCCTCTTCTTCCGATTACGATCTCACTG
ACCTCTCTGAATCTTCTTCTCCTTCTTCCGATCCCCTTCTTCGCAACCTCGAGGACGCTATTCAACGGATCCTCGTTCGCCGATCTGCCCCCGACTGGCTCCCCTTCGTT
CCCGGCGCTTCTTATTGGGTTCCACTGCCTTCTCATTTACCTCCCATTGCCAACCTTCTTCACACCTTCGCTAACCCTCTCTCTCCAGAACAATCCTTGTCTACCACTAC
CGTCCGTGGCTGGCCCGCTTCTCATTACTTTATTCAAGGTGCCCATCTGCCTTCTCTTGAGTCGGAAGTCGAGACAACGTCCGCTGAATGTGATGCTTCCCAACCTTTTG
ACCACGAGGAAGGATGAGAACCTCTACGTTCCCAAAATAAATTGCCATCTGATCTGGCTCGTGCTTCCCTCTAAACTTCATGTATAGAAGAAGAAGACCACCAATAGCAG
ACCAGCTAGATGGCCGAGCAAGGGGAGGAGCTTGTACATATGTGAGCTTTGGCGAGTGTTCTTTATCCAATGGGAACCTCTTCGTCCGTCATTGCCAGATTGCACATAAT
ATCTATGCCGCAGTGTACTCTTGAATTCGACCACATATGTATATTATTCTCACCTTTGCTTTTAAATGTTTTTAACTTTTGGGTAATTTTGTTTCTTCCAACACAACTGC
TTCACGAGTATTAATATGGTTAGAAAAGCTGTGGACTTAGTTTCATATTAGAATATCAAGCCTACATATGGTGCGTGGTGCGTAGGGTAGATTTGGGTTTTGGAACTTGA
CAATGCAGGTTGGTCGCCTATCAAGCCTACATATGGTTTAGCTTCATAGTTGCGGTGTTTCCTCTCCATTTTCTTTCGGGTCAAACAAACCTTAATTAATTGTTTAAGTC
ACAACGGGGTCGTCGAGTTTGAATTCTAGGATACTGAGGTCATGGGAGGACAAAGTGGGTGTGGGGACAAGGTCCTCTTTAAGTTTAACTATAACTATGTTCTACCAAAT
ATCGAC
Protein sequenceShow/hide protein sequence
MSLSLSKSLTLTLNLLLPRSSSRLQSLFLRNFSSSSDYDLTDLSESSSPSSDPLLRNLEDAIQRILVRRSAPDWLPFVPGASYWVPLPSHLPPIANLLHTFANPLSPEQS
LSTTTVRGWPASHYFIQGAHLPSLESEVETTSAECDASQPFDHEEG