; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006145 (gene) of Snake gourd v1 genome

Gene IDTan0006145
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCupin type-1 domain-containing protein
Genome locationLG05:76515357..76516100
RNA-Seq ExpressionTan0006145
SyntenyTan0006145
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583945.1 hypothetical protein SDJN03_19877, partial [Cucurbita argyrosperma subsp. sororia]3.3e-4864.57Show/hide
Query:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENH--ETPTQNDAVEPPGEEIATPIFQNEQNV---------ESIEDGGGLGSEMKTEDKEEKSRKSRRRRS
        ME QQVL++ DSLWFFATV SNR PPP       + PTQND  EPPG EIATPIFQN+QNV         ++ E GGGLG  +      E+ RKSR RR 
Subjt:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENH--ETPTQNDAVEPPGEEIATPIFQNEQNV---------ESIEDGGGLGSEMKTEDKEEKSRKSRRRRS

Query:  CCLNQRRKIVGEMDLSYAVKEICECWLFEERIGNGHNFQR------KKKMPPFEDSMAMKEHIKSWAYAVACTVR
         CL QRRKIVGEMDLS AVKEICECWL E+RIG G+ ++R      KKKMPPFEDSMAMKEHI+SWAYAVACTVR
Subjt:  CCLNQRRKIVGEMDLSYAVKEICECWLFEERIGNGHNFQR------KKKMPPFEDSMAMKEHIKSWAYAVACTVR

KAG7019562.1 hypothetical protein SDJN02_18523, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-4865.32Show/hide
Query:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENHETPTQNDAVEPPGEEIATPIFQNEQNV---------ESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCC
        ME QQVL++ DSLWFFATV SNR PPP     + PTQ+D VEPPG EIATPIFQN+QNV         ++ E GGGLG  +      E+ RKSR RR  C
Subjt:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENHETPTQNDAVEPPGEEIATPIFQNEQNV---------ESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCC

Query:  LNQRRKIVGEMDLSYAVKEICECWLFEERIGNGHNFQR------KKKMPPFEDSMAMKEHIKSWAYAVACTVR
        L QRRKIVGEMDLS AVKEICECWL E+RIG G+ ++R      KKKMPPFEDSMAMKEHI+SWAYAVACTVR
Subjt:  LNQRRKIVGEMDLSYAVKEICECWLFEERIGNGHNFQR------KKKMPPFEDSMAMKEHIKSWAYAVACTVR

XP_022927166.1 uncharacterized protein LOC111434099 [Cucurbita moschata]2.7e-5066.47Show/hide
Query:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENHETPTQNDAVEPPGEEIATPIFQNEQNV---------ESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCC
        ME QQVL++ DSLWFFATV SNR PPP V+    PTQ+D VEPPG EIATPIFQN+QNV         ++ E GGGLG  +      E+ RKSR RR  C
Subjt:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENHETPTQNDAVEPPGEEIATPIFQNEQNV---------ESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCC

Query:  LNQRRKIVGEMDLSYAVKEICECWLFEERIGNGHNFQR------KKKMPPFEDSMAMKEHIKSWAYAVACTVR
        L QRRKIVGEMDLS AVKEICECWL E+RIG G+ +QR      KKKMPPFEDSMAMKEHI+SWAYAVACTVR
Subjt:  LNQRRKIVGEMDLSYAVKEICECWLFEERIGNGHNFQR------KKKMPPFEDSMAMKEHIKSWAYAVACTVR

XP_023520136.1 uncharacterized protein LOC111783440 [Cucurbita pepo subsp. pepo]4.3e-4863.84Show/hide
Query:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENH----ETPTQNDAVEPPGEEIATPIFQNEQNV---------ESIEDGGGLGSEMKTEDKEEKSRKSRRR
        ME QQVL++ DSLWFFATV SNR PPP         + PTQND VEPPG EIATPIFQN+QNV         ++ E GGGLG  +      E+ RKSR R
Subjt:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENH----ETPTQNDAVEPPGEEIATPIFQNEQNV---------ESIEDGGGLGSEMKTEDKEEKSRKSRRR

Query:  RSCCLNQRRKIVGEMDLSYAVKEICECWLFEERIGNGHNFQR------KKKMPPFEDSMAMKEHIKSWAYAVACTVR
        R  CL QRRKIVGEMDLS AVKEICECWL E+RIG G+ +QR      KKKMPPFEDSMAMKEHI+SWA+AVAC VR
Subjt:  RSCCLNQRRKIVGEMDLSYAVKEICECWLFEERIGNGHNFQR------KKKMPPFEDSMAMKEHIKSWAYAVACTVR

XP_038895160.1 uncharacterized protein LOC120083464 [Benincasa hispida]2.0e-5372.35Show/hide
Query:  MEGQQVLESFDSLWFFATVLSNRR----PPPAVENHETPTQNDAVEPPGEEIATPIFQNEQN----VESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCCL
        MEGQQVLE++DSLWFFATV SNR     PPP VEN   PTQND VEPPGEEIATP+  NEQN     + IE+GGG G  +KTE+KE++ RK+R RR   L
Subjt:  MEGQQVLESFDSLWFFATVLSNRR----PPPAVENHETPTQNDAVEPPGEEIATPIFQNEQN----VESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCCL

Query:  NQRRKIVGEMDLSYAVKEICECWLFEE-RIGNGHNFQRK---KKMPPFEDSMAMKEHIKSWAYAVACTVR
         QRRKIVGEMDLSYAVKEICECWLFEE RIG G+ +QRK   KKMPPFEDSMAMKEHI+SWAYAVACTVR
Subjt:  NQRRKIVGEMDLSYAVKEICECWLFEE-RIGNGHNFQRK---KKMPPFEDSMAMKEHIKSWAYAVACTVR

TrEMBL top hitse value%identityAlignment
A0A0A0LX63 Uncharacterized protein3.8e-4265.62Show/hide
Query:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENHETPTQNDAVEPPGEEIATPIFQNEQNVESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCCLNQRRK-IV
        MEGQQVLE+FDSLWFFATV SNR PP   +    P QND V   GEEIATPI +NE+N            EM+TE++EEK RK+R  RS  L QR+K IV
Subjt:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENHETPTQNDAVEPPGEEIATPIFQNEQNVESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCCLNQRRK-IV

Query:  GEMDLSYAVKEICECWLFEE-RIGNGHNFQRKKKMPPFEDSMAMKEHIKSWAYAVACTVR
        GE+DLSYAVKEICECW FEE RIG G   ++ KKMP FEDSMAMKEHI+SWAYAVACTVR
Subjt:  GEMDLSYAVKEICECWLFEE-RIGNGHNFQRKKKMPPFEDSMAMKEHIKSWAYAVACTVR

A0A1S3B8X4 uncharacterized protein LOC1034870563.1e-4466.88Show/hide
Query:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENHETPTQNDAVEPPGEEIATPIFQNEQNVESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCCLNQRRK-IV
        MEGQQV E+FDSLWFF+TV SNR PP      E P QND VEP GEEIATPI +NE+N        G   EM+TE+ EEK RK+R RRS  L QRRK +V
Subjt:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENHETPTQNDAVEPPGEEIATPIFQNEQNVESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCCLNQRRK-IV

Query:  GEMDLSYAVKEICECWLFE-ERIGNGHNFQRKKKMPPFEDSMAMKEHIKSWAYAVACTVR
        GE+DLSYAVKEICECW FE  RIG G   ++ KKMP FEDSMAMKEHI+SWAYAVACTVR
Subjt:  GEMDLSYAVKEICECWLFE-ERIGNGHNFQRKKKMPPFEDSMAMKEHIKSWAYAVACTVR

A0A5A7UQZ2 Uncharacterized protein3.1e-4466.88Show/hide
Query:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENHETPTQNDAVEPPGEEIATPIFQNEQNVESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCCLNQRRK-IV
        MEGQQV E+FDSLWFF+TV SNR PP      E P QND VEP GEEIATPI +NE+N        G   EM+TE+ EEK RK+R RRS  L QRRK +V
Subjt:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENHETPTQNDAVEPPGEEIATPIFQNEQNVESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCCLNQRRK-IV

Query:  GEMDLSYAVKEICECWLFE-ERIGNGHNFQRKKKMPPFEDSMAMKEHIKSWAYAVACTVR
        GE+DLSYAVKEICECW FE  RIG G   ++ KKMP FEDSMAMKEHI+SWAYAVACTVR
Subjt:  GEMDLSYAVKEICECWLFE-ERIGNGHNFQRKKKMPPFEDSMAMKEHIKSWAYAVACTVR

A0A6J1EGY0 uncharacterized protein LOC1114340991.3e-5066.47Show/hide
Query:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENHETPTQNDAVEPPGEEIATPIFQNEQNV---------ESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCC
        ME QQVL++ DSLWFFATV SNR PPP V+    PTQ+D VEPPG EIATPIFQN+QNV         ++ E GGGLG  +      E+ RKSR RR  C
Subjt:  MEGQQVLESFDSLWFFATVLSNRRPPPAVENHETPTQNDAVEPPGEEIATPIFQNEQNV---------ESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCC

Query:  LNQRRKIVGEMDLSYAVKEICECWLFEERIGNGHNFQR------KKKMPPFEDSMAMKEHIKSWAYAVACTVR
        L QRRKIVGEMDLS AVKEICECWL E+RIG G+ +QR      KKKMPPFEDSMAMKEHI+SWAYAVACTVR
Subjt:  LNQRRKIVGEMDLSYAVKEICECWLFEERIGNGHNFQR------KKKMPPFEDSMAMKEHIKSWAYAVACTVR

A0A6J1KF61 uncharacterized protein LOC1114952575.1e-4762.15Show/hide
Query:  MEGQQVLESFDSLWFFATVLSNRRPP----PAVENHETPTQNDAVEPPGEEIATPIFQNEQNV---------ESIEDGGGLGSEMKTEDKEEKSRKSRRR
        ME QQVL++ DSLWFFATV SN+ PP    P     + PT+ND VE PG EIATPIFQN+QNV         ++ E GGG GS +      E+ RKSR R
Subjt:  MEGQQVLESFDSLWFFATVLSNRRPP----PAVENHETPTQNDAVEPPGEEIATPIFQNEQNV---------ESIEDGGGLGSEMKTEDKEEKSRKSRRR

Query:  RSCCLNQRRKIVGEMDLSYAVKEICECWLFEERIGNGHNFQR------KKKMPPFEDSMAMKEHIKSWAYAVACTVR
        R  CL QRR+IVGEMDLS AVKEICECWL E+RIG G+ +QR      KKK+PPFEDSMAMKEHI+SWAYAVACTVR
Subjt:  RSCCLNQRRKIVGEMDLSYAVKEICECWLFEERIGNGHNFQR------KKKMPPFEDSMAMKEHIKSWAYAVACTVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGCCAGCAGGTTCTAGAGAGTTTTGATTCACTCTGGTTCTTCGCCACCGTACTTTCCAACAGAAGGCCGCCGCCGGCCGTCGAAAACCACGAAACTCCCACTCA
AAACGACGCCGTTGAGCCTCCCGGAGAAGAAATCGCGACACCCATTTTTCAAAATGAGCAAAATGTTGAGTCAATTGAAGATGGCGGTGGGTTAGGCTCTGAAATGAAAA
CAGAGGATAAGGAAGAGAAGAGCAGAAAAAGCAGAAGAAGGAGATCTTGTTGTTTGAACCAAAGGAGAAAAATTGTGGGAGAGATGGATCTGAGTTATGCTGTGAAGGAG
ATTTGTGAATGTTGGTTGTTTGAAGAGAGAATTGGAAATGGGCATAATTTTCAGAGGAAGAAGAAGATGCCTCCGTTTGAAGATAGTATGGCGATGAAAGAACATATTAA
ATCGTGGGCTTATGCAGTGGCTTGTACTGTTAGATAA
mRNA sequenceShow/hide mRNA sequence
AACTTGGTTATTAATTATTTTGACGCTCCATTGGGTTTTTGGGCGTCGAGTACAATGGAAGGCCAGCAGGTTCTAGAGAGTTTTGATTCACTCTGGTTCTTCGCCACCGT
ACTTTCCAACAGAAGGCCGCCGCCGGCCGTCGAAAACCACGAAACTCCCACTCAAAACGACGCCGTTGAGCCTCCCGGAGAAGAAATCGCGACACCCATTTTTCAAAATG
AGCAAAATGTTGAGTCAATTGAAGATGGCGGTGGGTTAGGCTCTGAAATGAAAACAGAGGATAAGGAAGAGAAGAGCAGAAAAAGCAGAAGAAGGAGATCTTGTTGTTTG
AACCAAAGGAGAAAAATTGTGGGAGAGATGGATCTGAGTTATGCTGTGAAGGAGATTTGTGAATGTTGGTTGTTTGAAGAGAGAATTGGAAATGGGCATAATTTTCAGAG
GAAGAAGAAGATGCCTCCGTTTGAAGATAGTATGGCGATGAAAGAACATATTAAATCGTGGGCTTATGCAGTGGCTTGTACTGTTAGATAAATCAAAATTAGATAGAGAT
TTCGATGACGATTTTGTTTATGAAGTTTTAATCTAAATTTTGTAGTTGGGGGGTTTGATGATTTGTAATTTGGTGAATAAGTTTTGTAAAGTTGGAAACTTTTTGGTTCC
ATCTCTATATTTTTTTTTTTGTTTTGGTAATTCTTTGCAGATGGATCTGATTTTTTGTCAGAAATGGTGGTAATAATTTCTTCA
Protein sequenceShow/hide protein sequence
MEGQQVLESFDSLWFFATVLSNRRPPPAVENHETPTQNDAVEPPGEEIATPIFQNEQNVESIEDGGGLGSEMKTEDKEEKSRKSRRRRSCCLNQRRKIVGEMDLSYAVKE
ICECWLFEERIGNGHNFQRKKKMPPFEDSMAMKEHIKSWAYAVACTVR