; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000752 (gene) of Snake gourd v1 genome

Gene IDTan0000752
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG02:27883229..27888051
RNA-Seq ExpressionTan0000752
SyntenyTan0000752
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011540.1 hypothetical protein SDJN02_26446, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-4859.39Show/hide
Query:  MSFKSGNFWSSTIMLRFRT-LLQLFHKTDASGGSGGGRR--PSQPPQVSSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQEKDQPS
        MS  SG+FWS TI LRFRT LLQL +K + S  +GG RR   S+PPQ SSVA++PY+           FRSFG SS      L+S+ PR           
Subjt:  MSFKSGNFWSSTIMLRFRT-LLQLFHKTDASGGSGGGRR--PSQPPQVSSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQEKDQPS

Query:  TTPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLT
                   AKWIFGSLLSLL+P    SWNK Q  EDEAE +IEEAE VAEVVEKVAELTEKVS+EI EK+ E+SK+KEAA +VE YSKEIAH A L 
Subjt:  TTPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLT

Query:  QDILHKVEEWKQKLDKSETAVNEQIKKKE
        Q ILHKVEEWKQKLDKSE  +NEQIKKKE
Subjt:  QDILHKVEEWKQKLDKSETAVNEQIKKKE

XP_011658811.1 uncharacterized protein LOC105436091 [Cucumis sativus]1.0e-4859.28Show/hide
Query:  LLQLFHKTDASGGSGGGRRPSQPPQV------SSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSS-LKSYFPR-------VKLSQEKDQPSTTPFFFFF
        LLQLF KTD S  +   RR S+P Q       SSVAM+P+                   STT MSS L+SY  R        + S+++D+PS + FFFF 
Subjt:  LLQLFHKTDASGGSGGGRRPSQPPQV------SSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSS-LKSYFPR-------VKLSQEKDQPSTTPFFFFF

Query:  -PGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLTQDILHKV
         P WAKWIFG+LLSLL+P    +WNKLQ +EDEAEMVIEEAE VAEVVEKVAELTEKVS++I EKLPEKSKLKEAA +VE+YSKEIAHDAHLTQDILHKV
Subjt:  -PGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLTQDILHKV

Query:  EEWKQKLDKSETAVNEQIKKK
        EEWK K+DKS+   NE  K++
Subjt:  EEWKQKLDKSETAVNEQIKKK

XP_022141966.1 uncharacterized protein LOC111012212 [Momordica charantia]2.3e-7773.28Show/hide
Query:  MSFKSGNFWSSTIMLRFRTLLQLFHKTD-ASGGSGGGRRPSQPPQVSSVAMIPYT-CYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQEKDQPST
        MS KSG+FWSST++LR R+LLQLFHKT+   GG GGGRR S PPQ SSVAM+PYT CYN Q  I  MFR FGS  T+           +KL++EKDQ ST
Subjt:  MSFKSGNFWSSTIMLRFRTLLQLFHKTD-ASGGSGGGRRPSQPPQVSSVAMIPYT-CYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQEKDQPST

Query:  TPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLTQ
           +FFFPGW KWIFGSLLSLLIP+WKQS NKLQTLE EAEMVIEEAE+VAEVVEK AE+ EK S+EIA+KLPEKSKLKEAA +VE YSK+IAHDAHLTQ
Subjt:  TPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLTQ

Query:  DILHKVEEWKQKLDKSETAVNEQIKKKEGIAN
        DILHKVEEWKQKLDKSETA+NEQI+KKEG AN
Subjt:  DILHKVEEWKQKLDKSETAVNEQIKKKEGIAN

XP_022972002.1 uncharacterized protein LOC111470651 [Cucurbita maxima]1.6e-4959.91Show/hide
Query:  MSFKSGNFWSSTIMLRFRT-LLQLFHKTDASGGSGGGRR--PSQPPQVSSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQEKDQPS
        MS  SG+FWS TI LRFR+ LLQL HK + S  +GG RR   S+PPQ SSVA++PY+           FRSFG SS      L+SY PR           
Subjt:  MSFKSGNFWSSTIMLRFRT-LLQLFHKTDASGGSGGGRR--PSQPPQVSSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQEKDQPS

Query:  TTPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLT
                   AKWIFGSLLSL +P    SWNK Q LEDEAE  IEEAE VAEVVEKVAELTEKVS+EI EKLPEKS++K+AA  VE YSKEIAHDA L 
Subjt:  TTPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLT

Query:  QDILHKVEEWKQKLDKSETAVNEQIKK
        Q ILHKVEEWKQKLDKSE  +NEQ+KK
Subjt:  QDILHKVEEWKQKLDKSETAVNEQIKK

XP_038888803.1 uncharacterized protein LOC120078589 [Benincasa hispida]2.5e-6366.23Show/hide
Query:  MSFKSGNFWSSTIMLRFRTLL-QLFHKTDASGGSGG--GRRPSQPPQVSSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQEKDQPS
        MS K G+FW        RTLL QLFHK D S   GG   RR S+PPQ SS+ M+ +               F  S+T   S L+ Y  R++ SQEKDQPS
Subjt:  MSFKSGNFWSSTIMLRFRTLL-QLFHKTDASGGSGG--GRRPSQPPQVSSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQEKDQPS

Query:  TTPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLT
        T+  FFFFPGWAKW+FGSLLSLL+P    SWN+L+TLEDEAEMVIEEAE VA+VVE+VAELTEKVS+EIAEKLPEKSKLKEAA++VENYSKE+AHDAHLT
Subjt:  TTPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLT

Query:  QDILHKVEEWKQKLDKSETAVNEQIKKK
        QDILHKVEEWKQKLD SET VNEQIKKK
Subjt:  QDILHKVEEWKQKLDKSETAVNEQIKKK

TrEMBL top hitse value%identityAlignment
A0A0A0K622 Uncharacterized protein1.7e-4956.85Show/hide
Query:  DMSFKSGNFWSSTIMLRFRTLLQLFHKTDASGGSGGGRRPSQPPQV------SSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSS-LKSYFPR------
        DMS K G+ W +        LLQLF KTD S  +   RR S+P Q       SSVAM+P+                   STT MSS L+SY  R      
Subjt:  DMSFKSGNFWSSTIMLRFRTLLQLFHKTDASGGSGGGRRPSQPPQV------SSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSS-LKSYFPR------

Query:  -VKLSQEKDQPSTTPFFFFF-PGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVE
          + S+++D+PS + FFFF  P WAKWIFG+LLSLL+P    +WNKLQ +EDEAEMVIEEAE VAEVVEKVAELTEKVS++I EKLPEKSKLKEAA +VE
Subjt:  -VKLSQEKDQPSTTPFFFFF-PGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVE

Query:  NYSKEIAHDAHLTQDILHKVEEWKQKLDKSETAVNEQIKKK
        +YSKEIAHDAHLTQDILHKVEEWK K+DKS+   NE  K++
Subjt:  NYSKEIAHDAHLTQDILHKVEEWKQKLDKSETAVNEQIKKK

A0A6J1CKS7 uncharacterized protein LOC1110122121.1e-7773.28Show/hide
Query:  MSFKSGNFWSSTIMLRFRTLLQLFHKTD-ASGGSGGGRRPSQPPQVSSVAMIPYT-CYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQEKDQPST
        MS KSG+FWSST++LR R+LLQLFHKT+   GG GGGRR S PPQ SSVAM+PYT CYN Q  I  MFR FGS  T+           +KL++EKDQ ST
Subjt:  MSFKSGNFWSSTIMLRFRTLLQLFHKTD-ASGGSGGGRRPSQPPQVSSVAMIPYT-CYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQEKDQPST

Query:  TPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLTQ
           +FFFPGW KWIFGSLLSLLIP+WKQS NKLQTLE EAEMVIEEAE+VAEVVEK AE+ EK S+EIA+KLPEKSKLKEAA +VE YSK+IAHDAHLTQ
Subjt:  TPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLTQ

Query:  DILHKVEEWKQKLDKSETAVNEQIKKKEGIAN
        DILHKVEEWKQKLDKSETA+NEQI+KKEG AN
Subjt:  DILHKVEEWKQKLDKSETAVNEQIKKKEGIAN

A0A6J1GLX4 uncharacterized protein LOC111455450 isoform X16.5e-4957.87Show/hide
Query:  RGRPIDMSFKSGNFWSSTIMLRFRT-LLQLFHKTDASGGSGGGRR--PSQPPQVSSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQ
        R  P  MS  SG+FWS TI LRFRT LLQL +K + S  +GG RR   S+PPQ SSVA++PY+           FRSFG SS      L+S+ PR     
Subjt:  RGRPIDMSFKSGNFWSSTIMLRFRT-LLQLFHKTDASGGSGGGRR--PSQPPQVSSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQ

Query:  EKDQPSTTPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIA
                         AKWIFGSLLSLL+P    SWNK Q  EDEAE +IEEAE VAEVVEKVAELTEKVS+EI EK+ E+SK+KEAA +VE YSKEIA
Subjt:  EKDQPSTTPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIA

Query:  HDAHLTQDILHKVEEWKQKLDKSETAVNEQIKKKE
        H A L Q ILHKVEEWKQKLDKS+  +NEQ+KKKE
Subjt:  HDAHLTQDILHKVEEWKQKLDKSETAVNEQIKKKE

A0A6J1GN56 uncharacterized protein LOC111455450 isoform X26.5e-4957.87Show/hide
Query:  RGRPIDMSFKSGNFWSSTIMLRFRT-LLQLFHKTDASGGSGGGRR--PSQPPQVSSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQ
        R  P  MS  SG+FWS TI LRFRT LLQL +K + S  +GG RR   S+PPQ SSVA++PY+           FRSFG SS      L+S+ PR     
Subjt:  RGRPIDMSFKSGNFWSSTIMLRFRT-LLQLFHKTDASGGSGGGRR--PSQPPQVSSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQ

Query:  EKDQPSTTPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIA
                         AKWIFGSLLSLL+P    SWNK Q  EDEAE +IEEAE VAEVVEKVAELTEKVS+EI EK+ E+SK+KEAA +VE YSKEIA
Subjt:  EKDQPSTTPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIA

Query:  HDAHLTQDILHKVEEWKQKLDKSETAVNEQIKKKE
        H A L Q ILHKVEEWKQKLDKS+  +NEQ+KKKE
Subjt:  HDAHLTQDILHKVEEWKQKLDKSETAVNEQIKKKE

A0A6J1I3G6 uncharacterized protein LOC1114706517.6e-5059.91Show/hide
Query:  MSFKSGNFWSSTIMLRFRT-LLQLFHKTDASGGSGGGRR--PSQPPQVSSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQEKDQPS
        MS  SG+FWS TI LRFR+ LLQL HK + S  +GG RR   S+PPQ SSVA++PY+           FRSFG SS      L+SY PR           
Subjt:  MSFKSGNFWSSTIMLRFRT-LLQLFHKTDASGGSGGGRR--PSQPPQVSSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQEKDQPS

Query:  TTPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLT
                   AKWIFGSLLSL +P    SWNK Q LEDEAE  IEEAE VAEVVEKVAELTEKVS+EI EKLPEKS++K+AA  VE YSKEIAHDA L 
Subjt:  TTPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLT

Query:  QDILHKVEEWKQKLDKSETAVNEQIKK
        Q ILHKVEEWKQKLDKSE  +NEQ+KK
Subjt:  QDILHKVEEWKQKLDKSETAVNEQIKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14095.1 unknown protein9.0e-2747.22Show/hide
Query:  YFPRVKLSQEKDQPSTTPFFFFFPGWAKWIFGSLLSLLIPSW-KQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAAR
        YF  V  +  K QPS    +F FP W +W+ GS +SL++  W  +   KL+ +E EAE+V+E  E VAE+VEKVA  T++++ E+AEKLPEK+KLK+ A 
Subjt:  YFPRVKLSQEKDQPSTTPFFFFFPGWAKWIFGSLLSLLIPSW-KQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAAR

Query:  LVENYSKEIAHDAHLTQDILHKVEEWKQKLDKSETAVNEQIKKK
        ++E+ S+  AH+AHLTQD LHKVE+  Q +D  E  +   I KK
Subjt:  LVENYSKEIAHDAHLTQDILHKVEEWKQKLDKSETAVNEQIKKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCTTTGGAGATCCTGAGAGGCCGTCCAATCGACATGTCGTTCAAATCTGGTAATTTCTGGAGTTCTACAATAATGCTCAGATTCCGAACCTTGCTTCAACTTTT
TCATAAGACAGACGCCTCCGGCGGCAGCGGCGGCGGTCGTCGGCCATCTCAGCCGCCGCAAGTCTCCTCCGTTGCAATGATTCCGTACACGTGCTATAATAAGCAATATT
CGATATGTCAAATGTTTCGGTCATTTGGTTCCAGTTCCACTACTGATATGTCGTCCCTCAAATCTTATTTTCCCAGGGTGAAACTTAGCCAGGAAAAGGATCAGCCATCA
ACAACTCCATTTTTCTTCTTCTTCCCTGGTTGGGCAAAATGGATTTTTGGATCCCTGTTGTCCCTCTTGATACCCAGTTGGAAGCAAAGTTGGAATAAATTGCAAACTCT
TGAAGACGAAGCTGAAATGGTAATTGAAGAGGCAGAAACTGTAGCAGAAGTAGTAGAAAAGGTAGCAGAATTAACAGAGAAGGTATCATCAGAAATTGCTGAGAAACTTC
CTGAGAAAAGTAAGCTCAAAGAAGCTGCTCGACTTGTTGAAAACTATTCCAAGGAAATTGCCCATGATGCTCACCTAACACAAGATATTCTACACAAGGTGGAAGAGTGG
AAGCAAAAACTAGATAAGTCAGAGACAGCTGTTAATGAACAGATTAAGAAGAAAGAGGGCATAGCAAACAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCTTTGGAGATCCTGAGAGGCCGTCCAATCGACATGTCGTTCAAATCTGGTAATTTCTGGAGTTCTACAATAATGCTCAGATTCCGAACCTTGCTTCAACTTTT
TCATAAGACAGACGCCTCCGGCGGCAGCGGCGGCGGTCGTCGGCCATCTCAGCCGCCGCAAGTCTCCTCCGTTGCAATGATTCCGTACACGTGCTATAATAAGCAATATT
CGATATGTCAAATGTTTCGGTCATTTGGTTCCAGTTCCACTACTGATATGTCGTCCCTCAAATCTTATTTTCCCAGGGTGAAACTTAGCCAGGAAAAGGATCAGCCATCA
ACAACTCCATTTTTCTTCTTCTTCCCTGGTTGGGCAAAATGGATTTTTGGATCCCTGTTGTCCCTCTTGATACCCAGTTGGAAGCAAAGTTGGAATAAATTGCAAACTCT
TGAAGACGAAGCTGAAATGGTAATTGAAGAGGCAGAAACTGTAGCAGAAGTAGTAGAAAAGGTAGCAGAATTAACAGAGAAGGTATCATCAGAAATTGCTGAGAAACTTC
CTGAGAAAAGTAAGCTCAAAGAAGCTGCTCGACTTGTTGAAAACTATTCCAAGGAAATTGCCCATGATGCTCACCTAACACAAGATATTCTACACAAGGTGGAAGAGTGG
AAGCAAAAACTAGATAAGTCAGAGACAGCTGTTAATGAACAGATTAAGAAGAAAGAGGGCATAGCAAACAACTGAAAATTTATTTTAAGGGTCAGCATGTAAAAAAAGAT
CATATGTAGAAACAAATTTTATTTTTATTTTAAAAAATAAAATTTATTTAAGTTAAATTATAA
Protein sequenceShow/hide protein sequence
MSSLEILRGRPIDMSFKSGNFWSSTIMLRFRTLLQLFHKTDASGGSGGGRRPSQPPQVSSVAMIPYTCYNKQYSICQMFRSFGSSSTTDMSSLKSYFPRVKLSQEKDQPS
TTPFFFFFPGWAKWIFGSLLSLLIPSWKQSWNKLQTLEDEAEMVIEEAETVAEVVEKVAELTEKVSSEIAEKLPEKSKLKEAARLVENYSKEIAHDAHLTQDILHKVEEW
KQKLDKSETAVNEQIKKKEGIANN