; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022373 (gene) of Snake gourd v1 genome

Gene IDTan0022373
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionglycine-rich cell wall structural protein 1-like
Genome locationLG07:66803066..66804173
RNA-Seq ExpressionTan0022373
SyntenyTan0022373
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042689.1 uncharacterized protein E6C27_scaffold44G001870 [Cucumis melo var. makuwa]2.3e-0548.35Show/hide
Query:  MSSLQCSKPGEQS--QQQQKHVQHQQDHCF-SHVTDKIKGVF--GHHHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEG
        M+  QC+KP + +  Q QQKH Q    HCF  HV+DKIKGVF  GHHH Q   A+    H      +AN  HCK +   KKKEH  K KEG
Subjt:  MSSLQCSKPGEQS--QQQQKHVQHQQDHCF-SHVTDKIKGVF--GHHHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEG

XP_008437383.1 PREDICTED: uncharacterized protein LOC103482815 [Cucumis melo]4.2e-2056.8Show/hide
Query:  MSSLQCSKPGEQS--QQQQKHVQHQQDHCF-SHVTDKIKGVF--GHHHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEGGLLHKIKDA
        M+  QC+KP + +  Q QQKH Q    HCF  HV+DKIKGVF  GHHH Q   A+    H      +AN  HCK +   KKKEH  K KEGGLLHKIK+A
Subjt:  MSSLQCSKPGEQS--QQQQKHVQHQQDHCF-SHVTDKIKGVF--GHHHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEGGLLHKIKDA

Query:  FSDHSSDSSDSDSDKECHKAHHKKK
        FSDHSSDSSDS++  ECHK HH KK
Subjt:  FSDHSSDSSDSDSDKECHKAHHKKK

XP_022146004.1 uncharacterized protein LOC111015315 [Momordica charantia]1.2e-1957.38Show/hide
Query:  MSSLQCSKPGEQSQQQQKHVQHQQDHCFSHVTDKIKGVF-GH-HHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEGGLLHKIKDAFSD
        M+SLQCSKP +Q   Q +  +H Q HCF HV+DKIKGVF GH HHGQ P AA  H      + NA+  H        K++  HKNK+G LLHKIKDAFSD
Subjt:  MSSLQCSKPGEQSQQQQKHVQHQQDHCFSHVTDKIKGVF-GH-HHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEGGLLHKIKDAFSD

Query:  HSSDSSDSDSDKECHKAHHKKK
        HSSDSSDSD+  E HKAH K K
Subjt:  HSSDSSDSDSDKECHKAHHKKK

XP_022958513.1 glycine-rich cell wall structural protein 1-like [Cucurbita moschata]1.8e-0745.92Show/hide
Query:  GHHHGQPPVAAHGHEHGHGH------------SGNANAGHCKPA--------AGLKKKEHGHKNKEGGLLHKIKDAFSDHSSDSSDSDSDKECHKAHH
        GH HGQ     HG   GHGH              N N GHC+PA        AG  +K+  HKNKEGG L+KIKDAFSDH   S +SDSD +C +  H
Subjt:  GHHHGQPPVAAHGHEHGHGH------------SGNANAGHCKPA--------AGLKKKEHGHKNKEGGLLHKIKDAFSDHSSDSSDSDSDKECHKAHH

XP_038875375.1 uncharacterized protein LOC120067846 [Benincasa hispida]1.1e-2564.75Show/hide
Query:  MSSLQCSKPGEQSQQQQKHVQHQQDHCF-SHVTDKIKGVF-GHHHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEGGLLHKIKDAFSD
        M+SLQC+KP +    QQKH Q Q  HCF  HV+DKIKGVF GHHHGQ P+A+    H      +ANA HCKP    KKKEH HKNKEGGLLHKIKDAFSD
Subjt:  MSSLQCSKPGEQSQQQQKHVQHQQDHCF-SHVTDKIKGVF-GHHHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEGGLLHKIKDAFSD

Query:  HSSDSSDSDSDKECHKAHHKKK
        HSSDSSDS++  EC K HH KK
Subjt:  HSSDSSDSDSDKECHKAHHKKK

TrEMBL top hitse value%identityAlignment
A0A1S3AUH3 uncharacterized protein LOC1034828152.0e-2056.8Show/hide
Query:  MSSLQCSKPGEQS--QQQQKHVQHQQDHCF-SHVTDKIKGVF--GHHHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEGGLLHKIKDA
        M+  QC+KP + +  Q QQKH Q    HCF  HV+DKIKGVF  GHHH Q   A+    H      +AN  HCK +   KKKEH  K KEGGLLHKIK+A
Subjt:  MSSLQCSKPGEQS--QQQQKHVQHQQDHCF-SHVTDKIKGVF--GHHHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEGGLLHKIKDA

Query:  FSDHSSDSSDSDSDKECHKAHHKKK
        FSDHSSDSSDS++  ECHK HH KK
Subjt:  FSDHSSDSSDSDSDKECHKAHHKKK

A0A5A7TMV4 Uncharacterized protein1.1e-0548.35Show/hide
Query:  MSSLQCSKPGEQS--QQQQKHVQHQQDHCF-SHVTDKIKGVF--GHHHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEG
        M+  QC+KP + +  Q QQKH Q    HCF  HV+DKIKGVF  GHHH Q   A+    H      +AN  HCK +   KKKEH  K KEG
Subjt:  MSSLQCSKPGEQS--QQQQKHVQHQQDHCF-SHVTDKIKGVF--GHHHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEG

A0A6J1CWW2 uncharacterized protein LOC1110153156.0e-2057.38Show/hide
Query:  MSSLQCSKPGEQSQQQQKHVQHQQDHCFSHVTDKIKGVF-GH-HHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEGGLLHKIKDAFSD
        M+SLQCSKP +Q   Q +  +H Q HCF HV+DKIKGVF GH HHGQ P AA  H      + NA+  H        K++  HKNK+G LLHKIKDAFSD
Subjt:  MSSLQCSKPGEQSQQQQKHVQHQQDHCFSHVTDKIKGVF-GH-HHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEGGLLHKIKDAFSD

Query:  HSSDSSDSDSDKECHKAHHKKK
        HSSDSSDSD+  E HKAH K K
Subjt:  HSSDSSDSDSDKECHKAHHKKK

A0A6J1H399 glycine-rich cell wall structural protein 1-like8.9e-0845.92Show/hide
Query:  GHHHGQPPVAAHGHEHGHGH------------SGNANAGHCKPA--------AGLKKKEHGHKNKEGGLLHKIKDAFSDHSSDSSDSDSDKECHKAHH
        GH HGQ     HG   GHGH              N N GHC+PA        AG  +K+  HKNKEGG L+KIKDAFSDH   S +SDSD +C +  H
Subjt:  GHHHGQPPVAAHGHEHGHGH------------SGNANAGHCKPA--------AGLKKKEHGHKNKEGGLLHKIKDAFSDHSSDSSDSDSDKECHKAHH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCTCTGCAATGCAGCAAACCAGGTGAGCAGAGCCAGCAGCAACAGAAGCACGTCCAACACCAACAAGACCATTGCTTCAGCCATGTCACCGACAAGATCAAAGG
CGTGTTTGGCCACCATCATGGACAGCCTCCGGTGGCGGCGCACGGACACGAACACGGACACGGCCACTCGGGCAATGCCAATGCAGGTCATTGCAAGCCTGCAGCAGGGT
TGAAGAAGAAGGAACATGGTCACAAAAACAAAGAAGGAGGTTTGTTGCACAAGATCAAGGATGCCTTTTCTGACCACAGCAGCGATAGCAGCGACAGTGACAGTGACAAA
GAGTGTCACAAAGCCCACCACAAAAAGAAGGCAAGTTTTCTATCTCTCCCTCTCAAGATCATTAGTGCAAACATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCTCTGCAATGCAGCAAACCAGGTGAGCAGAGCCAGCAGCAACAGAAGCACGTCCAACACCAACAAGACCATTGCTTCAGCCATGTCACCGACAAGATCAAAGG
CGTGTTTGGCCACCATCATGGACAGCCTCCGGTGGCGGCGCACGGACACGAACACGGACACGGCCACTCGGGCAATGCCAATGCAGGTCATTGCAAGCCTGCAGCAGGGT
TGAAGAAGAAGGAACATGGTCACAAAAACAAAGAAGGAGGTTTGTTGCACAAGATCAAGGATGCCTTTTCTGACCACAGCAGCGATAGCAGCGACAGTGACAGTGACAAA
GAGTGTCACAAAGCCCACCACAAAAAGAAGGCAAGTTTTCTATCTCTCCCTCTCAAGATCATTAGTGCAAACATTTAGTTTTTAGAGGGTTCTTACAACAAATGGTTCGA
TGTTCTCATTTTGATTTTGATTTGAGAATTAACTATAGTTTTTCAAATTTGTTAAACATATTGATATAATTAAATTTAGCGGTCTGATTGGTGATTTAAGATGGTATCAG
AGCAGGTGATCCAGAGAGGTCCTTCGTTCGAACCTCTCTGTAAAATCGTTTGCTCCTCAATTAATATTGATGTCCATTTCAAGTTTTTGATGTCGGGTTGAAATAGAATT
TACCAAGTGTATTTATTGGATTGTTTACTCACAAACCGTGAACATCAAGATGGGATTTTTTTTTTTTCTTTTTTTCTTTAGAATAACTAAAGGGGTGCAATAATATAAGA
ATTTTCAAAATGGATACTCTTGTATAGGTACGCTCAACTACAGAGTTTTGATACGATCTAGTACATTAGATCGTTAGTATGATTGCACCCAAGTTAGCGATCTTTTTGTT
AAAGATAATTTTAAGGGTAGTGGTCTAAGCTAGTGGTCAGAAGCTAATTATTTATTTGCCTATAAATACTCTTGTAATGTTTTCATTTTAATAAATAGGAAGATTTATCA
TTTCAAACGATTTGTATTTGTATTTTGATTTTGTATTTTCTCTTTGTTTTTGTTACATGTATGATGTATGACCTAATTCTTTTGATTGTATTTGCAGAACTTAAAGGGGA
AGAAATGT
Protein sequenceShow/hide protein sequence
MSSLQCSKPGEQSQQQQKHVQHQQDHCFSHVTDKIKGVFGHHHGQPPVAAHGHEHGHGHSGNANAGHCKPAAGLKKKEHGHKNKEGGLLHKIKDAFSDHSSDSSDSDSDK
ECHKAHHKKKASFLSLPLKIISANI