; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015920 (gene) of Snake gourd v1 genome

Gene IDTan0015920
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionkeratin, type II cytoskeletal 3 isoform X2
Genome locationLG10:56877051..56889407
RNA-Seq ExpressionTan0015920
SyntenyTan0015920
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022932819.1 keratin, type II cytoskeletal 2 epidermal isoform X1 [Cucurbita moschata]1.4e-6185.42Show/hide
Query:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV
        MPRNMNGSR E GED++G+LWKLPVLKSS IG+LGPAFGLGVGCGVGFG+GLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNV
Subjt:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV

Query:  GDLLHGQKSIFP-QDEISALVDELVLNTRKLIRATSREIEKWKR
        GDLL+G++SIFP +DEI ALVDEL LNT+KLIR T++EI+KWKR
Subjt:  GDLLHGQKSIFP-QDEISALVDELVLNTRKLIRATSREIEKWKR

XP_022932820.1 keratin, type II cytoskeletal 2 epidermal isoform X2 [Cucurbita moschata]1.9e-6386.71Show/hide
Query:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV
        MPRNMNGSR E GED++G+LWKLPVLKSS IG+LGPAFGLGVGCGVGFG+GLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNV
Subjt:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV

Query:  GDLLHGQKSIFPQDEISALVDELVLNTRKLIRATSREIEKWKR
        GDLL+G++SIFPQDEI ALVDEL LNT+KLIR T++EI+KWKR
Subjt:  GDLLHGQKSIFPQDEISALVDELVLNTRKLIRATSREIEKWKR

XP_022974026.1 keratin, type II cytoskeletal 3 isoform X2 [Cucurbita maxima]7.3e-6385.31Show/hide
Query:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV
        MPRNMNGSR+E GED++G+LWKLPVLKSS IG+LGPAFGLGVGCGVGFG+GLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNV
Subjt:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV

Query:  GDLLHGQKSIFPQDEISALVDELVLNTRKLIRATSREIEKWKR
        GDLL+G++SIFPQDEI A+VDEL LNT+KLI+ T++EI+KWKR
Subjt:  GDLLHGQKSIFPQDEISALVDELVLNTRKLIRATSREIEKWKR

XP_023521209.1 fibroin heavy chain isoform X2 [Cucurbita pepo subsp. pepo]7.3e-6386.71Show/hide
Query:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV
        MPRNMNGSR E  ED++G+LWKLPVLKSS IG+LGPAFGLGVGCGVGFG+GLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNV
Subjt:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV

Query:  GDLLHGQKSIFPQDEISALVDELVLNTRKLIRATSREIEKWKR
        GDLL+G +SIFPQDEI ALVDEL LNT+KLIR T+REI+KWKR
Subjt:  GDLLHGQKSIFPQDEISALVDELVLNTRKLIRATSREIEKWKR

XP_038889093.1 ctenidin-1 isoform X1 [Benincasa hispida]1.4e-6186.11Show/hide
Query:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV
        MPRN NG+R + GED+ GLLW LPVLKSS  GKLGPAFGLGVGCGVGFG+GLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV
Subjt:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV

Query:  GDLLHGQKSIFPQ-DEISALVDELVLNTRKLIRATSREIEKWKR
        GDLLHG +SIFPQ D+I ALVDELVLN+++LIRATSREI+KWKR
Subjt:  GDLLHGQKSIFPQ-DEISALVDELVLNTRKLIRATSREIEKWKR

TrEMBL top hitse value%identityAlignment
A0A0A0KSM8 Uncharacterized protein1.1e-6184.72Show/hide
Query:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV
        MPRNMNG+R +DGED+ GLLW LPVLKSS  G LGPAFGLGVGCGVGFG+GLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV
Subjt:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV

Query:  GDLLHGQKSIFP-QDEISALVDELVLNTRKLIRATSREIEKWKR
        GD+L G++SIFP QD+I ALVD+LVLNT++LIRATS+EI+KWKR
Subjt:  GDLLHGQKSIFP-QDEISALVDELVLNTRKLIRATSREIEKWKR

A0A1S4E347 glycine-rich cell wall structural protein 11.1e-6185.42Show/hide
Query:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV
        MPRNMNG+R  DGED+ GLLW LPVLKSS  G LGPAFGLGVGCGVGFG+GLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV
Subjt:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV

Query:  GDLLHGQKSIFP-QDEISALVDELVLNTRKLIRATSREIEKWKR
        GD+L G +SIFP QD+I ALVD+LVLNT++LIRATSREI+KWKR
Subjt:  GDLLHGQKSIFP-QDEISALVDELVLNTRKLIRATSREIEKWKR

A0A6J1EY31 keratin, type II cytoskeletal 2 epidermal isoform X16.7e-6285.42Show/hide
Query:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV
        MPRNMNGSR E GED++G+LWKLPVLKSS IG+LGPAFGLGVGCGVGFG+GLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNV
Subjt:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV

Query:  GDLLHGQKSIFP-QDEISALVDELVLNTRKLIRATSREIEKWKR
        GDLL+G++SIFP +DEI ALVDEL LNT+KLIR T++EI+KWKR
Subjt:  GDLLHGQKSIFP-QDEISALVDELVLNTRKLIRATSREIEKWKR

A0A6J1F360 keratin, type II cytoskeletal 2 epidermal isoform X29.3e-6486.71Show/hide
Query:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV
        MPRNMNGSR E GED++G+LWKLPVLKSS IG+LGPAFGLGVGCGVGFG+GLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNV
Subjt:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV

Query:  GDLLHGQKSIFPQDEISALVDELVLNTRKLIRATSREIEKWKR
        GDLL+G++SIFPQDEI ALVDEL LNT+KLIR T++EI+KWKR
Subjt:  GDLLHGQKSIFPQDEISALVDELVLNTRKLIRATSREIEKWKR

A0A6J1IA95 keratin, type II cytoskeletal 3 isoform X23.5e-6385.31Show/hide
Query:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV
        MPRNMNGSR+E GED++G+LWKLPVLKSS IG+LGPAFGLGVGCGVGFG+GLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNV
Subjt:  MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNV

Query:  GDLLHGQKSIFPQDEISALVDELVLNTRKLIRATSREIEKWKR
        GDLL+G++SIFPQDEI A+VDEL LNT+KLI+ T++EI+KWKR
Subjt:  GDLLHGQKSIFPQDEISALVDELVLNTRKLIRATSREIEKWKR

SwissProt top hitse value%identityAlignment
Q9JL61 DNA-binding protein Rfx58.0e-0449.33Show/hide
Query:  NGLLWKLPVLKSSTIGK---LGPAFGLGVGCGVGFGVGLVGGAGFGPGI-PGLQLGFGLGAGCGVGLGFGYGVGR
        N +L  +P L  +  G    LGP FG G G G G G GL  GAG GPG+ PGL  G G G G G+G G G G GR
Subjt:  NGLLWKLPVLKSSTIGK---LGPAFGLGVGCGVGFGVGLVGGAGFGPGI-PGLQLGFGLGAGCGVGLGFGYGVGR

Arabidopsis top hitse value%identityAlignment
AT1G66820.1 glycine-rich protein3.2e-0847.44Show/hide
Query:  LKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQ-----LGFGLGAGCGVGLGFGYGVGRGIAQDD-KRRY
        +K++T+   GP  G G+GCG G G+GL GG G G    GL      LGFG+G G G G G+G+GVG G + DD K R+
Subjt:  LKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQ-----LGFGLGAGCGVGLGFGYGVGRGIAQDD-KRRY

AT4G10330.1 glycine-rich protein4.9e-4160.45Show/hide
Query:  SRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLHGQ
        +R+  G+DD GLLWKLP ++   IGK+GPAFGLGVGCG GFG GL+GG GFGPG+PGLQ G G GAGCG+G+GFGYGVGRG A D  R Y NV     G+
Subjt:  SRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLHGQ

Query:  KSIFPQDEISALVDELVLNTRKLIRATSREIEKW
         S+   +E+ +L+DELV++T+KL++AT+ EI+KW
Subjt:  KSIFPQDEISALVDELVLNTRKLIRATSREIEKW

AT4G14301.1 unknown protein6.3e-0455.17Show/hide
Query:  IGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRG
        IGK G  FG G+G G GFG G+  G GFG GI G   G G G G G G GFG G+G+G
Subjt:  IGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGAAATATGAACGGAAGCAGACAGGAAGACGGAGAAGATGACAACGGATTACTGTGGAAGCTTCCAGTTCTGAAATCTTCCACAATCGGAAAGTTAGGTCCCGC
CTTCGGTCTCGGCGTCGGTTGTGGCGTCGGCTTTGGCGTCGGCCTCGTCGGAGGTGCTGGATTTGGTCCGGGAATTCCTGGTTTACAACTTGGCTTTGGTCTTGGTGCTG
GGTGTGGAGTTGGCTTGGGATTTGGCTATGGTGTTGGCAGGGGCATTGCCCAAGATGACAAACGGAGATACTCTAACGTTGGGGATCTATTACATGGCCAAAAAAGTATT
TTTCCTCAGGATGAGATTAGCGCGCTTGTTGACGAGCTTGTCCTCAATACAAGGAAGCTTATCCGAGCTACGTCAAGGGAGATTGAGAAGTGGAAAAGATGA
mRNA sequenceShow/hide mRNA sequence
CACACGGGAGCTCCTCAGAAAACTCTCAACGAAGAATGTATAATGTTAGGGCTTTGTATTGTTGTTGATAAAGTTTAGATGTGGAATTAGTTTGGAAACTTCACATCTAT
TCTTCCGTTGCTTATTCGTCCCGAAACCCCAAGGAAGAGATGCCTCGAAATATGAACGGAAGCAGACAGGAAGACGGAGAAGATGACAACGGATTACTGTGGAAGCTTCC
AGTTCTGAAATCTTCCACAATCGGAAAGTTAGGTCCCGCCTTCGGTCTCGGCGTCGGTTGTGGCGTCGGCTTTGGCGTCGGCCTCGTCGGAGGTGCTGGATTTGGTCCGG
GAATTCCTGGTTTACAACTTGGCTTTGGTCTTGGTGCTGGGTGTGGAGTTGGCTTGGGATTTGGCTATGGTGTTGGCAGGGGCATTGCCCAAGATGACAAACGGAGATAC
TCTAACGTTGGGGATCTATTACATGGCCAAAAAAGTATTTTTCCTCAGGATGAGATTAGCGCGCTTGTTGACGAGCTTGTCCTCAATACAAGGAAGCTTATCCGAGCTAC
GTCAAGGGAGATTGAGAAGTGGAAAAGATGAGATGACTTTTATGTTTTTACATTGCACAAGTCCAACCCCCTCCCCTATCATTAGAAAGAAACCACAATCAAAATCATAG
AAGAAAGAAACATGCTGGCTGTCTATACGGTTTTATTTGTCTATGGCTTTGTATAGGCAAACTTCATGTTCAATTGTTCTGAGAGATGTATTTGGTTGCTCTAAATCATC
ATGTGCAGTGGGTTTTGAGTAGTTACAAAATGTCGAATACTTAAATTAGTTTAGGTGGTCTTTAAAAGGCTGATTGATAATGTTCATACTACAGAAATAGCCGTTTCATT
AGAGGTTTTATTATGTTTTAATTTAAGCCTTTTCTGGCTCGTGATAATGTTCAATTGTTACGTTGAGTTGCATCAAACTAAATATTCAAGCAATGGAGATAATTTGATTT
CCTTCACAAA
Protein sequenceShow/hide protein sequence
MPRNMNGSRQEDGEDDNGLLWKLPVLKSSTIGKLGPAFGLGVGCGVGFGVGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLHGQKSI
FPQDEISALVDELVLNTRKLIRATSREIEKWKR