; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017006 (gene) of Snake gourd v1 genome

Gene IDTan0017006
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein ALP1-like protein
Genome locationLG10:60228942..60229825
RNA-Seq ExpressionTan0017006
SyntenyTan0017006
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3433792.1 hypothetical protein FNV43_RR24895 [Rhamnella rubrinervis]1.2e-4552.38Show/hide
Query:  MEDLDGEVYTKVLKLLHGDV--------------------AWRKLFLLMPDARKKD---------FIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCE
        M+++DG+ Y+K +KLLH DV                     +  L+L+  +  + D          +  H+YV+ELLNG+++RCYDCFRM K  FI+FCE
Subjt:  MEDLDGEVYTKVLKLLHGDV--------------------AWRKLFLLMPDARKKD---------FIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCE

Query:  DLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIISKPKYYPFFK
        +LK+KTNLK SR+++VQE+VAIFLL + HNERNR+ AERFQHSG+TISR FN VLKKVC LGVE+IC  N D V  EI   PKYYPFFK
Subjt:  DLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIISKPKYYPFFK

KAF7123090.1 hypothetical protein RHSIM_Rhsim12G0067000 [Rhododendron simsii]1.5e-4044.02Show/hide
Query:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDAR-------------------------------------------------KKDFIHSHNYVIELLNGN
        M+DLD + Y KVL+ LHGDV WR++F+ MP+ R                                                 +   +  H+YV+E+LNG+
Subjt:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDAR-------------------------------------------------KKDFIHSHNYVIELLNGN

Query:  KTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIIS
        + RC+  FRMK   FI+FCE LK   NLK SRYL++QE+V IFLL + HNERNR+  ERFQHSG TIS  F++VLK VCKLGV II PP+ D++P +I  
Subjt:  KTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIIS

Query:  KPKYYPFFK
          KY+PFFK
Subjt:  KPKYYPFFK

KAF7148819.1 hypothetical protein RHSIM_Rhsim03G0151700 [Rhododendron simsii]2.0e-4043.33Show/hide
Query:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDAR--------------------------------------------------KKDFIHSHNYVIELLNG
        M+DLD + Y KVL+ LHGD+ WR++F+ MP+ R                                                  +   +  H+YV+E+LNG
Subjt:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDAR--------------------------------------------------KKDFIHSHNYVIELLNG

Query:  NKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEII
        ++ RC+  FRMK   FI+FCE LK   NLK SRYL++QE+V IFLL + HNERNR+  ERFQHSG TIS  F++VLK VCKLGV II PP+ D++P +I 
Subjt:  NKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEII

Query:  SKPKYYPFFK
           KY+PFFK
Subjt:  SKPKYYPFFK

KAG8371481.1 hypothetical protein BUALT_Bualt13G0092200 [Buddleja alternifolia]2.1e-3752.5Show/hide
Query:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKKDFIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSH
        +E ++ +V    L L   +  + +L+LL    R    +  H YV+EL+N N TRCYD FRMK   FI+F   L     LK SRYLS  E+VAIFL I++H
Subjt:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKKDFIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSH

Query:  NERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIISKPKYYPFFK
           +R+AAE+FQHSG+TIS+VF++VLK +CKLGVEII PPN D VP EI+  PKYYPFFK
Subjt:  NERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIISKPKYYPFFK

KAG8372299.1 hypothetical protein BUALT_Bualt12G0051800 [Buddleja alternifolia]1.6e-3753.12Show/hide
Query:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKKDFIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSH
        +E ++ +V    L L   +  + +L+LL    R    +  H YV+EL+N N TRCYD FRMK   FI+F   L  K  LK SRYLS  E+VAIFL I++H
Subjt:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKKDFIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSH

Query:  NERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIISKPKYYPFFK
           +R+AAE+FQHSG+TIS+VF++VLK +CKLGVEII PPN D VP EI+  PKYYPFFK
Subjt:  NERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIISKPKYYPFFK

TrEMBL top hitse value%identityAlignment
A0A0A0LCZ7 Uncharacterized protein1.0e-3750.28Show/hide
Query:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKKDFIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNL---------------------
        M+D+DGE Y K+LKL HG                      H+YVIELLNGN +RC+DCFR ++ TF+ FCEDLK+KTNL                     
Subjt:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKKDFIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNL---------------------

Query:  ----------------KASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNL
                        KASRYL+VQEKVAIFLLI+SHNE NRI  ERFQHSG TIS  FN+VL+KVCKLG+EII PPN+
Subjt:  ----------------KASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNL

A0A1S3E695 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like2.0e-2548.78Show/hide
Query:  IHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEII
        +    +V E+LNG++T C+D FRMKK  F++FC +L+ K  L  SR + V+EKVA FL I+ HN R+R+A+ RFQHS +TISR F +VL+ VC+LG E+I
Subjt:  IHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEII

Query:  CPPNLDNVPLEIISKPKYYPFFK
           +++ +P  I +  KYYP+FK
Subjt:  CPPNLDNVPLEIISKPKYYPFFK

A0A2N9GND4 Uncharacterized protein2.2e-2951.16Show/hide
Query:  RKKDFIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKL
        +++D    HNY++E+LNG  + CY+ FRM+K  FIS C+ LK+   L+ SR++SVQEKVAIF+L + H+ RNR+  +RFQHSG+TISR FN VL  + KL
Subjt:  RKKDFIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKL

Query:  GVEIICPPN-LDNVPLEIISKPKYYPFFK
          ++I P   L  +P+ I  KPKYYP+FK
Subjt:  GVEIICPPN-LDNVPLEIISKPKYYPFFK

A0A3Q7Y331 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like2.0e-2548.78Show/hide
Query:  IHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEII
        +    +V E+LNG++T C+D FRMKK  F++FC +L+ K  L  SR + V+EKVA FL I+ HN R+R+A+ RFQHS +TISR F +VL+ VC+LG E+I
Subjt:  IHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEII

Query:  CPPNLDNVPLEIISKPKYYPFFK
           +++ +P  I +  KYYP+FK
Subjt:  CPPNLDNVPLEIISKPKYYPFFK

A0A6A4KNZ4 Myb_DNA-bind_3 domain-containing protein (Fragment)7.2e-2848.28Show/hide
Query:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKK----------------DFIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRY
        MEDL  + Y  VL+ LHGDV WR++FL MP+ ++                 +F+  H+YV+E+LNG++ RC+  FRMK   FI+FCE LK   NLK SRY
Subjt:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKK----------------DFIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRY

Query:  LSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKV
        L++QE+V IFLL + HNER+R+  E FQHSG TIS  F  V+  V
Subjt:  LSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein2.1e-0829.57Show/hide
Query:  LNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPP---NLDN
        L  +   C    RM    F + C  L+   +L+ +  +S++E VA+FL I  HNE  R    RF  + +T+ R F +VL     L  + I  P    L  
Subjt:  LNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPP---NLDN

Query:  VPLEIISKPKYYPFF
        +P  +    +Y+P+F
Subjt:  VPLEIISKPKYYPFF

AT5G28730.1 unknown protein2.4e-1236.04Show/hide
Query:  NKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDN---VPL
        N+  C    RM    F   CE L  K  L++S  +S+ E VAIFL+I + N+  R  A RF H+ +TI R F+ VLK + +L VE I P  ++    +  
Subjt:  NKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDN---VPL

Query:  EIISKPKYYPF
         +    +Y+PF
Subjt:  EIISKPKYYPF

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)9.0e-1533.88Show/hide
Query:  LMPDARKKDFIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLK
        L  +  K      + +V ++LNG   +C++ FRM K  F   C+ L+ +  L+ +  + ++ ++AIFL I+ HN R R   E F +SG+TISR FN VL 
Subjt:  LMPDARKKDFIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLK

Query:  KVCKLGVEIICPPNLDNVPLE
         V  +  +    PN ++  LE
Subjt:  KVCKLGVEIICPPNLDNVPLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATCTTGATGGCGAGGTCTATACTAAAGTACTCAAACTTTTGCATGGAGATGTTGCCTGGAGAAAGCTATTTTTGCTCATGCCTGATGCTAGGAAGAAAGATTT
CATTCATAGTCATAATTATGTGATTGAGTTGTTAAATGGCAACAAGACGAGATGTTATGATTGCTTTAGGATGAAAAAAGGTACATTCATATCTTTTTGTGAAGATTTAA
AAGCGAAGACAAACCTGAAAGCATCTAGGTATCTTTCTGTTCAAGAGAAAGTTGCTATTTTTTTATTAATCGTATCACATAATGAGAGAAATCGTATAGCAGCAGAAAGG
TTTCAACATTCAGGTCAAACTATTTCTAGAGTTTTTAACCAAGTTTTGAAAAAGGTTTGCAAGCTTGGAGTAGAAATTATTTGTCCACCAAATCTAGACAATGTGCCACT
AGAGATCATATCCAAGCCTAAATATTATCCTTTCTTTAAGGTAATGCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATCTTGATGGCGAGGTCTATACTAAAGTACTCAAACTTTTGCATGGAGATGTTGCCTGGAGAAAGCTATTTTTGCTCATGCCTGATGCTAGGAAGAAAGATTT
CATTCATAGTCATAATTATGTGATTGAGTTGTTAAATGGCAACAAGACGAGATGTTATGATTGCTTTAGGATGAAAAAAGGTACATTCATATCTTTTTGTGAAGATTTAA
AAGCGAAGACAAACCTGAAAGCATCTAGGTATCTTTCTGTTCAAGAGAAAGTTGCTATTTTTTTATTAATCGTATCACATAATGAGAGAAATCGTATAGCAGCAGAAAGG
TTTCAACATTCAGGTCAAACTATTTCTAGAGTTTTTAACCAAGTTTTGAAAAAGGTTTGCAAGCTTGGAGTAGAAATTATTTGTCCACCAAATCTAGACAATGTGCCACT
AGAGATCATATCCAAGCCTAAATATTATCCTTTCTTTAAGGTAATGCCTTGA
Protein sequenceShow/hide protein sequence
MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKKDFIHSHNYVIELLNGNKTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAER
FQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIISKPKYYPFFKVMP