; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003226 (gene) of Snake gourd v1 genome

Gene IDTan0003226
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein ALP1-like protein
Genome locationLG02:25747005..25747892
RNA-Seq ExpressionTan0003226
SyntenyTan0003226
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3433792.1 hypothetical protein FNV43_RR24895 [Rhamnella rubrinervis]3.2e-4652.91Show/hide
Query:  MEDLDGEVYTKVLKLLHGDV--------------------AWRKLFLLMPDARKKD---------FIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCE
        M+++DG+ Y+K +KLLH DV                     +  L+L+  +  + D          +  HDYV+ELLNG+ +RCYDCFRM K  FI+FCE
Subjt:  MEDLDGEVYTKVLKLLHGDV--------------------AWRKLFLLMPDARKKD---------FIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCE

Query:  DLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIISKPKYYPFFK
        +LK+KTNLK SR+++VQE+VAIFLL + HNERNR+ AERFQHSG+TISR FN VLKKVC LGVE+IC  N D V  EI   PKYYPFFK
Subjt:  DLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIISKPKYYPFFK

KAF7123090.1 hypothetical protein RHSIM_Rhsim12G0067000 [Rhododendron simsii]4.0e-4144.5Show/hide
Query:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDAR-------------------------------------------------KKDFIHSHDYVIELLNGN
        M+DLD + Y KVL+ LHGDV WR++F+ MP+ R                                                 +   +  HDYV+E+LNG+
Subjt:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDAR-------------------------------------------------KKDFIHSHDYVIELLNGN

Query:  NTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIIS
          RC+  FRMK   FI+FCE LK   NLK SRYL++QE+V IFLL + HNERNR+  ERFQHSG TIS  F++VLK VCKLGV II PP+ D++P +I  
Subjt:  NTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIIS

Query:  KPKYYPFFK
          KY+PFFK
Subjt:  KPKYYPFFK

KAF7148819.1 hypothetical protein RHSIM_Rhsim03G0151700 [Rhododendron simsii]5.3e-4143.81Show/hide
Query:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDAR--------------------------------------------------KKDFIHSHDYVIELLNG
        M+DLD + Y KVL+ LHGD+ WR++F+ MP+ R                                                  +   +  HDYV+E+LNG
Subjt:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDAR--------------------------------------------------KKDFIHSHDYVIELLNG

Query:  NNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEII
        +  RC+  FRMK   FI+FCE LK   NLK SRYL++QE+V IFLL + HNERNR+  ERFQHSG TIS  F++VLK VCKLGV II PP+ D++P +I 
Subjt:  NNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEII

Query:  SKPKYYPFFK
           KY+PFFK
Subjt:  SKPKYYPFFK

KAG8371481.1 hypothetical protein BUALT_Bualt13G0092200 [Buddleja alternifolia]2.1e-3752.5Show/hide
Query:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKKDFIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSH
        +E ++ +V    L L   +  + +L+LL    R    +  H YV+EL+N N TRCYD FRMK   FI+F   L     LK SRYLS  E+VAIFL I++H
Subjt:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKKDFIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSH

Query:  NERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIISKPKYYPFFK
           +R+AAE+FQHSG+TIS+VF++VLK +CKLGVEII PPN D VP EI+  PKYYPFFK
Subjt:  NERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIISKPKYYPFFK

KAG8372299.1 hypothetical protein BUALT_Bualt12G0051800 [Buddleja alternifolia]1.6e-3753.12Show/hide
Query:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKKDFIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSH
        +E ++ +V    L L   +  + +L+LL    R    +  H YV+EL+N N TRCYD FRMK   FI+F   L  K  LK SRYLS  E+VAIFL I++H
Subjt:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKKDFIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSH

Query:  NERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIISKPKYYPFFK
           +R+AAE+FQHSG+TIS+VF++VLK +CKLGVEII PPN D VP EI+  PKYYPFFK
Subjt:  NERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIISKPKYYPFFK

TrEMBL top hitse value%identityAlignment
A0A0A0LCZ7 Uncharacterized protein1.5e-3850.84Show/hide
Query:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKKDFIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNL---------------------
        M+D+DGE Y K+LKL HG                      HDYVIELLNGN++RC+DCFR ++ TF+ FCEDLK+KTNL                     
Subjt:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKKDFIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNL---------------------

Query:  ----------------KASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNL
                        KASRYL+VQEKVAIFLLI+SHNE NRI  ERFQHSG TIS  FN+VL+KVCKLG+EII PPN+
Subjt:  ----------------KASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNL

A0A1S3E695 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like1.1e-2548.78Show/hide
Query:  IHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEII
        +   ++V E+LNG+ T C+D FRMKK  F++FC +L+ K  L  SR + V+EKVA FL I+ HN R+R+A+ RFQHS +TISR F +VL+ VC+LG E+I
Subjt:  IHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEII

Query:  CPPNLDNVPLEIISKPKYYPFFK
           +++ +P  I +  KYYP+FK
Subjt:  CPPNLDNVPLEIISKPKYYPFFK

A0A2N9GND4 Uncharacterized protein3.8e-2950.39Show/hide
Query:  RKKDFIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKL
        +++D    H+Y++E+LNG ++ CY+ FRM+K  FIS C+ LK+   L+ SR++SVQEKVAIF+L + H+ RNR+  +RFQHSG+TISR FN VL  + KL
Subjt:  RKKDFIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKL

Query:  GVEIICPPN-LDNVPLEIISKPKYYPFFK
          ++I P   L  +P+ I  KPKYYP+FK
Subjt:  GVEIICPPN-LDNVPLEIISKPKYYPFFK

A0A3Q7Y331 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like1.1e-2548.78Show/hide
Query:  IHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEII
        +   ++V E+LNG+ T C+D FRMKK  F++FC +L+ K  L  SR + V+EKVA FL I+ HN R+R+A+ RFQHS +TISR F +VL+ VC+LG E+I
Subjt:  IHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEII

Query:  CPPNLDNVPLEIISKPKYYPFFK
           +++ +P  I +  KYYP+FK
Subjt:  CPPNLDNVPLEIISKPKYYPFFK

A0A6A4KNZ4 Myb_DNA-bind_3 domain-containing protein (Fragment)1.9e-2848.97Show/hide
Query:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKK----------------DFIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRY
        MEDL  + Y  VL+ LHGDV WR++FL MP+ ++                 +F+  HDYV+E+LNG+  RC+  FRMK   FI+FCE LK   NLK SRY
Subjt:  MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKK----------------DFIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRY

Query:  LSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKV
        L++QE+V IFLL + HNER+R+  E FQHSG TIS  F  V+  V
Subjt:  LSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein2.8e-0830.56Show/hide
Query:  CYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPP---NLDNVPLEIIS
        C    RM    F + C  L+   +L+ +  +S++E VA+FL I  HNE  R    RF  + +T+ R F +VL     L  + I  P    L  +P  +  
Subjt:  CYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPP---NLDNVPLEIIS

Query:  KPKYYPFF
          +Y+P+F
Subjt:  KPKYYPFF

AT5G28730.1 unknown protein3.2e-1236.04Show/hide
Query:  NNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDN---VPL
        N   C    RM    F   CE L  K  L++S  +S+ E VAIFL+I + N+  R  A RF H+ +TI R F+ VLK + +L VE I P  ++    +  
Subjt:  NNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDN---VPL

Query:  EIISKPKYYPF
         +    +Y+PF
Subjt:  EIISKPKYYPF

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.8e-1534.71Show/hide
Query:  LMPDARKKDFIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLK
        L  +  K      + +V ++LNG N +C++ FRM K  F   C+ L+ +  L+ +  + ++ ++AIFL I+ HN R R   E F +SG+TISR FN VL 
Subjt:  LMPDARKKDFIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAERFQHSGQTISRVFNQVLK

Query:  KVCKLGVEIICPPNLDNVPLE
         V  +  +    PN ++  LE
Subjt:  KVCKLGVEIICPPNLDNVPLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATCTTGATGGCGAGGTCTATACTAAAGTACTCAAACTTTTGCATGGAGATGTTGCCTGGAGAAAGCTATTTTTGCTCATGCCTGATGCTAGGAAGAAAGATTT
CATTCATAGTCATGATTATGTGATTGAGTTGTTAAATGGCAACAACACGAGATGTTATGATTGCTTTAGGATGAAAAAAGGTACATTCATATCTTTTTGTGAAGATTTAA
AAGCGAAGACAAACCTGAAAGCATCTAGGTATCTTTCTGTTCAAGAGAAAGTTGCTATTTTTTTATTAATCGTATCACATAATGAGAGAAATCGTATAGCAGCAGAAAGG
TTTCAACATTCAGGTCAAACTATTTCTAGAGTTTTTAACCAAGTTTTGAAAAAGGTTTGCAAGCTTGGAGTAGAAATTATTTGTCCACCAAATCTAGACAATGTGCCACT
AGAGATCATATCCAAGCCTAAATATTATCCTTTCTTTAAGGTAATGCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATCTTGATGGCGAGGTCTATACTAAAGTACTCAAACTTTTGCATGGAGATGTTGCCTGGAGAAAGCTATTTTTGCTCATGCCTGATGCTAGGAAGAAAGATTT
CATTCATAGTCATGATTATGTGATTGAGTTGTTAAATGGCAACAACACGAGATGTTATGATTGCTTTAGGATGAAAAAAGGTACATTCATATCTTTTTGTGAAGATTTAA
AAGCGAAGACAAACCTGAAAGCATCTAGGTATCTTTCTGTTCAAGAGAAAGTTGCTATTTTTTTATTAATCGTATCACATAATGAGAGAAATCGTATAGCAGCAGAAAGG
TTTCAACATTCAGGTCAAACTATTTCTAGAGTTTTTAACCAAGTTTTGAAAAAGGTTTGCAAGCTTGGAGTAGAAATTATTTGTCCACCAAATCTAGACAATGTGCCACT
AGAGATCATATCCAAGCCTAAATATTATCCTTTCTTTAAGGTAATGCCTTGA
Protein sequenceShow/hide protein sequence
MEDLDGEVYTKVLKLLHGDVAWRKLFLLMPDARKKDFIHSHDYVIELLNGNNTRCYDCFRMKKGTFISFCEDLKAKTNLKASRYLSVQEKVAIFLLIVSHNERNRIAAER
FQHSGQTISRVFNQVLKKVCKLGVEIICPPNLDNVPLEIISKPKYYPFFKVMP