; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014264 (gene) of Snake gourd v1 genome

Gene IDTan0014264
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationLG05:7527842..7528810
RNA-Seq ExpressionTan0014264
SyntenyTan0014264
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3433792.1 hypothetical protein FNV43_RR24895 [Rhamnella rubrinervis]8.1e-4952.79Show/hide
Query:  ECFKVLNAIEDLDGESYTKILKLLHGDV--------------------AWRKLFLLMPDARKKDFIR----------HDYVIELLSGNETRCFDCFRMKK
        +CF++LN ++++DG+SY+K +KLLH DV                     +  L+L+  +  + D +R          HDYV+ELL+G+E+RC+DCFRM K
Subjt:  ECFKVLNAIEDLDGESYTKILKLLHGDV--------------------AWRKLFLLMPDARKKDFIR----------HDYVIELLSGNETRCFDCFRMKK

Query:  NTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDVDNVPPEITSKPKYYPFFK
        + FI FCE+LK+KTNLK SR++TVQE+VAIFLL I HNE+NR+ AERFQHSG+TISR FN VLKKV  LGVE+IC  + D V PEI   PKYYPFFK
Subjt:  NTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDVDNVPPEITSKPKYYPFFK

KAF7123090.1 hypothetical protein RHSIM_Rhsim12G0067000 [Rhododendron simsii]2.5e-4245.16Show/hide
Query:  ECFKVLNAIEDLDGESYTKILKLLHGDVAWRKLFLLMPDARKKDFIR--------------------------------------------------HDY
        EC  +LN ++DLD +SY K+L+ LHGDV WR++F+ MP+ R+  +I+                                                  HDY
Subjt:  ECFKVLNAIEDLDGESYTKILKLLHGDVAWRKLFLLMPDARKKDFIR--------------------------------------------------HDY

Query:  VIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDVD
        V+E+L+G+E RC   FRMK   FIAFCE LK   NLK SRYLT+QE+V IFLL I HNE+NR+  ERFQHSG TIS  F+ VLK V KLGV II PP  D
Subjt:  VIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDVD

Query:  NVPPEITSKPKYYPFFK
        ++P +I    KY+PFFK
Subjt:  NVPPEITSKPKYYPFFK

KAF7148819.1 hypothetical protein RHSIM_Rhsim03G0151700 [Rhododendron simsii]3.3e-4244.5Show/hide
Query:  ECFKVLNAIEDLDGESYTKILKLLHGDVAWRKLFLLMPDARKKDFIR---------------------------------------------------HD
        EC  +LN ++DLD +SY K+L+ LHGD+ WR++F+ MP+ R+  +I+                                                   HD
Subjt:  ECFKVLNAIEDLDGESYTKILKLLHGDVAWRKLFLLMPDARKKDFIR---------------------------------------------------HD

Query:  YVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDV
        YV+E+L+G+E RC   FRMK   FIAFCE LK   NLK SRYLT+QE+V IFLL I HNE+NR+  ERFQHSG TIS  F+ VLK V KLGV II PP  
Subjt:  YVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDV

Query:  DNVPPEITSKPKYYPFFK
        D++P +I    KY+PFFK
Subjt:  DNVPPEITSKPKYYPFFK

KAG8372299.1 hypothetical protein BUALT_Bualt12G0051800 [Buddleja alternifolia]7.8e-3648.21Show/hide
Query:  MECFKVLNAIEDLDGESYTKILKLLHGDVAWRKLFLLMPDARKKDFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVA
        +  + ++  +E ++ E    +  +L     + +L+LL    R      H YV+EL++ N TRC+D FRMK + FI F   L  K  LK SRYL+  E+VA
Subjt:  MECFKVLNAIEDLDGESYTKILKLLHGDVAWRKLFLLMPDARKKDFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVA

Query:  IFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDVDNVPPEITSKPKYYPFFK
        IFL II+H   +R+AAE+FQHSG+TIS+VF+ VLK + KLGVEII PP+ D VPPEI   PKYYPFFK
Subjt:  IFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDVDNVPPEITSKPKYYPFFK

XP_030481634.1 protein ALP1-like [Cannabis sativa]5.4e-3745.2Show/hide
Query:  ECFKVLNAIEDLDGESYTKILKLLHGDVA----------WRKLFLLMPDARKKDFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASR
        EC + LN I+ + GE Y K ++    +V+          + +L+L     R      H+YV+E+L G+E+RC+D FRM K+ FI FC  LK K  L+ SR
Subjt:  ECFKVLNAIEDLDGESYTKILKLLHGDVA----------WRKLFLLMPDARKKDFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASR

Query:  YLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDVDNVPPEITSKPKYYPFFK
        YL+V+E+V++FL ++ HNE++R+ AERFQHS  TIS  F  VL+ V +L  E+I PP  D VPPEI   PKYYPF K
Subjt:  YLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDVDNVPPEITSKPKYYPFFK

TrEMBL top hitse value%identityAlignment
A0A0A0LCZ7 Uncharacterized protein1.3e-3651.12Show/hide
Query:  IEDLDGESYTKILKLLHGDVAWRKLFLLMPDARKKDFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNL----------------------
        ++D+DGE+Y KILKL HG                     HDYVIELL+GN++RCFDCFR ++ TF+ FCEDLK+KTNL                      
Subjt:  IEDLDGESYTKILKLLHGDVAWRKLFLLMPDARKKDFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNL----------------------

Query:  ---------------KASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDV
                       KASRYLTVQEKVAIFLLIISHNE NRI  ERFQHSG TIS  FN VL+KV KLG+EII PP++
Subjt:  ---------------KASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDV

A0A2N9GND4 Uncharacterized protein6.1e-2649.59Show/hide
Query:  HDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICP-
        H+Y++E+L+G ++ C++ FRM+KN FI+ C+ LK+   L+ SR+++VQEKVAIF+L I H+ +NR+  +RFQHSG+TISR FN VL  + KL  ++I P 
Subjt:  HDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICP-

Query:  PDVDNVPPEITSKPKYYPFFK
          +  +P  I  KPKYYP+FK
Subjt:  PDVDNVPPEITSKPKYYPFFK

A0A443P2U0 Protein ALP1-like protein6.1e-2642.21Show/hide
Query:  DGESYTKILKLLHGDVAWRKLFLLMPDARKKDFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNR
        D E++T  + L+ G     +L +     R K    +++V ++L G+  RC+D FRM+K+ F   C  LK++  L  S++L+V+E+VAIFL+ I H+ +NR
Subjt:  DGESYTKILKLLHGDVAWRKLFLLMPDARKKDFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNR

Query:  IAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDVDNVPPEITSKPKYYPFF
        + A+RFQHSG+TIS  F  VLK ++ LG E+I PP   + PPEI +  KYYP+F
Subjt:  IAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDVDNVPPEITSKPKYYPFF

A0A445H7N1 Uncharacterized protein3.3e-2445.52Show/hide
Query:  FLLMPDARKKDFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVL
        F+     R     R  Y I+ L G+ETRC++ FRMKK+ F+ FCE LK   NL   + ++++E VA+FL+II HN ++R+ AERFQHS  T+S+ F ++L
Subjt:  FLLMPDARKKDFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVL

Query:  KKVLKLGVEIICPPDVDNVPPEITSKPKYYPFFK
        K V KLG  II   +     P I   PKYYP+FK
Subjt:  KKVLKLGVEIICPPDVDNVPPEITSKPKYYPFFK

A0A6A4KNZ4 Myb_DNA-bind_3 domain-containing protein (Fragment)5.1e-3350Show/hide
Query:  ECFKVLNAIEDLDGESYTKILKLLHGDVAWRKLFLLMPDARKK----------------DFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKT
        EC  +LN +EDL  +SY  +L+ LHGDV WR++FL MP+ ++                 +F+ HDYV+E+L+G+E RC   FRMK   FIAFCE LK   
Subjt:  ECFKVLNAIEDLDGESYTKILKLLHGDVAWRKLFLLMPDARKK----------------DFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKT

Query:  NLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKV
        NLK SRYLT+QE+V IFLL I HNE++R+  E FQHSG TIS  F +V+  V
Subjt:  NLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein9.6e-0827.83Show/hide
Query:  LSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPP---DVDN
        L  +   C    RM    F   C  L+   +L+ +  ++++E VA+FL I  HNE  R    RF  + +T+ R F  VL     L  + I  P   ++  
Subjt:  LSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPP---DVDN

Query:  VPPEITSKPKYYPFF
        +P  +    +Y+P+F
Subjt:  VPPEITSKPKYYPFF

AT5G28730.1 unknown protein1.3e-1233.33Show/hide
Query:  LLMPDARKKDFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLK
        LL  +A  ++ I H      +  NE  C    RM    F   CE L  K  L++S  +++ E VAIFL+I + N+  R  A RF H+ +TI R F+ VLK
Subjt:  LLMPDARKKDFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLK

Query:  KVLKLGVEIICPPDVDN---VPPEITSKPKYYPFFKVMHSIFFF
         + +L VE I P  V+    +   +    +Y+PF   +  I  F
Subjt:  KVLKLGVEIICPPDVDN---VPPEITSKPKYYPFFKVMHSIFFF

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.6e-1337.11Show/hide
Query:  YVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICP
        +V ++L+G   +CF+ FRM K  F   C+ L+ +  L+ +  + ++ ++AIFL II HN + R   E F +SG+TISR FN VL  V+ +  +   P
Subjt:  YVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNEKNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATGCTTCAAGGTTTTGAATGCAATAGAGGATCTTGATGGAGAGTCTTATACTAAAATACTCAAACTTTTGCATGGAGATGTTGCTTGGAGAAAGTTATTTTTGCT
TATGCCTGATGCGAGGAAGAAAGATTTCATACGTCATGATTATGTGATTGAGCTACTAAGTGGCAACGAGACAAGATGCTTTGATTGCTTTAGGATGAAAAAAAATACAT
TCATAGCTTTTTGTGAAGATTTAAAAGCGAAAACAAATTTGAAAGCTTCTAGGTATCTTACTGTTCAAGAGAAAGTTGCCATTTTTTTATTGATCATATCACATAATGAG
AAAAATCGTATAGCAGCAGAAAGGTTTCAACATTCGGGTCAAACTATTTCTCGAGTTTTTAACCTTGTTTTGAAAAAAGTTTTGAAGCTTGGAGTAGAAATCATTTGTCC
ACCCGATGTAGACAATGTACCACCAGAGATCACATCTAAACCTAAATATTACCCTTTCTTTAAGGTAATGCATTCAATTTTTTTTTTCACAAACATAACATTTTTAAAAT
ATATTTATAGTCATATATATCTTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAATGCTTCAAGGTTTTGAATGCAATAGAGGATCTTGATGGAGAGTCTTATACTAAAATACTCAAACTTTTGCATGGAGATGTTGCTTGGAGAAAGTTATTTTTGCT
TATGCCTGATGCGAGGAAGAAAGATTTCATACGTCATGATTATGTGATTGAGCTACTAAGTGGCAACGAGACAAGATGCTTTGATTGCTTTAGGATGAAAAAAAATACAT
TCATAGCTTTTTGTGAAGATTTAAAAGCGAAAACAAATTTGAAAGCTTCTAGGTATCTTACTGTTCAAGAGAAAGTTGCCATTTTTTTATTGATCATATCACATAATGAG
AAAAATCGTATAGCAGCAGAAAGGTTTCAACATTCGGGTCAAACTATTTCTCGAGTTTTTAACCTTGTTTTGAAAAAAGTTTTGAAGCTTGGAGTAGAAATCATTTGTCC
ACCCGATGTAGACAATGTACCACCAGAGATCACATCTAAACCTAAATATTACCCTTTCTTTAAGGTAATGCATTCAATTTTTTTTTTCACAAACATAACATTTTTAAAAT
ATATTTATAGTCATATATATCTTTTTTAG
Protein sequenceShow/hide protein sequence
MECFKVLNAIEDLDGESYTKILKLLHGDVAWRKLFLLMPDARKKDFIRHDYVIELLSGNETRCFDCFRMKKNTFIAFCEDLKAKTNLKASRYLTVQEKVAIFLLIISHNE
KNRIAAERFQHSGQTISRVFNLVLKKVLKLGVEIICPPDVDNVPPEITSKPKYYPFFKVMHSIFFFTNITFLKYIYSHIYLF