; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003202 (gene) of Snake gourd v1 genome

Gene IDTan0003202
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG09:46674632..46683249
RNA-Seq ExpressionTan0003202
SyntenyTan0003202
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046727.1 retrotransposon protein [Cucumis melo var. makuwa]3.0e-2460.87Show/hide
Query:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFE
        CH +     ++  GL  T+ VD+EEMV +FLHILAHDVKNRVI R+F RSGE +SRHFN+VL  V++LHD LLKKP+P+ N CTD+RW+ FE
Subjt:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFE

TYK08389.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]3.2e-2654.39Show/hide
Query:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGY
        CH +     +++ GLV T+ +D+EEMV +FLHILAHDVKNR+I ++FARSGE VSRHFN+VL  +L+LHD LLKKP+P+TN+CTD RWK FE    N   
Subjt:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGY

Query:  AVDATRFVLDINEV
        A+D T   ++++ +
Subjt:  AVDATRFVLDINEV

XP_008441953.1 PREDICTED: uncharacterized protein LOC103485952 [Cucumis melo]3.2e-2654.39Show/hide
Query:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGY
        CH +     +++ GLV T+ +D+EEMV +FLHILAHDVKNR+I ++FARSGE VSRHFN+VL  +L+LHD LLKKP+P+TN+CTD RWK FE    N   
Subjt:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGY

Query:  AVDATRFVLDINEV
        A+D T   ++++ +
Subjt:  AVDATRFVLDINEV

XP_008455792.1 PREDICTED: putative nuclease HARBI1 [Cucumis melo]4.7e-2563.04Show/hide
Query:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFE
        CH +     ++T GLV T+ +D+EEMV +FLHILAHDVKNR+I R+F RSGE VSRHFN+VL    +LHD LLKKP+P+TN+CTD RWK FE
Subjt:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFE

XP_030483302.1 protein ALP1-like [Cannabis sativa]1.4e-2658.56Show/hide
Query:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGY
        CH +     K+TGGL  ++NVD+EEMV IFLHI+AHDVKNR++ RQFARSGE VSRHFN+VLN +L LHDLLLKKP  I + C DERWK    W  N   
Subjt:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGY

Query:  AVDATRFVLDI
        A+D T   +++
Subjt:  AVDATRFVLDI

TrEMBL top hitse value%identityAlignment
A0A1S3B4J8 uncharacterized protein LOC1034859521.6e-2654.39Show/hide
Query:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGY
        CH +     +++ GLV T+ +D+EEMV +FLHILAHDVKNR+I ++FARSGE VSRHFN+VL  +L+LHD LLKKP+P+TN+CTD RWK FE    N   
Subjt:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGY

Query:  AVDATRFVLDINEV
        A+D T   ++++ +
Subjt:  AVDATRFVLDINEV

A0A1S3C1U8 putative nuclease HARBI12.3e-2563.04Show/hide
Query:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFE
        CH +     ++T GLV T+ +D+EEMV +FLHILAHDVKNR+I R+F RSGE VSRHFN+VL    +LHD LLKKP+P+TN+CTD RWK FE
Subjt:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFE

A0A5A7U0J9 Putative nuclease HARBI11.6e-2654.39Show/hide
Query:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGY
        CH +     +++ GLV T+ +D+EEMV +FLHILAHDVKNR+I ++FARSGE VSRHFN+VL  +L+LHD LLKKP+P+TN+CTD RWK FE    N   
Subjt:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGY

Query:  AVDATRFVLDINEV
        A+D T   ++++ +
Subjt:  AVDATRFVLDINEV

A0A5D3CDG6 Putative nuclease HARBI11.6e-2654.39Show/hide
Query:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGY
        CH +     +++ GLV T+ +D+EEMV +FLHILAHDVKNR+I ++FARSGE VSRHFN+VL  +L+LHD LLKKP+P+TN+CTD RWK FE    N   
Subjt:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGY

Query:  AVDATRFVLDINEV
        A+D T   ++++ +
Subjt:  AVDATRFVLDINEV

A0A803QNC5 Uncharacterized protein7.0e-2758.56Show/hide
Query:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGY
        CH +     K+TGGL  ++NVD+EEMV IFLHI+AHDVKNR++ RQFARSGE VSRHFN+VLN +L LHDLLLKKP  I + C DERWK    W  N   
Subjt:  CHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGY

Query:  AVDATRFVLDI
        A+D T   +++
Subjt:  AVDATRFVLDI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)4.7e-0740.68Show/hide
Query:  KSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQL
        ++ G L  T  + IE  + IFL I+ H+++ R +   F  SGE +SRHFN VLN V+ +
Subjt:  KSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGTTGGAAATAAATGTCATTTTATTGGAGAATTAAAAAAAAAATCAACTGGCGGTTTAGTACCGACACAGAATGTTGATATCGAAGAAATGGTTGTTATATTCCT
GCATATCCTAGCACATGATGTTAAGAATCGGGTGATTCACAGGCAATTTGCCCGGTCCGGTGAGATTGTTTCTAGACACTTCAACTTGGTGCTAAACACAGTTTTACAGC
TACACGATTTGTTATTGAAAAAACCAGAACCAATCACCAACACATGCACTGATGAGCGATGGAAAAGTTTTGAGGGTTGGAGACCTAATCCTGGATACGCTGTGGATGCG
ACCCGCTTTGTATTAGATATAAACGAGGTCATCCAACTCGTTCATGTGGTTGACATGCGAGTGGGGGCATCCAGTGCAATGAGATTGTACAAGACCGGGCCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGTTGGAAATAAATGTCATTTTATTGGAGAATTAAAAAAAAAATCAACTGGCGGTTTAGTACCGACACAGAATGTTGATATCGAAGAAATGGTTGTTATATTCCT
GCATATCCTAGCACATGATGTTAAGAATCGGGTGATTCACAGGCAATTTGCCCGGTCCGGTGAGATTGTTTCTAGACACTTCAACTTGGTGCTAAACACAGTTTTACAGC
TACACGATTTGTTATTGAAAAAACCAGAACCAATCACCAACACATGCACTGATGAGCGATGGAAAAGTTTTGAGGGTTGGAGACCTAATCCTGGATACGCTGTGGATGCG
ACCCGCTTTGTATTAGATATAAACGAGGTCATCCAACTCGTTCATGTGGTTGACATGCGAGTGGGGGCATCCAGTGCAATGAGATTGTACAAGACCGGGCCGTGA
Protein sequenceShow/hide protein sequence
MVVGNKCHFIGELKKKSTGGLVPTQNVDIEEMVVIFLHILAHDVKNRVIHRQFARSGEIVSRHFNLVLNTVLQLHDLLLKKPEPITNTCTDERWKSFEGWRPNPGYAVDA
TRFVLDINEVIQLVHVVDMRVGASSAMRLYKTGP