; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004975 (gene) of Snake gourd v1 genome

Gene IDTan0004975
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationLG01:81941798..81962615
RNA-Seq ExpressionTan0004975
SyntenyTan0004975
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051908.1 putative gag protein [Cucumis melo var. makuwa]3.9e-1467.11Show/hide
Query:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN
        MS D  E+VP+V+DPN+AILQ IQG++E+MREER+ERRAQQQRE R  QEDE MFDL   ER LGGRGN     RN
Subjt:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN

TYK02449.1 F15O4.13 [Cucumis melo var. makuwa]3.9e-1467.11Show/hide
Query:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN
        MS D  E+VP+V+DPN+AILQ IQG++E+MREER+ERRAQQQRE R  QEDE MFDL   ER LGGRGN     RN
Subjt:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN

TYK21797.1 uncharacterized protein E5676_scaffold991G00010 [Cucumis melo var. makuwa]1.0e-1467.11Show/hide
Query:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN
        MS D  E+VP+V+DPN+AILQ IQG++E+MREER+ERRAQQQRE R+ QEDE MFDL   ER LGGRGN     RN
Subjt:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN

TYK22420.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]3.9e-1467.11Show/hide
Query:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN
        MS D  E+VP+V+DPN+AILQ IQG++E+MREER+ERRAQQQRE R  QEDE MFDL   ER LGGRGN     RN
Subjt:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN

TYK26105.1 F15O4.13 [Cucumis melo var. makuwa]3.9e-1467.11Show/hide
Query:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN
        MS D  E+VP+V+DPN+AILQ IQG++E+MREER+ERRAQQQRE R  QEDE MFDL   ER LGGRGN     RN
Subjt:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN

TrEMBL top hitse value%identityAlignment
A0A5A7U9D0 Putative gag protein1.9e-1467.11Show/hide
Query:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN
        MS D  E+VP+V+DPN+AILQ IQG++E+MREER+ERRAQQQRE R  QEDE MFDL   ER LGGRGN     RN
Subjt:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN

A0A5A7VII0 Uncharacterized protein1.9e-1467.11Show/hide
Query:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN
        MS D  E+VP+V+DPN+AILQ IQG++E+MREER+ERRAQQQRE R  QEDE MFDL   ER LGGRGN     RN
Subjt:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN

A0A5D3C3D3 F15O4.131.9e-1467.11Show/hide
Query:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN
        MS D  E+VP+V+DPN+AILQ IQG++E+MREER+ERRAQQQRE R  QEDE MFDL   ER LGGRGN     RN
Subjt:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN

A0A5D3DDM5 Uncharacterized protein5.0e-1567.11Show/hide
Query:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN
        MS D  E+VP+V+DPN+AILQ IQG++E+MREER+ERRAQQQRE R+ QEDE MFDL   ER LGGRGN     RN
Subjt:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN

A0A5D3DRJ1 F15O4.131.9e-1467.11Show/hide
Query:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN
        MS D  E+VP+V+DPN+AILQ IQG++E+MREER+ERRAQQQRE R  QEDE MFDL   ER LGGRGN     RN
Subjt:  MSQDNKEKVPQVVDPNMAILQGIQGVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACAAACAATTGATCCATTGCATGTATCTGAAGGAGCAATAACAAGGAGCAAGACCAAGAATATTCAAGAGGCTTCCACATTGCATCTCCAAAAGCTCGTTAATGC
ACATGGAGAGATAAAGATATTTGAGCCCAAAATTATTTATAATATGGCACGAGAAAAGTTGTGTAGTTTGAAAGATGGCACGGTGGACAAAAAAAGTATTCATTGGATTA
GCCCCGATCAAGCTAATTCTGGAGTTCTTATAAAAAATATGTCACAAGATAATAAAGAAAAAGTTCCGCAAGTGGTAGATCCAAATATGGCTATTCTTCAAGGAATTCAA
GGTGTGATGGAGATGATGAGGGAAGAAAGAGAAGAAAGGAGAGCACAACAACAAAGAGAAGAACGAATCTTGCAAGAAGATGAATGCATGTTTGATTTACAGGTACAAGA
AAGAAACTTAGGAGGAAGAGGAAATGATAGTTTTGTGAATAGGAATGAACCGACACAACAAAGAAGCATGTTTGTTGCTGTCAAAAGAGTGGAGGCGGAAAGCTCCAATG
CTAAAAAGAATGAAGCTTCAAAGGAGAAATTTACTTTACTTCCATTGTCTCCATATGAAGTACATTGTGATCATTTGAAATTAGAGAAGAAAAGAAAAGAACTTGAACAA
AAGGCCCGACCCGCTCATTGGCCCGAGAGGGACTCTGTTTTTAGTGTTACGAACATTCGTGAAGGATTGACTTGTTGTTATTGGTCAATATCCGTGGACACAGAAATATA
TCTGCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAACAAACAATTGATCCATTGCATGTATCTGAAGGAGCAATAACAAGGAGCAAGACCAAGAATATTCAAGAGGCTTCCACATTGCATCTCCAAAAGCTCGTTAATGC
ACATGGAGAGATAAAGATATTTGAGCCCAAAATTATTTATAATATGGCACGAGAAAAGTTGTGTAGTTTGAAAGATGGCACGGTGGACAAAAAAAGTATTCATTGGATTA
GCCCCGATCAAGCTAATTCTGGAGTTCTTATAAAAAATATGTCACAAGATAATAAAGAAAAAGTTCCGCAAGTGGTAGATCCAAATATGGCTATTCTTCAAGGAATTCAA
GGTGTGATGGAGATGATGAGGGAAGAAAGAGAAGAAAGGAGAGCACAACAACAAAGAGAAGAACGAATCTTGCAAGAAGATGAATGCATGTTTGATTTACAGGTACAAGA
AAGAAACTTAGGAGGAAGAGGAAATGATAGTTTTGTGAATAGGAATGAACCGACACAACAAAGAAGCATGTTTGTTGCTGTCAAAAGAGTGGAGGCGGAAAGCTCCAATG
CTAAAAAGAATGAAGCTTCAAAGGAGAAATTTACTTTACTTCCATTGTCTCCATATGAAGTACATTGTGATCATTTGAAATTAGAGAAGAAAAGAAAAGAACTTGAACAA
AAGGCCCGACCCGCTCATTGGCCCGAGAGGGACTCTGTTTTTAGTGTTACGAACATTCGTGAAGGATTGACTTGTTGTTATTGGTCAATATCCGTGGACACAGAAATATA
TCTGCAGTGA
Protein sequenceShow/hide protein sequence
MKQTIDPLHVSEGAITRSKTKNIQEASTLHLQKLVNAHGEIKIFEPKIIYNMAREKLCSLKDGTVDKKSIHWISPDQANSGVLIKNMSQDNKEKVPQVVDPNMAILQGIQ
GVMEMMREEREERRAQQQREERILQEDECMFDLQVQERNLGGRGNDSFVNRNEPTQQRSMFVAVKRVEAESSNAKKNEASKEKFTLLPLSPYEVHCDHLKLEKKRKELEQ
KARPAHWPERDSVFSVTNIREGLTCCYWSISVDTEIYLQ