; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011954 (gene) of Snake gourd v1 genome

Gene IDTan0011954
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG01:99145911..99146276
RNA-Seq ExpressionTan0011954
SyntenyTan0011954
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035676.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-3167.68Show/hide
Query:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQ
        M+SSI+ LLAS+KL GDN+ TWK+N+NTILV DDL+F+LTEECPQ P+S A+R+ R+AYDRWI+ANEKA+VYI+ S+S+VLAKKHE + T K+I+DSL+
Subjt:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQ

KAA0046201.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-3270.41Show/hide
Query:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSL
        M++SI+ LLAS+KL GDN+ TWK N+NTILV +DL+FVLTEECPQ P+STA+R+VR+AYDRW++ANEKA+VYII ++S+VLAKKHE + TAK+IMDSL
Subjt:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSL

XP_022157844.1 uncharacterized protein LOC111024457 [Momordica charantia]1.3e-3167.68Show/hide
Query:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQ
        M+SSI+ LLASEKL G N+ TWKNN+NTILV DDL+FVLTEECPQ P++ A+R+VR+A+DRW++AN+KA+VYI+ S+++VLAKKHE ++TAK+IMDSL+
Subjt:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQ

XP_038882358.1 uncharacterized protein LOC120073622 [Benincasa hispida]1.1e-3269Show/hide
Query:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQE
        M+SSII LL SEKL GDN+  WK+N+NTILV DDL+FVLTEECPQ P+S A+R+VR+AYDRW++ANEKA++YI+ S+S+VLAKKHE + TAK+I+DSL+E
Subjt:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQE

XP_038891685.1 uncharacterized protein LOC120081079 [Benincasa hispida]2.1e-3168Show/hide
Query:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQE
        M+S II LLASEKL GDN+  WK+N+NTIL+ DDL+FVL+EECPQ P+S A+R+VR+AYDRW++ANEKA VYI+ S+S+VLAKKHE + TAK+I+DSL+E
Subjt:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQE

TrEMBL top hitse value%identityAlignment
A0A5A7T0E9 Gag/pol protein7.9e-3267.68Show/hide
Query:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQ
        M+SSI+ LLAS+KL GDN+ TWK+N+NTILV DDL+F+LTEECPQ P+S A+R+ R+AYDRWI+ANEKA+VYI+ S+S+VLAKKHE + T K+I+DSL+
Subjt:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQ

A0A5A7TWX1 Gag/pol protein1.4e-3169.7Show/hide
Query:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQ
        M+S I+ LLASEKL  DN+ TWK+N+NTILV DDL+FVLTEECPQ P+S A+R+ R+AYDRWI+ANEKA+VYI+ S+S+VLAKKHE + TAK+IMDSL+
Subjt:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQ

A0A5A7TXW7 Gag/pol protein1.6e-3270.41Show/hide
Query:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSL
        M++SI+ LLAS+KL GDN+ TWK N+NTILV +DL+FVLTEECPQ P+STA+R+VR+AYDRW++ANEKA+VYII ++S+VLAKKHE + TAK+IMDSL
Subjt:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSL

A0A6J1DUZ9 uncharacterized protein LOC1110242941.0e-3167.68Show/hide
Query:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQ
        M+SSI+ LLASEKL G N+ TWKNN+NTILV DDL+FVLTEECPQ P+  A+R+VR+A+DRW++AN+KA+VYI+ S+++VLAKKHE ++TAK+IMDSL+
Subjt:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQ

A0A6J1DXQ5 uncharacterized protein LOC1110244576.1e-3267.68Show/hide
Query:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQ
        M+SSI+ LLASEKL G N+ TWKNN+NTILV DDL+FVLTEECPQ P++ A+R+VR+A+DRW++AN+KA+VYI+ S+++VLAKKHE ++TAK+IMDSL+
Subjt:  MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCTCAATAATAGCTTTACTAGCTTCCGAAAAATTAGTGGGAGATAACTTCCAAACATGGAAGAATAATATCAACACGATCTTAGTAACTGACGACCTGAAGTT
CGTGCTTACTGAGGAATGTCCTCAATTACCGAGCTCGACTGCATCACGAAGTGTTCGTGATGCTTACGATCGATGGATCAGGGCCAATGAAAAGGCCAAGGTCTACATAA
TTGTCAGCTTGTCTGAAGTCTTGGCAAAGAAGCATGAGTTGATGGTCACCGCTAAGAAGATCATGGATTCGTTGCAGGAATGTTTGGACAACAGTCCTTTCAGGTCAGGC
ACGATTCGATCAAACACGTCTTCAACGCACGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCTCAATAATAGCTTTACTAGCTTCCGAAAAATTAGTGGGAGATAACTTCCAAACATGGAAGAATAATATCAACACGATCTTAGTAACTGACGACCTGAAGTT
CGTGCTTACTGAGGAATGTCCTCAATTACCGAGCTCGACTGCATCACGAAGTGTTCGTGATGCTTACGATCGATGGATCAGGGCCAATGAAAAGGCCAAGGTCTACATAA
TTGTCAGCTTGTCTGAAGTCTTGGCAAAGAAGCATGAGTTGATGGTCACCGCTAAGAAGATCATGGATTCGTTGCAGGAATGTTTGGACAACAGTCCTTTCAGGTCAGGC
ACGATTCGATCAAACACGTCTTCAACGCACGGATGA
Protein sequenceShow/hide protein sequence
MSSSIIALLASEKLVGDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIRANEKAKVYIIVSLSEVLAKKHELMVTAKKIMDSLQECLDNSPFRSG
TIRSNTSSTHG