; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022408 (gene) of Snake gourd v1 genome

Gene IDTan0022408
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG09:6684761..6688170
RNA-Seq ExpressionTan0022408
SyntenyTan0022408
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046201.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-3471.03Show/hide
Query:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD
        M++SI+ LLAS KL  DN+ TWK N+NTILV +DL+FVLTEECPQ P+STA+R+VR+AYDRW+K NEKA+VYIIAN+S+VLAKKHE + TAKEIMDSL  
Subjt:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD

Query:  MFGQQSF
        MFGQ S+
Subjt:  MFGQQSF

KAA0048103.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-3471.15Show/hide
Query:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD
        M+S I+ LLAS+KL RDN+ TWK+N+NTILV DDL+FVLTEECPQ P+S A+R+ R+AYDRWIK NEKA+VYI+A++S+VLAKKHE + TAKEIMDSL+ 
Subjt:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD

Query:  MFGQ
        MFGQ
Subjt:  MFGQ

XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]8.1e-3367.59Show/hide
Query:  KNMSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSL
        K MS+S I LLASDKL  DN+  WK+N+NTILV DDL+FVLTEECP  P+  A+R+VRDAYDRW+K NEKA+VYI+A++SEVL+KKHE + T +EIMDSL
Subjt:  KNMSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSL

Query:  QDMFGQQS
        Q +FGQ S
Subjt:  QDMFGQQS

XP_038882242.1 uncharacterized protein LOC120073466 [Benincasa hispida]4.8e-3368.22Show/hide
Query:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD
        M+S ++ LLAS+ L  DN+ TWK+++NTILV DDLKFVLT+ECP +P+S A+R VRDAYDRW KVNEKA+VYI+AN+S+VLAKKHE M T+KEIM+SL+ 
Subjt:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD

Query:  MFGQQSF
        MFGQ SF
Subjt:  MFGQQSF

XP_038882358.1 uncharacterized protein LOC120073622 [Benincasa hispida]1.3e-3366.36Show/hide
Query:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD
        M+SSII LL S+KL  DN+  WK+N+NTILV DDL+FVLTEECPQ P+S A+R+VR+AYDRW+K NEKA++YI+A++S+VLAKKHE + TAKEI+DSL++
Subjt:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD

Query:  MFGQQSF
        +FGQ S+
Subjt:  MFGQQSF

TrEMBL top hitse value%identityAlignment
A0A5A7T0E9 Gag/pol protein3.9e-3368.27Show/hide
Query:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD
        M+SSI+ LLAS KL  DN+ TWK+N+NTILV DDL+F+LTEECPQ P+S A+R+ R+AYDRWIK NEKA+VYI+A++S+VLAKKHE + T KEI+DSL+ 
Subjt:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD

Query:  MFGQ
        MFGQ
Subjt:  MFGQ

A0A5A7TWX1 Gag/pol protein1.6e-3471.15Show/hide
Query:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD
        M+S I+ LLAS+KL RDN+ TWK+N+NTILV DDL+FVLTEECPQ P+S A+R+ R+AYDRWIK NEKA+VYI+A++S+VLAKKHE + TAKEIMDSL+ 
Subjt:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD

Query:  MFGQ
        MFGQ
Subjt:  MFGQ

A0A5A7TXW7 Gag/pol protein5.5e-3571.03Show/hide
Query:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD
        M++SI+ LLAS KL  DN+ TWK N+NTILV +DL+FVLTEECPQ P+STA+R+VR+AYDRW+K NEKA+VYIIAN+S+VLAKKHE + TAKEIMDSL  
Subjt:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD

Query:  MFGQQSF
        MFGQ S+
Subjt:  MFGQQSF

A0A5D3BBF3 Gag/pol protein5.1e-3369.23Show/hide
Query:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD
        M+S I+ LLAS+KL RDN+ TWK+N+NTILV DDL+FVLTEECPQ  +S A+R+ R+AYDRWIK NEKA+VYI++++S+VLAKKHE + TAKEIMDSL+ 
Subjt:  MSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQD

Query:  MFGQ
        MFGQ
Subjt:  MFGQ

A0A6J1DWG6 uncharacterized protein LOC1110250213.9e-3367.59Show/hide
Query:  KNMSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSL
        K MS+S I LLASDKL  DN+  WK+N+NTILV DDL+FVLTEECP  P+  A+R+VRDAYDRW+K NEKA+VYI+A++SEVL+KKHE + T +EIMDSL
Subjt:  KNMSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSL

Query:  QDMFGQQS
        Q +FGQ S
Subjt:  QDMFGQQS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGGGTTGGAGACCTAATCCTGGATATACTCAGGATGCGACCGCTTTGTATTAAGATACAAACGAGAACAAAAAAGATCCTACCATCTTCCTTCTTCCAAAAGAA
GAACATCCGAGAAGCCAAGTGGTGGTGTTCGGTCTTCGTTCGAGAGAAGAGTTCGAGTAGTTTGAGATCGTTGGGAGAACACGAAGAGTTCGTGAACGGAAACGAAAATC
GAAGCACGTCTACAATAAATCTAAGAGTTAGATTGATTTTAACGAAATTCAGCTGCACAACTTCACTGCGACGATTCGATCGTCTTCCGCTGCGTGGAAGTTTCATTCCC
TTCAATTGGTATCAGAGCCATACGTTGGTTCTTTGTTGTGCACTGTTTTTCGGTAAAATTAGGCATTTTGTTGTAAATCGAGTCTGTAAGCTCGAGTCGTTCGTGGCAAG
AGTTGGTGTGAAGAAATCGGAGGGGAAATGGGCGAGAATCGACGAAAAACAGCAAGAGTTTGACTTGGACCAGACAATCCCTTCGGAGGGCCTTGATCATGGGAGTCGAA
ACACCGTGAATTCTCAAAAGGGATACAGTTTCCTTGTTGTTTTGCCTTCCTGGTTCACCCTTCGGTGGCTATTGTTTGGACGGATACTTGGAAACTTAAAGATTGAGGCT
AAAAATATGTCTAGCTCAATAATCGCTTTACTTGCTTCCGACAAACTAGTGAGAGATAACTTCCAAACGTGGAAGAACAACATAAACACGATTTTAGTAACTGATGACCT
GAAGTTCGTGCTTACTGAAGAGTGTCCTCAGTTGCCGAGCTCGACTGCATCACGAAGTGTTCGTGATGCATACGATCGATGGATCAAGGTCAATGAAAAGGCCAAGGTCT
ATATCATTGCCAACTTGTCTGAAGTATTGGCAAAGAAGCATGAGTTGATGGTCACCGCCAAGGAGATCATGGATTCGTTGCAGGACATGTTTGGACAACAGTCCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGGGGTTGGAGACCTAATCCTGGATATACTCAGGATGCGACCGCTTTGTATTAAGATACAAACGAGAACAAAAAAGATCCTACCATCTTCCTTCTTCCAAAAGAA
GAACATCCGAGAAGCCAAGTGGTGGTGTTCGGTCTTCGTTCGAGAGAAGAGTTCGAGTAGTTTGAGATCGTTGGGAGAACACGAAGAGTTCGTGAACGGAAACGAAAATC
GAAGCACGTCTACAATAAATCTAAGAGTTAGATTGATTTTAACGAAATTCAGCTGCACAACTTCACTGCGACGATTCGATCGTCTTCCGCTGCGTGGAAGTTTCATTCCC
TTCAATTGGTATCAGAGCCATACGTTGGTTCTTTGTTGTGCACTGTTTTTCGGTAAAATTAGGCATTTTGTTGTAAATCGAGTCTGTAAGCTCGAGTCGTTCGTGGCAAG
AGTTGGTGTGAAGAAATCGGAGGGGAAATGGGCGAGAATCGACGAAAAACAGCAAGAGTTTGACTTGGACCAGACAATCCCTTCGGAGGGCCTTGATCATGGGAGTCGAA
ACACCGTGAATTCTCAAAAGGGATACAGTTTCCTTGTTGTTTTGCCTTCCTGGTTCACCCTTCGGTGGCTATTGTTTGGACGGATACTTGGAAACTTAAAGATTGAGGCT
AAAAATATGTCTAGCTCAATAATCGCTTTACTTGCTTCCGACAAACTAGTGAGAGATAACTTCCAAACGTGGAAGAACAACATAAACACGATTTTAGTAACTGATGACCT
GAAGTTCGTGCTTACTGAAGAGTGTCCTCAGTTGCCGAGCTCGACTGCATCACGAAGTGTTCGTGATGCATACGATCGATGGATCAAGGTCAATGAAAAGGCCAAGGTCT
ATATCATTGCCAACTTGTCTGAAGTATTGGCAAAGAAGCATGAGTTGATGGTCACCGCCAAGGAGATCATGGATTCGTTGCAGGACATGTTTGGACAACAGTCCTTTTAG
Protein sequenceShow/hide protein sequence
MDGVGDLILDILRMRPLCIKIQTRTKKILPSSFFQKKNIREAKWWCSVFVREKSSSSLRSLGEHEEFVNGNENRSTSTINLRVRLILTKFSCTTSLRRFDRLPLRGSFIP
FNWYQSHTLVLCCALFFGKIRHFVVNRVCKLESFVARVGVKKSEGKWARIDEKQQEFDLDQTIPSEGLDHGSRNTVNSQKGYSFLVVLPSWFTLRWLLFGRILGNLKIEA
KNMSSSIIALLASDKLVRDNFQTWKNNINTILVTDDLKFVLTEECPQLPSSTASRSVRDAYDRWIKVNEKAKVYIIANLSEVLAKKHELMVTAKEIMDSLQDMFGQQSF