; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006679 (gene) of Snake gourd v1 genome

Gene IDTan0006679
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG08:32491858..32492178
RNA-Seq ExpressionTan0006679
SyntenyTan0006679
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038359.1 gag/pol protein [Cucumis melo var. makuwa]6.0e-3876.19Show/hide
Query:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA
        M+SSI+QLLAF+KL+ DNY  WKSNLNTILVVDDLRFVLTEECP  PSS A++  RKAYDRWI+ANEK  VYILA++SDVL+KKHES+ TAKEIM+SL+ 
Subjt:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA

Query:  MFGQP
        MFGQP
Subjt:  MFGQP

TYJ97035.1 gag/pol protein [Cucumis melo var. makuwa]6.0e-3876.19Show/hide
Query:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA
        M+SSI+QLLAF+KL+ DNY  WKSNLNTILVVDDLRFVLTEECP  PSS A++  RKAYDRWI+ANEK  VYILA++SDVL+KKHES+ TAKEIM+SL+ 
Subjt:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA

Query:  MFGQP
        MFGQP
Subjt:  MFGQP

XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]3.5e-3875.47Show/hide
Query:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA
        MS+S IQLLA DKL+ DNYG WKSNLNTILV+DDLRFVLTEECPP P+  ANR VR AYDRW++ANEK  VYILA+IS+VLSKKHE + T +EIM+SLQA
Subjt:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA

Query:  MFGQPS
        +FGQPS
Subjt:  MFGQPS

XP_038880476.1 uncharacterized protein LOC120072136 [Benincasa hispida]2.3e-3773.58Show/hide
Query:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA
        MS+SI+Q LA +KL+DDNYGTWKSNLNTILV+DDL+FVLTEECPP P+   NR +  A+DRW +ANEK  VYILA+ISD+LSKKHE MV AKEIM+SLQA
Subjt:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA

Query:  MFGQPS
        +FGQPS
Subjt:  MFGQPS

XP_038881660.1 uncharacterized protein LOC120073110 [Benincasa hispida]5.5e-3977.36Show/hide
Query:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA
        M+SSIIQLLAF+KL+DDNY  WKSNLNTILVVDDLRFVLTEECP  P+S ANR V +AYDRW++ANEK  VYILAN+SDVL+KKHES+ TAKEI++SL+ 
Subjt:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA

Query:  MFGQPS
        MFGQPS
Subjt:  MFGQPS

TrEMBL top hitse value%identityAlignment
A0A5A7TAH8 Gag/pol protein2.9e-3876.19Show/hide
Query:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA
        M+SSI+QLLAF+KL+ DNY  WKSNLNTILVVDDLRFVLTEECP  PSS A++  RKAYDRWI+ANEK  VYILA++SDVL+KKHES+ TAKEIM+SL+ 
Subjt:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA

Query:  MFGQP
        MFGQP
Subjt:  MFGQP

A0A5A7TXW7 Gag/pol protein1.9e-3774.53Show/hide
Query:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA
        M++SI+QLLA  KL+ DNY TWK NLNTILVV+DLRFVLTEECP  P+STANR VR+AYDRW++ANEK  VYI+AN+SDVL+KKHES+ TAKEIM+SL  
Subjt:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA

Query:  MFGQPS
        MFGQPS
Subjt:  MFGQPS

A0A5D3BF11 Gag/pol protein2.9e-3876.19Show/hide
Query:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA
        M+SSI+QLLAF+KL+ DNY  WKSNLNTILVVDDLRFVLTEECP  PSS A++  RKAYDRWI+ANEK  VYILA++SDVL+KKHES+ TAKEIM+SL+ 
Subjt:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA

Query:  MFGQP
        MFGQP
Subjt:  MFGQP

A0A5D3C306 Gag/pol protein2.5e-3775.24Show/hide
Query:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA
        M+SSI+QLLAF+KL+DDNY  +KSNLN ILVVDDLRFVLTEECP  P+S ANR  RKAYDRWI+ANEK  VYILA++SDVL+KKHES+ T KEIM+SL+ 
Subjt:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA

Query:  MFGQP
        MFGQP
Subjt:  MFGQP

A0A6J1DWG6 uncharacterized protein LOC1110250211.7e-3875.47Show/hide
Query:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA
        MS+S IQLLA DKL+ DNYG WKSNLNTILV+DDLRFVLTEECPP P+  ANR VR AYDRW++ANEK  VYILA+IS+VLSKKHE + T +EIM+SLQA
Subjt:  MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQA

Query:  MFGQPS
        +FGQPS
Subjt:  MFGQPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAGCTCTATTATTCAGTTACTTGCCTTCGACAAACTTAGCGACGATAACTACGGAACGTGGAAATCAAACTTGAATACGATTCTTGTTGTTGATGATCTGAGGTT
TGTCTTAACGGAGGAATGTCCTCCCCCTCCTAGCTCGACTGCAAACCGAATTGTTCGGAAAGCATATGACAGATGGATTAGGGCTAATGAGAAGACCCTTGTCTACATCT
TAGCAAACATATCTGATGTGTTGTCTAAGAAGCATGAAAGCATGGTCACCGCAAAGGAGATCATGGAATCATTGCAGGCGATGTTTGGACAACCGTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAGCTCTATTATTCAGTTACTTGCCTTCGACAAACTTAGCGACGATAACTACGGAACGTGGAAATCAAACTTGAATACGATTCTTGTTGTTGATGATCTGAGGTT
TGTCTTAACGGAGGAATGTCCTCCCCCTCCTAGCTCGACTGCAAACCGAATTGTTCGGAAAGCATATGACAGATGGATTAGGGCTAATGAGAAGACCCTTGTCTACATCT
TAGCAAACATATCTGATGTGTTGTCTAAGAAGCATGAAAGCATGGTCACCGCAAAGGAGATCATGGAATCATTGCAGGCGATGTTTGGACAACCGTCCTGA
Protein sequenceShow/hide protein sequence
MSSSIIQLLAFDKLSDDNYGTWKSNLNTILVVDDLRFVLTEECPPPPSSTANRIVRKAYDRWIRANEKTLVYILANISDVLSKKHESMVTAKEIMESLQAMFGQPS