; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018065 (gene) of Snake gourd v1 genome

Gene IDTan0018065
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG01:23405892..23406233
RNA-Seq ExpressionTan0018065
SyntenyTan0018065
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025358.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-4581.42Show/hide
Query:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG
        MKDLGEAQ+VLGIQI+R+RKNK LALSQ++YIDKMLVRY MQN KKG LPFRHG+HLS +QC KTPQEVE +RRIPYAS V SLM+VMLCTRPDICYAVG
Subjt:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG

Query:  IVSRYQSNSGYGH
        I SRYQSNSG  H
Subjt:  IVSRYQSNSGYGH

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]8.4e-4682.3Show/hide
Query:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG
        MKDLGEAQ+VLGIQI+R+RKNKTLALSQ++YIDK+LVRY MQN KKG LPFRHG+HLS +Q PKTPQEVE MRRIPYAS V SLM+VMLCTRPDICYAVG
Subjt:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG

Query:  IVSRYQSNSGYGH
        IVSRYQSN G  H
Subjt:  IVSRYQSNSGYGH

KAA0043583.1 putative Integrase core domain [Cucumis melo var. makuwa]1.7e-4684.07Show/hide
Query:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG
        MKDLGEAQ+VLGIQI+R+RKNKTLALSQ++YIDKMLVRY MQN KKG LPFRHG+HLS +QCPKTPQEVE MRRIPYASVV SLM+VML TRPDICYAVG
Subjt:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG

Query:  IVSRYQSNSGYGH
        IVSRYQSN G  H
Subjt:  IVSRYQSNSGYGH

KAA0059556.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-4581.42Show/hide
Query:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG
        MKD+GEAQ+VLGIQI+R+RKNKTLALSQ++YIDKMLVRY MQN KKG LPFRHG+HLS +QCPKTPQEVE MRRIPYAS V SLM+ MLCTRPDICYAVG
Subjt:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG

Query:  IVSRYQSNSGYGH
        IVSRYQSN    H
Subjt:  IVSRYQSNSGYGH

KAA0062886.1 gag/pol protein [Cucumis melo var. makuwa]3.8e-4682.3Show/hide
Query:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG
        MKDLGEAQ+VLGIQI+R+RKNKTLALSQ++YIDKMLVRY MQN KKG LPFRHG+HLS +QCPKTPQEVE MRRIPYAS V SLM+ MLCTRPDICY VG
Subjt:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG

Query:  IVSRYQSNSGYGH
        IVSRYQSN G  H
Subjt:  IVSRYQSNSGYGH

TrEMBL top hitse value%identityAlignment
A0A5A7SGT6 Gag/pol protein9.0e-4681.42Show/hide
Query:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG
        MKDLGEAQ+VLGIQI+R+RKNK LALSQ++YIDKMLVRY MQN KKG LPFRHG+HLS +QC KTPQEVE +RRIPYAS V SLM+VMLCTRPDICYAVG
Subjt:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG

Query:  IVSRYQSNSGYGH
        I SRYQSNSG  H
Subjt:  IVSRYQSNSGYGH

A0A5A7TJH9 Putative Integrase core domain8.2e-4784.07Show/hide
Query:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG
        MKDLGEAQ+VLGIQI+R+RKNKTLALSQ++YIDKMLVRY MQN KKG LPFRHG+HLS +QCPKTPQEVE MRRIPYASVV SLM+VML TRPDICYAVG
Subjt:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG

Query:  IVSRYQSNSGYGH
        IVSRYQSN G  H
Subjt:  IVSRYQSNSGYGH

A0A5A7TKM4 Gag/pol protein4.1e-4682.3Show/hide
Query:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG
        MKDLGEAQ+VLGIQI+R+RKNKTLALSQ++YIDK+LVRY MQN KKG LPFRHG+HLS +Q PKTPQEVE MRRIPYAS V SLM+VMLCTRPDICYAVG
Subjt:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG

Query:  IVSRYQSNSGYGH
        IVSRYQSN G  H
Subjt:  IVSRYQSNSGYGH

A0A5A7UZF3 Gag/pol protein6.9e-4681.42Show/hide
Query:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG
        MKD+GEAQ+VLGIQI+R+RKNKTLALSQ++YIDKMLVRY MQN KKG LPFRHG+HLS +QCPKTPQEVE MRRIPYAS V SLM+ MLCTRPDICYAVG
Subjt:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG

Query:  IVSRYQSNSGYGH
        IVSRYQSN    H
Subjt:  IVSRYQSNSGYGH

A0A5A7V8W0 Gag/pol protein1.8e-4682.3Show/hide
Query:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG
        MKDLGEAQ+VLGIQI+R+RKNKTLALSQ++YIDKMLVRY MQN KKG LPFRHG+HLS +QCPKTPQEVE MRRIPYAS V SLM+ MLCTRPDICY VG
Subjt:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG

Query:  IVSRYQSNSGYGH
        IVSRYQSN G  H
Subjt:  IVSRYQSNSGYGH

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.5e-0933.63Show/hide
Query:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHL----STKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDIC
        M DL E +  +GI+I    +   + LSQS+Y+ K+L ++ M+N    + P    I+     S + C             P  S++  LM++MLCTRPD+ 
Subjt:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHL----STKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDIC

Query:  YAVGIVSRYQSNS
         AV I+SRY S +
Subjt:  YAVGIVSRYQSNS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-2548.67Show/hide
Query:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG
        MKDLG AQ +LG++I+R R ++ L LSQ  YI+++L R+ M+N K  + P    + LS K CP T +E   M ++PY+S V SLM+ M+CTRPDI +AVG
Subjt:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG

Query:  IVSRYQSNSGYGH
        +VSR+  N G  H
Subjt:  IVSRYQSNSGYGH

P25600 Putative transposon Ty5-1 protein YCL074W2.8e-0431.43Show/hide
Query:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG
        MKDLG+    LG+ I     N  + LS   YI K     ++  FK    P  +   L     P            PY S+V  L+F     RPDI Y V 
Subjt:  MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVG

Query:  IVSRY
        ++SR+
Subjt:  IVSRY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGATTTGGGAGAGGCTCAGTTTGTTCTTGGAATCCAAATTCTTCGGAATCGCAAGAACAAAACGCTAGCACTGTCTCAGTCATCTTATATCGACAAGATGTTGGT
TAGATATAAGATGCAGAATTTCAAGAAGGGTGCATTACCTTTCAGGCATGGAATTCATTTGTCTACGAAACAATGTCCTAAGACACCTCAAGAAGTTGAGGGTATGAGAC
GCATTCCCTATGCATCTGTTGTCAGTAGTCTGATGTTTGTCATGCTATGTACTCGACCCGACATATGCTATGCAGTGGGAATAGTCAGCAGGTATCAGTCTAATTCAGGA
TATGGTCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGATTTGGGAGAGGCTCAGTTTGTTCTTGGAATCCAAATTCTTCGGAATCGCAAGAACAAAACGCTAGCACTGTCTCAGTCATCTTATATCGACAAGATGTTGGT
TAGATATAAGATGCAGAATTTCAAGAAGGGTGCATTACCTTTCAGGCATGGAATTCATTTGTCTACGAAACAATGTCCTAAGACACCTCAAGAAGTTGAGGGTATGAGAC
GCATTCCCTATGCATCTGTTGTCAGTAGTCTGATGTTTGTCATGCTATGTACTCGACCCGACATATGCTATGCAGTGGGAATAGTCAGCAGGTATCAGTCTAATTCAGGA
TATGGTCACTAG
Protein sequenceShow/hide protein sequence
MKDLGEAQFVLGIQILRNRKNKTLALSQSSYIDKMLVRYKMQNFKKGALPFRHGIHLSTKQCPKTPQEVEGMRRIPYASVVSSLMFVMLCTRPDICYAVGIVSRYQSNSG
YGH