; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018636 (gene) of Snake gourd v1 genome

Gene IDTan0018636
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG07:37707383..37707790
RNA-Seq ExpressionTan0018636
SyntenyTan0018636
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032020.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-4476.12Show/hide
Query:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG
        MFGQ   QI+H +LKY+YN+RM EG+SVREHVL++IVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y LTTLLNE+QTF+SLMK KG
Subjt:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG

Query:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK
        Q  GEAN+   +R+FH+GS+SGTKS  SSSG KK
Subjt:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-4475.37Show/hide
Query:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG
        MFGQ   QI+H +LKY+YN+RM EG+SVREHVL+++VHFNVAEMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y LTTLLNE+QTF+SLMK KG
Subjt:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG

Query:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK
        Q  GEAN+   +R+FH+GS+SGTKS  SSSG KK
Subjt:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-4475.37Show/hide
Query:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG
        MFGQ   QI+H +LKY+YN+RM EG+SVREHVL+++VHFNVAEMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y LTTLLNE+QTF+SLMK KG
Subjt:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG

Query:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK
        Q  GEAN+   +R+FH+GS+SGTKS  SSSG KK
Subjt:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-4475.37Show/hide
Query:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG
        MFGQ   QI+H +LKY+YN+RM EG+SVREHVL+++VHFNVAEMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y LTTLLNE+QTF+SLMK KG
Subjt:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG

Query:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK
        Q  GEAN+   +R+FH+GS+SGTKS  SSSG KK
Subjt:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-4475.37Show/hide
Query:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG
        MFGQ   QI+H +LKY+YN+RM EG+SVREHVL+++VHFNVAEMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y LTTLLNE+QTF+SLMK KG
Subjt:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG

Query:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK
        Q  GEAN+   +R+FH+GS+SGTKS  SSSG KK
Subjt:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.0e-4475.37Show/hide
Query:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG
        MFGQ   QI+H +LKY+YN+RM EG+SVREHVL+++VHFNVAEMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y LTTLLNE+QTF+SLMK KG
Subjt:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG

Query:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK
        Q  GEAN+   +R+FH+GS+SGTKS  SSSG KK
Subjt:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK

A0A5A7SMN4 Gag/pol protein7.0e-4576.12Show/hide
Query:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG
        MFGQ   QI+H +LKY+YN+RM EG+SVREHVL++IVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y LTTLLNE+QTF+SLMK KG
Subjt:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG

Query:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK
        Q  GEAN+   +R+FH+GS+SGTKS  SSSG KK
Subjt:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK

A0A5A7TU93 Gag/pol protein2.0e-4475.37Show/hide
Query:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG
        MFGQ   QI+H +LKY+YN+RM EG+SVREHVL+++VHFNVAEMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y LTTLLNE+QTF+SLMK KG
Subjt:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG

Query:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK
        Q  GEAN+   +R+FH+GS+SGTKS  SSSG KK
Subjt:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK

A0A5A7V4M1 Gag/pol protein2.0e-4475.37Show/hide
Query:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG
        MFGQ   QI+H +LKY+YN+RM EG+SVREHVL+++VHFNVAEMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y LTTLLNE+QTF+SLMK KG
Subjt:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG

Query:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK
        Q  GEAN+   +R+FH+GS+SGTKS  SSSG KK
Subjt:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK

A0A5D3CPJ6 Gag/pol protein2.0e-4475.37Show/hide
Query:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG
        MFGQ   QI+H +LKY+YN+RM EG+SVREHVL+++VHFNVAEMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y LTTLLNE+QTF+SLMK KG
Subjt:  MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKG

Query:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK
        Q  GEAN+   +R+FH+GS+SGTKS  SSSG KK
Subjt:  QADGEANLFGHSRRFHKGSSSGTKSCSSSSGLKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGGACAACCGTTTGGACAGATTCGGCACAAATCTCTCAAATACGTTTATAACTCCCGTATGATGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATTGT
CCACTTCAACGTGGCTGAGATGAACGGAGCGGTCATTGACGAGCAGAGTCAGGTATCGTTCATCTTGGAATCTCTTCCGAAGAGTTTTCTGCAATTCCGCAGCAATGCGG
TGATGAACAAGATAGAGTATAACCTGACTACTCTCCTTAATGAAGTACAAACTTTCCAGTCTCTTATGAAGAATAAGGGACAGGCTGATGGAGAGGCAAATCTGTTTGGC
CATTCCAGAAGGTTCCATAAAGGTTCATCCTCTGGGACAAAGTCCTGTAGCTCATCTTCTGGGCTTAAGAAGACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGGACAACCGTTTGGACAGATTCGGCACAAATCTCTCAAATACGTTTATAACTCCCGTATGATGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATTGT
CCACTTCAACGTGGCTGAGATGAACGGAGCGGTCATTGACGAGCAGAGTCAGGTATCGTTCATCTTGGAATCTCTTCCGAAGAGTTTTCTGCAATTCCGCAGCAATGCGG
TGATGAACAAGATAGAGTATAACCTGACTACTCTCCTTAATGAAGTACAAACTTTCCAGTCTCTTATGAAGAATAAGGGACAGGCTGATGGAGAGGCAAATCTGTTTGGC
CATTCCAGAAGGTTCCATAAAGGTTCATCCTCTGGGACAAAGTCCTGTAGCTCATCTTCTGGGCTTAAGAAGACCTAA
Protein sequenceShow/hide protein sequence
MFGQPFGQIRHKSLKYVYNSRMMEGSSVREHVLDLIVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEVQTFQSLMKNKGQADGEANLFG
HSRRFHKGSSSGTKSCSSSSGLKKT