; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016816 (gene) of Snake gourd v1 genome

Gene IDTan0016816
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:47470684..47473068
RNA-Seq ExpressionTan0016816
SyntenyTan0016816
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]7.6e-4851.17Show/hide
Query:  WKIQGNLEVTKGEELVKKVFFKDLCNNITNTRLTIASAAG--FDPFKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILAS
        WK   N +  +  E+ +K +  D+ ++ T   L      G  +  +KN IN +L++DDL+FVL E+CPQVPA NA ++V++ Y RW KANEK + YILAS
Subjt:  WKIQGNLEVTKGEELVKKVFFKDLCNNITNTRLTIASAAG--FDPFKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILAS

Query:  LSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNACMKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMN
        LS  LA+K E + +A EIM  +Q +      QI+H++LKY+YNA M +G SVREHVL+++VHF+V  MNE VIDE SQV FILESL +SFLQF +N VMN
Subjt:  LSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNACMKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMN

Query:  KKEYNLTSLLHEL
        K  Y LT+LL+EL
Subjt:  KKEYNLTSLLHEL

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-4760.71Show/hide
Query:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC
        +KN INT+L++DDL+FVL EECPQVPA NA ++V++ Y RW KANEK + YILASLS  LA+K E + +A EIM  +Q +      QI+H++LKY+YNA 
Subjt:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC

Query:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL
        M +G SVREHVL+++VHF+V EMN  VIDE SQV FILESL +SFLQF +N VMNK  Y LT+LL+EL
Subjt:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]5.8e-4861.31Show/hide
Query:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC
        +KN INT+L++DDL+FVL EECPQVPA NA Q+V++ Y RW KANEK + YILASLS  LA+K E + +A EIM  +Q +      QI+H++LKY+YNA 
Subjt:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC

Query:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL
        M +G SVREHVL+++VHF+V EMN  VIDE SQV FILESL +SFLQF +N VMNK  Y LT+LL+EL
Subjt:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL

KAA0066490.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-4861.31Show/hide
Query:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC
        +KN INT+L++DDL+FVL EECPQVPA NA ++V++ Y RW KANEK + YILASLS  LA+K E V +A EIM  +Q +      QI+H++LKY+YNA 
Subjt:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC

Query:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL
        M +G SVREHVL+++VHFH+ EMN  VIDE SQV FILESL +SFLQF +N VMNK  Y LT+LL+EL
Subjt:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL

XP_038885834.1 uncharacterized protein LOC120076130 [Benincasa hispida]4.8e-5058.33Show/hide
Query:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC
        +K N+NTILV+DDL+F+L EECP +P+ NA ++V+DAYNRW + N+KV  YILA++S  LA+K E + +  +IM +++ +   P   +RH+S+KY+YN  
Subjt:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC

Query:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL
        MK+G SVREHVL+++VHF+V E+NEVV+DE+SQ+ FILESL KSFLQFCTN +MNK EYNLT+LL+EL
Subjt:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein8.2e-4860.71Show/hide
Query:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC
        +KN INT+L++DDL+FVL EECPQVPA NA ++V++ Y RW KANEK + YILASLS  LA+K E + +A EIM  +Q +      QI+H++LKY+YNA 
Subjt:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC

Query:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL
        M +G SVREHVL+++VHF+V EMN  VIDE SQV FILESL +SFLQF +N VMNK  Y LT+LL+EL
Subjt:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL

A0A5A7SNP8 Gag/pol protein3.7e-4851.17Show/hide
Query:  WKIQGNLEVTKGEELVKKVFFKDLCNNITNTRLTIASAAG--FDPFKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILAS
        WK   N +  +  E+ +K +  D+ ++ T   L      G  +  +KN IN +L++DDL+FVL E+CPQVPA NA ++V++ Y RW KANEK + YILAS
Subjt:  WKIQGNLEVTKGEELVKKVFFKDLCNNITNTRLTIASAAG--FDPFKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILAS

Query:  LSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNACMKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMN
        LS  LA+K E + +A EIM  +Q +      QI+H++LKY+YNA M +G SVREHVL+++VHF+V  MNE VIDE SQV FILESL +SFLQF +N VMN
Subjt:  LSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNACMKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMN

Query:  KKEYNLTSLLHEL
        K  Y LT+LL+EL
Subjt:  KKEYNLTSLLHEL

A0A5A7V6N0 Gag/pol protein2.8e-4861.31Show/hide
Query:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC
        +KN INT+L++DDL+FVL EECPQVPA NA Q+V++ Y RW KANEK + YILASLS  LA+K E + +A EIM  +Q +      QI+H++LKY+YNA 
Subjt:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC

Query:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL
        M +G SVREHVL+++VHF+V EMN  VIDE SQV FILESL +SFLQF +N VMNK  Y LT+LL+EL
Subjt:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL

A0A5A7VH46 Gag/pol protein1.3e-4861.31Show/hide
Query:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC
        +KN INT+L++DDL+FVL EECPQVPA NA ++V++ Y RW KANEK + YILASLS  LA+K E V +A EIM  +Q +      QI+H++LKY+YNA 
Subjt:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC

Query:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL
        M +G SVREHVL+++VHFH+ EMN  VIDE SQV FILESL +SFLQF +N VMNK  Y LT+LL+EL
Subjt:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL

A0A5D3CPJ6 Gag/pol protein8.2e-4860.71Show/hide
Query:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC
        +KN INT+L++DDL+FVL EECPQVPA NA ++V++ Y RW KANEK + YILASLS  LA+K E + +A EIM  +Q +      QI+H++LKY+YNA 
Subjt:  FKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKLEGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNAC

Query:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL
        M +G SVREHVL+++VHF+V EMN  VIDE SQV FILESL +SFLQF +N VMNK  Y LT+LL+EL
Subjt:  MKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTCGTGGAAAATTCAAGGAAACCTTGAAGTCACAAAAGGGGAGGAACTCGTGAAAAAAGTGTTCTTCAAAGATCTATGTAACAATATCACGAACACACGATTAAC
AATCGCTTCTGCTGCGGGATTCGATCCCTTCAAAAATAACATTAATACAATTTTAGTTGTGGACGACCTGAAGTTCGTACTTTATGAGGAATGTCCTCAAGTCCCTGCTC
AAAACGCGCCTCAATCTGTTAAGGACGCGTATAACCGTTGGACTAAAGCCAATGAGAAGGTCAAGGTTTATATCCTGGCTAGCCTATCTGTAGATCTGGCTAGGAAACTT
GAGGGTGTGGACTCAGCTCATGAGATCATGAGTTATATGCAAAATTTGTCAGAATATCCATATGAACAGATTCGACATGAATCCCTCAAATACGTTTATAACGCGTGCAT
GAAGAAGGGAACTTCAGTGAGAGAACATGTTCTCGATCTGTTGGTTCACTTCCACGTGGTCGAAATGAATGAAGTGGTTATAGACGAGCAAAGTCAGGTGTTATTCATCC
TCGAATCTCTTTCAAAGAGCTTCTTGCAATTCTGCACCAATGGAGTGATGAACAAGAAAGAGTATAATCTGACTTCCCTCCTTCACGAGCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGTCGTGGAAAATTCAAGGAAACCTTGAAGTCACAAAAGGGGAGGAACTCGTGAAAAAAGTGTTCTTCAAAGATCTATGTAACAATATCACGAACACACGATTAAC
AATCGCTTCTGCTGCGGGATTCGATCCCTTCAAAAATAACATTAATACAATTTTAGTTGTGGACGACCTGAAGTTCGTACTTTATGAGGAATGTCCTCAAGTCCCTGCTC
AAAACGCGCCTCAATCTGTTAAGGACGCGTATAACCGTTGGACTAAAGCCAATGAGAAGGTCAAGGTTTATATCCTGGCTAGCCTATCTGTAGATCTGGCTAGGAAACTT
GAGGGTGTGGACTCAGCTCATGAGATCATGAGTTATATGCAAAATTTGTCAGAATATCCATATGAACAGATTCGACATGAATCCCTCAAATACGTTTATAACGCGTGCAT
GAAGAAGGGAACTTCAGTGAGAGAACATGTTCTCGATCTGTTGGTTCACTTCCACGTGGTCGAAATGAATGAAGTGGTTATAGACGAGCAAAGTCAGGTGTTATTCATCC
TCGAATCTCTTTCAAAGAGCTTCTTGCAATTCTGCACCAATGGAGTGATGAACAAGAAAGAGTATAATCTGACTTCCCTCCTTCACGAGCTATAA
Protein sequenceShow/hide protein sequence
MVSWKIQGNLEVTKGEELVKKVFFKDLCNNITNTRLTIASAAGFDPFKNNINTILVVDDLKFVLYEECPQVPAQNAPQSVKDAYNRWTKANEKVKVYILASLSVDLARKL
EGVDSAHEIMSYMQNLSEYPYEQIRHESLKYVYNACMKKGTSVREHVLDLLVHFHVVEMNEVVIDEQSQVLFILESLSKSFLQFCTNGVMNKKEYNLTSLLHEL