; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009093 (gene) of Snake gourd v1 genome

Gene IDTan0009093
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG10:39824441..39826168
RNA-Seq ExpressionTan0009093
SyntenyTan0009093
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054515.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-4946.72Show/hide
Query:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVL-------------------
        MSSS I LL  D+L G+N+ TWKS LN ILVI DLRFVL EECP  P+  A+++VRDA+D+WT+ANDKAR++ILAS+SD+L                   
Subjt:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVL-------------------

Query:  ----------------------------------------------------------------------------------AKEPLELIHSDLCGPMNV
                                                                                          AKEPLELIHS LCGPMNV
Subjt:  ----------------------------------------------------------------------------------AKEPLELIHSDLCGPMNV

Query:  KARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG
        KARGG+EYFISFIDDYSRYGYLYLM HKSE LEKFKEYK EVEN L K IK LRSDRGG
Subjt:  KARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG

KAA0059677.1 gag/pol protein [Cucumis melo var. makuwa]6.2e-4743.17Show/hide
Query:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVLAKE----------------
        M+SS +QLLA +KLNGDN+  WKSNLNTILV+DDLRFVLTEECP  P+S+A +T   A+D+W +AN+KA VYILAS+SDVLAK+                
Subjt:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVLAKE----------------

Query:  -------------------------------------------------------------------------------------------------PLE
                                                                                                          LE
Subjt:  -------------------------------------------------------------------------------------------------PLE

Query:  LIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG
        L+H D CGPMNVKARG YEYFISFIDDYSRYG++YL+ +KS + EKFKEYKAEVEN  GKTIKTLRSDRGG
Subjt:  LIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG

KAA0059678.1 gag/pol protein [Cucumis melo var. makuwa]9.9e-4541.84Show/hide
Query:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVL-------------------
        MSSS I LL  D+L  +N+ TWKS LN ILVI DLRFVL EECP  P+  A+++VRDA+D+WT+ANDK+R++IL S+SD+L                   
Subjt:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVL-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----AKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG
             AKEPLELIHSDLCGPMNVKARGG+EYFI FIDDYSRYGYLYLM HK E LEKFKEYKAEVEN L K IK LRSDRGG
Subjt:  -----AKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG

TYJ98650.1 gag/pol protein [Cucumis melo var. makuwa]6.0e-4243.65Show/hide
Query:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVL-------------------
        MSS  I LL  D+L G+N+ TWKS LN ILVI DLRFVL EEC   P   A+++VRDA+D+WT+ANDKAR++ILAS+SD+L                   
Subjt:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVL-------------------

Query:  ----------------------------------------------------------------------------------------AKEPLELIHSDL
                                                                                                AKEPLELIHSDL
Subjt:  ----------------------------------------------------------------------------------------AKEPLELIHSDL

Query:  CGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSL
        CGPMNVKARGG+EYFISFIDDYSRYGYLYLM HKSE LEKFKEYK EVEN L
Subjt:  CGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSL

TYK29762.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-4542.45Show/hide
Query:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVLAKE----------------
        M+SS +QLLAS+KLNGDN+  WKSNLNTIL + DLRFVLT +CP  P+S+A RT R A+D+W +AN+KARVYILAS+SDVLAK+                
Subjt:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVLAKE----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---PLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTI-KTLRSDRGG
           PLELIH DLC PMNVKA+GGYEYFISFIDDYSRYG++YL+ +KS++ EKFKEYKAEVEN  GKTI  TLRSDRGG
Subjt:  ---PLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTI-KTLRSDRGG

TrEMBL top hitse value%identityAlignment
A0A5A7ULG4 Gag/pol protein1.4e-4946.72Show/hide
Query:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVL-------------------
        MSSS I LL  D+L G+N+ TWKS LN ILVI DLRFVL EECP  P+  A+++VRDA+D+WT+ANDKAR++ILAS+SD+L                   
Subjt:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVL-------------------

Query:  ----------------------------------------------------------------------------------AKEPLELIHSDLCGPMNV
                                                                                          AKEPLELIHS LCGPMNV
Subjt:  ----------------------------------------------------------------------------------AKEPLELIHSDLCGPMNV

Query:  KARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG
        KARGG+EYFISFIDDYSRYGYLYLM HKSE LEKFKEYK EVEN L K IK LRSDRGG
Subjt:  KARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG

A0A5A7UWW4 Gag/pol protein4.8e-4541.84Show/hide
Query:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVL-------------------
        MSSS I LL  D+L  +N+ TWKS LN ILVI DLRFVL EECP  P+  A+++VRDA+D+WT+ANDK+R++IL S+SD+L                   
Subjt:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVL-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----AKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG
             AKEPLELIHSDLCGPMNVKARGG+EYFI FIDDYSRYGYLYLM HK E LEKFKEYKAEVEN L K IK LRSDRGG
Subjt:  -----AKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG

A0A5A7UYF5 Gag/pol protein3.0e-4743.17Show/hide
Query:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVLAKE----------------
        M+SS +QLLA +KLNGDN+  WKSNLNTILV+DDLRFVLTEECP  P+S+A +T   A+D+W +AN+KA VYILAS+SDVLAK+                
Subjt:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVLAKE----------------

Query:  -------------------------------------------------------------------------------------------------PLE
                                                                                                          LE
Subjt:  -------------------------------------------------------------------------------------------------PLE

Query:  LIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG
        L+H D CGPMNVKARG YEYFISFIDDYSRYG++YL+ +KS + EKFKEYKAEVEN  GKTIKTLRSDRGG
Subjt:  LIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG

A0A5D3DDH0 Gag/pol protein5.0e-4239.39Show/hide
Query:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVL-------------------
        MSSS I LL  D+L G N+  WKS LN ILVI DLRFVL E+CP   +  A+++VRDA+D+WT+AND+A ++ILAS+SD+L                   
Subjt:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVL-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------AKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG
                            AKEPLELIHSDLCGPMNVKARG +EYFISFIDDYSRYGYLYLM HKSE LEKFKEYKAEVEN L K IK LRSDRGG
Subjt:  --------------------AKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG

A0A5D3E1K0 Gag/pol protein7.4e-4642.45Show/hide
Query:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVLAKE----------------
        M+SS +QLLAS+KLNGDN+  WKSNLNTIL + DLRFVLT +CP  P+S+A RT R A+D+W +AN+KARVYILAS+SDVLAK+                
Subjt:  MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVLAKE----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---PLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTI-KTLRSDRGG
           PLELIH DLC PMNVKA+GGYEYFISFIDDYSRYG++YL+ +KS++ EKFKEYKAEVEN  GKTI  TLRSDRGG
Subjt:  ---PLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTI-KTLRSDRGG

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.2e-0632Show/hide
Query:  KEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRG
        K PL ++HSD+CGP+         YF+ F+D ++ Y   YL+ +KS+    F+++ A+ E      +  L  D G
Subjt:  KEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.1e-1343.84Show/hide
Query:  LELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG
        L+L++SD+CGPM +++ GG +YF++FIDD SR  ++Y++  K +  + F+++ A VE   G+ +K LRSD GG
Subjt:  LELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG

Q12491 Transposon Ty2-B Gag-Pol polyprotein8.0e-0530.26Show/hide
Query:  EPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSE--TLEKFKEYKAEVENSLGKTIKTLRSDRG
        EP + +H+D+ GP++   +    YFISF D+ +R+ ++Y +H + E   L  F    A ++N     +  ++ DRG
Subjt:  EPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSE--TLEKFKEYKAEVENSLGKTIKTLRSDRG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.2e-0634.94Show/hide
Query:  SISDVLAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG
        S S + +  PLE I+SD+     + +   Y Y++ F+D ++RY +LY +  KS+  E F  +K  +EN     I T  SD GG
Subjt:  SISDVLAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.5e-0736.14Show/hide
Query:  SISDVLAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG
        S S + + +PLE I+SD+     + +   Y Y++ F+D ++RY +LY +  KS+  + F  +K+ VEN     I TL SD GG
Subjt:  SISDVLAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAGCTCTTTTATTCAATTACTCGCCTCCGACAAACTTAACGGCGATAACTTTGGAACTTGGAAATCAAACTTGAATACAATTCTTGTAATTGATGATCTAAGGTT
CGTCTTGACGGAGGAATGTCCTTCCCCTCCCAGCTCGTCTGCAACCCGAACAGTTCGGGATGCATTTGACAAATGGACTAGAGCTAATGATAAAGCCCGGGTCTACATCT
TAGCCAGCATATCTGATGTGTTAGCCAAAGAACCCTTGGAACTCATACATTCGGATCTCTGTGGTCCAATGAATGTCAAGGCACGAGGAGGGTATGAATATTTCATCAGT
TTCATTGATGATTATTCAAGGTATGGCTATCTTTACCTAATGCATCATAAGTCCGAAACCCTTGAAAAGTTCAAGGAATATAAGGCAGAGGTTGAGAATTCGTTAGGTAA
AACGATTAAAACACTTCGATCAGATCGAGGTGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGAGCTCTTTTATTCAATTACTCGCCTCCGACAAACTTAACGGCGATAACTTTGGAACTTGGAAATCAAACTTGAATACAATTCTTGTAATTGATGATCTAAGGTT
CGTCTTGACGGAGGAATGTCCTTCCCCTCCCAGCTCGTCTGCAACCCGAACAGTTCGGGATGCATTTGACAAATGGACTAGAGCTAATGATAAAGCCCGGGTCTACATCT
TAGCCAGCATATCTGATGTGTTAGCCAAAGAACCCTTGGAACTCATACATTCGGATCTCTGTGGTCCAATGAATGTCAAGGCACGAGGAGGGTATGAATATTTCATCAGT
TTCATTGATGATTATTCAAGGTATGGCTATCTTTACCTAATGCATCATAAGTCCGAAACCCTTGAAAAGTTCAAGGAATATAAGGCAGAGGTTGAGAATTCGTTAGGTAA
AACGATTAAAACACTTCGATCAGATCGAGGTGGATAG
Protein sequenceShow/hide protein sequence
MSSSFIQLLASDKLNGDNFGTWKSNLNTILVIDDLRFVLTEECPSPPSSSATRTVRDAFDKWTRANDKARVYILASISDVLAKEPLELIHSDLCGPMNVKARGGYEYFIS
FIDDYSRYGYLYLMHHKSETLEKFKEYKAEVENSLGKTIKTLRSDRGG