; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039034 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039034
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr2:34170928..34183905
RNA-Seq ExpressionLag0039034
SyntenyLag0039034
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026154.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-2543.32Show/hide
Query:  MQNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMP--------------------YAVGIVT------------------AYEAAIKTVWLRKFLPDLE
        +QNSKK  L FRH + +SKE CP+TPQ VEDMR +P                    YAVGIV+                  A EAA + VWLRKFL DLE
Subjt:  MQNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMP--------------------YAVGIVT------------------AYEAAIKTVWLRKFLPDLE

Query:  GVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE----------LNGSIVV-----------SFMKALWTKLYEDHLEGLDLRDMY
         VPN+NL ITLYCD+S  + N KEP SHK+ + IE            G ++V           SF K L  K++E HLE L LRDMY
Subjt:  GVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE----------LNGSIVV-----------SFMKALWTKLYEDHLEGLDLRDMY

KAA0035552.1 retrovirus-related pol polyprotein from transposon tnt 1-94 [Cucumis melo var. makuwa]3.7e-2850.68Show/hide
Query:  QNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMPYAVGIVTAYEAAIKTVWLRKFLPDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE----
        +NSKKD L F++G+ +SKE CP+TPQ VE+MRH       V A EAAI+ VW RKFL DLE V N+NL ITLYCD+S A+ N KEPRSHK+ + IE    
Subjt:  QNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMPYAVGIVTAYEAAIKTVWLRKFLPDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE----

Query:  -----------------LNGSIVVSFMKALWTKLYEDHLEGLDLRDMY
                         L  +I   F K L TK++E HLE L LRDMY
Subjt:  -----------------LNGSIVVSFMKALWTKLYEDHLEGLDLRDMY

KAA0053385.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]3.7e-2843.46Show/hide
Query:  IRYEMQNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMP--------------------YAVGIVT------------------AYEAAIKTVWLRKFL
        +RY MQNSKKD L FRHG+ +SKE CP+TPQ  EDMR +P                    YAVGIV+                  A +AA + +WLRKFL
Subjt:  IRYEMQNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMP--------------------YAVGIVT------------------AYEAAIKTVWLRKFL

Query:  PDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE----------LNGSIVVS-----------FMKALWTKLYEDHLEGLDLRDMY
         DLE VPN+NL ITLYCD+S A+ N KE RSHK+ + IE            G ++V+           F K L  K++E HLE L LRDMY
Subjt:  PDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE----------LNGSIVVS-----------FMKALWTKLYEDHLEGLDLRDMY

KAA0060794.1 putative Integrase core domain [Cucumis melo var. makuwa]5.4e-2740.48Show/hide
Query:  IRYEMQNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMP--------------------YAVGIVT---------------------------------
        +RY MQNSKKD L F+HG+ +SKE CP+TPQ VEDMR +P                    YAVGIV+                                 
Subjt:  IRYEMQNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMP--------------------YAVGIVT---------------------------------

Query:  ----AYEAAIKTVWLRKFLPDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE---------------------LNGSIVVSFMKALWTKLYEDH
            A EAA + VWLRKFL DLE VPN+NL ITLYCD+S A+ N KEPRSHK+ + IE                        +I   F K L  K+++DH
Subjt:  ----AYEAAIKTVWLRKFLPDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE---------------------LNGSIVVSFMKALWTKLYEDH

Query:  LEGLDLRDMY
        LE L LRDMY
Subjt:  LEGLDLRDMY

TYK30982.1 retrovirus-related pol polyprotein from transposon tnt 1-94 [Cucumis melo var. makuwa]4.9e-2851.02Show/hide
Query:  NSKKDSLSFRHGILMSKELCPETPQNVEDMRHMPYAVGIVTAYEAAIKTVWLRKFLPDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE-----
        NSKKD L F++G+ +SKE CP+TPQ VE+MRH       V A EAAI+ VW RKFL DLE V N+NL ITLYCD+S A+ N KEPRSHK+ + IE     
Subjt:  NSKKDSLSFRHGILMSKELCPETPQNVEDMRHMPYAVGIVTAYEAAIKTVWLRKFLPDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE-----

Query:  ----------------LNGSIVVSFMKALWTKLYEDHLEGLDLRDMY
                        L  +I   F K L TK++E HLE L LRDMY
Subjt:  ----------------LNGSIVVSFMKALWTKLYEDHLEGLDLRDMY

TrEMBL top hitse value%identityAlignment
A0A5A7SKC5 Gag/pol protein6.4e-2643.32Show/hide
Query:  MQNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMP--------------------YAVGIVT------------------AYEAAIKTVWLRKFLPDLE
        +QNSKK  L FRH + +SKE CP+TPQ VEDMR +P                    YAVGIV+                  A EAA + VWLRKFL DLE
Subjt:  MQNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMP--------------------YAVGIVT------------------AYEAAIKTVWLRKFLPDLE

Query:  GVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE----------LNGSIVV-----------SFMKALWTKLYEDHLEGLDLRDMY
         VPN+NL ITLYCD+S  + N KEP SHK+ + IE            G ++V           SF K L  K++E HLE L LRDMY
Subjt:  GVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE----------LNGSIVV-----------SFMKALWTKLYEDHLEGLDLRDMY

A0A5A7SWB2 Retrovirus-related pol polyprotein from transposon tnt 1-941.8e-2850.68Show/hide
Query:  QNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMPYAVGIVTAYEAAIKTVWLRKFLPDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE----
        +NSKKD L F++G+ +SKE CP+TPQ VE+MRH       V A EAAI+ VW RKFL DLE V N+NL ITLYCD+S A+ N KEPRSHK+ + IE    
Subjt:  QNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMPYAVGIVTAYEAAIKTVWLRKFLPDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE----

Query:  -----------------LNGSIVVSFMKALWTKLYEDHLEGLDLRDMY
                         L  +I   F K L TK++E HLE L LRDMY
Subjt:  -----------------LNGSIVVSFMKALWTKLYEDHLEGLDLRDMY

A0A5A7UI63 Putative gag-pol polyprotein1.8e-2843.46Show/hide
Query:  IRYEMQNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMP--------------------YAVGIVT------------------AYEAAIKTVWLRKFL
        +RY MQNSKKD L FRHG+ +SKE CP+TPQ  EDMR +P                    YAVGIV+                  A +AA + +WLRKFL
Subjt:  IRYEMQNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMP--------------------YAVGIVT------------------AYEAAIKTVWLRKFL

Query:  PDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE----------LNGSIVVS-----------FMKALWTKLYEDHLEGLDLRDMY
         DLE VPN+NL ITLYCD+S A+ N KE RSHK+ + IE            G ++V+           F K L  K++E HLE L LRDMY
Subjt:  PDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE----------LNGSIVVS-----------FMKALWTKLYEDHLEGLDLRDMY

A0A5A7V0F0 Putative Integrase core domain2.6e-2740.48Show/hide
Query:  IRYEMQNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMP--------------------YAVGIVT---------------------------------
        +RY MQNSKKD L F+HG+ +SKE CP+TPQ VEDMR +P                    YAVGIV+                                 
Subjt:  IRYEMQNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMP--------------------YAVGIVT---------------------------------

Query:  ----AYEAAIKTVWLRKFLPDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE---------------------LNGSIVVSFMKALWTKLYEDH
            A EAA + VWLRKFL DLE VPN+NL ITLYCD+S A+ N KEPRSHK+ + IE                        +I   F K L  K+++DH
Subjt:  ----AYEAAIKTVWLRKFLPDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE---------------------LNGSIVVSFMKALWTKLYEDH

Query:  LEGLDLRDMY
        LE L LRDMY
Subjt:  LEGLDLRDMY

A0A5D3E512 Retrovirus-related pol polyprotein from transposon tnt 1-942.4e-2851.02Show/hide
Query:  NSKKDSLSFRHGILMSKELCPETPQNVEDMRHMPYAVGIVTAYEAAIKTVWLRKFLPDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE-----
        NSKKD L F++G+ +SKE CP+TPQ VE+MRH       V A EAAI+ VW RKFL DLE V N+NL ITLYCD+S A+ N KEPRSHK+ + IE     
Subjt:  NSKKDSLSFRHGILMSKELCPETPQNVEDMRHMPYAVGIVTAYEAAIKTVWLRKFLPDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIE-----

Query:  ----------------LNGSIVVSFMKALWTKLYEDHLEGLDLRDMY
                        L  +I   F K L TK++E HLE L LRDMY
Subjt:  ----------------LNGSIVVSFMKALWTKLYEDHLEGLDLRDMY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCTAAGAAACCCTATCCATTTCTGGAGTTTGCGTTGTTCCGAATCTCATGATCAAGCCCTCCGTAGGGTGATCGCTCCCAGGGCGACTCGCAGGCAGATCATAGG
AATCTCACGGTGCAACCTAATGGAGGAGACCGGGGTCACTCCCACGGTGACTCGAGGCCAAGCATGGACCTCACGATGTGAACTCTTGAGGACAACCAGCCCGAACTGTG
TCAGAATAGCCCAAACCGGACCCAAATTGCTCAACCCGAACCAAATACTCCAAGAACCGGTCCAACCCACGGAGAACTGGACCGAACCGAACTTTGAACCGCAAACCATC
AGCGCACACGGGAAAAAAAACAAAATACTTGCCTTGGACCCGCCTCGAACTTGCGCCCTCAAGGATGCGTCGAGACGCTGTAGGCCAAGCGTCCCGACGCTGGCCCGATT
TTTTCCAGCGCCTGATCTTCAGCGTCGCGACGCCGTTGAGCAGGGTCGCGACGCTACGCCTCTGGACAGCCTCTACACCCCCTCCCTGCGTTTTTCCCCATCTCCGGCCA
TCTCTCTCTCGCTCTCTCCCGCGGATAGCTCGTGCCGCCGCTTGCAGCGCTGCCGCCTAGCTCAGCCGCCGTCGCCCATCTCCTCCACGCGCAGCCGCCGCCGCCATCTT
CGCGTTTCTCTCTCTCTCTCTCGATTTCGGTTCGCGTGGAAGCAGCCCCCAGCCGCCGGTGTCCTCGTCGTCGTTCCAGCCGCCGCCGCGGATCCTCGCCGTAGCCGTCG
CCGTTGCTGTCCCTCGCCGGAAAACAGAGACCCAAGCGTCGTCGCTCTTTTTCCCCTCTTTTCCTTGCGTTTCAACAAGATTCGCGCGCGTCCAGCAGTCCGAGCCTCGC
TTTTGTGCGATTTTGCTTCTGTTCAGCAAGCGGTTTGGCCTCGAATCTCCTTGTCGGCGCCGTCTAAGTGTTCGATAAGGTTCGAAACACTTCAGCTTGGATACCCATTG
CCCAAGGAGCGTTCTAACACGTTGTTAGAGATTATGCAGGTGATGAGTTTTATGTGCCCGGTGATGCGGACGAGGAGATTCATGAGGGATGACCACATGGGTCCAGTTAC
CTGTGATCTAAAATTTTGGTTTCGTCCTATGTATGCCGTTATGCTGCCGAAATTTTCGGAGCTCTCGGAATTGAATCCTCTAGCGGAAGCGGTTTTCACCGATCGTGATT
TAATTTCAATTATTCGAATACCGACCGTTCTTCACAGCACTATGCTCCTCAATGAACAGTTCGTCAACGATCTCTCCAACGAACACAACCATTTGGATCACCCACTAATC
TCTGAGTTCTCGAGAACCCTTTACGAGCTAGCAGGTGGACCCAATGGACCTACAGATCAGAAGCTCCAACGATACGAGACTAATTGGCTAAACTCATTAACCAACAGTTC
CTCTCGGGCCAGGAGAGGATGGGCGCCCTTGTTCAAGACCCGGAATCAGCCCTTAAGGGAACACACATCTGCTTACCCCAATAGGAGAAGGAGTGAATTCCATCTTGTAC
TGTTATGTTCCCAGCCCCCATTCGGTCTTGCCCCTGAAATGGATACCCCCACTCGCATGTCTCCTACATGGATGCTTTGGATCATTGCATATGTATCGAATACAAAGTGG
GCCGTATCACATAGTGTTACCAGGATAAGATATGAGATGCAAAATTCCAAGAAGGATTCACTATCTTTCAGGCATGGAATTCTTATGTCTAAGGAACTATGTCCTGAGAC
ACCTCAAAACGTTGAGGATATGAGACATATGCCCTATGCAGTAGGGATTGTCACTGCTTATGAAGCAGCGATAAAGACTGTATGGCTTAGGAAGTTCTTGCCTGATTTGG
AAGGTGTTCCAAATGTGAACTTGCTCATCACACTTTATTGTGATAGTAGTCGTGCTATGGAAAATTTAAAAGAACCTCGCAGCCACAAGAAAAGTAGGCAAATTGAATTA
AATGGAAGTATTGTTGTTTCATTCATGAAGGCTCTCTGGACTAAATTGTATGAGGATCATCTAGAAGGTCTAGATCTGCGAGATATGTACAAAGATACCTATACCGGTAA
GTCGATCCTTCACGAGTGTTCGCAACTACATTTGGGTCAAATTACCGTTTTACCCATGTGTTACCTCTGGCTCCGTAAGTACCAGTGCTCCTCTAATGAACAACCTGTTT
ATGGTCCAACCAGTAAACAGAAAGTCCCTCTCGGGCCAGTGAGAGGGCGGGATCCCTTTGTTCAAGACTCGGAGTCACCATTAAGGGAACACTCATCTACTTCTTCTAGA
AGCGGGAAGGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACCTAAGAAACCCTATCCATTTCTGGAGTTTGCGTTGTTCCGAATCTCATGATCAAGCCCTCCGTAGGGTGATCGCTCCCAGGGCGACTCGCAGGCAGATCATAGG
AATCTCACGGTGCAACCTAATGGAGGAGACCGGGGTCACTCCCACGGTGACTCGAGGCCAAGCATGGACCTCACGATGTGAACTCTTGAGGACAACCAGCCCGAACTGTG
TCAGAATAGCCCAAACCGGACCCAAATTGCTCAACCCGAACCAAATACTCCAAGAACCGGTCCAACCCACGGAGAACTGGACCGAACCGAACTTTGAACCGCAAACCATC
AGCGCACACGGGAAAAAAAACAAAATACTTGCCTTGGACCCGCCTCGAACTTGCGCCCTCAAGGATGCGTCGAGACGCTGTAGGCCAAGCGTCCCGACGCTGGCCCGATT
TTTTCCAGCGCCTGATCTTCAGCGTCGCGACGCCGTTGAGCAGGGTCGCGACGCTACGCCTCTGGACAGCCTCTACACCCCCTCCCTGCGTTTTTCCCCATCTCCGGCCA
TCTCTCTCTCGCTCTCTCCCGCGGATAGCTCGTGCCGCCGCTTGCAGCGCTGCCGCCTAGCTCAGCCGCCGTCGCCCATCTCCTCCACGCGCAGCCGCCGCCGCCATCTT
CGCGTTTCTCTCTCTCTCTCTCGATTTCGGTTCGCGTGGAAGCAGCCCCCAGCCGCCGGTGTCCTCGTCGTCGTTCCAGCCGCCGCCGCGGATCCTCGCCGTAGCCGTCG
CCGTTGCTGTCCCTCGCCGGAAAACAGAGACCCAAGCGTCGTCGCTCTTTTTCCCCTCTTTTCCTTGCGTTTCAACAAGATTCGCGCGCGTCCAGCAGTCCGAGCCTCGC
TTTTGTGCGATTTTGCTTCTGTTCAGCAAGCGGTTTGGCCTCGAATCTCCTTGTCGGCGCCGTCTAAGTGTTCGATAAGGTTCGAAACACTTCAGCTTGGATACCCATTG
CCCAAGGAGCGTTCTAACACGTTGTTAGAGATTATGCAGGTGATGAGTTTTATGTGCCCGGTGATGCGGACGAGGAGATTCATGAGGGATGACCACATGGGTCCAGTTAC
CTGTGATCTAAAATTTTGGTTTCGTCCTATGTATGCCGTTATGCTGCCGAAATTTTCGGAGCTCTCGGAATTGAATCCTCTAGCGGAAGCGGTTTTCACCGATCGTGATT
TAATTTCAATTATTCGAATACCGACCGTTCTTCACAGCACTATGCTCCTCAATGAACAGTTCGTCAACGATCTCTCCAACGAACACAACCATTTGGATCACCCACTAATC
TCTGAGTTCTCGAGAACCCTTTACGAGCTAGCAGGTGGACCCAATGGACCTACAGATCAGAAGCTCCAACGATACGAGACTAATTGGCTAAACTCATTAACCAACAGTTC
CTCTCGGGCCAGGAGAGGATGGGCGCCCTTGTTCAAGACCCGGAATCAGCCCTTAAGGGAACACACATCTGCTTACCCCAATAGGAGAAGGAGTGAATTCCATCTTGTAC
TGTTATGTTCCCAGCCCCCATTCGGTCTTGCCCCTGAAATGGATACCCCCACTCGCATGTCTCCTACATGGATGCTTTGGATCATTGCATATGTATCGAATACAAAGTGG
GCCGTATCACATAGTGTTACCAGGATAAGATATGAGATGCAAAATTCCAAGAAGGATTCACTATCTTTCAGGCATGGAATTCTTATGTCTAAGGAACTATGTCCTGAGAC
ACCTCAAAACGTTGAGGATATGAGACATATGCCCTATGCAGTAGGGATTGTCACTGCTTATGAAGCAGCGATAAAGACTGTATGGCTTAGGAAGTTCTTGCCTGATTTGG
AAGGTGTTCCAAATGTGAACTTGCTCATCACACTTTATTGTGATAGTAGTCGTGCTATGGAAAATTTAAAAGAACCTCGCAGCCACAAGAAAAGTAGGCAAATTGAATTA
AATGGAAGTATTGTTGTTTCATTCATGAAGGCTCTCTGGACTAAATTGTATGAGGATCATCTAGAAGGTCTAGATCTGCGAGATATGTACAAAGATACCTATACCGGTAA
GTCGATCCTTCACGAGTGTTCGCAACTACATTTGGGTCAAATTACCGTTTTACCCATGTGTTACCTCTGGCTCCGTAAGTACCAGTGCTCCTCTAATGAACAACCTGTTT
ATGGTCCAACCAGTAAACAGAAAGTCCCTCTCGGGCCAGTGAGAGGGCGGGATCCCTTTGTTCAAGACTCGGAGTCACCATTAAGGGAACACTCATCTACTTCTTCTAGA
AGCGGGAAGGAGTGA
Protein sequenceShow/hide protein sequence
MDLRNPIHFWSLRCSESHDQALRRVIAPRATRRQIIGISRCNLMEETGVTPTVTRGQAWTSRCELLRTTSPNCVRIAQTGPKLLNPNQILQEPVQPTENWTEPNFEPQTI
SAHGKKNKILALDPPRTCALKDASRRCRPSVPTLARFFPAPDLQRRDAVEQGRDATPLDSLYTPSLRFSPSPAISLSLSPADSSCRRLQRCRLAQPPSPISSTRSRRRHL
RVSLSLSRFRFAWKQPPAAGVLVVVPAAAADPRRSRRRCCPSPENRDPSVVALFPLFSLRFNKIRARPAVRASLLCDFASVQQAVWPRISLSAPSKCSIRFETLQLGYPL
PKERSNTLLEIMQVMSFMCPVMRTRRFMRDDHMGPVTCDLKFWFRPMYAVMLPKFSELSELNPLAEAVFTDRDLISIIRIPTVLHSTMLLNEQFVNDLSNEHNHLDHPLI
SEFSRTLYELAGGPNGPTDQKLQRYETNWLNSLTNSSSRARRGWAPLFKTRNQPLREHTSAYPNRRRSEFHLVLLCSQPPFGLAPEMDTPTRMSPTWMLWIIAYVSNTKW
AVSHSVTRIRYEMQNSKKDSLSFRHGILMSKELCPETPQNVEDMRHMPYAVGIVTAYEAAIKTVWLRKFLPDLEGVPNVNLLITLYCDSSRAMENLKEPRSHKKSRQIEL
NGSIVVSFMKALWTKLYEDHLEGLDLRDMYKDTYTGKSILHECSQLHLGQITVLPMCYLWLRKYQCSSNEQPVYGPTSKQKVPLGPVRGRDPFVQDSESPLREHSSTSSR
SGKE