; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021983 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021983
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr7:15392276..15403478
RNA-Seq ExpressionLag0021983
SyntenyLag0021983
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046140.1 gag/pol protein [Cucumis melo var. makuwa]8.8e-5060Show/hide
Query:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA
        M+SS+VQLLAS+KLN DNY  WKSNLNT+LV+DDLRF+LT EC   S S   R SQ+ YD W +AN K +V+IL+S+SDVL+KKHES+AT KEIMDSL+ 
Subjt:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA

Query:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAKEL-----QTYQSFTKGKGKENEANVVS
        MFGQ   FLRH+ +KY+Y  +MKEGT +REHVL+MM+HFN  +          Q+ T+GKGK+ EANV +
Subjt:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAKEL-----QTYQSFTKGKGKENEANVVS

KAA0047871.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-5070.63Show/hide
Query:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA
        M+SS+VQLLASEKLNGDNY TWKSNLNT+LV+DDLRFVLTEEC   S S   R SQ  YD W + N K +V+IL S+SDVL+KKHES+ATTKEIMDSL+ 
Subjt:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA

Query:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAK
        MFGQP   LRH+A+KY+Y  +MKEGTSVREHVL+MM+HFN A+
Subjt:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAK

TYK05765.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-4953.59Show/hide
Query:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA
        M+SS+VQLLA EKLN DNY  +KSNLN +LV+DDLRFVLTEEC     S   R S+  YD W +AN K +V+IL S+SDVL+KKHES+ATTKEIMDSL+ 
Subjt:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA

Query:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAK--------------------------------------------ELQTYQSFTKGKG
        MFGQP  F+RHKA+KY+Y  +MKEGTSVREHVL+MM+HFN AK                                            ELQ +Q+ TKGKG
Subjt:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAK--------------------------------------------ELQTYQSFTKGKG

Query:  KENEANVVS
        KE EANV +
Subjt:  KENEANVVS

XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]8.0e-5155.02Show/hide
Query:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA
        MS+S +QLLAS+KLNGDNYG WKSNLNT+LVIDDLRFVLTEEC P       R  +D YD W +AN K +V+IL SIS+VLSKKHE +ATT+EIMDSLQA
Subjt:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA

Query:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAK--------------------------------------------ELQTYQSFTKGKG
        +FGQPS+ L H A+KYVYN +MKEG+SVREHVLNMMVHFN A+                                            ELQ Y+S  K KG
Subjt:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAK--------------------------------------------ELQTYQSFTKGKG

Query:  KENEANVVS
         E EANV +
Subjt:  KENEANVVS

XP_038904195.1 uncharacterized protein LOC120090541 [Benincasa hispida]6.8e-5069.93Show/hide
Query:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA
        MS+S++QLLASEKLNGDNYGTWKSN+NT+LVIDDLRFVLTEEC P+ G    R   D YDIW +AN K +V+IL SI DVLSKKHE +AT +EI+DSLQ+
Subjt:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA

Query:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAK
        +FGQPS+   H A+K+VYN +MKEGT VREHVLNMMVHFN A+
Subjt:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAK

TrEMBL top hitse value%identityAlignment
A0A5A7TSK9 Gag/pol protein4.3e-5060Show/hide
Query:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA
        M+SS+VQLLAS+KLN DNY  WKSNLNT+LV+DDLRF+LT EC   S S   R SQ+ YD W +AN K +V+IL+S+SDVL+KKHES+AT KEIMDSL+ 
Subjt:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA

Query:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAKEL-----QTYQSFTKGKGKENEANVVS
        MFGQ   FLRH+ +KY+Y  +MKEGT +REHVL+MM+HFN  +          Q+ T+GKGK+ EANV +
Subjt:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAKEL-----QTYQSFTKGKGKENEANVVS

A0A5A7TXW7 Gag/pol protein3.6e-4961.96Show/hide
Query:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA
        M++S+VQLLAS+KLNGDNY TWK NLNT+LV++DLRFVLTEEC     ST  R  ++ YD W +AN K +V+I+ ++SDVL+KKHES+AT KEIMDSL  
Subjt:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA

Query:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAKELQTYQSFTKGKGKENEANV
        MFGQPS  L+H+A+KY+Y  Q+KEGTSVREHVL+MM+HFN A   + +Q+ T  K KE E+NV
Subjt:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAKELQTYQSFTKGKGKENEANV

A0A5A7U2U3 Gag/pol protein1.1e-5070.63Show/hide
Query:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA
        M+SS+VQLLASEKLNGDNY TWKSNLNT+LV+DDLRFVLTEEC   S S   R SQ  YD W + N K +V+IL S+SDVL+KKHES+ATTKEIMDSL+ 
Subjt:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA

Query:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAK
        MFGQP   LRH+A+KY+Y  +MKEGTSVREHVL+MM+HFN A+
Subjt:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAK

A0A5D3C306 Gag/pol protein7.3e-5053.59Show/hide
Query:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA
        M+SS+VQLLA EKLN DNY  +KSNLN +LV+DDLRFVLTEEC     S   R S+  YD W +AN K +V+IL S+SDVL+KKHES+ATTKEIMDSL+ 
Subjt:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA

Query:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAK--------------------------------------------ELQTYQSFTKGKG
        MFGQP  F+RHKA+KY+Y  +MKEGTSVREHVL+MM+HFN AK                                            ELQ +Q+ TKGKG
Subjt:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAK--------------------------------------------ELQTYQSFTKGKG

Query:  KENEANVVS
        KE EANV +
Subjt:  KENEANVVS

A0A6J1DWG6 uncharacterized protein LOC1110250213.9e-5155.02Show/hide
Query:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA
        MS+S +QLLAS+KLNGDNYG WKSNLNT+LVIDDLRFVLTEEC P       R  +D YD W +AN K +V+IL SIS+VLSKKHE +ATT+EIMDSLQA
Subjt:  MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQA

Query:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAK--------------------------------------------ELQTYQSFTKGKG
        +FGQPS+ L H A+KYVYN +MKEG+SVREHVLNMMVHFN A+                                            ELQ Y+S  K KG
Subjt:  MFGQPSSFLRHKALKYVYNSQMKEGTSVREHVLNMMVHFNAAK--------------------------------------------ELQTYQSFTKGKG

Query:  KENEANVVS
         E EANV +
Subjt:  KENEANVVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCTCTTTAGTGCAACTATTAGCTTCAGAAAAACTTAATGGCGACAACTATGGAACTTGGAAATCAAACTTAAACACATTATTAGTAATTGATGATCTTCGATT
CGTCTTGACGGAAGAGTGTCTGCCTAACTCTGGCTCAACAGGAACCCGAGTGAGTCAGGATGTCTATGACATATGGACTAGAGCCAACAACAAGGAAAAAGTCTTCATCT
TGATAAGCATATCTGATGTTTTGTCAAAGAAACACGAGAGCGTGGCTACGACCAAAGAAATTATGGACTCATTACAAGCAATGTTTGGACAACCGTCCTCATTTCTTAGA
CATAAGGCTCTCAAATATGTTTACAATTCTCAAATGAAAGAAGGAACTTCTGTTCGAGAGCATGTTTTGAACATGATGGTGCATTTCAATGCAGCCAAGGAGCTTCAGAC
TTACCAGTCCTTTACTAAAGGTAAAGGAAAAGAAAACGAAGCAAACGTTGTTTCTGGTTGTTCATTAGAGGAGTACTGGGAACTTAAGGACAAAGATGTAAACCAGGGAG
ATCTCTCTCAAAAGACTCCCACAAGTCTCCTGCTTCAAGAAGTCTTAGAGATATACCTGTGTAGCCGATTGGTGGTGTTCGCACAAGAGGGTTGCTGCGTTTTCATTCTT
ATTGGCAAGGAAAGGCGAATTGATCAAGGCTTTCTACAAAAGAAACTAGTTCCTGGAAGGCACTTGGAGAAGGCGAGGAAAACAGAGGAAAAGCTGGAATTTGTCCAGAA
ATGCGACTGCATTTCTGGAAGGCAAAATGAAATGCGACCGCATTTCTGGAAAAACAAAGACTTTGCATGCTATGTGAGTTTGATCTCGAAAGGGACAGACGATATCAGTT
CTCACAAAGTGCTTAGGGTTGGTCGTCGTGAGAGGTTGAGAATACCGTCTACGAGAGATGTCTGGAAAGTGCACAAATGCAGCTGTTGGGTTCCTAGAATTCCTCCCGGA
ACTCTGGATCTTTTGGAATGTGTTTCAACCCCAATGAAGTACTACCGATCAAGCACTAGGCAACTGGCTATCCCTAGCTCAGTCACTGGCGTTCACCAATACACAATGAT
CTATTTTTCTCCCAGCTTTGCTCCAGCACAGATTTTGCTCCCGTTGTTCATTAGAGGAGCGCTGAGTTATAGTGGGGAAGATCTCGTGGTGGTGTTCGTTGAAGTATTCA
GAGTCGTCAACGGAGGTAAGGAGGATGCAAAATCTGAATTACCAAGAAGCATGTCAGAGCTTAATGATGAAATCAGCATAATCGCAGTGGTAAATCAGGATCCCATGGTC
GAGCGCATGATGCATGATTTAGCAACTTCTGTTCCGAGTCTTCTCTCATCTCTTGGAGACCTTGAAGTCAGCAGAGAAATCATATACATCACCCAAGCGCATGATGAAGA
ATTTCACAAGTTTGGGATCGCTTCAATAGGTTATATGCACTTCCTCATCGCCCACTGTCCGAGGAGTCCTTACTTTTGCACTTCGTTGATGGTCTGGCGCTTGACGATAG
AAGAATGCTCGATATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCTCTTTAGTGCAACTATTAGCTTCAGAAAAACTTAATGGCGACAACTATGGAACTTGGAAATCAAACTTAAACACATTATTAGTAATTGATGATCTTCGATT
CGTCTTGACGGAAGAGTGTCTGCCTAACTCTGGCTCAACAGGAACCCGAGTGAGTCAGGATGTCTATGACATATGGACTAGAGCCAACAACAAGGAAAAAGTCTTCATCT
TGATAAGCATATCTGATGTTTTGTCAAAGAAACACGAGAGCGTGGCTACGACCAAAGAAATTATGGACTCATTACAAGCAATGTTTGGACAACCGTCCTCATTTCTTAGA
CATAAGGCTCTCAAATATGTTTACAATTCTCAAATGAAAGAAGGAACTTCTGTTCGAGAGCATGTTTTGAACATGATGGTGCATTTCAATGCAGCCAAGGAGCTTCAGAC
TTACCAGTCCTTTACTAAAGGTAAAGGAAAAGAAAACGAAGCAAACGTTGTTTCTGGTTGTTCATTAGAGGAGTACTGGGAACTTAAGGACAAAGATGTAAACCAGGGAG
ATCTCTCTCAAAAGACTCCCACAAGTCTCCTGCTTCAAGAAGTCTTAGAGATATACCTGTGTAGCCGATTGGTGGTGTTCGCACAAGAGGGTTGCTGCGTTTTCATTCTT
ATTGGCAAGGAAAGGCGAATTGATCAAGGCTTTCTACAAAAGAAACTAGTTCCTGGAAGGCACTTGGAGAAGGCGAGGAAAACAGAGGAAAAGCTGGAATTTGTCCAGAA
ATGCGACTGCATTTCTGGAAGGCAAAATGAAATGCGACCGCATTTCTGGAAAAACAAAGACTTTGCATGCTATGTGAGTTTGATCTCGAAAGGGACAGACGATATCAGTT
CTCACAAAGTGCTTAGGGTTGGTCGTCGTGAGAGGTTGAGAATACCGTCTACGAGAGATGTCTGGAAAGTGCACAAATGCAGCTGTTGGGTTCCTAGAATTCCTCCCGGA
ACTCTGGATCTTTTGGAATGTGTTTCAACCCCAATGAAGTACTACCGATCAAGCACTAGGCAACTGGCTATCCCTAGCTCAGTCACTGGCGTTCACCAATACACAATGAT
CTATTTTTCTCCCAGCTTTGCTCCAGCACAGATTTTGCTCCCGTTGTTCATTAGAGGAGCGCTGAGTTATAGTGGGGAAGATCTCGTGGTGGTGTTCGTTGAAGTATTCA
GAGTCGTCAACGGAGGTAAGGAGGATGCAAAATCTGAATTACCAAGAAGCATGTCAGAGCTTAATGATGAAATCAGCATAATCGCAGTGGTAAATCAGGATCCCATGGTC
GAGCGCATGATGCATGATTTAGCAACTTCTGTTCCGAGTCTTCTCTCATCTCTTGGAGACCTTGAAGTCAGCAGAGAAATCATATACATCACCCAAGCGCATGATGAAGA
ATTTCACAAGTTTGGGATCGCTTCAATAGGTTATATGCACTTCCTCATCGCCCACTGTCCGAGGAGTCCTTACTTTTGCACTTCGTTGATGGTCTGGCGCTTGACGATAG
AAGAATGCTCGATATAG
Protein sequenceShow/hide protein sequence
MSSSLVQLLASEKLNGDNYGTWKSNLNTLLVIDDLRFVLTEECLPNSGSTGTRVSQDVYDIWTRANNKEKVFILISISDVLSKKHESVATTKEIMDSLQAMFGQPSSFLR
HKALKYVYNSQMKEGTSVREHVLNMMVHFNAAKELQTYQSFTKGKGKENEANVVSGCSLEEYWELKDKDVNQGDLSQKTPTSLLLQEVLEIYLCSRLVVFAQEGCCVFIL
IGKERRIDQGFLQKKLVPGRHLEKARKTEEKLEFVQKCDCISGRQNEMRPHFWKNKDFACYVSLISKGTDDISSHKVLRVGRRERLRIPSTRDVWKVHKCSCWVPRIPPG
TLDLLECVSTPMKYYRSSTRQLAIPSSVTGVHQYTMIYFSPSFAPAQILLPLFIRGALSYSGEDLVVVFVEVFRVVNGGKEDAKSELPRSMSELNDEISIIAVVNQDPMV
ERMMHDLATSVPSLLSSLGDLEVSREIIYITQAHDEEFHKFGIASIGYMHFLIAHCPRSPYFCTSLMVWRLTIEECSI