; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022174 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022174
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr7:20365064..20369856
RNA-Seq ExpressionLag0022174
SyntenyLag0022174
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040928.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-2149.25Show/hide
Query:  MDLEMKSLHFNSIWDLV-----NLLDGVILLTLKEASSSFPNESLGEAELVLSQFSYVYKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKD
        MDLEM+S++ NSIW LV     N+ +  +   +  ++ +F    + +  L+ +       + +VS YQSNPG D W  V  IL+YL+RT+ YMLVYG+KD
Subjt:  MDLEMKSLHFNSIWDLV-----NLLDGVILLTLKEASSSFPNESLGEAELVLSQFSYVYKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKD

Query:  LILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV
        LIL GYTD+D  TDKD  KSTS SVFTLNGGA+V
Subjt:  LILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV

KAA0049866.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-2140.1Show/hide
Query:  MDLEMKSLHFNSIWDLVNLLDGV------------------------------------ILLTLKEASSS------------------------------
        MDLEM+S++FNS+W+LV+L +GV                                     +  LK+AS S                              
Subjt:  MDLEMKSLHFNSIWDLVNLLDGV------------------------------------ILLTLKEASSS------------------------------

Query:  -FPNESLGEAELVLSQFSY------VYKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGA
         F  + LGEA+ VL   ++       Y + +VS YQSNPG D W  V  I +YLRR R YMLVYGAKDLIL GYTD D  TDKD  KSTS SVFTLNGGA
Subjt:  -FPNESLGEAELVLSQFSY------VYKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGA

Query:  IV
        IV
Subjt:  IV

TYK16417.1 gag/pol protein [Cucumis melo var. makuwa]5.2e-2173.68Show/hide
Query:  YKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV
        Y +++VS YQSNPG D W VV  IL+YLRRTR YMLVYGAKDLIL GYTD+D  TDKD  KSTS SVFTLNGGA+V
Subjt:  YKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV

TYK19425.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-2549.69Show/hide
Query:  MDLEMKSLHFNSIWDLVNLLDGV--ILLTL----------KEASSSFPNESLGEAELVLSQ---------FSYV---------YKIKVVSEYQSNPGFDL
        MDLEM+S++FNS+W+L  L +G   ILL++           +  ++F N +L E  + +SQ           YV         Y + +VS YQSNPG D 
Subjt:  MDLEMKSLHFNSIWDLVNLLDGV--ILLTL----------KEASSSFPNESLGEAELVLSQ---------FSYV---------YKIKVVSEYQSNPGFDL

Query:  WNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV
        W  V  IL+YLRRTR YMLVYGAKDLIL GYTD D  TDKD  KSTS SVFTLNGGA+V
Subjt:  WNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV

TYK23832.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-2142.53Show/hide
Query:  MDLEMKSLHFNSIWDLVNLLD------------------GVILLTLKEASSSFPNESLGEAELVL---------------SQFSYV------------YK
        MDLEM+S++ N +W LV+  +                  G +    K  ++ F  + LG A+ VL               SQ SYV             K
Subjt:  MDLEMKSLHFNSIWDLVNLLD------------------GVILLTLKEASSSFPNESLGEAELVL---------------SQFSYV------------YK

Query:  IKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV
        + +VS YQSNPG D W  V  IL+YLR+T++YMLVYG+KDLIL GYTD+D  TDKD  KSTS SVFTLNG A+V
Subjt:  IKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV

TrEMBL top hitse value%identityAlignment
A0A5D3CK74 Gag/pol protein8.6e-2249.25Show/hide
Query:  MDLEMKSLHFNSIWDLV-----NLLDGVILLTLKEASSSFPNESLGEAELVLSQFSYVYKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKD
        MDLEM+S++ NSIW LV     N+ +  +   +  ++ +F    + +  L+ +       + +VS YQSNPG D W  V  IL+YL+RT+ YMLVYG+KD
Subjt:  MDLEMKSLHFNSIWDLV-----NLLDGVILLTLKEASSSFPNESLGEAELVLSQFSYVYKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKD

Query:  LILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV
        LIL GYTD+D  TDKD  KSTS SVFTLNGGA+V
Subjt:  LILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV

A0A5D3CWZ1 Gag/pol protein2.5e-2173.68Show/hide
Query:  YKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV
        Y +++VS YQSNPG D W VV  IL+YLRRTR YMLVYGAKDLIL GYTD+D  TDKD  KSTS SVFTLNGGA+V
Subjt:  YKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV

A0A5D3D7C0 Gag/pol protein1.7e-2549.69Show/hide
Query:  MDLEMKSLHFNSIWDLVNLLDGV--ILLTL----------KEASSSFPNESLGEAELVLSQ---------FSYV---------YKIKVVSEYQSNPGFDL
        MDLEM+S++FNS+W+L  L +G   ILL++           +  ++F N +L E  + +SQ           YV         Y + +VS YQSNPG D 
Subjt:  MDLEMKSLHFNSIWDLVNLLDGV--ILLTL----------KEASSSFPNESLGEAELVLSQ---------FSYV---------YKIKVVSEYQSNPGFDL

Query:  WNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV
        W  V  IL+YLRRTR YMLVYGAKDLIL GYTD D  TDKD  KSTS SVFTLNGGA+V
Subjt:  WNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV

A0A5D3DC34 Gag/pol protein5.0e-2240.1Show/hide
Query:  MDLEMKSLHFNSIWDLVNLLDGV------------------------------------ILLTLKEASSS------------------------------
        MDLEM+S++FNS+W+LV+L +GV                                     +  LK+AS S                              
Subjt:  MDLEMKSLHFNSIWDLVNLLDGV------------------------------------ILLTLKEASSS------------------------------

Query:  -FPNESLGEAELVLSQFSY------VYKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGA
         F  + LGEA+ VL   ++       Y + +VS YQSNPG D W  V  I +YLRR R YMLVYGAKDLIL GYTD D  TDKD  KSTS SVFTLNGGA
Subjt:  -FPNESLGEAELVLSQFSY------VYKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGA

Query:  IV
        IV
Subjt:  IV

A0A5D3DJU9 Gag/pol protein1.9e-2142.53Show/hide
Query:  MDLEMKSLHFNSIWDLVNLLD------------------GVILLTLKEASSSFPNESLGEAELVL---------------SQFSYV------------YK
        MDLEM+S++ N +W LV+  +                  G +    K  ++ F  + LG A+ VL               SQ SYV             K
Subjt:  MDLEMKSLHFNSIWDLVNLLD------------------GVILLTLKEASSSFPNESLGEAELVL---------------SQFSYV------------YK

Query:  IKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV
        + +VS YQSNPG D W  V  IL+YLR+T++YMLVYG+KDLIL GYTD+D  TDKD  KSTS SVFTLNG A+V
Subjt:  IKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIV

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-1042.86Show/hide
Query:  LVLSQFSYVYKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAI
        +V ++    + + VVS +  NPG + W  V +IL+YLR T    L +G  D ILKGYTD D+  D D  KS++  +FT +GGAI
Subjt:  LVLSQFSYVYKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAI

Q6NKZ9 Probable receptor-like serine/threonine-protein kinase At4g345008.3e-0643.08Show/hide
Query:  GKIVLKGL--YFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL
        GK+  K L    G  A+      MLVYEY++NGNLE+W HG +     LTW   MK+ + TA+ L
Subjt:  GKIVLKGL--YFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL

Q8LEB6 Probable receptor-like protein kinase At5g185003.0e-0849.21Show/hide
Query:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL
        G +  K L   L   + G   MLVYEYVNNGNLE+W  G    H+ LTW+A +K+L+ TA+AL
Subjt:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL

Q9LRP3 Probable receptor-like protein kinase At3g174204.0e-0850.79Show/hide
Query:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL
        G +  K L   L   + G   MLVYEY+NNGNLE+W HG M     LTW+A +KVL+ TA+AL
Subjt:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL

Q9SJG2 Probable receptor-like protein kinase At2g429601.1e-1055.56Show/hide
Query:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL
        G +  K L   L   + G+  MLVYEYVN+GNLE+W HGAM QH  LTW+A MK++  TAQAL
Subjt:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL

Arabidopsis top hitse value%identityAlignment
AT1G56720.1 Protein kinase superfamily protein1.0e-1155.56Show/hide
Query:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL
        G +  K L   L   + G   +LVYEYVNNGNLE+W HGAM QH  LTW+A MKVL+ T++AL
Subjt:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL

AT1G56720.2 Protein kinase superfamily protein1.0e-1155.56Show/hide
Query:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL
        G +  K L   L   + G   +LVYEYVNNGNLE+W HGAM QH  LTW+A MKVL+ T++AL
Subjt:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL

AT1G56720.3 Protein kinase superfamily protein1.0e-1155.56Show/hide
Query:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL
        G +  K L   L   + G   +LVYEYVNNGNLE+W HGAM QH  LTW+A MKVL+ T++AL
Subjt:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL

AT2G42960.1 Protein kinase superfamily protein7.9e-1255.56Show/hide
Query:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL
        G +  K L   L   + G+  MLVYEYVN+GNLE+W HGAM QH  LTW+A MK++  TAQAL
Subjt:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL

AT3G59110.1 Protein kinase superfamily protein3.9e-1153.97Show/hide
Query:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL
        G +  K L   L   + G+  MLVYEYVN+GNLE+W HGAM +   LTW+A MK+L+ TAQAL
Subjt:  GKIVLKGLYFGLAANLVGMLPMLVYEYVNNGNLEKWSHGAMCQHDMLTWKACMKVLLSTAQAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTGGAAATGAAGTCTTTGCATTTCAATTCCATTTGGGATCTTGTAAATTTGCTTGATGGGGTTATCTTACTGACATTAAAAGAGGCTAGCAGCTCATTTCCAAA
TGAAAGTCTGGGAGAAGCTGAGTTAGTCTTGTCTCAATTTTCTTATGTCTACAAGATTAAGGTTGTCAGTGAGTATCAATCAAATCCAGGATTTGATCTCTGGAATGTCG
TTACGTACATCCTCCAGTATCTTAGGAGAACGAGGTTTTATATGCTCGTGTATGGCGCTAAGGATTTGATCCTTAAAGGATACACTGACACAGATGTTTTAACTGATAAG
GATTTGATGAAATCTACATCAGTGTCTGTCTTCACTCTTAATGGAGGAGCAATAGTCGCGACCACCCGAGCAGATGCTGAGGATGCGTATCGTGTCGTCAGAAAAGATCT
TGTTTCAGATCCATGGATCTTTGCTAGCTCCAGCATTATATCTTCCTCTTTTATTGCTGATCATTTACATTGCTTCTTCAATCTCGCGCCTCGGATCTATGGTCACACGG
TTGTTCATGGAGGTGGAAAAATTGTACTGAAGGGTCTTTATTTTGGATTGGCTGCAAATCTTGTTGGGATGTTACCGATGCTAGTATATGAATATGTGAACAATGGAAAT
CTAGAAAAGTGGTCGCATGGAGCCATGTGCCAACATGACATGCTTACTTGGAAGGCTTGCATGAAGGTGCTTCTTAGCACCGCTCAGGCGCTCTACAACGAAGGAGGTGT
TTGGTTTTGTTGGTACTTTTTCGATTCTGTTGAACTTGTTCCTTGTCTCGAATTAAGTATATACCAAACGGATTTCAGATTCTTCACCCTATCAAGATTTAAGATTGTGT
TTGATGCAAGATCTAACTTGCTTAGTTTTAGGATATTTGGGGTCTTGTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTTGGAAATGAAGTCTTTGCATTTCAATTCCATTTGGGATCTTGTAAATTTGCTTGATGGGGTTATCTTACTGACATTAAAAGAGGCTAGCAGCTCATTTCCAAA
TGAAAGTCTGGGAGAAGCTGAGTTAGTCTTGTCTCAATTTTCTTATGTCTACAAGATTAAGGTTGTCAGTGAGTATCAATCAAATCCAGGATTTGATCTCTGGAATGTCG
TTACGTACATCCTCCAGTATCTTAGGAGAACGAGGTTTTATATGCTCGTGTATGGCGCTAAGGATTTGATCCTTAAAGGATACACTGACACAGATGTTTTAACTGATAAG
GATTTGATGAAATCTACATCAGTGTCTGTCTTCACTCTTAATGGAGGAGCAATAGTCGCGACCACCCGAGCAGATGCTGAGGATGCGTATCGTGTCGTCAGAAAAGATCT
TGTTTCAGATCCATGGATCTTTGCTAGCTCCAGCATTATATCTTCCTCTTTTATTGCTGATCATTTACATTGCTTCTTCAATCTCGCGCCTCGGATCTATGGTCACACGG
TTGTTCATGGAGGTGGAAAAATTGTACTGAAGGGTCTTTATTTTGGATTGGCTGCAAATCTTGTTGGGATGTTACCGATGCTAGTATATGAATATGTGAACAATGGAAAT
CTAGAAAAGTGGTCGCATGGAGCCATGTGCCAACATGACATGCTTACTTGGAAGGCTTGCATGAAGGTGCTTCTTAGCACCGCTCAGGCGCTCTACAACGAAGGAGGTGT
TTGGTTTTGTTGGTACTTTTTCGATTCTGTTGAACTTGTTCCTTGTCTCGAATTAAGTATATACCAAACGGATTTCAGATTCTTCACCCTATCAAGATTTAAGATTGTGT
TTGATGCAAGATCTAACTTGCTTAGTTTTAGGATATTTGGGGTCTTGTTGTGA
Protein sequenceShow/hide protein sequence
MDLEMKSLHFNSIWDLVNLLDGVILLTLKEASSSFPNESLGEAELVLSQFSYVYKIKVVSEYQSNPGFDLWNVVTYILQYLRRTRFYMLVYGAKDLILKGYTDTDVLTDK
DLMKSTSVSVFTLNGGAIVATTRADAEDAYRVVRKDLVSDPWIFASSSIISSSFIADHLHCFFNLAPRIYGHTVVHGGGKIVLKGLYFGLAANLVGMLPMLVYEYVNNGN
LEKWSHGAMCQHDMLTWKACMKVLLSTAQALYNEGGVWFCWYFFDSVELVPCLELSIYQTDFRFFTLSRFKIVFDARSNLLSFRIFGVLL