; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001626 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001626
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationchr4:33774483..33775106
RNA-Seq ExpressionLag0001626
SyntenyLag0001626
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0016301 - kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572024.1 hypothetical protein SDJN03_28752, partial [Cucurbita argyrosperma subsp. sororia]1.2e-6164.57Show/hide
Query:  RRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEK
        RR+R R D   IEPPYPWS E RA +H L+YL+SN I+TI GDV C++CE+ YE++Y+L+ KF+EIA FIE+ RD MH+RAP  W + +LP+CE C EE 
Subjt:  RRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEK

Query:  CVEPVIPENDHDN----INWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILFH
        CVEP+IP+ + DN    INWLFLLLGQL+G L+L QLKYFCA+T NHRTGAK+RL++LTYLALCKQLQPSN LF+
Subjt:  CVEPVIPENDHDN----INWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILFH

KAG7011696.1 hypothetical protein SDJN02_26602, partial [Cucurbita argyrosperma subsp. argyrosperma]6.9e-6264.57Show/hide
Query:  RRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEK
        RR+R R D   IEPPYPWS E RA +H L+YL+SN I+TI GDV C++CE+ YE++Y+L+ KF+EIA FIE+ RD MH+RAP  W + +LP+CE C EE 
Subjt:  RRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEK

Query:  CVEPVIPENDHDN----INWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILFH
        CVEP+IP+ + DN    INWLFLLLGQL+G L+L QLKYFCA+T NHRTGAK+RL++LTYLALCKQLQPSN LF+
Subjt:  CVEPVIPENDHDN----INWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILFH

XP_022135937.1 uncharacterized protein LOC111007768 [Momordica charantia]1.7e-6868.62Show/hide
Query:  PPPPPPPPPPQPPEHPRRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSS
        P P     P  P    RR+R  L NTPI+PPYPWSTEH+AVVH L+YLR N+ILTITGDV C RCEK+Y ++YDL+ KFEEIASFIE+N+ T+H+RAP S
Subjt:  PPPPPPPPPPQPPEHPRRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSS

Query:  WVDSVLPDCELCGEEKCVEPVIPENDHDNINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILFHR
        W +    DC+LCGEE CV P IPE D+ NINWLFLLLGQ++G L+L  LKYFCAYTNNHRTGAKNRL+YLTYL LCKQLQPS  LFHR
Subjt:  WVDSVLPDCELCGEEKCVEPVIPENDHDNINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILFHR

XP_022135938.1 probable serine/threonine-protein kinase samkC [Momordica charantia]1.9e-6459.91Show/hide
Query:  NTNQSYEGLNLELSLCLPPPPPPPPPPQPPEHP---RRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFE
        NT+QS +      S    P   P    Q  + P   RR R +  +T IEPPYPWST +RAVVH L YL+ N+ILTITGDV+C +C+K+Y+++YDLV KF+
Subjt:  NTNQSYEGLNLELSLCLPPPPPPPPPPQPPEHP---RRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFE

Query:  EIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEKCVEPVIPENDHD----NINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALC
        EIASFIE+N+DT+H+RAPSSW +  LP+C+ CG+E C+ PVIP  D D    NINWLFLLLGQ++GCL L  LKYFC YTNNHRT AK+RL+YLTYL+LC
Subjt:  EIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEKCVEPVIPENDHD----NINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALC

Query:  KQLQPSNILFHR
        KQLQPS  LFHR
Subjt:  KQLQPSNILFHR

XP_038895979.1 junction-mediating and -regulatory protein-like [Benincasa hispida]2.5e-6451.33Show/hide
Query:  NTNQSYEGLNLELSLCLPPPP----PPPPPPQPP--------------------------------------------------------EHPRRTRFRL
        N N      NLELSL LP PP    PPPPPP PP                                                          PRR R R 
Subjt:  NTNQSYEGLNLELSLCLPPPP----PPPPPPQPP--------------------------------------------------------EHPRRTRFRL

Query:  DNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEKCVEPVIP
        D T IEPPYPWST+ RAV+H+L YL+SN I+TI G+V+C++CE++YEM+YDL+ KF EIA FIE  +D+MH+RAP  W   +LP+C LC +E+CVEPVI 
Subjt:  DNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEKCVEPVIP

Query:  ENDHDNINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILF
        E D+  INWLFLLLG+ LGCL+L QLKYFCA TN HRTGAKNRLLYL YL LC QLQPSN LF
Subjt:  ENDHDNINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILF

TrEMBL top hitse value%identityAlignment
A0A1S3AZB1 protein PAF1 homolog1.4e-5758.08Show/hide
Query:  PPP-------PPPPPPP------QPPE--HPRRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASF
        PPP       PP  P P      QPPE   P+R R + DN+ IEPPYPWSTE  AV+HKL+YL +N ILTI G+V+C+RC+++ E++Y+L+ KF+EI  F
Subjt:  PPP-------PPPPPPP------QPPE--HPRRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASF

Query:  IEQNRDTMHERAPSSWVDSVLPDCELCGEEKCVEPVIPENDHDNINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSN
        IE+ +D MH+RAP  WV+ +L +C  C +E+CVEP+I E  + NINWLFLLLG  LGCL+L QLKYFC  TN HRTGAK+RL+YLTYLALCKQLQP++
Subjt:  IEQNRDTMHERAPSSWVDSVLPDCELCGEEKCVEPVIPENDHDNINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSN

A0A6J1C462 uncharacterized protein LOC1110077688.2e-6968.62Show/hide
Query:  PPPPPPPPPPQPPEHPRRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSS
        P P     P  P    RR+R  L NTPI+PPYPWSTEH+AVVH L+YLR N+ILTITGDV C RCEK+Y ++YDL+ KFEEIASFIE+N+ T+H+RAP S
Subjt:  PPPPPPPPPPQPPEHPRRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSS

Query:  WVDSVLPDCELCGEEKCVEPVIPENDHDNINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILFHR
        W +    DC+LCGEE CV P IPE D+ NINWLFLLLGQ++G L+L  LKYFCAYTNNHRTGAKNRL+YLTYL LCKQLQPS  LFHR
Subjt:  WVDSVLPDCELCGEEKCVEPVIPENDHDNINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILFHR

A0A6J1C690 probable serine/threonine-protein kinase samkC9.3e-6559.91Show/hide
Query:  NTNQSYEGLNLELSLCLPPPPPPPPPPQPPEHP---RRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFE
        NT+QS +      S    P   P    Q  + P   RR R +  +T IEPPYPWST +RAVVH L YL+ N+ILTITGDV+C +C+K+Y+++YDLV KF+
Subjt:  NTNQSYEGLNLELSLCLPPPPPPPPPPQPPEHP---RRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFE

Query:  EIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEKCVEPVIPENDHD----NINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALC
        EIASFIE+N+DT+H+RAPSSW +  LP+C+ CG+E C+ PVIP  D D    NINWLFLLLGQ++GCL L  LKYFC YTNNHRT AK+RL+YLTYL+LC
Subjt:  EIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEKCVEPVIPENDHD----NINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALC

Query:  KQLQPSNILFHR
        KQLQPS  LFHR
Subjt:  KQLQPSNILFHR

A0A6J1GM83 mucin-16-like5.7e-6264.57Show/hide
Query:  RRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEK
        RR+R R D   IEPPYPWS E RA +H L+YL+SN I+TI GDV C++CE+ YE++Y+L+ KF+EIA FIE+ RD MH+RAP  W + +LP+CE C EE 
Subjt:  RRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEK

Query:  CVEPVIPENDHDN----INWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILFH
        CVEP+IP+ + DN    INWLFLLLGQL+G L+L QLKYFCA+T NHRTGAK+RL++LTYLALCKQLQPSN LF+
Subjt:  CVEPVIPENDHDN----INWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILFH

A0A6J1I8I0 uncharacterized protein KIAA0754-like2.2e-6164Show/hide
Query:  RRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEK
        RR+R R D   IEPPYPWS E RA +H L+YL+SN I+ I GDV C++CE+ YE++Y+L+ KF+EIA FIE+ RD MH+RAP  W + +LP+CE C EE 
Subjt:  RRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEK

Query:  CVEPVIPENDHDN----INWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILFH
        CVEP+IP+ + DN    INWLFLLLGQL+G L+L QLKYFCA+T NHRTGAK+RL++LTYLALCKQLQPSN LF+
Subjt:  CVEPVIPENDHDN----INWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein5.3e-4447.45Show/hide
Query:  GLNLELSLCLPPPPPPPPPPQPPEHPRRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNR
        GL    S   PPP   P       +  R+     +  I PP+PW+T  R  +  L+YL SN+I TITG+V+CR CEK Y++ Y+L E+F E+  F    +
Subjt:  GLNLELSLCLPPPPPPPPPPQPPEHPRRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNR

Query:  DTMHERAPSSWVDSVLPDCELCGEEKCVEPVIPENDHDNINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILF
          M +RA   W       CELCG EK V+PVI E     INWLFLLLGQ LG   L QLK FC ++ NHRTGAK+R+LYLTY+ LCK LQP + LF
Subjt:  DTMHERAPSSWVDSVLPDCELCGEEKCVEPVIPENDHDNINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILF

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)1.6e-3746.75Show/hide
Query:  IEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEKCVEPVIPENDH
        I PPYPW+T+    +     L SN I  I+G V C+ C++   ++Y+L EKF E+  +I+ N++ M  RAP SW    L  C  C  E  ++PV+ E   
Subjt:  IEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEKCVEPVIPENDH

Query:  DNINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQP
        + INWLFLLLGQ+LGC  L QL+YFC   + HRTG+K+R++Y+TYL+LCKQL P
Subjt:  DNINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQP

AT2G16190.2 FUNCTIONS IN: molecular_function unknown1.8e-2343.44Show/hide
Query:  IEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEKCVEPVIPENDH
        I PPYPW+T+    +     L SN I  I+G V C+ C++   ++Y+L EKF E+  +I+ N++ M  RAP SW    L  C  C  E  ++PV+ E   
Subjt:  IEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRDTMHERAPSSWVDSVLPDCELCGEEKCVEPVIPENDH

Query:  DNINWLFLLLGQLLGCLRLHQL
        + INWLFLLLGQ+LGC  L QL
Subjt:  DNINWLFLLLGQLLGCLRLHQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATACCAACCAAAGCTACGAGGGTCTCAATCTCGAACTCTCCCTCTGCCTGCCGCCGCCGCCTCCTCCACCTCCTCCTCCGCAGCCACCCGAACATCCAAGACG
AACGAGGTTTCGACTAGACAACACGCCGATCGAGCCACCATATCCATGGTCCACAGAGCACCGGGCGGTAGTCCACAAGCTTGACTACCTCCGATCAAACCGCATCCTGA
CGATCACCGGCGACGTCGAATGCAGGCGGTGCGAGAAACGGTACGAGATGAAGTACGATCTGGTGGAGAAGTTCGAGGAGATAGCGAGTTTCATAGAGCAAAACAGGGAC
ACCATGCACGAGAGAGCGCCGAGTTCGTGGGTGGACTCGGTTTTGCCGGATTGCGAGTTATGTGGGGAAGAGAAGTGCGTGGAGCCGGTGATTCCTGAGAATGATCACGA
CAACATTAACTGGCTGTTCTTGCTTTTGGGACAATTGCTTGGATGTTTGAGACTCCATCAGCTCAAATATTTCTGTGCTTACACCAACAATCATCGGACTGGTGCAAAGA
ATCGCCTTCTTTATCTCACTTATCTTGCTTTGTGTAAGCAGCTTCAGCCCTCCAATATACTGTTCCATCGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAATACCAACCAAAGCTACGAGGGTCTCAATCTCGAACTCTCCCTCTGCCTGCCGCCGCCGCCTCCTCCACCTCCTCCTCCGCAGCCACCCGAACATCCAAGACG
AACGAGGTTTCGACTAGACAACACGCCGATCGAGCCACCATATCCATGGTCCACAGAGCACCGGGCGGTAGTCCACAAGCTTGACTACCTCCGATCAAACCGCATCCTGA
CGATCACCGGCGACGTCGAATGCAGGCGGTGCGAGAAACGGTACGAGATGAAGTACGATCTGGTGGAGAAGTTCGAGGAGATAGCGAGTTTCATAGAGCAAAACAGGGAC
ACCATGCACGAGAGAGCGCCGAGTTCGTGGGTGGACTCGGTTTTGCCGGATTGCGAGTTATGTGGGGAAGAGAAGTGCGTGGAGCCGGTGATTCCTGAGAATGATCACGA
CAACATTAACTGGCTGTTCTTGCTTTTGGGACAATTGCTTGGATGTTTGAGACTCCATCAGCTCAAATATTTCTGTGCTTACACCAACAATCATCGGACTGGTGCAAAGA
ATCGCCTTCTTTATCTCACTTATCTTGCTTTGTGTAAGCAGCTTCAGCCCTCCAATATACTGTTCCATCGCTGA
Protein sequenceShow/hide protein sequence
MENTNQSYEGLNLELSLCLPPPPPPPPPPQPPEHPRRTRFRLDNTPIEPPYPWSTEHRAVVHKLDYLRSNRILTITGDVECRRCEKRYEMKYDLVEKFEEIASFIEQNRD
TMHERAPSSWVDSVLPDCELCGEEKCVEPVIPENDHDNINWLFLLLGQLLGCLRLHQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNILFHR