; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001627 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001627
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationchr4:33777919..33778602
RNA-Seq ExpressionLag0001627
SyntenyLag0001627
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572024.1 hypothetical protein SDJN03_28752, partial [Cucurbita argyrosperma subsp. sororia]1.1e-6560.27Show/hide
Query:  PPSVEE---EELSLHPSSSVEEDDDTPPPPPPPPEHH-----HPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIE
        P +VEE   +  ++H   ++E+  +     P     H       RRSRT+ + R I+PPYPWS EQRA +HNLEYL+S  IVTI GDV+C +C+  Y IE
Subjt:  PPSVEE---EELSLHPSSSVEEDDDTPPPPPPPPEHH-----HPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIE

Query:  YDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLL
        Y+L+ KF+EIARFIER RD +HDRAP CW +P LP+C  C+EE CVEP+IPD  DD +F +INWLFLLLGQL+G LKLKQLKYFCA+T NHRTGAK+RL+
Subjt:  YDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLL

Query:  YLTYLALCKQLQPSNRLFH
        +LTYLALCKQLQPSNRLF+
Subjt:  YLTYLALCKQLQPSNRLFH

KAG7011696.1 hypothetical protein SDJN02_26602, partial [Cucurbita argyrosperma subsp. argyrosperma]6.6e-6660.27Show/hide
Query:  PPSVEE---EELSLHPSSSVEEDDDTPPPPPPPPEHH-----HPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIE
        P +VEE   +  ++H   ++E+  +     P     H       RRSRT+ + R I+PPYPWS EQRA +HNLEYL+S  IVTI GDV+C +C+  Y IE
Subjt:  PPSVEE---EELSLHPSSSVEEDDDTPPPPPPPPEHH-----HPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIE

Query:  YDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLL
        Y+L+ KF+EIARFIER RD +HDRAP CW +P LP+C  C+EE CVEP+IPD  DD +F +INWLFLLLGQL+G LKLKQLKYFCA+T NHRTGAK+RL+
Subjt:  YDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLL

Query:  YLTYLALCKQLQPSNRLFH
        +LTYLALCKQLQPSNRLF+
Subjt:  YLTYLALCKQLQPSNRLFH

XP_022135937.1 uncharacterized protein LOC111007768 [Momordica charantia]7.8e-6769.54Show/hide
Query:  RRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIEYDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEK
        RRSR +L N PIKPPYPWSTE +AVVH+L YLR  +I+TI+GDV+C RC+ QY IEYDL+ KFEEIA FIE+N+ TLHDRAP  W +P   DC++C EE 
Subjt:  RRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIEYDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEK

Query:  CVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFHR
        CV P IP+ D +   INWLFLLLGQ++G LKL+ LKYFCAYTNNHRTGAKNRL+YLTYL LCKQLQPS  LFHR
Subjt:  CVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFHR

XP_022953023.1 mucin-16-like [Cucurbita moschata]1.1e-6560.27Show/hide
Query:  PPSVEE---EELSLHPSSSVEEDDDTPPPPPPPPEHH-----HPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIE
        P +VEE   +  ++H   ++E+  +     P     H       RRSRT+ + R I+PPYPWS EQRA +HNLEYL+S  IVTI GDV+C +C+  Y IE
Subjt:  PPSVEE---EELSLHPSSSVEEDDDTPPPPPPPPEHH-----HPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIE

Query:  YDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLL
        Y+L+ KF+EIARFIER RD +HDRAP CW +P LP+C  C+EE CVEP+IPD  DD +F +INWLFLLLGQL+G LKLKQLKYFCA+T NHRTGAK+RL+
Subjt:  YDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLL

Query:  YLTYLALCKQLQPSNRLFH
        +LTYLALCKQLQPSNRLF+
Subjt:  YLTYLALCKQLQPSNRLFH

XP_022972400.1 uncharacterized protein KIAA0754-like [Cucurbita maxima]9.5e-6556.25Show/hide
Query:  ETNQSQVGL--NLELSLHPPSVEEEELSLHPS------SSVEEDDDTPPPPPPPPEHH-----HPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSR
        ET+   V +   +E +L  P+   + +   P+       +VEE  +     P     H       RRSRT+ + R I+PPYPWS EQRA +HNLEYL+S 
Subjt:  ETNQSQVGL--NLELSLHPPSVEEEELSLHPS------SSVEEDDDTPPPPPPPPEHH-----HPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSR

Query:  EIVTISGDVKCGRCKSQYMIEYDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLK
         IV I GDV+C +C+  Y IEY+L+ KF+EIARFIER RD +HDRAP CW +P LP+C  C+EE CVEP+IPD  DD +F +INWLFLLLGQL+G LKLK
Subjt:  EIVTISGDVKCGRCKSQYMIEYDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLK

Query:  QLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFH
        QLKYFCA+T NHRTGAK+RL++LTYLALCKQLQPSNRLF+
Subjt:  QLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFH

TrEMBL top hitse value%identityAlignment
A0A1S3AZB1 protein PAF1 homolog7.3e-5552.53Show/hide
Query:  NLELSLHPPSVEEEELSLHPSSSVEEDDDTPPPPP-----PPPEHHHPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQ
        +L L L  P +  + L   P+   +      P P       PPE   P+R RT+ +N  I+PPYPWSTE+ AV+H LEYL +  I+TI G+VKC RC  +
Subjt:  NLELSLHPPSVEEEELSLHPSSSVEEDDDTPPPPP-----PPPEHHHPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQ

Query:  YMIEYDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNR
          IEY+L+ KF+EI RFIER +D +HDRAP  W++P L +C  C +E+CVEP+I + +     INWLFLLLG  LG LKL QLKYFC  TN HRTGAK+R
Subjt:  YMIEYDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNR

Query:  LLYLTYLALCKQLQPSN
        L+YLTYLALCKQLQP++
Subjt:  LLYLTYLALCKQLQPSN

A0A6J1C462 uncharacterized protein LOC1110077683.8e-6769.54Show/hide
Query:  RRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIEYDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEK
        RRSR +L N PIKPPYPWSTE +AVVH+L YLR  +I+TI+GDV+C RC+ QY IEYDL+ KFEEIA FIE+N+ TLHDRAP  W +P   DC++C EE 
Subjt:  RRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIEYDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEK

Query:  CVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFHR
        CV P IP+ D +   INWLFLLLGQ++G LKL+ LKYFCAYTNNHRTGAKNRL+YLTYL LCKQLQPS  LFHR
Subjt:  CVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFHR

A0A6J1C690 probable serine/threonine-protein kinase samkC1.1e-6359.13Show/hide
Query:  SLHPSSSVEEDDDTPPPPPPPPEHHH-----PRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIEYDLVQKFEEIAR
        SL PS       ++P   P   E         RR R K  +  I+PPYPWST  RAVVH+L+YL+  +I+TI+GDVKC +C+ QY IEYDLV KF+EIA 
Subjt:  SLHPSSSVEEDDDTPPPPPPPPEHHH-----PRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIEYDLVQKFEEIAR

Query:  FIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIP--DDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQ
        FIE+N+DTLHDRAPS W +P LP+C+ C +E C+ PVIP  D+D+++  INWLFLLLGQ++G L LK LKYFC YTNNHRT AK+RL+YLTYL+LCKQLQ
Subjt:  FIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIP--DDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQ

Query:  PSNRLFHR
        PS  LFHR
Subjt:  PSNRLFHR

A0A6J1GM83 mucin-16-like5.4e-6660.27Show/hide
Query:  PPSVEE---EELSLHPSSSVEEDDDTPPPPPPPPEHH-----HPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIE
        P +VEE   +  ++H   ++E+  +     P     H       RRSRT+ + R I+PPYPWS EQRA +HNLEYL+S  IVTI GDV+C +C+  Y IE
Subjt:  PPSVEE---EELSLHPSSSVEEDDDTPPPPPPPPEHH-----HPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIE

Query:  YDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLL
        Y+L+ KF+EIARFIER RD +HDRAP CW +P LP+C  C+EE CVEP+IPD  DD +F +INWLFLLLGQL+G LKLKQLKYFCA+T NHRTGAK+RL+
Subjt:  YDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLL

Query:  YLTYLALCKQLQPSNRLFH
        +LTYLALCKQLQPSNRLF+
Subjt:  YLTYLALCKQLQPSNRLFH

A0A6J1I8I0 uncharacterized protein KIAA0754-like4.6e-6556.25Show/hide
Query:  ETNQSQVGL--NLELSLHPPSVEEEELSLHPS------SSVEEDDDTPPPPPPPPEHH-----HPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSR
        ET+   V +   +E +L  P+   + +   P+       +VEE  +     P     H       RRSRT+ + R I+PPYPWS EQRA +HNLEYL+S 
Subjt:  ETNQSQVGL--NLELSLHPPSVEEEELSLHPS------SSVEEDDDTPPPPPPPPEHH-----HPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSR

Query:  EIVTISGDVKCGRCKSQYMIEYDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLK
         IV I GDV+C +C+  Y IEY+L+ KF+EIARFIER RD +HDRAP CW +P LP+C  C+EE CVEP+IPD  DD +F +INWLFLLLGQL+G LKLK
Subjt:  EIVTISGDVKCGRCKSQYMIEYDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLK

Query:  QLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFH
        QLKYFCA+T NHRTGAK+RL++LTYLALCKQLQPSNRLF+
Subjt:  QLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein1.0e-4044Show/hide
Query:  PSSSVEEDDDTPPPPPPPPEHHHPR--RSRTKLNNR--PIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIEYDLVQKFEEIARFIER
        PS       +  PPP   P     R  RSR+ ++ +   I PP+PW+T +R  + +LEYL S +I TI+G+V+C  C+  Y + Y+L ++F E+ +F   
Subjt:  PSSSVEEDDDTPPPPPPPPEHHHPR--RSRTKLNNR--PIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIEYDLVQKFEEIARFIER

Query:  NRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLF
         +  + DRA   W  P    C +C  EK V+PVI    E   +INWLFLLLGQ LG   L+QLK FC ++ NHRTGAK+R+LYLTY+ LCK LQP + LF
Subjt:  NRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLF

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)3.7e-3538.53Show/hide
Query:  LELSLHPPSVE-EEELSLHPSSSVEEDDDTPPP-----------------PPPPPEHHHPRRS----RTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSRE
        LE ++ PP+V     L   PS  V      PPP                 PP      + +R        + +R I PPYPW+T++   + +   L S  
Subjt:  LELSLHPPSVE-EEELSLHPSSSVEEDDDTPPP-----------------PPPPPEHHHPRRS----RTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSRE

Query:  IVTISGDVKCGRCKSQYMIEYDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLK
        I  ISG V C  C     +EY+L +KF E+  +I+ N++ +  RAP  W +P L  CR CK E  ++PV+ +  EE   INWLFLLLGQ+LG   L QL+
Subjt:  IVTISGDVKCGRCKSQYMIEYDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLK

Query:  YFCAYTNNHRTGAKNRLLYLTYLALCKQLQP
        YFC   + HRTG+K+R++Y+TYL+LCKQL P
Subjt:  YFCAYTNNHRTGAKNRLLYLTYLALCKQLQP

AT2G16190.2 FUNCTIONS IN: molecular_function unknown2.4e-2135.18Show/hide
Query:  LELSLHPPSVE-EEELSLHPSSSVEEDDDTPPP-----------------PPPPPEHHHPRRS----RTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSRE
        LE ++ PP+V     L   PS  V      PPP                 PP      + +R        + +R I PPYPW+T++   + +   L S  
Subjt:  LELSLHPPSVE-EEELSLHPSSSVEEDDDTPPP-----------------PPPPPEHHHPRRS----RTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSRE

Query:  IVTISGDVKCGRCKSQYMIEYDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQL
        I  ISG V C  C     +EY+L +KF E+  +I+ N++ +  RAP  W +P L  CR CK E  ++PV+ +  EE   INWLFLLLGQ+LG   L QL
Subjt:  IVTISGDVKCGRCKSQYMIEYDLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACTAACCAAAGCCAAGTGGGTCTCAATCTCGAACTCTCCCTCCATCCGCCGTCGGTGGAGGAGGAGGAACTCTCCCTCCATCCGTCATCCTCGGTGGAGGAGGA
CGACGACACACCGCCTCCACCGCCACCGCCACCCGAACATCATCATCCAAGACGAAGTAGGACGAAACTGAACAACAGGCCGATCAAGCCGCCATATCCATGGTCGACGG
AGCAGCGAGCGGTGGTCCACAACCTCGAGTACCTCCGGTCGAGGGAGATCGTGACGATCAGCGGCGACGTGAAATGCGGGCGGTGCAAGAGCCAGTACATGATAGAGTAC
GATCTGGTGCAGAAGTTCGAGGAGATAGCGAGGTTCATAGAGAGGAACAGGGACACGCTGCACGACAGAGCGCCGAGCTGCTGGCTGAGCCCGGCATTGCCGGACTGCAG
AATCTGTAAAGAAGAGAAGTGCGTGGAGCCGGTGATTCCTGATGATGATGAGGAGTTTGTGAAGATCAATTGGCTGTTCTTGCTTCTGGGACAGTTGCTGGGAACTTTGA
AACTCAAACAACTCAAATACTTCTGCGCTTACACCAACAATCATCGGACTGGTGCAAAGAATCGCCTTCTTTATCTCACTTATCTTGCTTTGTGCAAGCAGCTTCAGCCC
TCCAACAGATTGTTCCATCGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACTAACCAAAGCCAAGTGGGTCTCAATCTCGAACTCTCCCTCCATCCGCCGTCGGTGGAGGAGGAGGAACTCTCCCTCCATCCGTCATCCTCGGTGGAGGAGGA
CGACGACACACCGCCTCCACCGCCACCGCCACCCGAACATCATCATCCAAGACGAAGTAGGACGAAACTGAACAACAGGCCGATCAAGCCGCCATATCCATGGTCGACGG
AGCAGCGAGCGGTGGTCCACAACCTCGAGTACCTCCGGTCGAGGGAGATCGTGACGATCAGCGGCGACGTGAAATGCGGGCGGTGCAAGAGCCAGTACATGATAGAGTAC
GATCTGGTGCAGAAGTTCGAGGAGATAGCGAGGTTCATAGAGAGGAACAGGGACACGCTGCACGACAGAGCGCCGAGCTGCTGGCTGAGCCCGGCATTGCCGGACTGCAG
AATCTGTAAAGAAGAGAAGTGCGTGGAGCCGGTGATTCCTGATGATGATGAGGAGTTTGTGAAGATCAATTGGCTGTTCTTGCTTCTGGGACAGTTGCTGGGAACTTTGA
AACTCAAACAACTCAAATACTTCTGCGCTTACACCAACAATCATCGGACTGGTGCAAAGAATCGCCTTCTTTATCTCACTTATCTTGCTTTGTGCAAGCAGCTTCAGCCC
TCCAACAGATTGTTCCATCGCTGA
Protein sequenceShow/hide protein sequence
METNQSQVGLNLELSLHPPSVEEEELSLHPSSSVEEDDDTPPPPPPPPEHHHPRRSRTKLNNRPIKPPYPWSTEQRAVVHNLEYLRSREIVTISGDVKCGRCKSQYMIEY
DLVQKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEEKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQP
SNRLFHR