; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006852 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006852
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationchr6:46488754..46491568
RNA-Seq ExpressionLag0006852
SyntenyLag0006852
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139277.1 uncharacterized protein LOC101212944 [Cucumis sativus]2.3e-8795.93Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKALQLGDSK SC PCICDCPPPLSLLKI+PGLANLSVTDCG NDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERK+TSLWERRARQMGWEGE
Subjt:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

XP_008457359.1 PREDICTED: uncharacterized protein LOC103497063 [Cucumis melo]1.0e-8796.51Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKALQLGDSK SC PCICDCPPPLSLLKI+PGLANLSVTDCG NDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

XP_022143173.1 uncharacterized protein LOC111013108 [Momordica charantia]2.3e-8796.51Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKA QLGDSK SC PCICDCPPPLSLLKIAPGLANLSVTDCG NDPDLK EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

XP_022938183.1 uncharacterized protein LOC111444344 [Cucurbita moschata]1.5e-8695.32Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPAL+WRFKKAL LGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLK+EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEG
        EAVAGEHTRHMNIT+FEAKRAASQYQREAEKCI+ATETCEEARERAEAL IKERKLTSLWERRARQMGW+G
Subjt:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEG

XP_038890751.1 uncharacterized protein LOC120080238 [Benincasa hispida]4.6e-8896.51Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSK SC PCICDCPPPLSLLKI+PGLANLS+TDCG NDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

TrEMBL top hitse value%identityAlignment
A0A0A0LG38 Uncharacterized protein1.1e-8795.93Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKALQLGDSK SC PCICDCPPPLSLLKI+PGLANLSVTDCG NDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERK+TSLWERRARQMGWEGE
Subjt:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

A0A1S3C6L6 uncharacterized protein LOC1034970635.0e-8896.51Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKALQLGDSK SC PCICDCPPPLSLLKI+PGLANLSVTDCG NDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

A0A5A7VC79 DUF1068 domain-containing protein5.0e-8896.51Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKALQLGDSK SC PCICDCPPPLSLLKI+PGLANLSVTDCG NDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

A0A6J1CQ21 uncharacterized protein LOC1110131081.1e-8796.51Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKA QLGDSK SC PCICDCPPPLSLLKIAPGLANLSVTDCG NDPDLK EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

A0A6J1FI57 uncharacterized protein LOC1114443447.2e-8795.32Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPAL+WRFKKAL LGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLK+EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEG
        EAVAGEHTRHMNIT+FEAKRAASQYQREAEKCI+ATETCEEARERAEAL IKERKLTSLWERRARQMGW+G
Subjt:  EAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)3.2e-3143.98Show/hide
Query:  RSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQEAV
        R  A L+  L +  +  A  + GP LYW   +AL    S +SC  C C+C    S + I   L+N S  DC  +DP++ ++ EK + +LLTEELKL+EA 
Subjt:  RSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQEAV

Query:  AGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGW
        + E  +  ++ L EAK+  S YQ+EA+KC +  ETCEEARE+AE  + +++KLTS WE RARQ GW
Subjt:  AGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGW

AT2G24290.1 Protein of unknown function (DUF1068)5.7e-6873.14Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTS---CAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEEL
        M+RRSG C+R CLVIF+VVSAL VCGPALYW+  K   +G ++++   C PC+CD PPPLSLL+IAPGLANLS+T CG +DP+LK+EMEK FVDLLTEEL
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTS---CAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEEL

Query:  KLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        KLQEAVA EH+RHMN+TL EAKR ASQYQ+EAEKC AATE CE ARERA+AL++KERK+T LWERRARQ+GWEGE
Subjt:  KLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

AT2G32580.1 Protein of unknown function (DUF1068)1.8e-3446.34Show/hide
Query:  ACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGE
        A L+  L + A+     + GP LYW   +AL +  S TSC+ C+CDC   L LL I  GL+N S TDC   DP++ ++ EK + +LLTEELK +EA + E
Subjt:  ACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGE

Query:  HTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE
          + ++  L EAK+  S YQ+EA+KC +  ETCEEARE+AE  +++++KLTS+WE+RARQ G++
Subjt:  HTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE

AT4G04360.1 Protein of unknown function (DUF1068)1.5e-2842.11Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDS-KTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKL
        M+RR     +   V+  +     + GP+LYW   +   + DS  +SC PC+CDC     LL I  GL+N S  DC  ++ +  +E E  F +++ EELKL
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDS-KTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKL

Query:  QEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE
        +EA A E     +  L +AK+AASQYQ+EA+KC    ETCE ARE+AEA + ++R+L+ +WE RARQ GW+
Subjt:  QEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE

AT4G30996.1 Protein of unknown function (DUF1068)1.3e-7278.16Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTS--CAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELK
        M RRSG C+R CLVIFAVVSAL VCGPALYW+F K   +G ++ +  C PC+CDCPPPLSLL+IAPGLANLS+TDCG +DP+LKQEMEKQFVDLLTEELK
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTS--CAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELK

Query:  LQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        LQEAVA EH+RHMN+TL EAKR ASQYQ+EAEKC AATE CE ARERAEAL+IKERK+TSLWE+RARQ GWEGE
Subjt:  LQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACGCCGATCTGGGGCTTGCCTGCGGTGTTGTCTCGTGATTTTTGCTGTAGTTTCTGCTTTGGCTGTTTGTGGACCGGCTTTGTATTGGAGATTCAAGAAGGCTTT
GCAATTGGGTGATTCCAAAACCTCCTGTGCTCCTTGCATCTGCGATTGTCCACCCCCTTTATCCCTTTTGAAGATTGCCCCTGGTCTGGCCAACCTCTCCGTCACAGACT
GTGGAGGTAATGACCCAGATCTCAAGCAGGAGATGGAAAAACAATTTGTGGACCTTTTGACAGAGGAATTGAAACTTCAAGAAGCAGTTGCTGGCGAACATACTCGCCAT
ATGAACATCACTTTATTTGAGGCAAAAAGGGCAGCTTCTCAGTATCAGAGGGAGGCTGAAAAGTGCATTGCTGCCACAGAAACTTGTGAAGAGGCCCGAGAACGCGCCGA
GGCATTGATGATCAAGGAGAGGAAGCTTACATCATTGTGGGAGCGACGGGCCCGCCAAATGGGTTGGGAAGGGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCACGCCGATCTGGGGCTTGCCTGCGGTGTTGTCTCGTGATTTTTGCTGTAGTTTCTGCTTTGGCTGTTTGTGGACCGGCTTTGTATTGGAGATTCAAGAAGGCTTT
GCAATTGGGTGATTCCAAAACCTCCTGTGCTCCTTGCATCTGCGATTGTCCACCCCCTTTATCCCTTTTGAAGATTGCCCCTGGTCTGGCCAACCTCTCCGTCACAGACT
GTGGAGGTAATGACCCAGATCTCAAGCAGGAGATGGAAAAACAATTTGTGGACCTTTTGACAGAGGAATTGAAACTTCAAGAAGCAGTTGCTGGCGAACATACTCGCCAT
ATGAACATCACTTTATTTGAGGCAAAAAGGGCAGCTTCTCAGTATCAGAGGGAGGCTGAAAAGTGCATTGCTGCCACAGAAACTTGTGAAGAGGCCCGAGAACGCGCCGA
GGCATTGATGATCAAGGAGAGGAAGCTTACATCATTGTGGGAGCGACGGGCCCGCCAAATGGGTTGGGAAGGGGAATAA
Protein sequenceShow/hide protein sequence
MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRH
MNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE