; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034666 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034666
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSOUL heme-binding family protein
Genome locationchr3:9581542..9584573
RNA-Seq ExpressionLag0034666
SyntenyLag0034666
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR018790 - Protein of unknown function DUF2358


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587902.1 hypothetical protein SDJN03_16467, partial [Cucurbita argyrosperma subsp. sororia]8.3e-5872.73Show/hide
Query:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT
        F+YDDLRHVFDEQGI RT YD++VRF+DP+TKYD I+ Y+LNIALLREFF+ EI LHWVK               KFILLPWKPELVLTG S+MG++P T
Subjt:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT

Query:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE
         KFC+HVDLWDS+QNNDYFSLE LWDVFKQLRFY T +LESPKYQ LKRTANYE
Subjt:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE

KAG7021789.1 hypothetical protein SDJN02_15516 [Cucurbita argyrosperma subsp. argyrosperma]6.3e-5872.73Show/hide
Query:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT
        F+YDDLRHVFDEQGI RT YD++VRF+DP+TKYD I+ Y+LNIALLREFF+ EI LHWVK               KFILLPWKPELVLTG S+MG++P T
Subjt:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT

Query:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE
         KFC+HVDLWDS+QNNDYFSLE LWDVFKQLRFY T +LESPKYQ LKRTANYE
Subjt:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE

XP_022933414.1 uncharacterized protein LOC111440839 [Cucurbita moschata]1.4e-5772.08Show/hide
Query:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT
        F+YDDLRHVFDEQGI RT YDD+VRF+DP+TKYD I+ Y+LNIALLREFF+ EI LHWVK               KFILLPWKPELVLTG S+MG++P T
Subjt:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT

Query:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE
         KFC+HVDLWDS+QNNDYFS+E LWDVFKQ RFY T +LESPKYQ LKRTANYE
Subjt:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE

XP_022965046.1 uncharacterized protein LOC111465022 [Cucurbita maxima]4.1e-5771.43Show/hide
Query:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT
        F+YDDLRHVFDEQGI RT YD++VRF+DP+TKYD I+ Y+LNIALLREFF+ EI LHWVK               KFILLPWKPELVLTG S+MG++P T
Subjt:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT

Query:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE
         KFC+HVDLWDS+QNNDYFS+E LWDVFKQ RFY T +LESPKYQ LKRTANYE
Subjt:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE

XP_023531546.1 uncharacterized protein LOC111793749 [Cucurbita pepo subsp. pepo]1.2e-5670.78Show/hide
Query:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT
        F+YDDLRHVFDEQGI RT YD++VRF+DP+TKYD I+ Y+LNIALLREFF+ EI  HWVK               KFILLPWKPELVLTG S+MG++P T
Subjt:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT

Query:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE
         KFC+HVD+WDS+QNNDYFSLE LWDVFKQ RFY T +LESPKYQ LKRTANYE
Subjt:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE

TrEMBL top hitse value%identityAlignment
A0A6J1CV62 uncharacterized protein LOC111014503 isoform X28.6e-5366.23Show/hide
Query:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT
        FLY+DLRHVFD QGI  T YD+ VRF+DP+TKY+ I  Y+LNIALLR+ F+ +  LHWVK               KF+LLPWKPELVLTG S+M +DP+T
Subjt:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT

Query:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE
         KFC HVDLWDSVQNN+YFSLEGLWD+FKQ RFY T +LESP+YQ LKRTANYE
Subjt:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE

A0A6J1ER73 uncharacterized protein LOC111437064 isoform X13.9e-5366.88Show/hide
Query:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT
        FLY+DL H+FDEQGI RT YDDQVRF+DP+TK+D IT YL NI+LLRE F+ E  LHWVK               KF+LLPWKP+LV TG S+MG++P+T
Subjt:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT

Query:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE
         KFC+HVDLWDS+QNNDYFS+EGL DVFKQLRFY T +LESPKY+ LKRT NYE
Subjt:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE

A0A6J1EZQ2 uncharacterized protein LOC1114408396.8e-5872.08Show/hide
Query:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT
        F+YDDLRHVFDEQGI RT YDD+VRF+DP+TKYD I+ Y+LNIALLREFF+ EI LHWVK               KFILLPWKPELVLTG S+MG++P T
Subjt:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT

Query:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE
         KFC+HVDLWDS+QNNDYFS+E LWDVFKQ RFY T +LESPKYQ LKRTANYE
Subjt:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE

A0A6J1HKM5 uncharacterized protein LOC1114650222.0e-5771.43Show/hide
Query:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT
        F+YDDLRHVFDEQGI RT YD++VRF+DP+TKYD I+ Y+LNIALLREFF+ EI LHWVK               KFILLPWKPELVLTG S+MG++P T
Subjt:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT

Query:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE
         KFC+HVDLWDS+QNNDYFS+E LWDVFKQ RFY T +LESPKYQ LKRTANYE
Subjt:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE

A0A6J1KHA6 uncharacterized protein LOC111495248 isoform X11.0e-5368.18Show/hide
Query:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT
        FLY DL H+FDEQGI RT YDDQVRF+DP+TK+D IT YL NI+LLRE FK E  LHWVK               KF+LLPWKP+LV TG S+MG++P+T
Subjt:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT

Query:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE
         KFC+HVDLWDS+QNNDYFS+EGL DVFKQLRFY T +LESPKY+ LKRTANYE
Subjt:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-1631.79Show/hide
Query:  TLSEEDKEAMNMTAYGTIILNLSNSVLRQVIDEENPLKIWTRLNELYETKNVHNLIYLREKFFTYKMDAGKTLSENLDEFKKMTTEFKNLGEKIGDENEA
        T+  ED   ++  A   I L+LS+ V+  +IDE+    IWTRL  LY +K + N +YL+++ +   M  G     +L+ F  + T+  NLG KI +E++A
Subjt:  TLSEEDKEAMNMTAYGTIILNLSNSVLRQVIDEENPLKIWTRLNELYETKNVHNLIYLREKFFTYKMDAGKTLSENLDEFKKMTTEFKNLGEKIGDENEA

Query:  FVLLNSLLDSYKEVKNALKYGRVSITTDAIISTIKIKELELWPPRKKSQKEGHIKRECYSLKRKNQYHRSKKN
         +LLNSL  SY  +   + +G+ +I    + S + + E      RKK + +G   +   +  R   Y RS  N
Subjt:  FVLLNSLLDSYKEVKNALKYGRVSITTDAIISTIKIKELELWPPRKKSQKEGHIKRECYSLKRKNQYHRSKKN

Arabidopsis top hitse value%identityAlignment
AT5G20140.1 SOUL heme-binding family protein8.0e-5160.39Show/hide
Query:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT
        FLY+DL H+FD+QGI +T YD++V+F+DP+TK+D I+ YL NIA L+  F  +  LHW K               KFI LPWKPELV TG+S+M V+P+T
Subjt:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT

Query:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE
        +KFC+H+DLWDS++NNDYFSLEGL DVFKQLR Y T  LE+PKYQ LKRTANYE
Subjt:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE

AT5G20140.2 SOUL heme-binding family protein8.0e-5160.39Show/hide
Query:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT
        FLY+DL H+FD+QGI +T YD++V+F+DP+TK+D I+ YL NIA L+  F  +  LHW K               KFI LPWKPELV TG+S+M V+P+T
Subjt:  FLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVK---------------KFILLPWKPELVLTGISVMGVDPDT

Query:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE
        +KFC+H+DLWDS++NNDYFSLEGL DVFKQLR Y T  LE+PKYQ LKRTANYE
Subjt:  DKFCTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGACCGATTGGTGGATTTTTTTGTATGACGATCTCCGCCATGTGTTCGACGAACAGGGGATCGGTCGGACGACGTACGACGACCAAGTGCGATTCCAAGACCCACT
CACAAAATATGATAACATCACTAGTTATTTGCTGAATATTGCCCTGTTGCGAGAATTCTTCAAGCTTGAGATCACATTGCACTGGGTCAAGAAGTTCATCCTTCTTCCAT
GGAAACCAGAATTAGTTTTGACTGGAATTTCCGTTATGGGCGTCGATCCAGATACGGACAAGTTCTGTACCCACGTGGATCTATGGGATTCCGTACAAAATAATGACTAC
TTTTCTCTAGAAGGATTGTGGGATGTATTTAAACAGTTGAGATTTTATGCGACTAAAAAATTGGAATCACCCAAGTATCAGACATTGAAAAGGACTGCAAATTATGAGAC
CCTGCTTGCGTTAGCAGATCCCACGAAGTTGCTAGCTACTCTATCGGAAGAAGACAAAGAAGCCATGAATATGACCGCATATGGAACCATCATCTTGAACTTAAGCAACA
GTGTGTTGAGACAAGTCATAGATGAGGAGAATCCTTTGAAAATCTGGACAAGACTTAATGAACTCTATGAGACCAAGAATGTGCATAATCTGATATACTTGAGAGAGAAA
TTCTTCACATACAAGATGGATGCAGGGAAAACACTGTCAGAAAATCTTGATGAGTTCAAGAAGATGACTACTGAGTTCAAGAACCTAGGTGAAAAGATAGGAGATGAGAA
CGAGGCATTTGTGTTACTTAACTCACTTCTTGATTCATACAAAGAAGTGAAAAATGCCCTCAAATATGGGAGAGTGTCAATCACCACAGATGCAATTATATCGACAATAA
AAATCAAAGAACTGGAACTTTGGCCACCAAGAAAGAAATCTCAGAAGGAAGGGCATATCAAGAGAGAATGCTATTCCTTGAAGAGAAAGAACCAATACCACCGATCTAAG
AAGAACAAACAACCCGAGGCTTCAGTTGGAGAGAACTCCATTACATATTCAGATTTTTTGGCTACTACAGACCAAAGAATGTCTCTAAGGCTCAAAGACGAATTAGTAAA
ACTGTTGAGAAATGTTAGGCATGTTCCTACCTTGAAAAGGAATTTAATTTTCCTAGGATTGCTTGGCTCAATTGGATGCACATATGGAGGAAAAGGAGGCACAATTGAGA
TAAAAAAGGACTCCAAAACAGTGCTGATTGGGGAGAAAATAAATGGTCTTTATGTTGTCAAAGACATAGAAATGGTGCAGTTCAAAAGGCCTTCAAGATTTCAGAAAAGT
TCATTTGATAATTGTGTTCATGTGAATCAGAGAACTTTCAGTCGTAATGTTTTTCTCCTACGGTACGTATATGACATGTTGTTAGTGAAGGATTGGAATTCCCCAATTCC
GTTACAGCGGAAGCAATTGGACCGTTCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGACCGATTGGTGGATTTTTTTGTATGACGATCTCCGCCATGTGTTCGACGAACAGGGGATCGGTCGGACGACGTACGACGACCAAGTGCGATTCCAAGACCCACT
CACAAAATATGATAACATCACTAGTTATTTGCTGAATATTGCCCTGTTGCGAGAATTCTTCAAGCTTGAGATCACATTGCACTGGGTCAAGAAGTTCATCCTTCTTCCAT
GGAAACCAGAATTAGTTTTGACTGGAATTTCCGTTATGGGCGTCGATCCAGATACGGACAAGTTCTGTACCCACGTGGATCTATGGGATTCCGTACAAAATAATGACTAC
TTTTCTCTAGAAGGATTGTGGGATGTATTTAAACAGTTGAGATTTTATGCGACTAAAAAATTGGAATCACCCAAGTATCAGACATTGAAAAGGACTGCAAATTATGAGAC
CCTGCTTGCGTTAGCAGATCCCACGAAGTTGCTAGCTACTCTATCGGAAGAAGACAAAGAAGCCATGAATATGACCGCATATGGAACCATCATCTTGAACTTAAGCAACA
GTGTGTTGAGACAAGTCATAGATGAGGAGAATCCTTTGAAAATCTGGACAAGACTTAATGAACTCTATGAGACCAAGAATGTGCATAATCTGATATACTTGAGAGAGAAA
TTCTTCACATACAAGATGGATGCAGGGAAAACACTGTCAGAAAATCTTGATGAGTTCAAGAAGATGACTACTGAGTTCAAGAACCTAGGTGAAAAGATAGGAGATGAGAA
CGAGGCATTTGTGTTACTTAACTCACTTCTTGATTCATACAAAGAAGTGAAAAATGCCCTCAAATATGGGAGAGTGTCAATCACCACAGATGCAATTATATCGACAATAA
AAATCAAAGAACTGGAACTTTGGCCACCAAGAAAGAAATCTCAGAAGGAAGGGCATATCAAGAGAGAATGCTATTCCTTGAAGAGAAAGAACCAATACCACCGATCTAAG
AAGAACAAACAACCCGAGGCTTCAGTTGGAGAGAACTCCATTACATATTCAGATTTTTTGGCTACTACAGACCAAAGAATGTCTCTAAGGCTCAAAGACGAATTAGTAAA
ACTGTTGAGAAATGTTAGGCATGTTCCTACCTTGAAAAGGAATTTAATTTTCCTAGGATTGCTTGGCTCAATTGGATGCACATATGGAGGAAAAGGAGGCACAATTGAGA
TAAAAAAGGACTCCAAAACAGTGCTGATTGGGGAGAAAATAAATGGTCTTTATGTTGTCAAAGACATAGAAATGGTGCAGTTCAAAAGGCCTTCAAGATTTCAGAAAAGT
TCATTTGATAATTGTGTTCATGTGAATCAGAGAACTTTCAGTCGTAATGTTTTTCTCCTACGGTACGTATATGACATGTTGTTAGTGAAGGATTGGAATTCCCCAATTCC
GTTACAGCGGAAGCAATTGGACCGTTCGTAA
Protein sequenceShow/hide protein sequence
MLTDWWIFLYDDLRHVFDEQGIGRTTYDDQVRFQDPLTKYDNITSYLLNIALLREFFKLEITLHWVKKFILLPWKPELVLTGISVMGVDPDTDKFCTHVDLWDSVQNNDY
FSLEGLWDVFKQLRFYATKKLESPKYQTLKRTANYETLLALADPTKLLATLSEEDKEAMNMTAYGTIILNLSNSVLRQVIDEENPLKIWTRLNELYETKNVHNLIYLREK
FFTYKMDAGKTLSENLDEFKKMTTEFKNLGEKIGDENEAFVLLNSLLDSYKEVKNALKYGRVSITTDAIISTIKIKELELWPPRKKSQKEGHIKRECYSLKRKNQYHRSK
KNKQPEASVGENSITYSDFLATTDQRMSLRLKDELVKLLRNVRHVPTLKRNLIFLGLLGSIGCTYGGKGGTIEIKKDSKTVLIGEKINGLYVVKDIEMVQFKRPSRFQKS
SFDNCVHVNQRTFSRNVFLLRYVYDMLLVKDWNSPIPLQRKQLDRS