; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005544 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005544
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPhotosystem I P700 chlorophyll a apoprotein A2
Genome locationscaffold7:25969927..25971903
RNA-Seq ExpressionSpg005544
SyntenySpg005544
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0009579 - thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR001280 - Photosystem I PsaA/PsaB
IPR036408 - Photosystem I PsaA/PsaB superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043405.1 hypothetical protein E6C27_scaffold1639G00040 [Cucumis melo var. makuwa]1.1e-2840.24Show/hide
Query:  DVLASSSGIVAGPGDLSSFSIKDLLSL----SQEAKSVLINALIVSDEPQTD--ARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLS
        D L++S  ++ G    S   I +   L    S EA  V I  +   D  Q    A KE  + +     +KGE STS  K  ++ DEK SN  ILRY PLS
Subjt:  DVLASSSGIVAGPGDLSSFSIKDLLSL----SQEAKSVLINALIVSDEPQTD--ARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLS

Query:  RRKKGESPFAECPKNL--------------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVA
        RR+KGESPF + P+ L              KKLL+E + +P +RK  GY+SPEP+ IIR+ K KV  +NHITV EVD  +EKE   Q+ S F +IRP V 
Subjt:  RRKKGESPFAECPKNL--------------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVA

Query:  RASVFQKLIVNEIDEESAQPTNSSTRSSVFRRLSMFIGEGDSTFQ-LRMSR
        R +VF++L V + + +  Q T+S  + S  +RL+M   +   T Q LR +R
Subjt:  RASVFQKLIVNEIDEESAQPTNSSTRSSVFRRLSMFIGEGDSTFQ-LRMSR

KAA0047672.1 uncharacterized protein E6C27_scaffold115G001730 [Cucumis melo var. makuwa]2.9e-2940.71Show/hide
Query:  KDLLSLSQEAKSVLINALIVSDEP----QTDARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNLK----KL
        K  L      ++V I  L+V+ E     ++ A ++  +  +     K E STS  K  ++ +E  SN PILRY PLSR KKGESPF E P+ LK    ++
Subjt:  KDLLSLSQEAKSVLINALIVSDEP----QTDARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNLK----KL

Query:  LKEDYSL---PTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVFQKLIVNEIDEESAQPTNSSTRSSVF
        LKE +++     T+K LGY+SPEP+RI R+GK KV  +NHITV+EVD  + KE   Q+TS F RI P VARA VF++L + E   +  Q T++  R S F
Subjt:  LKEDYSL---PTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVFQKLIVNEIDEESAQPTNSSTRSSVF

Query:  RRLSMFIGEGDSTFQLRMSRDLQLFK
        +RL++   E     Q  M+     F+
Subjt:  RRLSMFIGEGDSTFQLRMSRDLQLFK

KAA0055957.1 uncharacterized protein E6C27_scaffold319G00830 [Cucumis melo var. makuwa]4.5e-3039.66Show/hide
Query:  ARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNL--------------------------------------
        A KE  + +      K E ST+  K  ++ DEK SN PILRY PLSRRKKGESPF E P+ L                                      
Subjt:  ARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNL--------------------------------------

Query:  ------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVFQKLIVNEIDEESAQPTNSS
              KKLL+E + +P +RK LGY+SPEP+RI R+GK KV   NHITV+EVD  +EKE  +Q+TS F RI P VARA VF++L + E   +  Q T++ 
Subjt:  ------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVFQKLIVNEIDEESAQPTNSS

Query:  TRSSVFRRLSMFIGEGDSTFQLRMSRDLQLFK
         R S F+RL++   E     Q  M+     F+
Subjt:  TRSSVFRRLSMFIGEGDSTFQLRMSRDLQLFK

KAA0061113.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.3e-2637.83Show/hide
Query:  KGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNL-----------------------------------------------------
        K E STS  K  +V DEK SN PILRY PLSRRKKGESPF E P+ L                                                     
Subjt:  KGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNL-----------------------------------------------------

Query:  -------------------------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVF
                                 KKLL+E +++P +RK LGY+ PEP+RI R+GK KV  +NHITV+EVD  +EKE   Q+TS F R+ P VARA VF
Subjt:  -------------------------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVF

Query:  QKLIVNEIDEESAQPTNSSTRSSVFRRLSM
        ++L + E + +  Q T+S  R S F+RL+M
Subjt:  QKLIVNEIDEESAQPTNSSTRSSVFRRLSM

TYK28162.1 uncharacterized protein E5676_scaffold289G00760 [Cucumis melo var. makuwa]6.5e-2937.6Show/hide
Query:  ARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNL--------------------------------------
        A KE  + +      K E ST+  K  ++ DEK SN PILRY PLSRRKKGESPF E P+ L                                      
Subjt:  ARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNL--------------------------------------

Query:  ----------------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVFQKLIVNEID
                        KKLL+E + +P +RK LGY+SPEP+RI R+GK KV  +NHITV+EVD  +EKE  +Q+TS F RI P VARA VF++L + E +
Subjt:  ----------------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVFQKLIVNEID

Query:  EESAQPTNSSTRSSVFRRLSMFIGEGDSTFQLRMSRDLQLFK
         +  Q T++  + S F+RL++   E     Q  M+     F+
Subjt:  EESAQPTNSSTRSSVFRRLSMFIGEGDSTFQLRMSRDLQLFK

TrEMBL top hitse value%identityAlignment
A0A5A7TPR5 RNase H domain-containing protein5.4e-2940.24Show/hide
Query:  DVLASSSGIVAGPGDLSSFSIKDLLSL----SQEAKSVLINALIVSDEPQTD--ARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLS
        D L++S  ++ G    S   I +   L    S EA  V I  +   D  Q    A KE  + +     +KGE STS  K  ++ DEK SN  ILRY PLS
Subjt:  DVLASSSGIVAGPGDLSSFSIKDLLSL----SQEAKSVLINALIVSDEPQTD--ARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLS

Query:  RRKKGESPFAECPKNL--------------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVA
        RR+KGESPF + P+ L              KKLL+E + +P +RK  GY+SPEP+ IIR+ K KV  +NHITV EVD  +EKE   Q+ S F +IRP V 
Subjt:  RRKKGESPFAECPKNL--------------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVA

Query:  RASVFQKLIVNEIDEESAQPTNSSTRSSVFRRLSMFIGEGDSTFQ-LRMSR
        R +VF++L V + + +  Q T+S  + S  +RL+M   +   T Q LR +R
Subjt:  RASVFQKLIVNEIDEESAQPTNSSTRSSVFRRLSMFIGEGDSTFQ-LRMSR

A0A5A7UMY2 Reverse transcriptase domain-containing protein2.2e-3039.66Show/hide
Query:  ARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNL--------------------------------------
        A KE  + +      K E ST+  K  ++ DEK SN PILRY PLSRRKKGESPF E P+ L                                      
Subjt:  ARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNL--------------------------------------

Query:  ------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVFQKLIVNEIDEESAQPTNSS
              KKLL+E + +P +RK LGY+SPEP+RI R+GK KV   NHITV+EVD  +EKE  +Q+TS F RI P VARA VF++L + E   +  Q T++ 
Subjt:  ------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVFQKLIVNEIDEESAQPTNSS

Query:  TRSSVFRRLSMFIGEGDSTFQLRMSRDLQLFK
         R S F+RL++   E     Q  M+     F+
Subjt:  TRSSVFRRLSMFIGEGDSTFQLRMSRDLQLFK

A0A5D3BY54 Ty3-gypsy retrotransposon protein1.1e-2637.83Show/hide
Query:  KGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNL-----------------------------------------------------
        K E STS  K  +V DEK SN PILRY PLSRRKKGESPF E P+ L                                                     
Subjt:  KGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNL-----------------------------------------------------

Query:  -------------------------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVF
                                 KKLL+E +++P +RK LGY+ PEP+RI R+GK KV  +NHITV+EVD  +EKE   Q+TS F R+ P VARA VF
Subjt:  -------------------------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVF

Query:  QKLIVNEIDEESAQPTNSSTRSSVFRRLSM
        ++L + E + +  Q T+S  R S F+RL+M
Subjt:  QKLIVNEIDEESAQPTNSSTRSSVFRRLSM

A0A5D3C8N8 Ribonuclease H1.4e-2940.71Show/hide
Query:  KDLLSLSQEAKSVLINALIVSDEP----QTDARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNLK----KL
        K  L      ++V I  L+V+ E     ++ A ++  +  +     K E STS  K  ++ +E  SN PILRY PLSR KKGESPF E P+ LK    ++
Subjt:  KDLLSLSQEAKSVLINALIVSDEP----QTDARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNLK----KL

Query:  LKEDYSL---PTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVFQKLIVNEIDEESAQPTNSSTRSSVF
        LKE +++     T+K LGY+SPEP+RI R+GK KV  +NHITV+EVD  + KE   Q+TS F RI P VARA VF++L + E   +  Q T++  R S F
Subjt:  LKEDYSL---PTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVFQKLIVNEIDEESAQPTNSSTRSSVF

Query:  RRLSMFIGEGDSTFQLRMSRDLQLFK
        +RL++   E     Q  M+     F+
Subjt:  RRLSMFIGEGDSTFQLRMSRDLQLFK

A0A5D3DXC7 Reverse transcriptase domain-containing protein3.1e-2937.6Show/hide
Query:  ARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNL--------------------------------------
        A KE  + +      K E ST+  K  ++ DEK SN PILRY PLSRRKKGESPF E P+ L                                      
Subjt:  ARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNL--------------------------------------

Query:  ----------------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVFQKLIVNEID
                        KKLL+E + +P +RK LGY+SPEP+RI R+GK KV  +NHITV+EVD  +EKE  +Q+TS F RI P VARA VF++L + E +
Subjt:  ----------------KKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARASVFQKLIVNEID

Query:  EESAQPTNSSTRSSVFRRLSMFIGEGDSTFQLRMSRDLQLFK
         +  Q T++  + S F+RL++   E     Q  M+     F+
Subjt:  EESAQPTNSSTRSSVFRRLSMFIGEGDSTFQLRMSRDLQLFK

SwissProt top hitse value%identityAlignment
P19431 Photosystem I P700 chlorophyll a apoprotein A21.2e-0971.74Show/hide
Query:  MKGFQEKYDSIEDE-NPGDFLVHHAIALGLHTTTLILVKGALDARG
        + G  +K  S+  +  PGDFLVHHAIALGLHTTTLILVKGALDARG
Subjt:  MKGFQEKYDSIEDE-NPGDFLVHHAIALGLHTTTLILVKGALDARG

P31088 Photosystem I P700 chlorophyll a apoprotein A2 11.6e-0988.24Show/hide
Query:  PGDFLVHHAIALGLHTTTLILVKGALDARGREAL
        PGDFLVHHAIALGLHTTTLILVKGALDARG + +
Subjt:  PGDFLVHHAIALGLHTTTLILVKGALDARGREAL

P58565 Photosystem I P700 chlorophyll a apoprotein A2 11.6e-0988.24Show/hide
Query:  PGDFLVHHAIALGLHTTTLILVKGALDARGREAL
        PGDFLVHHAIALGLHTTTLILVKGALDARG + +
Subjt:  PGDFLVHHAIALGLHTTTLILVKGALDARGREAL

Q0ID47 Photosystem I P700 chlorophyll a apoprotein A29.2e-1058.93Show/hide
Query:  MKGFQEKYDSIEDEN-------PGDFLVHHAIALGLHTTTLILVKGALDARGREAL
        M G+ +  + +   N       PGDFLVHHAIALGLHTTTLILVKGALDARG + +
Subjt:  MKGFQEKYDSIEDEN-------PGDFLVHHAIALGLHTTTLILVKGALDARGREAL

Q9R6T9 Photosystem I P700 chlorophyll a apoprotein A29.2e-1058.93Show/hide
Query:  MKGFQEKYDSIEDEN-------PGDFLVHHAIALGLHTTTLILVKGALDARGREAL
        M G+ +  + +   N       PGDFLVHHAIALGLHTTTLILVKGALDARG + +
Subjt:  MKGFQEKYDSIEDEN-------PGDFLVHHAIALGLHTTTLILVKGALDARGREAL

Arabidopsis top hitse value%identityAlignment
ATCG00340.1 Photosystem I, PsaA/PsaB protein1.1e-1088.24Show/hide
Query:  PGDFLVHHAIALGLHTTTLILVKGALDARGREAL
        PGDFLVHHAIALGLHTTTLILVKGALDARG + +
Subjt:  PGDFLVHHAIALGLHTTTLILVKGALDARGREAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGGATTCCAAGAAAAATATGATTCCATCGAAGACGAGAACCCCGGAGATTTCTTGGTTCATCATGCTATTGCTTTAGGTTTACATACAACTACATTGATCTTAGT
AAAAGGTGCTTTAGATGCACGTGGAAGAGAGGCACTTGAAACTGTCACGTGTCACATTGTGGACGTGGTGGAAGATGATGATGTCCTTGCTAGCTCCTCAGGAATAGTGG
CAGGTCCAGGAGACTTATCCTCCTTCAGCATAAAGGACTTATTGTCACTTTCTCAGGAAGCTAAAAGTGTCCTTATTAATGCATTGATAGTGTCTGATGAACCACAAACT
GATGCAAGAAAGGAAGCTATTGAAGAAGTGCAGGCATCCGACCTGAAAAAGGGTGAAACATCTACAAGCCTTGAGAAACCTAAGGTTGTAAAGGATGAGAAATGTTCAAA
TTCACCTATCTTACGATACGCCCCTCTATCTCGACGTAAAAAGGGTGAATCACCTTTCGCAGAATGCCCAAAAAACTTAAAGAAGCTTCTAAAGGAAGATTATAGTCTGC
CTACAACGAGAAAAGAACTTGGATATCAGTCACCTGAGCCAGTTCGCATAATAAGAAGAGGGAAGGCGAAAGTGGCATACACAAATCATATAACAGTAGAGGAGGTTGAT
GACTCAAAAGAAAAAGAGAGTGTCGACCAACAAACTTCTGTTTTTAGGCGCATCAGGCCACCAGTTGCTCGTGCTTCAGTCTTTCAGAAGTTAATTGTGAATGAAATAGA
TGAAGAAAGTGCACAACCTACCAATAGCTCCACTCGATCTTCAGTTTTTCGAAGGTTAAGTATGTTCATTGGGGAAGGAGACAGTACATTTCAACTCCGGATGTCACGCG
ACCTTCAGCTTTTCAAAGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGGATTCCAAGAAAAATATGATTCCATCGAAGACGAGAACCCCGGAGATTTCTTGGTTCATCATGCTATTGCTTTAGGTTTACATACAACTACATTGATCTTAGT
AAAAGGTGCTTTAGATGCACGTGGAAGAGAGGCACTTGAAACTGTCACGTGTCACATTGTGGACGTGGTGGAAGATGATGATGTCCTTGCTAGCTCCTCAGGAATAGTGG
CAGGTCCAGGAGACTTATCCTCCTTCAGCATAAAGGACTTATTGTCACTTTCTCAGGAAGCTAAAAGTGTCCTTATTAATGCATTGATAGTGTCTGATGAACCACAAACT
GATGCAAGAAAGGAAGCTATTGAAGAAGTGCAGGCATCCGACCTGAAAAAGGGTGAAACATCTACAAGCCTTGAGAAACCTAAGGTTGTAAAGGATGAGAAATGTTCAAA
TTCACCTATCTTACGATACGCCCCTCTATCTCGACGTAAAAAGGGTGAATCACCTTTCGCAGAATGCCCAAAAAACTTAAAGAAGCTTCTAAAGGAAGATTATAGTCTGC
CTACAACGAGAAAAGAACTTGGATATCAGTCACCTGAGCCAGTTCGCATAATAAGAAGAGGGAAGGCGAAAGTGGCATACACAAATCATATAACAGTAGAGGAGGTTGAT
GACTCAAAAGAAAAAGAGAGTGTCGACCAACAAACTTCTGTTTTTAGGCGCATCAGGCCACCAGTTGCTCGTGCTTCAGTCTTTCAGAAGTTAATTGTGAATGAAATAGA
TGAAGAAAGTGCACAACCTACCAATAGCTCCACTCGATCTTCAGTTTTTCGAAGGTTAAGTATGTTCATTGGGGAAGGAGACAGTACATTTCAACTCCGGATGTCACGCG
ACCTTCAGCTTTTCAAAGGTTAA
Protein sequenceShow/hide protein sequence
MKGFQEKYDSIEDENPGDFLVHHAIALGLHTTTLILVKGALDARGREALETVTCHIVDVVEDDDVLASSSGIVAGPGDLSSFSIKDLLSLSQEAKSVLINALIVSDEPQT
DARKEAIEEVQASDLKKGETSTSLEKPKVVKDEKCSNSPILRYAPLSRRKKGESPFAECPKNLKKLLKEDYSLPTTRKELGYQSPEPVRIIRRGKAKVAYTNHITVEEVD
DSKEKESVDQQTSVFRRIRPPVARASVFQKLIVNEIDEESAQPTNSSTRSSVFRRLSMFIGEGDSTFQLRMSRDLQLFKG