; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G04000 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G04000
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionthylakoid lumenal protein TL20.3, chloroplastic
Genome locationClcChr01:3815037..3820251
RNA-Seq ExpressionClc01G04000
SyntenyClc01G04000
Gene Ontology termsNA
InterPro domainsIPR001646 - Pentapeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147585.1 thylakoid lumenal protein TL20.3, chloroplastic isoform X1 [Cucumis sativus]7.6e-14194.27Show/hide
Query:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGE
        MALSSI SLSVKCLPL+SSKS+ P SLQ RKQI++VSQ NPQKDQ QDCS+RKHIGKITEPKRWQKL STALAAAAVI FSSGMPSVAELNKYEADTRGE
Subjt:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGE

Query:  FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV
        FGIGSAAQ+GSADLRKAVH+NENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV
Subjt:  FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV

Query:  GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK
        GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQ TGLCEATK
Subjt:  GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK

XP_008437129.1 PREDICTED: thylakoid lumenal protein TL20.3, chloroplastic [Cucumis melo]1.0e-14094.31Show/hide
Query:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRK--HIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTR
        MALSSI SLS KCLPL+SSKSK PSSLQPRK+I+ VSQ NPQKDQ QDCS+RK  HIGKITEPKRWQKL STALAAAAVI FSSGMPSVAELNKYEADTR
Subjt:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRK--HIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTR

Query:  GEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAI
        GEFGIGSAAQ+GSADLRKAVH+NENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAI
Subjt:  GEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAI

Query:  IVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK
        IVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK
Subjt:  IVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK

XP_022922205.1 thylakoid lumenal protein TL20.3, chloroplastic isoform X2 [Cucurbita moschata]1.6e-13892.47Show/hide
Query:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGE
        MALSS+ SLS+KCLP SSSKSK+PSSLQPRK  ++VSQ N QKDQ QDCS+RKHIGKI+EPKRWQK  STALAAAAVISFSSGMPS+AELNKYEADTRGE
Subjt:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGE

Query:  FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV
        FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEAN TNAVLVRSVLTRSDLGGA IV
Subjt:  FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV

Query:  GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK
        GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLL+RDGFCD+GTGLCEATK
Subjt:  GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK

XP_038875985.1 thylakoid lumenal protein TL20.3, chloroplastic isoform X1 [Benincasa hispida]2.8e-14396.06Show/hide
Query:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGE
        MALSSI SLSVKCLPLSSSKSKLPS LQPRK I+LVSQ NPQKDQ QDCS+RKHIGK TEPKRWQKL STALAAAAVISFSSGMPS+AELNKYEA+TRGE
Subjt:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGE

Query:  FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV
        FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV
Subjt:  FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV

Query:  GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK
        GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK
Subjt:  GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK

XP_038875987.1 thylakoid lumenal protein TL20.3, chloroplastic isoform X2 [Benincasa hispida]1.3e-14095.34Show/hide
Query:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGE
        MALSSI SLSVKCLPLSSSKSKLPS LQPRK I+LVSQ NPQKDQ QDCS+RKHIGK TEPKRWQKL STALAAAAVISFSSGMPS+AELNKYEA+TRGE
Subjt:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGE

Query:  FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV
        FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFS  DLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV
Subjt:  FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV

Query:  GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK
        GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK
Subjt:  GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK

TrEMBL top hitse value%identityAlignment
A0A0A0KP67 Uncharacterized protein3.7e-14194.27Show/hide
Query:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGE
        MALSSI SLSVKCLPL+SSKS+ P SLQ RKQI++VSQ NPQKDQ QDCS+RKHIGKITEPKRWQKL STALAAAAVI FSSGMPSVAELNKYEADTRGE
Subjt:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGE

Query:  FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV
        FGIGSAAQ+GSADLRKAVH+NENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV
Subjt:  FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV

Query:  GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK
        GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQ TGLCEATK
Subjt:  GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK

A0A1S3ATE2 thylakoid lumenal protein TL20.3, chloroplastic4.8e-14194.31Show/hide
Query:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRK--HIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTR
        MALSSI SLS KCLPL+SSKSK PSSLQPRK+I+ VSQ NPQKDQ QDCS+RK  HIGKITEPKRWQKL STALAAAAVI FSSGMPSVAELNKYEADTR
Subjt:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRK--HIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTR

Query:  GEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAI
        GEFGIGSAAQ+GSADLRKAVH+NENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAI
Subjt:  GEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAI

Query:  IVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK
        IVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK
Subjt:  IVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK

A0A6J1E2Q9 thylakoid lumenal protein TL20.3, chloroplastic isoform X13.2e-13791.49Show/hide
Query:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQ---DCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADT
        MALSS+ SLS+KCLP SSSKSK+PSSLQPRK  ++VSQ N QKDQ Q   DCS+RKHIGKI+EPKRWQK  STALAAAAVISFSSGMPS+AELNKYEADT
Subjt:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQ---DCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADT

Query:  RGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGA
        RGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEAN TNAVLVRSVLTRSDLGGA
Subjt:  RGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGA

Query:  IIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK
         IVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLL+RDGFCD+GTGLCEATK
Subjt:  IIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK

A0A6J1E828 thylakoid lumenal protein TL20.3, chloroplastic isoform X27.7e-13992.47Show/hide
Query:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGE
        MALSS+ SLS+KCLP SSSKSK+PSSLQPRK  ++VSQ N QKDQ QDCS+RKHIGKI+EPKRWQK  STALAAAAVISFSSGMPS+AELNKYEADTRGE
Subjt:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGE

Query:  FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV
        FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEAN TNAVLVRSVLTRSDLGGA IV
Subjt:  FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV

Query:  GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK
        GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLL+RDGFCD+GTGLCEATK
Subjt:  GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK

A0A6J1I174 thylakoid lumenal protein TL20.3, chloroplastic isoform X23.8e-13892.11Show/hide
Query:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGE
        MALSS+ SLS+KCLP SSSKSK+PSSLQPRK  ++VSQ N QK Q QDCS+RKHIGKI+EPKRWQKL STALAAAAVISFSSGMPS+AELNKYEADTRGE
Subjt:  MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGE

Query:  FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV
        FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEAN TNAVLVRSVLTRSDLGGA IV
Subjt:  FGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIV

Query:  GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK
        GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLL+RDGFCD+GTGLCEA+K
Subjt:  GADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK

SwissProt top hitse value%identityAlignment
B1WVN5 Pentapeptide repeat protein Rfr324.7e-1638.52Show/hide
Query:  GSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIVGAD
        GS+A +    L       ++   A FT+AD+ +S+FS     GA    +     +  GADL++ L        A+ TNAVL  +++ R+    A I GAD
Subjt:  GSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIVGAD

Query:  FSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGC
        FS AV+D+ +   LC  A G NP TGVSTR SLGC
Subjt:  FSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGC

P73709 Uncharacterized protein slr18193.1e-0435.24Show/hide
Query:  SAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIVGADF
        S A+  + + + A+  + N  R N T      S+ S    N A L +A   K N   A+L +    R  L EANF NA LV     R+DL  A +VGADF
Subjt:  SAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIVGADF

Query:  SDAVI
          A +
Subjt:  SDAVI

Q52118 Uncharacterized protein in mobD 3'region2.2e-0534.62Show/hide
Query:  SAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKA---VAY--KTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAII
        S A   +ADL++A   N N   A+ T+A++ ++D      +GA L  A   +AY  + + S A+LS+  + R  L++AN ++A L    L R+DL  AI+
Subjt:  SAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKA---VAY--KTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAII

Query:  VGAD
         GA+
Subjt:  VGAD

Q55837 Uncharacterized protein slr05163.1e-0434.12Show/hide
Query:  NFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIVGADFSDA
        + R  N  +A +  SD SG   +G  L +A+  + N +GA+LS+T +    L EAN   A L  + L RS L    + GA+   A
Subjt:  NFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIVGADFSDA

Q8H1Q1 Thylakoid lumenal protein TL20.3, chloroplastic1.9e-9770.92Show/hide
Query:  MALSSIFSLSVKCLPL---SSSKSKLPSSLQP--RKQIALVSQTNPQKDQIQDCSD-RKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYE
        MA SS+  L +K L +   SSS S+ P   Q    +++ L S++N    +I+D S+ R+      E   W+++ S A+ AAAVI+ SSG+P++AELN++E
Subjt:  MALSSIFSLSVKCLPL---SSSKSKLPSSLQP--RKQIALVSQTNPQKDQIQDCSD-RKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYE

Query:  ADTRGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDL
        ADTRGEFGIGSAAQ+GSADL K VH NENFRRANFTSADMRESDFSG TFNGAYLEKAVAYK NFSGADLSDTLMDRMVLNEAN TNAVLVRSVLTRSDL
Subjt:  ADTRGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDL

Query:  GGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCE
        GGA I GADFSDAVIDL QKQALCKYA+GTNP+TGV TR SLGCGNSRRNAYG+PSSPLLSAPPQ+LL RDGFCD+ TGLC+
Subjt:  GGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCE

Arabidopsis top hitse value%identityAlignment
AT1G12250.1 Pentapeptide repeat-containing protein1.3e-9870.92Show/hide
Query:  MALSSIFSLSVKCLPL---SSSKSKLPSSLQP--RKQIALVSQTNPQKDQIQDCSD-RKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYE
        MA SS+  L +K L +   SSS S+ P   Q    +++ L S++N    +I+D S+ R+      E   W+++ S A+ AAAVI+ SSG+P++AELN++E
Subjt:  MALSSIFSLSVKCLPL---SSSKSKLPSSLQP--RKQIALVSQTNPQKDQIQDCSD-RKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYE

Query:  ADTRGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDL
        ADTRGEFGIGSAAQ+GSADL K VH NENFRRANFTSADMRESDFSG TFNGAYLEKAVAYK NFSGADLSDTLMDRMVLNEAN TNAVLVRSVLTRSDL
Subjt:  ADTRGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDL

Query:  GGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCE
        GGA I GADFSDAVIDL QKQALCKYA+GTNP+TGV TR SLGCGNSRRNAYG+PSSPLLSAPPQ+LL RDGFCD+ TGLC+
Subjt:  GGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCE

AT1G12250.2 Pentapeptide repeat-containing protein8.0e-9686.7Show/hide
Query:  AAAVISFSSGMPSVAELNKYEADTRGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMV
        AAAVI+ SSG+P++AELN++EADTRGEFGIGSAAQ+GSADL K VH NENFRRANFTSADMRESDFSG TFNGAYLEKAVAYK NFSGADLSDTLMDRMV
Subjt:  AAAVISFSSGMPSVAELNKYEADTRGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMV

Query:  LNEANFTNAVLVRSVLTRSDLGGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTG
        LNEAN TNAVLVRSVLTRSDLGGA I GADFSDAVIDL QKQALCKYA+GTNP+TGV TR SLGCGNSRRNAYG+PSSPLLSAPPQ+LL RDGFCD+ TG
Subjt:  LNEANFTNAVLVRSVLTRSDLGGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTG

Query:  LCE
        LC+
Subjt:  LCE

AT5G53490.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-0424.57Show/hide
Query:  ASTALAAAAVISFSSGMPSVA-ELNKYE-ADTRGEFGIGSAAQFGSADLRKAVHVNE--NFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGAD
        A T   A+ VI+ +  +P ++ E ++ E A      G  +       DLR   + N+  N +    ++A M  + F G       + KA A + +F G +
Subjt:  ASTALAAAAVISFSSGMPSVA-ELNKYE-ADTRGEFGIGSAAQFGSADLRKAVHVNE--NFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGAD

Query:  LSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGC
         ++ ++DR+   ++N   AV   +VL+ S    A +    F D +I     Q +C+     N       R  LGC
Subjt:  LSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGC

AT5G53490.2 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-0424.57Show/hide
Query:  ASTALAAAAVISFSSGMPSVA-ELNKYE-ADTRGEFGIGSAAQFGSADLRKAVHVNE--NFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGAD
        A T   A+ VI+ +  +P ++ E ++ E A      G  +       DLR   + N+  N +    ++A M  + F G       + KA A + +F G +
Subjt:  ASTALAAAAVISFSSGMPSVA-ELNKYE-ADTRGEFGIGSAAQFGSADLRKAVHVNE--NFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGAD

Query:  LSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGC
         ++ ++DR+   ++N   AV   +VL+ S    A +    F D +I     Q +C+     N       R  LGC
Subjt:  LSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGC

AT5G53490.3 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-0424.57Show/hide
Query:  ASTALAAAAVISFSSGMPSVA-ELNKYE-ADTRGEFGIGSAAQFGSADLRKAVHVNE--NFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGAD
        A T   A+ VI+ +  +P ++ E ++ E A      G  +       DLR   + N+  N +    ++A M  + F G       + KA A + +F G +
Subjt:  ASTALAAAAVISFSSGMPSVA-ELNKYE-ADTRGEFGIGSAAQFGSADLRKAVHVNE--NFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGAD

Query:  LSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGC
         ++ ++DR+   ++N   AV   +VL+ S    A +    F D +I     Q +C+     N       R  LGC
Subjt:  LSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACTTTCTTCCATTTTCTCACTATCAGTAAAATGTCTACCCTTATCTTCTTCTAAATCAAAGCTTCCGAGTTCTCTACAACCCAGAAAACAAATTGCCCTGGTCTC
TCAAACAAACCCACAAAAGGATCAAATTCAAGATTGTTCGGACAGGAAGCACATTGGTAAGATTACAGAGCCCAAGAGATGGCAAAAGTTGGCTTCAACTGCTTTGGCTG
CTGCTGCAGTCATTAGTTTTAGCTCAGGCATGCCTTCAGTTGCTGAACTCAACAAGTATGAGGCTGATACCCGTGGTGAATTCGGAATTGGATCTGCCGCTCAATTTGGT
TCTGCAGATCTCAGGAAAGCAGTGCATGTTAACGAAAATTTCAGGAGAGCCAATTTCACATCTGCTGATATGAGGGAATCAGATTTCAGCGGCTGTACATTCAACGGCGC
ATATCTTGAAAAAGCTGTTGCATACAAGACAAATTTCTCAGGTGCTGATTTGAGCGATACATTGATGGATCGTATGGTATTAAATGAGGCAAATTTTACAAATGCAGTAC
TAGTTAGATCAGTGTTGACACGGAGCGATTTAGGCGGTGCCATAATTGTAGGTGCGGACTTTAGTGACGCCGTTATTGACCTTCCTCAGAAGCAAGCACTTTGCAAGTAT
GCGAGTGGGACGAACCCAGTGACGGGAGTGAGCACGAGGGCGAGCTTAGGGTGCGGAAACAGCCGTAGAAATGCGTACGGAACGCCGTCGTCTCCTCTGCTCAGCGCACC
GCCGCAGCAGCTGCTTGACCGCGACGGTTTCTGCGACCAAGGCACTGGCCTTTGTGAGGCCACTAAATAG
mRNA sequenceShow/hide mRNA sequence
AAAAACATGACTAGTGCATAAAATAGCACGAAAGGCAAACCAAACAAACAAGCGAACACGTATTCAAATCCTTAACCAGAAGTTGTCTACCTACAGCAAGAAACTGATAA
ATCTCTTCCTGTCCAATCGTTTTCTTTCCCCTCTTTTCCCTCTTCTCAACTCTTCCAGCAAAATATTTATCTTTCTTGGCGCAGAAAGATCCCCCAATTCCAACTTCTCA
TGGCACTTTCTTCCATTTTCTCACTATCAGTAAAATGTCTACCCTTATCTTCTTCTAAATCAAAGCTTCCGAGTTCTCTACAACCCAGAAAACAAATTGCCCTGGTCTCT
CAAACAAACCCACAAAAGGATCAAATTCAAGATTGTTCGGACAGGAAGCACATTGGTAAGATTACAGAGCCCAAGAGATGGCAAAAGTTGGCTTCAACTGCTTTGGCTGC
TGCTGCAGTCATTAGTTTTAGCTCAGGCATGCCTTCAGTTGCTGAACTCAACAAGTATGAGGCTGATACCCGTGGTGAATTCGGAATTGGATCTGCCGCTCAATTTGGTT
CTGCAGATCTCAGGAAAGCAGTGCATGTTAACGAAAATTTCAGGAGAGCCAATTTCACATCTGCTGATATGAGGGAATCAGATTTCAGCGGCTGTACATTCAACGGCGCA
TATCTTGAAAAAGCTGTTGCATACAAGACAAATTTCTCAGGTGCTGATTTGAGCGATACATTGATGGATCGTATGGTATTAAATGAGGCAAATTTTACAAATGCAGTACT
AGTTAGATCAGTGTTGACACGGAGCGATTTAGGCGGTGCCATAATTGTAGGTGCGGACTTTAGTGACGCCGTTATTGACCTTCCTCAGAAGCAAGCACTTTGCAAGTATG
CGAGTGGGACGAACCCAGTGACGGGAGTGAGCACGAGGGCGAGCTTAGGGTGCGGAAACAGCCGTAGAAATGCGTACGGAACGCCGTCGTCTCCTCTGCTCAGCGCACCG
CCGCAGCAGCTGCTTGACCGCGACGGTTTCTGCGACCAAGGCACTGGCCTTTGTGAGGCCACTAAATAGATACATAAAGACTTGGAGGGCTCTTCTCGAGAAGTGTTGAT
TGATGTTGTATCGTGGAAATCATTGCCTAGAAAACAAAATCGGCGCGACCTCGATGCTAAATCGCCAGCAGCCGGTTAGTCCCCAGAGTTAAGATAACCTCCGATCTATG
GAGTGCAATTGTTAGAGAATGGAGGAAATTGATGAAAAAATGTCCAGAAAACAGAGATCAAGTAGAATGAAGAGAGGAAGAAGATCAATGTATTCTATGCAAGACCAAAA
TTTTTGAAAGCTTATTAAGAAGAAGAAGGGAAGGAAACCGTTTTAGATCTAAGATTGATTTAACAATTTATTCCGATCTTCCAATTAATCCCACACGATTATTCATGTAA
GTTTTTCACTAATTTAATGTAATTTGAATTTATATAACATATTTATTTTTTTGGAAAATTATTATAAATAGAAAAAATATCAAATTATTTATAAATATAGAAAATTTTCA
CTTTCCATCTGTG
Protein sequenceShow/hide protein sequence
MALSSIFSLSVKCLPLSSSKSKLPSSLQPRKQIALVSQTNPQKDQIQDCSDRKHIGKITEPKRWQKLASTALAAAAVISFSSGMPSVAELNKYEADTRGEFGIGSAAQFG
SADLRKAVHVNENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGAIIVGADFSDAVIDLPQKQALCKY
ASGTNPVTGVSTRASLGCGNSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQGTGLCEATK