; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G018710 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G018710
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionProtein TIC 20
Genome locationCicolChr02:1529703..1531863
RNA-Seq ExpressionCcUC02G018710
SyntenyCcUC02G018710
Gene Ontology termsGO:0045037 - protein import into chloroplast stroma (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0009706 - chloroplast inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005691 - Chloroplast protein import component Tic20


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011953.1 Protein TIC 20-IV, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.2e-15279.37Show/hide
Query:  CPIKYDITDVPYSCMQSNTALPSASLFPSARKT------GKMFAVGPSN-ARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRL
        CPIKY+I DVPYSCM S TAL +ASLF    +T      GKMFAV PS+ ARL CNSPNPLL   P  T A  +SVRY++V LK CFKP QVV+ QCRRL
Subjt:  CPIKYDITDVPYSCMQSNTALPSASLFPSARKT------GKMFAVGPSN-ARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRL

Query:  GSANLDTKLVLSIANTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVP
        G ANL T LVLSI+NTKQE CKELK SSS GMLIS +SAAASP L GEQGSLFHKLPLLPPR  AGKSP+AFRDDSYSVKR+S VT+KPEWWWRTLACVP
Subjt:  GSANLDTKLVLSIANTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVP

Query:  YLMALQMSSTAYYLLPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQY
        YLMALQMSSTAYYLLPLLEHL+VDNLIFYVPG+VQ LPWWFP LYFNLAYFG+VRNKELPHFIRFHVMMGMLL T+LDI+WY SNFMPLIHYNGTY+MQY
Subjt:  YLMALQMSSTAYYLLPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQY

Query:  WGAVGFIYISVLLVCIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF
        W AVGFIYIS+L +CIR SLLGTYAKIPFI ENALIHTFFS GRY+RPF
Subjt:  WGAVGFIYISVLLVCIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF

XP_004144571.2 protein TIC 20-IV, chloroplastic [Cucumis sativus]1.8e-14782.98Show/hide
Query:  MQSNTAL-PSASLFPSARKTGKMFAVGPSNARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIANTKQEL
        MQSNTAL  SASLFP ARKTGKMFA                     PT TA GAS+RY  VFLKRCFKP  VV  QCRRLGSANLDTKL LSIA TKQ+ 
Subjt:  MQSNTAL-PSASLFPSARKTGKMFAVGPSNARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIANTKQEL

Query:  CKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYLLPLLEH
        CKELK SSSRGM IS ISAAASP+LSGEQGSLFHKLPLLPPR  A K PRAFRDDSYSVKR SGVTQKP+WW RTLACVPYLMALQMSSTAYYL+PLLEH
Subjt:  CKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYLLPLLEH

Query:  LDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLVCIRSSL
        LDVDNLIFYVPGSVQ+LPWWFPMLYFNLAYFG+VRNKELPHF+RFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYW AV FIYIS LLVCIRSSL
Subjt:  LDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLVCIRSSL

Query:  LGTYAKIPFIFENALIHTFFSIGRYYRPF
        LGTYAKIPFIFENALIHTFF+IGRYYRPF
Subjt:  LGTYAKIPFIFENALIHTFFSIGRYYRPF

XP_022952618.1 protein TIC 20-IV, chloroplastic-like isoform X1 [Cucurbita moschata]1.4e-14278.51Show/hide
Query:  MQSNTALPSASLF------PSARKTGKMFAVGPSN-ARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIA
        M S TAL +AS F          K GKM AV PS+ ARL CNS NPLL + P TT A  +SVRY++V LK CFKP QVV+ QCRRL  ANL T LVLSI+
Subjt:  MQSNTALPSASLF------PSARKTGKMFAVGPSN-ARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIA

Query:  NTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYL
        NTKQE CKELK SSSRGMLIS +SAAASP L GEQGSLFHKLPLLPPR  AGKSP+ FRDDSY VKR+S VT+KPEWWWRTLACVPYLMALQMSSTAYYL
Subjt:  NTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYL

Query:  LPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLV
        LPLLEHL+VDNLIFYVPG+VQ LPWWFPMLYFNLAYFG+VRNKELPHFIRFHVMMGMLL T+LDI+WY SNFMPLIHYNGTY+MQYW AVGFIYIS+L +
Subjt:  LPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLV

Query:  CIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF
        CIRSSLLGTYAKIPFI ENALIHTFFS GRY+RPF
Subjt:  CIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF

XP_023554543.1 protein TIC 20-IV, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo]2.5e-14479.1Show/hide
Query:  MQSNTALPSASLFPSARKT------GKMFAVGPSN-ARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIA
        M S TAL +ASLF    +T      GKMFAV PS+ ARL CNSPNPLL + P TT A  +SVRY++V LK CFKP QVV+ QCRRL  ANL T LVLSI+
Subjt:  MQSNTALPSASLFPSARKT------GKMFAVGPSN-ARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIA

Query:  NTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYL
        NTKQE CKELK SSSRGML+S +SAAASP L GEQGSLFHKLPLLPPR  A KSP+AFRDDSY VKR+S VT+KPEWWWRTLACVPYLMALQMSSTAYYL
Subjt:  NTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYL

Query:  LPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLV
        LPLLEHL+VDNLIFYVPG+VQ LPWWFPMLYFNLAYFG+VRNKELPHFIRFHVMMGMLL T+LDI+WY SNFMPLIHYNGTY+MQYW AVGFIYIS+L +
Subjt:  LPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLV

Query:  CIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF
        CIRSSLLGTYAKIPFI ENALIHTFFS GRY+RPF
Subjt:  CIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF

XP_038888476.1 protein TIC 20-IV, chloroplastic [Benincasa hispida]2.0e-16290.71Show/hide
Query:  RKTGKMFAVGPSNARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIANTKQELCKELKLSSSRGMLISQI
        +KTGKMFAVGPSNARLHCNSPNPLLP PP T TA GASVRYK+VFLKRCFKP QVV+ QC R GS+NLDTKLVLSI NTKQE  KELK SSS+G+LISQ+
Subjt:  RKTGKMFAVGPSNARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIANTKQELCKELKLSSSRGMLISQI

Query:  SAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYLLPLLEHLDVDNLIFYVPGSVQKL
        SAAASPHLSGEQGSLFHKLPLLPPRNCAGKSP+AFRDDSYSVKR+SGVTQKPEWWWRTLACVPYLMALQMSSTAYYLLPLLEHLDVDNLIFY+PG VQKL
Subjt:  SAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYLLPLLEHLDVDNLIFYVPGSVQKL

Query:  PWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLVCIRSSLLGTYAKIPFIFENALIH
        PWWFPMLYFNLAYFG+VRNKELPHFIRFHVMMGMLLET LDI+WY+SNFMPLIHYNGTYAMQYW AVGFIYIS LLVCIRSSLLGTYAKIPFIFENALIH
Subjt:  PWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLVCIRSSLLGTYAKIPFIFENALIH

Query:  TFFSIGRYYRPF
        TFFSIGRYYRPF
Subjt:  TFFSIGRYYRPF

TrEMBL top hitse value%identityAlignment
A0A0A0K289 Protein TIC 201.1e-15382.27Show/hide
Query:  KCPIKYDITDVPYSCMQSNTAL-PSASLFPSARKTGKMFAVGPSNARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANL
        K  IKY+I DV YSCMQSNTAL  SASLFP ARKTGKMFA                     PT TA GAS+RY  VFLKRCFKP  VV  QCRRLGSANL
Subjt:  KCPIKYDITDVPYSCMQSNTAL-PSASLFPSARKTGKMFAVGPSNARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANL

Query:  DTKLVLSIANTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMAL
        DTKL LSIA TKQ+ CKELK SSSRGM IS ISAAASP+LSGEQGSLFHKLPLLPPR  A K PRAFRDDSYSVKR SGVTQKP+WW RTLACVPYLMAL
Subjt:  DTKLVLSIANTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMAL

Query:  QMSSTAYYLLPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVG
        QMSSTAYYL+PLLEHLDVDNLIFYVPGSVQ+LPWWFPMLYFNLAYFG+VRNKELPHF+RFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYW AV 
Subjt:  QMSSTAYYLLPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVG

Query:  FIYISVLLVCIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF
        FIYIS LLVCIRSSLLGTYAKIPFIFENALIHTFF+IGRYYRPF
Subjt:  FIYISVLLVCIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF

A0A6J1GL44 Protein TIC 206.6e-14378.51Show/hide
Query:  MQSNTALPSASLF------PSARKTGKMFAVGPSN-ARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIA
        M S TAL +AS F          K GKM AV PS+ ARL CNS NPLL + P TT A  +SVRY++V LK CFKP QVV+ QCRRL  ANL T LVLSI+
Subjt:  MQSNTALPSASLF------PSARKTGKMFAVGPSN-ARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIA

Query:  NTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYL
        NTKQE CKELK SSSRGMLIS +SAAASP L GEQGSLFHKLPLLPPR  AGKSP+ FRDDSY VKR+S VT+KPEWWWRTLACVPYLMALQMSSTAYYL
Subjt:  NTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYL

Query:  LPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLV
        LPLLEHL+VDNLIFYVPG+VQ LPWWFPMLYFNLAYFG+VRNKELPHFIRFHVMMGMLL T+LDI+WY SNFMPLIHYNGTY+MQYW AVGFIYIS+L +
Subjt:  LPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLV

Query:  CIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF
        CIRSSLLGTYAKIPFI ENALIHTFFS GRY+RPF
Subjt:  CIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF

A0A6J1GM78 Protein TIC 202.3e-14078.21Show/hide
Query:  MQSNTALPSASLF------PSARKTGKMFAVGPSN-ARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIA
        M S TAL +AS F          K GKM AV PS+ ARL CNS NPLL + P TT A  +SVR  +V LK CFKP QVV+ QCRRL  ANL T LVLSI+
Subjt:  MQSNTALPSASLF------PSARKTGKMFAVGPSN-ARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIA

Query:  NTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYL
        NTKQE CKELK SSSRGMLIS +SAAASP L GEQGSLFHKLPLLPPR  AGKSP+ FRDDSY VKR+S VT+KPEWWWRTLACVPYLMALQMSSTAYYL
Subjt:  NTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYL

Query:  LPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLV
        LPLLEHL+VDNLIFYVPG+VQ LPWWFPMLYFNLAYFG+VRNKELPHFIRFHVMMGMLL T+LDI+WY SNFMPLIHYNGTY+MQYW AVGFIYIS+L +
Subjt:  LPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLV

Query:  CIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF
        CIRSSLLGTYAKIPFI ENALIHTFFS GRY+RPF
Subjt:  CIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF

A0A6J1GMA4 Protein TIC 204.3e-14281.73Show/hide
Query:  KTGKMFAVGPSN-ARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIANTKQELCKELKLSSSRGMLISQI
        K GKM AV PS+ ARL CNS NPLL + P TT A  +SVRY++V LK CFKP QVV+ QCRRL  ANL T LVLSI+NTKQE CKELK SSSRGMLIS +
Subjt:  KTGKMFAVGPSN-ARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIANTKQELCKELKLSSSRGMLISQI

Query:  SAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYLLPLLEHLDVDNLIFYVPGSVQKL
        SAAASP L GEQGSLFHKLPLLPPR  AGKSP+ FRDDSY VKR+S VT+KPEWWWRTLACVPYLMALQMSSTAYYLLPLLEHL+VDNLIFYVPG+VQ L
Subjt:  SAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYLLPLLEHLDVDNLIFYVPGSVQKL

Query:  PWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLVCIRSSLLGTYAKIPFIFENALIH
        PWWFPMLYFNLAYFG+VRNKELPHFIRFHVMMGMLL T+LDI+WY SNFMPLIHYNGTY+MQYW AVGFIYIS+L +CIRSSLLGTYAKIPFI ENALIH
Subjt:  PWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLVCIRSSLLGTYAKIPFIFENALIH

Query:  TFFSIGRYYRPF
        TFFS GRY+RPF
Subjt:  TFFSIGRYYRPF

A0A6J1HZ71 Protein TIC 204.3e-14277.61Show/hide
Query:  MQSNTALPSASLFPSARKT------GKMFAVGP-SNARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIA
        M S TAL +ASLF    +T      GKMFAV P S+ARL CNSPNPLL + P T  A  +SVRY++V LK CFKP QVV+ QCRRLG ANL   LVL I+
Subjt:  MQSNTALPSASLFPSARKT------GKMFAVGP-SNARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIA

Query:  NTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYL
        NTKQE CKELK SSSRGM+IS +SAAASP L GEQGSLFHKLPLLPPR  A KSP+AFRDDSY VKR+S +T+KPEWWWRTLACVPYLMALQMSSTAYYL
Subjt:  NTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMALQMSSTAYYL

Query:  LPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLV
        LPLLEHL+VDNLIFYVPG+VQ LPWWFPMLY NLAYFG+VRNKELPHFIRFHVMMGMLL T+LDI+WY SNFMPLIHYNGTY+MQYW AVGFIYIS+L +
Subjt:  LPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLV

Query:  CIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF
        CIRSSL+GTYAKIPFI ENALIHTFFS GRY+RPF
Subjt:  CIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF

SwissProt top hitse value%identityAlignment
Q8GZ79 Protein TIC 20-I, chloroplastic5.5e-4649.06Show/hide
Query:  SRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMAL----QMSSTAYYLLPLLEHLDVD
        SRG+ +S +SA++S  L+GEQGSL   LP+LP R     +PRA +D   S  R+  +T+KP+WWWRTLAC+PYLM L      + TAY+L P LE  D +
Subjt:  SRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMAL----QMSSTAYYLLPLLEHLDVD

Query:  NLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLVCIRSSLLGTY
         L +   G++ +LP WF M YF +AY G+VR KE PHF RFHV+MGMLLE +L +I   S +MPL  Y G + M +W AV F Y+  +L  IR +L G Y
Subjt:  NLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLVCIRSSLLGTY

Query:  AKIPFIFENALI
        A IPF+ + A I
Subjt:  AKIPFIFENALI

Q9ZQZ9 Protein TIC 20-IV, chloroplastic8.0e-5342.52Show/hide
Query:  NLDT--KLVLSIANTKQELCKELKLSSSRGMLISQISAAASPHLSGE------QGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRT
        N+D+  KL LS ++  +   +E+   S+   +    +A +S  L          G   H+ P+ P         R  +DD + +K    + ++PEWWWRT
Subjt:  NLDT--KLVLSIANTKQELCKELKLSSSRGMLISQISAAASPHLSGE------QGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRT

Query:  LACVPYLMALQMSSTAYYLLPLLE-HLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNG
        LACVPYL++LQ+S   +Y+ P LE H  + ++I+++PG++ + P WF M+Y  L Y  +V+NKELPH++RFH+MMGMLLET+L +IW  SNF PLIH+ G
Subjt:  LACVPYLMALQMSSTAYYLLPLLE-HLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNG

Query:  TYAMQYWGAVGFIYISVLLVCIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRP
         + M YW A+GF YI +LL CIR +L G YA+IPF+ + A IHT F++G + RP
Subjt:  TYAMQYWGAVGFIYISVLLVCIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRP

Q9ZST8 Protein TIC 20, chloroplastic1.0e-3634.33Show/hide
Query:  GASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIANTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAF
        G +V   +V    C  P +V +S  R     +L+ K                     RGM  + +SA +S  LSG Q  L   +P+LP  + +  +PRA 
Subjt:  GASVRYKNVFLKRCFKPGQVVMSQCRRLGSANLDTKLVLSIANTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAF

Query:  RDDSYSVKRYSGVTQKPEWWWRTLACVPYLM----ALQMSSTAYYLLPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVM
        +D S    R+  +T+KP WWWRTL+C+PYL+    A   + TAY+L P + +       F +  ++  LP W  + YF +AY  +VR KE PHF RFHV 
Subjt:  RDDSYSVKRYSGVTQKPEWWWRTLACVPYLM----ALQMSSTAYYLLPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVM

Query:  MGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLVCIRSSLLGTYAKIPFIFENALI
        +GML+E +L +    S +MP   Y G   M +W    F+++   + CIR +L+G YA +PF+ + A I
Subjt:  MGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLVCIRSSLLGTYAKIPFIFENALI

Arabidopsis top hitse value%identityAlignment
AT1G04940.1 translocon at the inner envelope membrane of chloroplasts 203.9e-4749.06Show/hide
Query:  SRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMAL----QMSSTAYYLLPLLEHLDVD
        SRG+ +S +SA++S  L+GEQGSL   LP+LP R     +PRA +D   S  R+  +T+KP+WWWRTLAC+PYLM L      + TAY+L P LE  D +
Subjt:  SRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMAL----QMSSTAYYLLPLLEHLDVD

Query:  NLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLVCIRSSLLGTY
         L +   G++ +LP WF M YF +AY G+VR KE PHF RFHV+MGMLLE +L +I   S +MPL  Y G + M +W AV F Y+  +L  IR +L G Y
Subjt:  NLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLVCIRSSLLGTY

Query:  AKIPFIFENALI
        A IPF+ + A I
Subjt:  AKIPFIFENALI

AT1G04945.3 HIT-type Zinc finger family protein1.3e-4248.98Show/hide
Query:  SRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMAL----QMSSTAYYLLPLLEHLDVD
        SRG+ +S +SA++S  L+GEQGSL   LP+LP R     +PRA +D   S  R+  +T+KP+WWWRTLAC+PYLM L      + TAY+L P LE  D +
Subjt:  SRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLMAL----QMSSTAYYLLPLLEHLDVD

Query:  NLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLVCIRSSL
         L +   G++ +LP WF M YF +AY G+VR KE PHF RFHV+MGMLLE +L +I   S +MPL  Y G + M +W AV F Y+  +L  IR +L
Subjt:  NLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIYISVLLVCIRSSL

AT4G03320.1 translocon at the inner envelope membrane of chloroplasts 20-IV5.7e-5442.52Show/hide
Query:  NLDT--KLVLSIANTKQELCKELKLSSSRGMLISQISAAASPHLSGE------QGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRT
        N+D+  KL LS ++  +   +E+   S+   +    +A +S  L          G   H+ P+ P         R  +DD + +K    + ++PEWWWRT
Subjt:  NLDT--KLVLSIANTKQELCKELKLSSSRGMLISQISAAASPHLSGE------QGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRT

Query:  LACVPYLMALQMSSTAYYLLPLLE-HLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNG
        LACVPYL++LQ+S   +Y+ P LE H  + ++I+++PG++ + P WF M+Y  L Y  +V+NKELPH++RFH+MMGMLLET+L +IW  SNF PLIH+ G
Subjt:  LACVPYLMALQMSSTAYYLLPLLE-HLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNG

Query:  TYAMQYWGAVGFIYISVLLVCIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRP
         + M YW A+GF YI +LL CIR +L G YA+IPF+ + A IHT F++G + RP
Subjt:  TYAMQYWGAVGFIYISVLLVCIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAAACGACCCGAAAGGTTTTGGGTATTTCCTATTAAGTGCCCAATAAAATATGATATTACAGATGTACCCTATTCTTGCATGCAAAGTAACACGGCCCTA
CCCTCCGCCTCACTTTTCCCTTCTGCGCGAAAAACAGGAAAGATGTTTGCGGTTGGTCCGAGCAACGCCCGACTCCACTGCAACTCACCGAATCCGCTCCTCCCG
CTGCCGCCGCCGACAACGACGGCGATAGGCGCATCCGTACGGTACAAGAATGTGTTTCTGAAAAGATGTTTCAAGCCCGGCCAAGTGGTTATGTCCCAATGTCGT
CGATTGGGTTCTGCTAACTTGGACACTAAACTTGTTTTATCTATTGCCAATACTAAACAGGAACTTTGTAAAGAACTGAAGCTTTCTTCTTCTAGAGGAATGTTG
ATATCACAGATTTCTGCAGCAGCATCCCCACATCTATCTGGGGAACAAGGCAGTCTATTCCACAAACTTCCACTTTTACCTCCACGAAATTGTGCTGGAAAGAGT
CCTCGAGCATTCAGAGACGACTCTTACAGTGTAAAACGCTACTCTGGGGTCACCCAGAAACCAGAATGGTGGTGGAGAACTTTGGCTTGTGTTCCATATTTGATG
GCTTTGCAGATGTCAAGTACAGCATATTACCTCTTGCCCTTGTTGGAACACTTGGATGTTGATAATTTGATATTTTATGTCCCTGGATCTGTTCAGAAGTTACCG
TGGTGGTTCCCCATGTTGTACTTCAACCTCGCATACTTCGGACTCGTGAGGAATAAAGAATTGCCTCATTTCATTCGGTTCCATGTCATGATGGGGATGTTATTA
GAAACCTCGCTCGACATCATATGGTATGCAAGCAATTTCATGCCACTCATACATTACAATGGTACATATGCAATGCAGTATTGGGGAGCAGTGGGGTTCATCTAC
ATTTCCGTCTTGTTGGTGTGTATAAGGAGTTCTCTCTTGGGGACTTATGCCAAAATACCATTCATTTTCGAAAATGCACTTATTCATACATTCTTTAGTATAGGA
CGATATTATAGACCATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTAAAACGACCCGAAAGGTTTTGGGTATTTCCTATTAAGTGCCCAATAAAATATGATATTACAGATGTACCCTATTCTTGCATGCAAAGTAACACGGCCCTA
CCCTCCGCCTCACTTTTCCCTTCTGCGCGAAAAACAGGAAAGATGTTTGCGGTTGGTCCGAGCAACGCCCGACTCCACTGCAACTCACCGAATCCGCTCCTCCCG
CTGCCGCCGCCGACAACGACGGCGATAGGCGCATCCGTACGGTACAAGAATGTGTTTCTGAAAAGATGTTTCAAGCCCGGCCAAGTGGTTATGTCCCAATGTCGT
CGATTGGGTTCTGCTAACTTGGACACTAAACTTGTTTTATCTATTGCCAATACTAAACAGGAACTTTGTAAAGAACTGAAGCTTTCTTCTTCTAGAGGAATGTTG
ATATCACAGATTTCTGCAGCAGCATCCCCACATCTATCTGGGGAACAAGGCAGTCTATTCCACAAACTTCCACTTTTACCTCCACGAAATTGTGCTGGAAAGAGT
CCTCGAGCATTCAGAGACGACTCTTACAGTGTAAAACGCTACTCTGGGGTCACCCAGAAACCAGAATGGTGGTGGAGAACTTTGGCTTGTGTTCCATATTTGATG
GCTTTGCAGATGTCAAGTACAGCATATTACCTCTTGCCCTTGTTGGAACACTTGGATGTTGATAATTTGATATTTTATGTCCCTGGATCTGTTCAGAAGTTACCG
TGGTGGTTCCCCATGTTGTACTTCAACCTCGCATACTTCGGACTCGTGAGGAATAAAGAATTGCCTCATTTCATTCGGTTCCATGTCATGATGGGGATGTTATTA
GAAACCTCGCTCGACATCATATGGTATGCAAGCAATTTCATGCCACTCATACATTACAATGGTACATATGCAATGCAGTATTGGGGAGCAGTGGGGTTCATCTAC
ATTTCCGTCTTGTTGGTGTGTATAAGGAGTTCTCTCTTGGGGACTTATGCCAAAATACCATTCATTTTCGAAAATGCACTTATTCATACATTCTTTAGTATAGGA
CGATATTATAGACCATTCTAG
Protein sequenceShow/hide protein sequence
MLKRPERFWVFPIKCPIKYDITDVPYSCMQSNTALPSASLFPSARKTGKMFAVGPSNARLHCNSPNPLLPLPPPTTTAIGASVRYKNVFLKRCFKPGQVVMSQCR
RLGSANLDTKLVLSIANTKQELCKELKLSSSRGMLISQISAAASPHLSGEQGSLFHKLPLLPPRNCAGKSPRAFRDDSYSVKRYSGVTQKPEWWWRTLACVPYLM
ALQMSSTAYYLLPLLEHLDVDNLIFYVPGSVQKLPWWFPMLYFNLAYFGLVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWGAVGFIY
ISVLLVCIRSSLLGTYAKIPFIFENALIHTFFSIGRYYRPF