; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0002828 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0002828
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionProtein TIC 20
Genome locationchr01:2997506..2999672
RNA-Seq ExpressionPI0002828
SyntenyPI0002828
Gene Ontology termsGO:0045037 - protein import into chloroplast stroma (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0009706 - chloroplast inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005691 - Chloroplast protein import component Tic20


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011953.1 Protein TIC 20-IV, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.4e-12970.52Show/hide
Query:  IKYEIRDVPYSCMQSNTALSSASLFSFARKQERCLRL-------------LRTTPDSP-----------------------SCY-------AQCRRLGSA
        IKYEIRDVPYSCM S TALS+ASLF F  +  +  ++             LR    +P                        C+        QCRRLG A
Subjt:  IKYEIRDVPYSCMQSNTALSSASLFSFARKQERCLRL-------------LRTTPDSP-----------------------SCY-------AQCRRLGSA

Query:  NLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLM
        NL T L LSI+NTKQ PCKELKFSSS GMLISH+SAAASP L GEQGSLFHKLPLLPPR YA K P+AFRDDSYSVKRHS VT+KPEWWWRTLACVPYLM
Subjt:  NLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLM

Query:  ALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAA
        ALQMSSTAYYL+PLLEHL+  NLIFYVPG+VQ LPWWFP LYFNLAYFGVVRNKELPHFIRFHVMMGMLL T+LDI+WY SNFMPLIHYNGTY+MQYWAA
Subjt:  ALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAA

Query:  VGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF
        VGFIYIS L +CIR SL GTYAKIPF+ ENALIHTFFS GRY+RPF
Subjt:  VGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF

XP_004144571.2 protein TIC 20-IV, chloroplastic [Cucumis sativus]5.8e-13781.53Show/hide
Query:  MQSNTAL-SSASLFSFARKQERCLRLLRTTPDSPS---------------CY-------AQCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLIS
        MQSNTAL SSASLF FARK  +      TT  +P+               C+        QCRRLGSANLDTKLALSI  TKQ+PCKELKFSSSRGM IS
Subjt:  MQSNTAL-SSASLFSFARKQERCLRLLRTTPDSPS---------------CY-------AQCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLIS

Query:  HISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQ
        HISAAASP+LSGEQGSLFHKLPLLPPR YA+KGPRAFRDDSYSVKR SGVTQKP+WW RTLACVPYLMALQMSSTAYYL+PLLEHLD  NLIFYVPGSVQ
Subjt:  HISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQ

Query:  KLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENAL
        +LPWWFPMLYFNLAYFGVVRNKELPHF+RFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAV FIYISSLLVCIRSSL GTYAKIPF+FENAL
Subjt:  KLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENAL

Query:  IHTFFSIGRYYRPF
        IHTFF+IGRYYRPF
Subjt:  IHTFFSIGRYYRPF

XP_008455368.1 PREDICTED: protein TIC 20-IV, chloroplastic isoform X1 [Cucumis melo]3.0e-13392.52Show/hide
Query:  QCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRT
        QCRRL SANLDTKLALSIT  KQ+PCKELKFSSSRGMLISHIS+AASPHLSGEQG LFHKLPL PPRN+ARKGPRAFRDDSYSVKR SGVTQKPEWW RT
Subjt:  QCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRT

Query:  LACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGT
        LACVPYLMA+QMSSTAYYL+PLLEHLD  NLIFYVPGSVQ+LPWWFPMLYFNLAYFGVVRNKELPHF RFHVMMGMLLETSLDIIWYASNFMPLIHYNGT
Subjt:  LACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGT

Query:  YAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF
        YAMQYWAAVGFIYISSLLVCIRSSL GTYAKIPF+FENALIHTFF+IGRYYRPF
Subjt:  YAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF

XP_016901684.1 PREDICTED: protein TIC 20-IV, chloroplastic isoform X2 [Cucumis melo]3.0e-13392.52Show/hide
Query:  QCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRT
        QCRRL SANLDTKLALSIT  KQ+PCKELKFSSSRGMLISHIS+AASPHLSGEQG LFHKLPL PPRN+ARKGPRAFRDDSYSVKR SGVTQKPEWW RT
Subjt:  QCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRT

Query:  LACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGT
        LACVPYLMA+QMSSTAYYL+PLLEHLD  NLIFYVPGSVQ+LPWWFPMLYFNLAYFGVVRNKELPHF RFHVMMGMLLETSLDIIWYASNFMPLIHYNGT
Subjt:  LACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGT

Query:  YAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF
        YAMQYWAAVGFIYISSLLVCIRSSL GTYAKIPF+FENALIHTFF+IGRYYRPF
Subjt:  YAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF

XP_038888476.1 protein TIC 20-IV, chloroplastic [Benincasa hispida]4.0e-13090.16Show/hide
Query:  QCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRT
        QC R GS+NLDTKL LSITNTKQ P KELK SSS+G+LIS +SAAASPHLSGEQGSLFHKLPLLPPRN A K P+AFRDDSYSVKRHSGVTQKPEWWWRT
Subjt:  QCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRT

Query:  LACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGT
        LACVPYLMALQMSSTAYYL+PLLEHLD  NLIFY+PG VQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLET LDI+WY+SNFMPLIHYNGT
Subjt:  LACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGT

Query:  YAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF
        YAMQYWAAVGFIYISSLLVCIRSSL GTYAKIPF+FENALIHTFFSIGRYYRPF
Subjt:  YAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF

TrEMBL top hitse value%identityAlignment
A0A0A0K289 Protein TIC 201.3e-15082.3Show/hide
Query:  MTPKVVDISRKRSIKYEIRDVPYSCMQSNTAL-SSASLFSFARKQERCLRLLRTTPDSPS---------------CY-------AQCRRLGSANLDTKLA
        M PKVVDISRKRSIKYEIRDV YSCMQSNTAL SSASLF FARK  +      TT  +P+               C+        QCRRLGSANLDTKLA
Subjt:  MTPKVVDISRKRSIKYEIRDVPYSCMQSNTAL-SSASLFSFARKQERCLRLLRTTPDSPS---------------CY-------AQCRRLGSANLDTKLA

Query:  LSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLMALQMSST
        LSI  TKQ+PCKELKFSSSRGM ISHISAAASP+LSGEQGSLFHKLPLLPPR YA+KGPRAFRDDSYSVKR SGVTQKP+WW RTLACVPYLMALQMSST
Subjt:  LSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLMALQMSST

Query:  AYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAVGFIYIS
        AYYL+PLLEHLD  NLIFYVPGSVQ+LPWWFPMLYFNLAYFGVVRNKELPHF+RFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAV FIYIS
Subjt:  AYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAVGFIYIS

Query:  SLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF
        SLLVCIRSSL GTYAKIPF+FENALIHTFF+IGRYYRPF
Subjt:  SLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF

A0A1S3C1G8 Protein TIC 201.4e-13392.52Show/hide
Query:  QCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRT
        QCRRL SANLDTKLALSIT  KQ+PCKELKFSSSRGMLISHIS+AASPHLSGEQG LFHKLPL PPRN+ARKGPRAFRDDSYSVKR SGVTQKPEWW RT
Subjt:  QCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRT

Query:  LACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGT
        LACVPYLMA+QMSSTAYYL+PLLEHLD  NLIFYVPGSVQ+LPWWFPMLYFNLAYFGVVRNKELPHF RFHVMMGMLLETSLDIIWYASNFMPLIHYNGT
Subjt:  LACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGT

Query:  YAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF
        YAMQYWAAVGFIYISSLLVCIRSSL GTYAKIPF+FENALIHTFF+IGRYYRPF
Subjt:  YAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF

A0A1S4E0E4 Protein TIC 201.4e-13392.52Show/hide
Query:  QCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRT
        QCRRL SANLDTKLALSIT  KQ+PCKELKFSSSRGMLISHIS+AASPHLSGEQG LFHKLPL PPRN+ARKGPRAFRDDSYSVKR SGVTQKPEWW RT
Subjt:  QCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRT

Query:  LACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGT
        LACVPYLMA+QMSSTAYYL+PLLEHLD  NLIFYVPGSVQ+LPWWFPMLYFNLAYFGVVRNKELPHF RFHVMMGMLLETSLDIIWYASNFMPLIHYNGT
Subjt:  LACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGT

Query:  YAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF
        YAMQYWAAVGFIYISSLLVCIRSSL GTYAKIPF+FENALIHTFF+IGRYYRPF
Subjt:  YAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF

A0A6J1GL44 Protein TIC 203.0e-12385.43Show/hide
Query:  QCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRT
        QCRRL  ANL T L LSI+NTKQ PCKELKFSSSRGMLISH+SAAASP L GEQGSLFHKLPLLPPR YA K P+ FRDDSY VKRHS VT+KPEWWWRT
Subjt:  QCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRT

Query:  LACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGT
        LACVPYLMALQMSSTAYYL+PLLEHL+  NLIFYVPG+VQ LPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLL T+LDI+WY SNFMPLIHYNGT
Subjt:  LACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGT

Query:  YAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF
        Y+MQYWAAVGFIYIS L +CIRSSL GTYAKIPF+ ENALIHTFFS GRY+RPF
Subjt:  YAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF

A0A6J1GMA4 Protein TIC 203.0e-12385.43Show/hide
Query:  QCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRT
        QCRRL  ANL T L LSI+NTKQ PCKELKFSSSRGMLISH+SAAASP L GEQGSLFHKLPLLPPR YA K P+ FRDDSY VKRHS VT+KPEWWWRT
Subjt:  QCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRT

Query:  LACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGT
        LACVPYLMALQMSSTAYYL+PLLEHL+  NLIFYVPG+VQ LPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLL T+LDI+WY SNFMPLIHYNGT
Subjt:  LACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGT

Query:  YAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF
        Y+MQYWAAVGFIYIS L +CIRSSL GTYAKIPF+ ENALIHTFFS GRY+RPF
Subjt:  YAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF

SwissProt top hitse value%identityAlignment
Q8GZ79 Protein TIC 20-I, chloroplastic1.9e-4548.58Show/hide
Query:  SRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLMAL----QMSSTAYYLVPLLEHLDAY
        SRG+ +S++SA++S  L+GEQGSL   LP+LP R      PRA +D   S  R   +T+KP+WWWRTLAC+PYLM L      + TAY+L P LE  +  
Subjt:  SRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLMAL----QMSSTAYYLVPLLEHLDAY

Query:  NLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAVGFIYISSLLVCIRSSLFGTY
           F   G++ +LP WF M YF +AY G+VR KE PHF RFHV+MGMLLE +L +I   S +MPL  Y G + M +W AV F Y+ ++L  IR +L G Y
Subjt:  NLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAVGFIYISSLLVCIRSSLFGTY

Query:  AKIPFLFENALI
        A IPF+ + A I
Subjt:  AKIPFLFENALI

Q9ZQZ9 Protein TIC 20-IV, chloroplastic8.3e-5444.44Show/hide
Query:  NLDT--KLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGP----RAFRDDSYSVKRHSGVTQKPEWWWRTLA
        N+D+  KL LS ++  +R  +E+   S+   +    +A +S  L        + LP L P     + P    R  +DD + +K    + ++PEWWWRTLA
Subjt:  NLDT--KLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGP----RAFRDDSYSVKRHSGVTQKPEWWWRTLA

Query:  CVPYLMALQMSSTAYYLVPLLEHLDAY-NLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTY
        CVPYL++LQ+S   +Y+ P LE  DA  ++I+++PG++ + P WF M+Y  L Y  VV+NKELPH++RFH+MMGMLLET+L +IW  SNF PLIH+ G +
Subjt:  CVPYLMALQMSSTAYYLVPLLEHLDAY-NLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTY

Query:  AMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRP
         M YW A+GF YI  LL CIR +L G YA+IPF+ + A IHT F++G + RP
Subjt:  AMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRP

Q9ZST8 Protein TIC 20, chloroplastic6.6e-3538.86Show/hide
Query:  RGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLM----ALQMSSTAYYLVPLLEHLDAYN
        RGM  + +SA +S  LSG Q  L   +P+LP  + +   PRA +D S    R   +T+KP WWWRTL+C+PYL+    A   + TAY+L P + +     
Subjt:  RGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLM----ALQMSSTAYYLVPLLEHLDAYN

Query:  LIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAVGFIYISSLLVCIRSSLFGTYA
          F +  ++  LP W  + YF +AY  +VR KE PHF RFHV +GML+E +L +    S +MP   Y G   M +W    F+++ + + CIR +L G YA
Subjt:  LIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAVGFIYISSLLVCIRSSLFGTYA

Query:  KIPFLFENALI
         +PF+ + A I
Subjt:  KIPFLFENALI

Arabidopsis top hitse value%identityAlignment
AT1G04940.1 translocon at the inner envelope membrane of chloroplasts 201.3e-4648.58Show/hide
Query:  SRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLMAL----QMSSTAYYLVPLLEHLDAY
        SRG+ +S++SA++S  L+GEQGSL   LP+LP R      PRA +D   S  R   +T+KP+WWWRTLAC+PYLM L      + TAY+L P LE  +  
Subjt:  SRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLMAL----QMSSTAYYLVPLLEHLDAY

Query:  NLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAVGFIYISSLLVCIRSSLFGTY
           F   G++ +LP WF M YF +AY G+VR KE PHF RFHV+MGMLLE +L +I   S +MPL  Y G + M +W AV F Y+ ++L  IR +L G Y
Subjt:  NLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAVGFIYISSLLVCIRSSLFGTY

Query:  AKIPFLFENALI
        A IPF+ + A I
Subjt:  AKIPFLFENALI

AT1G04945.3 HIT-type Zinc finger family protein2.0e-4248.47Show/hide
Query:  SRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLMAL----QMSSTAYYLVPLLEHLDAY
        SRG+ +S++SA++S  L+GEQGSL   LP+LP R      PRA +D   S  R   +T+KP+WWWRTLAC+PYLM L      + TAY+L P LE  +  
Subjt:  SRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLMAL----QMSSTAYYLVPLLEHLDAY

Query:  NLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAVGFIYISSLLVCIRSSL
           F   G++ +LP WF M YF +AY G+VR KE PHF RFHV+MGMLLE +L +I   S +MPL  Y G + M +W AV F Y+ ++L  IR +L
Subjt:  NLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAVGFIYISSLLVCIRSSL

AT4G03320.1 translocon at the inner envelope membrane of chloroplasts 20-IV5.9e-5544.44Show/hide
Query:  NLDT--KLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGP----RAFRDDSYSVKRHSGVTQKPEWWWRTLA
        N+D+  KL LS ++  +R  +E+   S+   +    +A +S  L        + LP L P     + P    R  +DD + +K    + ++PEWWWRTLA
Subjt:  NLDT--KLALSITNTKQRPCKELKFSSSRGMLISHISAAASPHLSGEQGSLFHKLPLLPPRNYARKGP----RAFRDDSYSVKRHSGVTQKPEWWWRTLA

Query:  CVPYLMALQMSSTAYYLVPLLEHLDAY-NLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTY
        CVPYL++LQ+S   +Y+ P LE  DA  ++I+++PG++ + P WF M+Y  L Y  VV+NKELPH++RFH+MMGMLLET+L +IW  SNF PLIH+ G +
Subjt:  CVPYLMALQMSSTAYYLVPLLEHLDAY-NLIFYVPGSVQKLPWWFPMLYFNLAYFGVVRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTY

Query:  AMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRP
         M YW A+GF YI  LL CIR +L G YA+IPF+ + A IHT F++G + RP
Subjt:  AMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCCAAAAGTTGTGGATATTTCTAGAAAGAGGTCGATAAAATATGAAATTAGAGATGTACCCTATTCTTGCATGCAAAGTAACACGGCCCTATCTTCCGCCTCACT
TTTCTCTTTTGCGCGAAAACAGGAAAGATGTTTGCGGTTGCTCCGAACAACGCCCGACTCCCCTAGTTGTTATGCCCAATGTCGTCGATTGGGCTCTGCCAACTTAGACA
CAAAACTTGCTTTATCTATTACCAATACTAAACAGAGACCTTGTAAAGAACTGAAGTTTTCTTCTTCTAGAGGAATGTTGATATCACATATTTCTGCAGCAGCATCCCCA
CATCTATCCGGGGAACAAGGCAGTCTATTCCATAAACTTCCACTTTTGCCTCCACGAAATTATGCTAGAAAGGGTCCTCGAGCATTCAGAGACGACTCTTACAGTGTAAA
ACGCCACTCTGGGGTCACCCAAAAACCAGAATGGTGGTGGAGGACTTTGGCTTGTGTTCCATATTTGATGGCTTTGCAAATGTCTAGTACAGCATATTACCTCGTGCCCT
TGTTGGAACACTTGGATGCTTATAATTTGATATTTTATGTCCCGGGATCCGTTCAGAAGTTACCGTGGTGGTTCCCCATGTTGTACTTCAACCTTGCATACTTCGGAGTC
GTGAGGAATAAAGAATTGCCTCATTTCATTCGGTTCCATGTCATGATGGGAATGTTATTAGAAACCTCCCTTGACATCATATGGTATGCTAGCAATTTCATGCCACTCAT
ACATTACAATGGTACATACGCAATGCAATATTGGGCTGCAGTGGGGTTCATCTACATTTCCAGCTTGTTGGTGTGTATAAGGAGTTCTCTCTTTGGGACTTATGCCAAAA
TACCATTCCTTTTCGAAAACGCACTTATTCATACATTCTTTAGTATAGGACGATATTATCGACCCTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCCCAAAAGTTGTGGATATTTCTAGAAAGAGGTCGATAAAATATGAAATTAGAGATGTACCCTATTCTTGCATGCAAAGTAACACGGCCCTATCTTCCGCCTCACT
TTTCTCTTTTGCGCGAAAACAGGAAAGATGTTTGCGGTTGCTCCGAACAACGCCCGACTCCCCTAGTTGTTATGCCCAATGTCGTCGATTGGGCTCTGCCAACTTAGACA
CAAAACTTGCTTTATCTATTACCAATACTAAACAGAGACCTTGTAAAGAACTGAAGTTTTCTTCTTCTAGAGGAATGTTGATATCACATATTTCTGCAGCAGCATCCCCA
CATCTATCCGGGGAACAAGGCAGTCTATTCCATAAACTTCCACTTTTGCCTCCACGAAATTATGCTAGAAAGGGTCCTCGAGCATTCAGAGACGACTCTTACAGTGTAAA
ACGCCACTCTGGGGTCACCCAAAAACCAGAATGGTGGTGGAGGACTTTGGCTTGTGTTCCATATTTGATGGCTTTGCAAATGTCTAGTACAGCATATTACCTCGTGCCCT
TGTTGGAACACTTGGATGCTTATAATTTGATATTTTATGTCCCGGGATCCGTTCAGAAGTTACCGTGGTGGTTCCCCATGTTGTACTTCAACCTTGCATACTTCGGAGTC
GTGAGGAATAAAGAATTGCCTCATTTCATTCGGTTCCATGTCATGATGGGAATGTTATTAGAAACCTCCCTTGACATCATATGGTATGCTAGCAATTTCATGCCACTCAT
ACATTACAATGGTACATACGCAATGCAATATTGGGCTGCAGTGGGGTTCATCTACATTTCCAGCTTGTTGGTGTGTATAAGGAGTTCTCTCTTTGGGACTTATGCCAAAA
TACCATTCCTTTTCGAAAACGCACTTATTCATACATTCTTTAGTATAGGACGATATTATCGACCCTTCTAG
Protein sequenceShow/hide protein sequence
MTPKVVDISRKRSIKYEIRDVPYSCMQSNTALSSASLFSFARKQERCLRLLRTTPDSPSCYAQCRRLGSANLDTKLALSITNTKQRPCKELKFSSSRGMLISHISAAASP
HLSGEQGSLFHKLPLLPPRNYARKGPRAFRDDSYSVKRHSGVTQKPEWWWRTLACVPYLMALQMSSTAYYLVPLLEHLDAYNLIFYVPGSVQKLPWWFPMLYFNLAYFGV
VRNKELPHFIRFHVMMGMLLETSLDIIWYASNFMPLIHYNGTYAMQYWAAVGFIYISSLLVCIRSSLFGTYAKIPFLFENALIHTFFSIGRYYRPF