; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G018300 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G018300
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionFUNCTIONS IN: molecular_function unknown; INVOLVED IN: protein folding, protein transport; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages;
Genome locationchr09:27132318..27150429
RNA-Seq ExpressionLsi09G018300
SyntenyLsi09G018300
Gene Ontology termsGO:0006457 - protein folding (biological process)
GO:0015031 - protein transport (biological process)
GO:0005576 - extracellular region (cellular component)
InterPro domainsIPR008881 - Trigger factor, ribosome-binding, bacterial
IPR008998 - Agglutinin domain
IPR036242 - Agglutinin domain superfamily
IPR036611 - Trigger factor ribosome-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039924.1 uncharacterized protein E6C27_scaffold122G002040 [Cucumis melo var. makuwa]1.0e-28984.36Show/hide
Query:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLK--------KTIVDQINEDYRDDDE
        KGLGKAGTDILGG VKGAGK+VETVGDV EKAPVVGGIGTVVE TGKAIEN GE TEDFGE+VF+K+E  PK+           +  +D+  EDY++D +
Subjt:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLK--------KTIVDQINEDYRDDDE

Query:  EGDSKESENLDENYEDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASK
        +  +       ++YEDDDDIDEAEKKLMK++IDD   + EEEE++E   KV+PKN SLKSIRN KYLRYISESEN+DGLLRYS KNIVGPYSKFSV ASK
Subjt:  EGDSKESENLDENYEDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASK

Query:  TKRGFFHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLS
        TK GFFHIRCCYNNKFWVRLSE+SNYIAAIANEEEDDTSKWSCTLFE IFVPEKTG YYIRHVQLNTFLC+AEGDPSPYNDCLVARVED+T IDENLVLS
Subjt:  TKRGFFHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLS

Query:  AAMDWDSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIV
        A  DWDSIFILPKYVAFKSNNDQYLEPSGKYLKFS SSVEDPAVVFEII+MQDGYVRIKHVSSGKYWIRDP+WIWC+SIDI++DNPN LFWPVKVDNNIV
Subjt:  AAMDWDSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIV

Query:  AFRNKGNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSV
        AFRNKGNNRFCKRL+T+GKTNCLNAAVGTIT+TARLEVTEIVVARSVED++YRVNDARVYGKKILTVSKGVAINNTKV DK+SLKFRYEKKVERTWSSSV
Subjt:  AFRNKGNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSV

Query:  SSTFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVT
        SSTFGIATKF +KIPTVGS+KFELSLEVSS NTREETEKEKSFVET ETITIPAMSKVKFSAMVTQA CDVPFSYTRRDTLKDGRQVTHRLEDG+FTGVT
Subjt:  SSTFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVT

Query:  TYDYKFETEKVQSL
        TYDYKFETEKV+SL
Subjt:  TYDYKFETEKVQSL

KAE8646727.1 hypothetical protein Csa_005365 [Cucumis sativus]1.2e-27484.51Show/hide
Query:  APVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLKKT-IVDQINEDYRDDDEEGDSKESENLDENY-----EDDDDIDEAEKKLMKNEID
        APVVG +GTVVE TGKAIEN GE TEDFGE+VFEK+ENKP++G K++   D + + Y  + +  +    E++D++      EDDDDIDEAEKKLMK++ID
Subjt:  APVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLKKT-IVDQINEDYRDDDEEGDSKESENLDENY-----EDDDDIDEAEKKLMKNEID

Query:  DDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASKTKRGFFHIRCCYNNKFWVRLSENSNYIAAIANE
        D   + EEEE++E   KV+PKN SLKSIRN KYLRYISESEN+DGLLR+S KNIVGPYSKFSV ASKTK GFFHIRCCYNNKFWVRLSE+SNYIAA+ANE
Subjt:  DDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASKTKRGFFHIRCCYNNKFWVRLSENSNYIAAIANE

Query:  EEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLSAAMDWDSIFILPKYVAFKSNNDQYLEPSGKYLK
        EEDDTSKWSCTLFE IFVPEKTG YYIRHVQLNTFLC+AEGDPSPYNDCLVARVED+TTIDENLVL A  DWDSIFILPKYVAFKSNND+YLEPSGKYLK
Subjt:  EEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLSAAMDWDSIFILPKYVAFKSNNDQYLEPSGKYLK

Query:  FSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIVAFRNKGNNRFCKRLTTEGKTNCLNAAVGTITDT
        FS SSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDP+WIWC+SIDI +DNPN LFWPVKVDNNIVAFRNKGNNRFCKRLTT+GKTNCLNAAVGTIT+T
Subjt:  FSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIVAFRNKGNNRFCKRLTTEGKTNCLNAAVGTITDT

Query:  ARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSVSSTFGIATKFTSKIPTVGSLKFELSLEVSSGNT
        ARLE TEIVVARSVED+EYRVNDARVYGKKILTVSKGVAINNTKV DK+SLKFRYEKKVERTWSSSVSSTFGIATKF +KIPTVGSLKFELSLEVSS NT
Subjt:  ARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSVSSTFGIATKFTSKIPTVGSLKFELSLEVSSGNT

Query:  REETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVTTYDYKFETEKVQSL
        REETEKEKSFVETGETITIPAMSKVKFSAMVTQA CDVPFSYTRRDTLKDGRQVTHRLEDG+FTGVTTYDYKFETEKV+SL
Subjt:  REETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVTTYDYKFETEKVQSL

KAG6575375.1 hypothetical protein SDJN03_26014, partial [Cucurbita argyrosperma subsp. sororia]1.8e-26277.28Show/hide
Query:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLKKTIVDQINEDYRDDDEEGDSKESE
        +GLGKAGTD LGGV+KGAGK+VETVGDVAEKAP+VGG+GTVVE+TGKAIEN GE TEDFGE+VF+K EN PK+G      DQ+ EDY D+D         
Subjt:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLKKTIVDQINEDYRDDDEEGDSKESE

Query:  NLDENYEDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASKTKRGFFHI
            N  DD+DIDEAEKKLM +E      D + ++DDEA AK +PKNFSLKS RNNKYLRYISESE++DGLLR+S KNIVGPYSKF++RAS+T+ G  HI
Subjt:  NLDENYEDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASKTKRGFFHI

Query:  RCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLSAAMDWDSI
        RCCYNNKFWVRLSE+SNYIAAIANEEE+D SKWSCTLFE IF+P+K  H YIRHVQLNTFLC+AE DPSPYNDCL ARVED++TID+NLVL  AMDWDSI
Subjt:  RCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLSAAMDWDSI

Query:  FILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIVAFRNKGNN
        FILPKYVAFK NN +YLEPSGKYLKFS S+VED +VVFEIIS QDGYV IKHV+SGKYW+RDPNWIWC+S +  +DNPNALFWPVKVD+NIVA RNKGNN
Subjt:  FILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIVAFRNKGNN

Query:  RFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSVSSTFGIAT
         FCKRLTTEGKTNCLNAAV TITDTARLEV EIVVARS+ED+EYRVNDARVYGKKILTVSKGVAINNT+V DKV +KFRYEKKVE +WSSSVSSTFGI+T
Subjt:  RFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSVSSTFGIAT

Query:  KFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVTTYDYKFET
        K ++KIPTVG LKFELS+EVS G++    E+EKSFVET ETITIP MSKVKFSA+VTQACCDVPFSYT++DTLKDGRQV+HRLEDGIF GVTTYDYKFET
Subjt:  KFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVTTYDYKFET

Query:  EKV
        EK+
Subjt:  EKV

XP_004140683.2 uncharacterized protein LOC101212952 [Cucumis sativus]2.1e-29084.8Show/hide
Query:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLKKT-IVDQINEDYRDDDEEGDSKES
        KGLGKAGTDILGG VKGAGK+VETVGDV EKAPVVG +GTVVE TGKAIEN GE TEDFGE+VFEK+ENKP++G K++   D + + Y  + +  +    
Subjt:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLKKT-IVDQINEDYRDDDEEGDSKES

Query:  ENLDENY-----EDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASKTK
        E++D++      EDDDDIDEAEKKLMK++IDD   + EEEE++E   KV+PKN SLKSIRN KYLRYISESEN+DGLLR+S KNIVGPYSKFSV ASKTK
Subjt:  ENLDENY-----EDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASKTK

Query:  RGFFHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLSAA
         GFFHIRCCYNNKFWVRLSE+SNYIAA+ANEEEDDTSKWSCTLFE IFVPEKTG YYIRHVQLNTFLC+AEGDPSPYNDCLVARVED+TTIDENLVL A 
Subjt:  RGFFHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLSAA

Query:  MDWDSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIVAF
         DWDSIFILPKYVAFKSNND+YLEPSGKYLKFS SSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDP+WIWC+SIDI +DNPN LFWPVKVDNNIVAF
Subjt:  MDWDSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIVAF

Query:  RNKGNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSVSS
        RNKGNNRFCKRLTT+GKTNCLNAAVGTIT+TARLE TEIVVARSVED+EYRVNDARVYGKKILTVSKGVAINNTKV DK+SLKFRYEKKVERTWSSSVSS
Subjt:  RNKGNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSVSS

Query:  TFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVTTY
        TFGIATKF +KIPTVGSLKFELSLEVSS NTREETEKEKSFVETGETITIPAMSKVKFSAMVTQA CDVPFSYTRRDTLKDGRQVTHRLEDG+FTGVTTY
Subjt:  TFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVTTY

Query:  DYKFETEKVQSL
        DYKFETEKV+SL
Subjt:  DYKFETEKVQSL

XP_008460195.1 PREDICTED: uncharacterized protein LOC103499080 [Cucumis melo]6.0e-29084.36Show/hide
Query:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLK--------KTIVDQINEDYRDDDE
        KGLGKAGTDILGG VKGAGK+VETVGDV EKAPVVGGIGTVVE TGKAIEN GE TEDFGE+VF+K+E  PK+           +  +D+  EDY++D +
Subjt:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLK--------KTIVDQINEDYRDDDE

Query:  EGDSKESENLDENYEDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASK
        +  +       ++YEDDDDIDEAEKKLMK++IDD   + EEEE++E   KV+PKN SLKSIRN KYLRYISESEN+DGLLRYS KNIVGPYSKFSV ASK
Subjt:  EGDSKESENLDENYEDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASK

Query:  TKRGFFHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLS
        TK GFFHIRCCYNNKFWVRLSE+SNYIAAIANEEEDDTSKWSCTLFE IFVPEKTG YYIRHVQLNTFLC+AEGDPSPYNDCLVARVED+T IDENLVLS
Subjt:  TKRGFFHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLS

Query:  AAMDWDSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIV
        A  DWDSIFILPKYVAFKSNNDQYLEPSGKYLKFS SSVEDPAVVFEII+MQDGYVRIKHVSSGKYWIRDP+WIWC+SIDI++DNPN LFWPVKVDNNIV
Subjt:  AAMDWDSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIV

Query:  AFRNKGNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSV
        AFRNKGNNRFCKRL+T+GKTNCLNAAVGTIT+TARLEVTEIVVARSVED++YRVNDARVYGKKILTVSKGVAINNTKV DK+SLKFRYEKKVERTWSSSV
Subjt:  AFRNKGNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSV

Query:  SSTFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVT
        SSTFGIATKF +KIPTVGS+KFELSLEVSS NTREETEKEKSFVET ETITIPAMSKVKFSAMVTQA CDVPFSYTRRDTLKDGRQVTHRLEDG+FTGVT
Subjt:  SSTFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVT

Query:  TYDYKFETEKVQSL
        TYDYKFETEKV+SL
Subjt:  TYDYKFETEKVQSL

TrEMBL top hitse value%identityAlignment
A0A0A0K983 Uncharacterized protein1.0e-29084.8Show/hide
Query:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLKKT-IVDQINEDYRDDDEEGDSKES
        KGLGKAGTDILGG VKGAGK+VETVGDV EKAPVVG +GTVVE TGKAIEN GE TEDFGE+VFEK+ENKP++G K++   D + + Y  + +  +    
Subjt:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLKKT-IVDQINEDYRDDDEEGDSKES

Query:  ENLDENY-----EDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASKTK
        E++D++      EDDDDIDEAEKKLMK++IDD   + EEEE++E   KV+PKN SLKSIRN KYLRYISESEN+DGLLR+S KNIVGPYSKFSV ASKTK
Subjt:  ENLDENY-----EDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASKTK

Query:  RGFFHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLSAA
         GFFHIRCCYNNKFWVRLSE+SNYIAA+ANEEEDDTSKWSCTLFE IFVPEKTG YYIRHVQLNTFLC+AEGDPSPYNDCLVARVED+TTIDENLVL A 
Subjt:  RGFFHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLSAA

Query:  MDWDSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIVAF
         DWDSIFILPKYVAFKSNND+YLEPSGKYLKFS SSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDP+WIWC+SIDI +DNPN LFWPVKVDNNIVAF
Subjt:  MDWDSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIVAF

Query:  RNKGNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSVSS
        RNKGNNRFCKRLTT+GKTNCLNAAVGTIT+TARLE TEIVVARSVED+EYRVNDARVYGKKILTVSKGVAINNTKV DK+SLKFRYEKKVERTWSSSVSS
Subjt:  RNKGNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSVSS

Query:  TFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVTTY
        TFGIATKF +KIPTVGSLKFELSLEVSS NTREETEKEKSFVETGETITIPAMSKVKFSAMVTQA CDVPFSYTRRDTLKDGRQVTHRLEDG+FTGVTTY
Subjt:  TFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVTTY

Query:  DYKFETEKVQSL
        DYKFETEKV+SL
Subjt:  DYKFETEKVQSL

A0A0A0KD65 Uncharacterized protein4.6e-28081.09Show/hide
Query:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLKKTIVDQINEDYRDDD---EEGDSK
        KGLGKAGTDILGG VKGAGK VETVG+ AEKAPVVGGIGTVVE TGKAIEN G+ TE+ GEKVFE KE KPKK LK TI+DQINEDY  DD   ++GDSK
Subjt:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLKKTIVDQINEDYRDDD---EEGDSK

Query:  ESENLD-------------ENYEDD--DDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGP
        ESE                E  E+D  D+IDEAEK+LMK++I+D   + EE E+DE + KV+PKNFSLK +RNNKYLRYISESEN+DGLLRYSSKNIVGP
Subjt:  ESENLD-------------ENYEDD--DDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGP

Query:  YSKFSVRASKTKRGFFHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDL
        YSKF++R+SKTK GFFHIRCCYNNKFWVRLSENS+YIAAIANEEEDDTSKWS TLFE IFV EK G  YIRHVQLN FLCIAEG P PYNDCLVARVED+
Subjt:  YSKFSVRASKTKRGFFHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDL

Query:  TTIDENLVLSAAMDWDSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALF
        +TIDENL LSA MDWDSIFILP+YVAFK NND+YLEPS KYLKFSGSS E+PAVVF+IISMQDGYVRIKHVSSGKYWIRDP+WIWC+SIDI +DNPN LF
Subjt:  TTIDENLVLSAAMDWDSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALF

Query:  WPVKVDNNIVAFRNKGNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEK
        WPVKVDNNIVAFRNKGNNRFCKRLTT+GKTNCLNAAVGTIT+TARLE TEIVVARS+ED++YRVNDARVYG K LTVSKGVAINNTKV DKVSLK RYEK
Subjt:  WPVKVDNNIVAFRNKGNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEK

Query:  KVERTWSSSVSSTFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHR
        KVERTWSSSVSSTFG+AT+F SKIPTVGSLKFELSLEVS   TREETEKEKSFVE+GE I IPAMSKVKFSA+V QACCD+PFSYTRRDTLKDGRQVTHR
Subjt:  KVERTWSSSVSSTFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHR

Query:  LEDGIFTGVTTYDYKFETEKVQSL
        L+DGIF GVTTYDYK ETEKV+SL
Subjt:  LEDGIFTGVTTYDYKFETEKVQSL

A0A1S3CBI1 uncharacterized protein LOC1034990802.9e-29084.36Show/hide
Query:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLK--------KTIVDQINEDYRDDDE
        KGLGKAGTDILGG VKGAGK+VETVGDV EKAPVVGGIGTVVE TGKAIEN GE TEDFGE+VF+K+E  PK+           +  +D+  EDY++D +
Subjt:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLK--------KTIVDQINEDYRDDDE

Query:  EGDSKESENLDENYEDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASK
        +  +       ++YEDDDDIDEAEKKLMK++IDD   + EEEE++E   KV+PKN SLKSIRN KYLRYISESEN+DGLLRYS KNIVGPYSKFSV ASK
Subjt:  EGDSKESENLDENYEDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASK

Query:  TKRGFFHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLS
        TK GFFHIRCCYNNKFWVRLSE+SNYIAAIANEEEDDTSKWSCTLFE IFVPEKTG YYIRHVQLNTFLC+AEGDPSPYNDCLVARVED+T IDENLVLS
Subjt:  TKRGFFHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLS

Query:  AAMDWDSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIV
        A  DWDSIFILPKYVAFKSNNDQYLEPSGKYLKFS SSVEDPAVVFEII+MQDGYVRIKHVSSGKYWIRDP+WIWC+SIDI++DNPN LFWPVKVDNNIV
Subjt:  AAMDWDSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIV

Query:  AFRNKGNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSV
        AFRNKGNNRFCKRL+T+GKTNCLNAAVGTIT+TARLEVTEIVVARSVED++YRVNDARVYGKKILTVSKGVAINNTKV DK+SLKFRYEKKVERTWSSSV
Subjt:  AFRNKGNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSV

Query:  SSTFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVT
        SSTFGIATKF +KIPTVGS+KFELSLEVSS NTREETEKEKSFVET ETITIPAMSKVKFSAMVTQA CDVPFSYTRRDTLKDGRQVTHRLEDG+FTGVT
Subjt:  SSTFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVT

Query:  TYDYKFETEKVQSL
        TYDYKFETEKV+SL
Subjt:  TYDYKFETEKVQSL

A0A5A7T8Z0 Uncharacterized protein4.9e-29084.36Show/hide
Query:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLK--------KTIVDQINEDYRDDDE
        KGLGKAGTDILGG VKGAGK+VETVGDV EKAPVVGGIGTVVE TGKAIEN GE TEDFGE+VF+K+E  PK+           +  +D+  EDY++D +
Subjt:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLK--------KTIVDQINEDYRDDDE

Query:  EGDSKESENLDENYEDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASK
        +  +       ++YEDDDDIDEAEKKLMK++IDD   + EEEE++E   KV+PKN SLKSIRN KYLRYISESEN+DGLLRYS KNIVGPYSKFSV ASK
Subjt:  EGDSKESENLDENYEDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASK

Query:  TKRGFFHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLS
        TK GFFHIRCCYNNKFWVRLSE+SNYIAAIANEEEDDTSKWSCTLFE IFVPEKTG YYIRHVQLNTFLC+AEGDPSPYNDCLVARVED+T IDENLVLS
Subjt:  TKRGFFHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLS

Query:  AAMDWDSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIV
        A  DWDSIFILPKYVAFKSNNDQYLEPSGKYLKFS SSVEDPAVVFEII+MQDGYVRIKHVSSGKYWIRDP+WIWC+SIDI++DNPN LFWPVKVDNNIV
Subjt:  AAMDWDSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIV

Query:  AFRNKGNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSV
        AFRNKGNNRFCKRL+T+GKTNCLNAAVGTIT+TARLEVTEIVVARSVED++YRVNDARVYGKKILTVSKGVAINNTKV DK+SLKFRYEKKVERTWSSSV
Subjt:  AFRNKGNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSV

Query:  SSTFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVT
        SSTFGIATKF +KIPTVGS+KFELSLEVSS NTREETEKEKSFVET ETITIPAMSKVKFSAMVTQA CDVPFSYTRRDTLKDGRQVTHRLEDG+FTGVT
Subjt:  SSTFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVT

Query:  TYDYKFETEKVQSL
        TYDYKFETEKV+SL
Subjt:  TYDYKFETEKVQSL

A0A6J1GPP7 uncharacterized protein LOC1114563419.7e-26277.19Show/hide
Query:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLKKTIVDQINEDYRDDDEEGDSKESE
        +GLGKAGTD LGGV+KGAGK+VETVGDVAEKAP+VGG+GTVVE+TGKAIEN GE TEDFGE+VF+K EN PK+G      DQ+ EDY             
Subjt:  KGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLKKTIVDQINEDYRDDDEEGDSKESE

Query:  NLDENYEDDDDIDEAEKKLMKNE---IDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASKTKRGF
               DD+DIDEAEKKLM +E   + DDS  E+ ++DDEA AK +PKNFSLKS RNNKYLRYISESE++DGLLR+S KNIVGPYSKF++RAS+T+ G 
Subjt:  NLDENYEDDDDIDEAEKKLMKNE---IDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASKTKRGF

Query:  FHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLSAAMDW
         HIRCCYNNKFWVRLSE+SNYIAAIANEEE+D SKWSCTLFE IF+P+K  H YIRHVQLNTFLC+AE DPSPYNDCL ARVED++TID+NLVL  AMDW
Subjt:  FHIRCCYNNKFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLSAAMDW

Query:  DSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIVAFRNK
        DSIFILPKYVAFK NN +YLEPSGKYLKFS S+VED +VVFEIIS QDGYV IKHV+SGKYW+RDPNWIWCES +  +DNPNALFWPVKVD+NIVA RNK
Subjt:  DSIFILPKYVAFKSNNDQYLEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIVAFRNK

Query:  GNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSVSSTFG
        GNN FCKRLTTEGKTNCLNAAV TITDTARLEV EIVVARS+ED+EYRVNDARVYGKKILTVSKGVAINNT+V DKV +KFRYEKKVE +WSSSVSSTFG
Subjt:  GNNRFCKRLTTEGKTNCLNAAVGTITDTARLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSVSSTFG

Query:  IATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVTTYDYK
        I+TK ++KIPTVG LKFELS+EVS G++    E+EKSFVET ETITIP MSKVKFSA+VTQACCDVPFSYT++DTLKDGRQV+HRLEDGIF GVTTYDYK
Subjt:  IATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFVETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVTTYDYK

Query:  FETEK
        FETEK
Subjt:  FETEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G30695.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: protein folding, protein transport; LOCATED IN: chloroplast stroma, chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Trigger factor, ribosome-binding, bacterial (InterPro:IPR008881); Has 253 Blast hits to 253 proteins in 72 species: Archae - 0; Bacteria - 138; Metazoa - 0; Fungi - 0; Plants - 40; Viruses - 0; Other Eukaryotes - 75 (source: NCBI BLink).5.3e-3458.82Show/hide
Query:  AAASDPEDVSVSSSQFEDFSVTNATDTTENKELKIRVEVSGVKTRAIFNNVFDKMVAEAQPIPGFRRVKGGNIQRHIPRDILLEILGPSKVYKQVIKEVI
        A  + P DV  SS   E   +T     T N E+K+ V+VSG KT+ +FN+VF+KMVA AQPIPGFRRVKGG    +IP+D+LLEILG SKVYKQVIK++I
Subjt:  AAASDPEDVSVSSSQFEDFSVTNATDTTENKELKIRVEVSGVKTRAIFNNVFDKMVAEAQPIPGFRRVKGGNIQRHIPRDILLEILGPSKVYKQVIKEVI

Query:  NSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDE
        NS +  YV++E LKVGK+L + QSYEDLE+ FEP E
Subjt:  NSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDE

AT2G30695.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: protein folding, protein transport; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Trigger factor, ribosome-binding, bacterial (InterPro:IPR008881); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).5.3e-3458.82Show/hide
Query:  AAASDPEDVSVSSSQFEDFSVTNATDTTENKELKIRVEVSGVKTRAIFNNVFDKMVAEAQPIPGFRRVKGGNIQRHIPRDILLEILGPSKVYKQVIKEVI
        A  + P DV  SS   E   +T     T N E+K+ V+VSG KT+ +FN+VF+KMVA AQPIPGFRRVKGG    +IP+D+LLEILG SKVYKQVIK++I
Subjt:  AAASDPEDVSVSSSQFEDFSVTNATDTTENKELKIRVEVSGVKTRAIFNNVFDKMVAEAQPIPGFRRVKGGNIQRHIPRDILLEILGPSKVYKQVIKEVI

Query:  NSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDE
        NS +  YV++E LKVGK+L + QSYEDLE+ FEP E
Subjt:  NSTVAAYVEKEALKVGKDLRIEQSYEDLEDQFEPDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACCTGTTGGTCTAGTGGCAGTGGGAACATCCAAAAAAAGCCAAAGGGCTAAGGGGTTGTGGGTTCAATCCATGGTGACCACCTACCTAGGATTTAATATCCTACG
AAGTTTTGAGGCAGCCATCACAGATTACAAAGGCAATGCAATAACGTTGAAAAATGCTAAAGTAGTTGTAGAATCTGAAGAGGAAAACCAGATACAACTAAGAGTGGACT
TGACTGGGGATGAGACACAAAAGTTGTTGTTTCTGGTATCAACTATGGAATCCCCAAAAACAGAGGCACATGATGCTGCAGCTTCTGATCCTGAAGATGTGAGTGTTTCT
TCTTCTCAGTTTGAAGACTTCTCCGTCACTAATGCTACTGATACAACTGAGAATAAAGAACTAAAGATTCGTGTTGAGGTGTCTGGAGTCAAAACTCGAGCAATTTTCAA
CAATGTGTTTGACAAAATGGTTGCTGAAGCCCAGCCTATTCCAGGCTTTAGAAGAGTGAAAGGAGGTAATATCCAGAGGCATATACCCCGAGACATTCTATTAGAGATAC
TGGGACCTTCTAAGGTGTACAAACAAGTTATTAAGGAAGTTATCAACTCTACTGTTGCTGCATATGTGGAAAAGGAAGCTCTAAAAGTGGGTAAAGACTTGAGAATAGAG
CAAAGCTATGAGGATCTTGAAGACCAATTTGAACCAGATGAAAAAGGACTAGGGAAAGCTGGGACTGACATTTTGGGAGGAGTTGTGAAAGGAGCAGGAAAGGTGGTTGA
AACAGTGGGAGATGTGGCTGAGAAGGCGCCGGTCGTCGGTGGCATCGGCACTGTCGTGGAGAGCACCGGGAAGGCAATCGAAAATGCCGGTGAGGTGACCGAGGATTTCG
GCGAAAAAGTATTTGAAAAGAAAGAAAATAAGCCCAAAAAAGGTCTTAAAAAAACTATAGTCGACCAAATTAATGAAGATTATCGTGACGACGACGAGGAAGGTGACTCG
AAAGAAAGCGAAAACCTTGATGAAAACTATGAAGACGATGATGACATAGATGAAGCAGAGAAGAAGTTGATGAAGAATGAAATAGATGATGATTCAGGTGACGAAGAAGA
AGAAGAAGACGACGAAGCAGCAGCGAAGGTACTCCCGAAGAATTTCTCCCTCAAATCCATCCGCAACAACAAATACCTTCGCTACATAAGCGAAAGCGAAAACTCAGATG
GACTCCTCCGTTACTCCAGCAAGAACATTGTCGGTCCGTATTCGAAATTCTCCGTTCGCGCATCGAAAACCAAACGGGGTTTCTTCCACATAAGATGTTGTTACAACAAC
AAATTCTGGGTTCGTTTATCTGAAAACTCCAACTACATTGCAGCCATTGCCAACGAAGAAGAAGACGACACATCGAAATGGTCGTGCACTTTGTTCGAACTGATTTTCGT
ACCGGAAAAAACCGGACATTACTACATCCGTCATGTTCAACTCAACACCTTCCTTTGCATAGCTGAAGGAGATCCTTCACCTTACAATGATTGTTTAGTTGCAAGAGTTG
AAGACTTAACAACCATTGACGAGAATCTTGTTCTGTCAGCCGCCATGGATTGGGACTCCATATTTATACTACCAAAATACGTAGCTTTCAAAAGCAACAACGACCAATAT
CTAGAACCATCTGGAAAATACCTTAAATTTTCAGGTTCTAGCGTGGAAGATCCAGCCGTTGTGTTTGAGATAATATCCATGCAAGATGGGTATGTTCGTATCAAACATGT
GAGTTCAGGTAAGTATTGGATTCGAGATCCGAATTGGATATGGTGTGAATCAATCGACATTGAAAAAGACAACCCCAACGCTTTGTTTTGGCCTGTGAAAGTTGATAACA
ATATCGTGGCGTTTCGTAACAAAGGCAACAACCGTTTTTGCAAGAGGTTGACGACGGAAGGCAAGACTAATTGCCTTAATGCCGCGGTTGGAACGATTACGGATACCGCA
CGTTTGGAAGTAACAGAGATTGTTGTTGCAAGAAGTGTGGAAGATATTGAGTATCGTGTTAATGATGCAAGAGTTTATGGTAAGAAGATTCTCACTGTGTCAAAAGGGGT
TGCTATTAACAACACGAAAGTTGAAGATAAAGTAAGTTTGAAGTTTAGGTATGAGAAGAAGGTGGAAAGAACATGGAGTTCGTCGGTGTCGTCGACTTTCGGAATTGCTA
CCAAGTTTACATCGAAGATTCCAACGGTTGGGAGTTTGAAGTTTGAGCTTTCGTTGGAGGTCTCGAGTGGAAACACGAGGGAAGAAACGGAGAAGGAAAAATCATTTGTC
GAGACCGGAGAGACGATAACTATACCGGCAATGTCGAAGGTGAAGTTTAGTGCAATGGTAACACAAGCTTGTTGTGATGTTCCTTTTTCCTATACTCGAAGGGACACTTT
GAAAGATGGAAGACAAGTGACACATCGTTTGGAAGATGGTATTTTCACAGGTGTTACAACTTATGATTATAAATTTGAGACTGAAAAAGTACAATCACTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGACCTGTTGGTCTAGTGGCAGTGGGAACATCCAAAAAAAGCCAAAGGGCTAAGGGGTTGTGGGTTCAATCCATGGTGACCACCTACCTAGGATTTAATATCCTACG
AAGTTTTGAGGCAGCCATCACAGATTACAAAGGCAATGCAATAACGTTGAAAAATGCTAAAGTAGTTGTAGAATCTGAAGAGGAAAACCAGATACAACTAAGAGTGGACT
TGACTGGGGATGAGACACAAAAGTTGTTGTTTCTGGTATCAACTATGGAATCCCCAAAAACAGAGGCACATGATGCTGCAGCTTCTGATCCTGAAGATGTGAGTGTTTCT
TCTTCTCAGTTTGAAGACTTCTCCGTCACTAATGCTACTGATACAACTGAGAATAAAGAACTAAAGATTCGTGTTGAGGTGTCTGGAGTCAAAACTCGAGCAATTTTCAA
CAATGTGTTTGACAAAATGGTTGCTGAAGCCCAGCCTATTCCAGGCTTTAGAAGAGTGAAAGGAGGTAATATCCAGAGGCATATACCCCGAGACATTCTATTAGAGATAC
TGGGACCTTCTAAGGTGTACAAACAAGTTATTAAGGAAGTTATCAACTCTACTGTTGCTGCATATGTGGAAAAGGAAGCTCTAAAAGTGGGTAAAGACTTGAGAATAGAG
CAAAGCTATGAGGATCTTGAAGACCAATTTGAACCAGATGAAAAAGGACTAGGGAAAGCTGGGACTGACATTTTGGGAGGAGTTGTGAAAGGAGCAGGAAAGGTGGTTGA
AACAGTGGGAGATGTGGCTGAGAAGGCGCCGGTCGTCGGTGGCATCGGCACTGTCGTGGAGAGCACCGGGAAGGCAATCGAAAATGCCGGTGAGGTGACCGAGGATTTCG
GCGAAAAAGTATTTGAAAAGAAAGAAAATAAGCCCAAAAAAGGTCTTAAAAAAACTATAGTCGACCAAATTAATGAAGATTATCGTGACGACGACGAGGAAGGTGACTCG
AAAGAAAGCGAAAACCTTGATGAAAACTATGAAGACGATGATGACATAGATGAAGCAGAGAAGAAGTTGATGAAGAATGAAATAGATGATGATTCAGGTGACGAAGAAGA
AGAAGAAGACGACGAAGCAGCAGCGAAGGTACTCCCGAAGAATTTCTCCCTCAAATCCATCCGCAACAACAAATACCTTCGCTACATAAGCGAAAGCGAAAACTCAGATG
GACTCCTCCGTTACTCCAGCAAGAACATTGTCGGTCCGTATTCGAAATTCTCCGTTCGCGCATCGAAAACCAAACGGGGTTTCTTCCACATAAGATGTTGTTACAACAAC
AAATTCTGGGTTCGTTTATCTGAAAACTCCAACTACATTGCAGCCATTGCCAACGAAGAAGAAGACGACACATCGAAATGGTCGTGCACTTTGTTCGAACTGATTTTCGT
ACCGGAAAAAACCGGACATTACTACATCCGTCATGTTCAACTCAACACCTTCCTTTGCATAGCTGAAGGAGATCCTTCACCTTACAATGATTGTTTAGTTGCAAGAGTTG
AAGACTTAACAACCATTGACGAGAATCTTGTTCTGTCAGCCGCCATGGATTGGGACTCCATATTTATACTACCAAAATACGTAGCTTTCAAAAGCAACAACGACCAATAT
CTAGAACCATCTGGAAAATACCTTAAATTTTCAGGTTCTAGCGTGGAAGATCCAGCCGTTGTGTTTGAGATAATATCCATGCAAGATGGGTATGTTCGTATCAAACATGT
GAGTTCAGGTAAGTATTGGATTCGAGATCCGAATTGGATATGGTGTGAATCAATCGACATTGAAAAAGACAACCCCAACGCTTTGTTTTGGCCTGTGAAAGTTGATAACA
ATATCGTGGCGTTTCGTAACAAAGGCAACAACCGTTTTTGCAAGAGGTTGACGACGGAAGGCAAGACTAATTGCCTTAATGCCGCGGTTGGAACGATTACGGATACCGCA
CGTTTGGAAGTAACAGAGATTGTTGTTGCAAGAAGTGTGGAAGATATTGAGTATCGTGTTAATGATGCAAGAGTTTATGGTAAGAAGATTCTCACTGTGTCAAAAGGGGT
TGCTATTAACAACACGAAAGTTGAAGATAAAGTAAGTTTGAAGTTTAGGTATGAGAAGAAGGTGGAAAGAACATGGAGTTCGTCGGTGTCGTCGACTTTCGGAATTGCTA
CCAAGTTTACATCGAAGATTCCAACGGTTGGGAGTTTGAAGTTTGAGCTTTCGTTGGAGGTCTCGAGTGGAAACACGAGGGAAGAAACGGAGAAGGAAAAATCATTTGTC
GAGACCGGAGAGACGATAACTATACCGGCAATGTCGAAGGTGAAGTTTAGTGCAATGGTAACACAAGCTTGTTGTGATGTTCCTTTTTCCTATACTCGAAGGGACACTTT
GAAAGATGGAAGACAAGTGACACATCGTTTGGAAGATGGTATTTTCACAGGTGTTACAACTTATGATTATAAATTTGAGACTGAAAAAGTACAATCACTTTGATTTTCAT
AATGTGGTTTGAGATTTGTTAATTATATATGTGTTTGTGTGGGAGGAAATGTTTGAGTTGGAGGACTTAGATTGTGATTTTGATGTTTTTATGTGAAGAAAATTATCACA
ATTATCTAAGGGGGCAAAGAATAAAAGGTATGAGGATGAGTGTTTGTGTTATTTTACCTAATTTTGAGTTTGGAAATCTCAATAAATATTTCAAGCTCAAATTTCATGAC
TTTAA
Protein sequenceShow/hide protein sequence
MGPVGLVAVGTSKKSQRAKGLWVQSMVTTYLGFNILRSFEAAITDYKGNAITLKNAKVVVESEEENQIQLRVDLTGDETQKLLFLVSTMESPKTEAHDAAASDPEDVSVS
SSQFEDFSVTNATDTTENKELKIRVEVSGVKTRAIFNNVFDKMVAEAQPIPGFRRVKGGNIQRHIPRDILLEILGPSKVYKQVIKEVINSTVAAYVEKEALKVGKDLRIE
QSYEDLEDQFEPDEKGLGKAGTDILGGVVKGAGKVVETVGDVAEKAPVVGGIGTVVESTGKAIENAGEVTEDFGEKVFEKKENKPKKGLKKTIVDQINEDYRDDDEEGDS
KESENLDENYEDDDDIDEAEKKLMKNEIDDDSGDEEEEEDDEAAAKVLPKNFSLKSIRNNKYLRYISESENSDGLLRYSSKNIVGPYSKFSVRASKTKRGFFHIRCCYNN
KFWVRLSENSNYIAAIANEEEDDTSKWSCTLFELIFVPEKTGHYYIRHVQLNTFLCIAEGDPSPYNDCLVARVEDLTTIDENLVLSAAMDWDSIFILPKYVAFKSNNDQY
LEPSGKYLKFSGSSVEDPAVVFEIISMQDGYVRIKHVSSGKYWIRDPNWIWCESIDIEKDNPNALFWPVKVDNNIVAFRNKGNNRFCKRLTTEGKTNCLNAAVGTITDTA
RLEVTEIVVARSVEDIEYRVNDARVYGKKILTVSKGVAINNTKVEDKVSLKFRYEKKVERTWSSSVSSTFGIATKFTSKIPTVGSLKFELSLEVSSGNTREETEKEKSFV
ETGETITIPAMSKVKFSAMVTQACCDVPFSYTRRDTLKDGRQVTHRLEDGIFTGVTTYDYKFETEKVQSL