; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021894 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021894
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionFUNCTIONS IN: molecular_function unknown; INVOLVED IN: protein folding, protein transport; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages;
Genome locationtig00153841:774687..798721
RNA-Seq ExpressionSgr021894
SyntenySgr021894
Gene Ontology termsGO:0005576 - extracellular region (cellular component)
InterPro domainsIPR008998 - Agglutinin domain
IPR036242 - Agglutinin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039924.1 uncharacterized protein E6C27_scaffold122G002040 [Cucumis melo var. makuwa]3.5e-24771.15Show/hide
Query:  RGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSK---DLVADEDGDNDDDIKMEDFKD
        +G+GKAGTDILGG V GAGK+VE VG+    A +VGG +G V+EGTGKAIE+VGEATED GE VF K+E  PK S+   DL+   + + D   + ED+K+
Subjt:  RGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSK---DLVADEDGDNDDDIKMEDFKD

Query:  SAEYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKT
          +  D       ++DD+IDEAEKKLM+S+   +    ++EE  EE +KVIPK+ SL+  RN +YLRYI+E E +DGLLRYS KN+VGPYSKF++ ASKT
Subjt:  SAEYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKT

Query:  EPGFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNG-YYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSS
        +PGF HIRCCYNNKFWV  SEDS YIAA+ANEEEDD SKWSCTLFEPI VPEK G YY RHVQLNTFLC+AEGDP+PY DCLVARVED+T IDE+LVLS+
Subjt:  EPGFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNG-YYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSS

Query:  TVDWDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVA
          DWDSIFILPKYVAFK NND YLEPSGKYLKFSAS+VEDP+V FE+I M DGY+R+KHV+SG Y IRDP+WIWC SI+ ++D+PNTLFWPVKVDN  VA
Subjt:  TVDWDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVA

Query:  LRNKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVS
         RNKGNNRFCKRL+T+GKTNCLNAAV TITETARLE  EIV+ARS+++VDYR+NDARVY KKILTV+KG+AINNT+V+DK++ KFRYEKKVERTW SSVS
Subjt:  LRNKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVS

Query:  STFGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTT
        STFG+AT+FK+KIPTVGS+KFE   EV+ E+TREETEKEKSF ETAETIT+P MSKVKFS ++TQA CDVPFSYT+RDTLKDG+QVTHR EDG+FTGVTT
Subjt:  STFGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTT

Query:  FDYKFETEKL
        +DYKFETEK+
Subjt:  FDYKFETEKL

KAG6575375.1 hypothetical protein SDJN03_26014, partial [Cucurbita argyrosperma subsp. sororia]1.0e-23868.31Show/hide
Query:  MRGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSKDLVADEDGDNDDDIKMEDFKDSA
        +RG+GKAGTD LGGV+ GAGKLVE VG+    A IVGG VG V+E TGKAIE++GE TED GE VF K EN PK   D + ++  DND+D          
Subjt:  MRGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSKDLVADEDGDNDDDIKMEDFKDSA

Query:  EYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKTEP
                    +D++IDEAEKKLM  E     + SD ++  E  +K IPK+FSL+  RN +YLRYI+E E++DGLLR+S KN+VGPYSKFAIRAS+TEP
Subjt:  EYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKTEP

Query:  GFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNGYYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSSTVD
        G VHIRCCYNNKFWV  SEDS YIAA+ANEEE+D+SKWSCTLFEPI +P+K  +Y RHVQLNTFLCLAE DP+PY DCL ARVED++TID++LVL + +D
Subjt:  GFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNGYYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSSTVD

Query:  WDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVALRN
        WDSIFILPKYVAFK NN  YLEPSGKYLKFSASNVED SV FE+I   DGY+ +KHVNSG Y +RDPNWIWC S N  +D+PN LFWPVKVD+  VALRN
Subjt:  WDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVALRN

Query:  KGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVSSTF
        KGNN FCKRLTTEGKTNCLNAAV TIT+TARLE +EIV+ARSI++V+YR+NDARVY KKILTV+KG+AINNTEVADKV  KFRYEKKVE +W SSVSSTF
Subjt:  KGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVSSTF

Query:  GLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTTFDY
        G++T+  +KIPTVG LKFE   EV++  +    E+EKSF ETAETIT+PPMSKVKFS ++TQA CDVPFSYTQ+DTLKDG+QV+HR EDGIF GVTT+DY
Subjt:  GLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTTFDY

Query:  KFETEKLPL
        KFETEKLPL
Subjt:  KFETEKLPL

XP_004140683.2 uncharacterized protein LOC101212952 [Cucumis sativus]5.1e-24670.92Show/hide
Query:  RGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSKDLVADEDG-----DNDDDIKMEDF
        +G+GKAGTDILGG V GAGK+VE VG+    A +VG  VG V+EGTGKAIE+VGEATED GE VF K+ENKP+       + D      + + D + ED+
Subjt:  RGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSKDLVADEDG-----DNDDDIKMEDF

Query:  KDSAEYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRAS
        K+  +  D +     ++DD+IDEAEKKLM+S+   +    ++EE  EE +KVIPK+ SL+  RN +YLRYI+E E +DGLLR+S KN+VGPYSKF++ AS
Subjt:  KDSAEYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRAS

Query:  KTEPGFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNG-YYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVL
        KT+PGF HIRCCYNNKFWV  SEDS YIAAVANEEEDD SKWSCTLFEPI VPEK G YY RHVQLNTFLC+AEGDP+PY DCLVARVED+TTIDE+LVL
Subjt:  KTEPGFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNG-YYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVL

Query:  SSTVDWDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKF
         +  DWDSIFILPKYVAFK NND YLEPSGKYLKFSAS+VEDP+V FE+I M DGY+R+KHV+SG Y IRDP+WIWC SI+  +D+PNTLFWPVKVDN  
Subjt:  SSTVDWDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKF

Query:  VALRNKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSS
        VA RNKGNNRFCKRLTT+GKTNCLNAAV TITETARLEA EIV+ARS+++V+YR+NDARVY KKILTV+KG+AINNT+V DK++ KFRYEKKVERTW SS
Subjt:  VALRNKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSS

Query:  VSSTFGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGV
        VSSTFG+AT+FK+KIPTVGSLKFE   EV+ E+TREETEKEKSF ET ETIT+P MSKVKFS ++TQA CDVPFSYT+RDTLKDG+QVTHR EDG+FTGV
Subjt:  VSSTFGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGV

Query:  TTFDYKFETEKL
        TT+DYKFETEK+
Subjt:  TTFDYKFETEKL

XP_008460195.1 PREDICTED: uncharacterized protein LOC103499080 [Cucumis melo]3.5e-24771.15Show/hide
Query:  RGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSK---DLVADEDGDNDDDIKMEDFKD
        +G+GKAGTDILGG V GAGK+VE VG+    A +VGG +G V+EGTGKAIE+VGEATED GE VF K+E  PK S+   DL+   + + D   + ED+K+
Subjt:  RGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSK---DLVADEDGDNDDDIKMEDFKD

Query:  SAEYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKT
          +  D       ++DD+IDEAEKKLM+S+   +    ++EE  EE +KVIPK+ SL+  RN +YLRYI+E E +DGLLRYS KN+VGPYSKF++ ASKT
Subjt:  SAEYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKT

Query:  EPGFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNG-YYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSS
        +PGF HIRCCYNNKFWV  SEDS YIAA+ANEEEDD SKWSCTLFEPI VPEK G YY RHVQLNTFLC+AEGDP+PY DCLVARVED+T IDE+LVLS+
Subjt:  EPGFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNG-YYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSS

Query:  TVDWDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVA
          DWDSIFILPKYVAFK NND YLEPSGKYLKFSAS+VEDP+V FE+I M DGY+R+KHV+SG Y IRDP+WIWC SI+ ++D+PNTLFWPVKVDN  VA
Subjt:  TVDWDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVA

Query:  LRNKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVS
         RNKGNNRFCKRL+T+GKTNCLNAAV TITETARLE  EIV+ARS+++VDYR+NDARVY KKILTV+KG+AINNT+V+DK++ KFRYEKKVERTW SSVS
Subjt:  LRNKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVS

Query:  STFGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTT
        STFG+AT+FK+KIPTVGS+KFE   EV+ E+TREETEKEKSF ETAETIT+P MSKVKFS ++TQA CDVPFSYT+RDTLKDG+QVTHR EDG+FTGVTT
Subjt:  STFGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTT

Query:  FDYKFETEKL
        +DYKFETEK+
Subjt:  FDYKFETEKL

XP_022991799.1 uncharacterized protein LOC111488338 [Cucurbita maxima]4.3e-23767.16Show/hide
Query:  MRGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSKDLVADEDGDNDDDIKMEDFKDSA
        +RG+GKAGTD LGGV+ GAGKLVE VG+    A IVGG +G V+E TG+AIE++GE TED GE +F K EN PK   D + D+D DND+D          
Subjt:  MRGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSKDLVADEDGDNDDDIKMEDFKDSA

Query:  EYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKTEP
                        IDEAEKKLM  E     + SD ++  E  +K IP++FSL+  RN +YLRYI+E E+SDGLLR+S KN+VGPYSKFAIRASKT+P
Subjt:  EYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKTEP

Query:  GFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNGYYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSSTVD
        G VHIRCCYNNKFWV  SEDS YIAA+ANEEE+D+SKWSCTLFEPI +P+K  +Y RHVQLNTFLC+AE DP+PY DC+ AR+ED++TID++LVL + +D
Subjt:  GFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNGYYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSSTVD

Query:  WDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVALRN
        WDSIFILPKYVAFK NN  YLEPSGKYLKFSASNVED S+ FE+I   DGY+ +KHVNSG Y +RDPNWIWC S N  +D+PN LFWPVKVD+  VALRN
Subjt:  WDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVALRN

Query:  KGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVSSTF
        KGNN FCKRLTTEGKTNCLNAAV TIT+TARLE +EIV+ARSI++V+YR+NDARVY KKILTV+KG+AINNTEVADKV  KFRYEKKVE +W SSVSSTF
Subjt:  KGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVSSTF

Query:  GLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTTFDY
        G++T+  +KIPTVG LKFE   EV++  +    E+EKSF ET ETIT+PPMSKVKFS ++TQA CDVPFSYTQ+DTLKDG+QV+HR EDGIF GVTT+DY
Subjt:  GLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTTFDY

Query:  KFETEKLPL
        KFETEKLPL
Subjt:  KFETEKLPL

TrEMBL top hitse value%identityAlignment
A0A0A0K983 Uncharacterized protein2.5e-24670.92Show/hide
Query:  RGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSKDLVADEDG-----DNDDDIKMEDF
        +G+GKAGTDILGG V GAGK+VE VG+    A +VG  VG V+EGTGKAIE+VGEATED GE VF K+ENKP+       + D      + + D + ED+
Subjt:  RGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSKDLVADEDG-----DNDDDIKMEDF

Query:  KDSAEYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRAS
        K+  +  D +     ++DD+IDEAEKKLM+S+   +    ++EE  EE +KVIPK+ SL+  RN +YLRYI+E E +DGLLR+S KN+VGPYSKF++ AS
Subjt:  KDSAEYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRAS

Query:  KTEPGFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNG-YYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVL
        KT+PGF HIRCCYNNKFWV  SEDS YIAAVANEEEDD SKWSCTLFEPI VPEK G YY RHVQLNTFLC+AEGDP+PY DCLVARVED+TTIDE+LVL
Subjt:  KTEPGFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNG-YYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVL

Query:  SSTVDWDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKF
         +  DWDSIFILPKYVAFK NND YLEPSGKYLKFSAS+VEDP+V FE+I M DGY+R+KHV+SG Y IRDP+WIWC SI+  +D+PNTLFWPVKVDN  
Subjt:  SSTVDWDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKF

Query:  VALRNKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSS
        VA RNKGNNRFCKRLTT+GKTNCLNAAV TITETARLEA EIV+ARS+++V+YR+NDARVY KKILTV+KG+AINNT+V DK++ KFRYEKKVERTW SS
Subjt:  VALRNKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSS

Query:  VSSTFGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGV
        VSSTFG+AT+FK+KIPTVGSLKFE   EV+ E+TREETEKEKSF ET ETIT+P MSKVKFS ++TQA CDVPFSYT+RDTLKDG+QVTHR EDG+FTGV
Subjt:  VSSTFGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGV

Query:  TTFDYKFETEKL
        TT+DYKFETEK+
Subjt:  TTFDYKFETEKL

A0A1S3CBI1 uncharacterized protein LOC1034990801.7e-24771.15Show/hide
Query:  RGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSK---DLVADEDGDNDDDIKMEDFKD
        +G+GKAGTDILGG V GAGK+VE VG+    A +VGG +G V+EGTGKAIE+VGEATED GE VF K+E  PK S+   DL+   + + D   + ED+K+
Subjt:  RGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSK---DLVADEDGDNDDDIKMEDFKD

Query:  SAEYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKT
          +  D       ++DD+IDEAEKKLM+S+   +    ++EE  EE +KVIPK+ SL+  RN +YLRYI+E E +DGLLRYS KN+VGPYSKF++ ASKT
Subjt:  SAEYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKT

Query:  EPGFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNG-YYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSS
        +PGF HIRCCYNNKFWV  SEDS YIAA+ANEEEDD SKWSCTLFEPI VPEK G YY RHVQLNTFLC+AEGDP+PY DCLVARVED+T IDE+LVLS+
Subjt:  EPGFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNG-YYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSS

Query:  TVDWDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVA
          DWDSIFILPKYVAFK NND YLEPSGKYLKFSAS+VEDP+V FE+I M DGY+R+KHV+SG Y IRDP+WIWC SI+ ++D+PNTLFWPVKVDN  VA
Subjt:  TVDWDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVA

Query:  LRNKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVS
         RNKGNNRFCKRL+T+GKTNCLNAAV TITETARLE  EIV+ARS+++VDYR+NDARVY KKILTV+KG+AINNT+V+DK++ KFRYEKKVERTW SSVS
Subjt:  LRNKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVS

Query:  STFGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTT
        STFG+AT+FK+KIPTVGS+KFE   EV+ E+TREETEKEKSF ETAETIT+P MSKVKFS ++TQA CDVPFSYT+RDTLKDG+QVTHR EDG+FTGVTT
Subjt:  STFGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTT

Query:  FDYKFETEKL
        +DYKFETEK+
Subjt:  FDYKFETEKL

A0A5A7T8Z0 Uncharacterized protein1.7e-24771.15Show/hide
Query:  RGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSK---DLVADEDGDNDDDIKMEDFKD
        +G+GKAGTDILGG V GAGK+VE VG+    A +VGG +G V+EGTGKAIE+VGEATED GE VF K+E  PK S+   DL+   + + D   + ED+K+
Subjt:  RGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSK---DLVADEDGDNDDDIKMEDFKD

Query:  SAEYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKT
          +  D       ++DD+IDEAEKKLM+S+   +    ++EE  EE +KVIPK+ SL+  RN +YLRYI+E E +DGLLRYS KN+VGPYSKF++ ASKT
Subjt:  SAEYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKT

Query:  EPGFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNG-YYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSS
        +PGF HIRCCYNNKFWV  SEDS YIAA+ANEEEDD SKWSCTLFEPI VPEK G YY RHVQLNTFLC+AEGDP+PY DCLVARVED+T IDE+LVLS+
Subjt:  EPGFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNG-YYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSS

Query:  TVDWDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVA
          DWDSIFILPKYVAFK NND YLEPSGKYLKFSAS+VEDP+V FE+I M DGY+R+KHV+SG Y IRDP+WIWC SI+ ++D+PNTLFWPVKVDN  VA
Subjt:  TVDWDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVA

Query:  LRNKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVS
         RNKGNNRFCKRL+T+GKTNCLNAAV TITETARLE  EIV+ARS+++VDYR+NDARVY KKILTV+KG+AINNT+V+DK++ KFRYEKKVERTW SSVS
Subjt:  LRNKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVS

Query:  STFGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTT
        STFG+AT+FK+KIPTVGS+KFE   EV+ E+TREETEKEKSF ETAETIT+P MSKVKFS ++TQA CDVPFSYT+RDTLKDG+QVTHR EDG+FTGVTT
Subjt:  STFGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTT

Query:  FDYKFETEKL
        +DYKFETEK+
Subjt:  FDYKFETEKL

A0A6J1GPP7 uncharacterized protein LOC1114563415.1e-23667.87Show/hide
Query:  MRGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSKDLVADEDGDNDDDIKMEDFKDSA
        +RG+GKAGTD LGGV+ GAGKLVE VG+    A IVGG VG V+E TGKAIE++GE TED GE VF K EN PK   D + ++  DND            
Subjt:  MRGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSKDLVADEDGDNDDDIKMEDFKDSA

Query:  EYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEA-SKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKTE
                        IDEAEKKLM  E     + S+D +  +EA +K IPK+FSL+  RN +YLRYI+E E++DGLLR+S KN+VGPYSKFAIRAS+TE
Subjt:  EYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEA-SKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKTE

Query:  PGFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNGYYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSSTV
        PG VHIRCCYNNKFWV  SEDS YIAA+ANEEE+D+SKWSCTLFEPI +P+K  +Y RHVQLNTFLCLAE DP+PY DCL ARVED++TID++LVL + +
Subjt:  PGFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNGYYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSSTV

Query:  DWDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVALR
        DWDSIFILPKYVAFK NN  YLEPSGKYLKFSASNVED SV FE+I   DGY+ +KHVNSG Y +RDPNWIWC+S N  +D+PN LFWPVKVD+  VALR
Subjt:  DWDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVALR

Query:  NKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVSST
        NKGNN FCKRLTTEGKTNCLNAAV TIT+TARLE +EIV+ARSI++V+YR+NDARVY KKILTV+KG+AINNTEVADKV  KFRYEKKVE +W SSVSST
Subjt:  NKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVSST

Query:  FGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTTFD
        FG++T+  +KIPTVG LKFE   EV++  +    E+EKSF ETAETIT+PPMSKVKFS ++TQA CDVPFSYTQ+DTLKDG+QV+HR EDGIF GVTT+D
Subjt:  FGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTTFD

Query:  YKFETEKLPL
        YKFETEK PL
Subjt:  YKFETEKLPL

A0A6J1JVU2 uncharacterized protein LOC1114883382.1e-23767.16Show/hide
Query:  MRGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSKDLVADEDGDNDDDIKMEDFKDSA
        +RG+GKAGTD LGGV+ GAGKLVE VG+    A IVGG +G V+E TG+AIE++GE TED GE +F K EN PK   D + D+D DND+D          
Subjt:  MRGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSKDLVADEDGDNDDDIKMEDFKDSA

Query:  EYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKTEP
                        IDEAEKKLM  E     + SD ++  E  +K IP++FSL+  RN +YLRYI+E E+SDGLLR+S KN+VGPYSKFAIRASKT+P
Subjt:  EYKDLLKNCGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKTEP

Query:  GFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNGYYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSSTVD
        G VHIRCCYNNKFWV  SEDS YIAA+ANEEE+D+SKWSCTLFEPI +P+K  +Y RHVQLNTFLC+AE DP+PY DC+ AR+ED++TID++LVL + +D
Subjt:  GFVHIRCCYNNKFWVCRSEDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNGYYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSSTVD

Query:  WDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVALRN
        WDSIFILPKYVAFK NN  YLEPSGKYLKFSASNVED S+ FE+I   DGY+ +KHVNSG Y +RDPNWIWC S N  +D+PN LFWPVKVD+  VALRN
Subjt:  WDSIFILPKYVAFKCNNDCYLEPSGKYLKFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVALRN

Query:  KGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVSSTF
        KGNN FCKRLTTEGKTNCLNAAV TIT+TARLE +EIV+ARSI++V+YR+NDARVY KKILTV+KG+AINNTEVADKV  KFRYEKKVE +W SSVSSTF
Subjt:  KGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVSSTF

Query:  GLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTTFDY
        G++T+  +KIPTVG LKFE   EV++  +    E+EKSF ET ETIT+PPMSKVKFS ++TQA CDVPFSYTQ+DTLKDG+QV+HR EDGIF GVTT+DY
Subjt:  GLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTTFDY

Query:  KFETEKLPL
        KFETEKLPL
Subjt:  KFETEKLPL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G30695.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: protein folding, protein transport; LOCATED IN: chloroplast stroma, chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Trigger factor, ribosome-binding, bacterial (InterPro:IPR008881); Has 253 Blast hits to 253 proteins in 72 species: Archae - 0; Bacteria - 138; Metazoa - 0; Fungi - 0; Plants - 40; Viruses - 0; Other Eukaryotes - 75 (source: NCBI BLink).1.3e-2661.22Show/hide
Query:  SSSQFEDFSVTDATNINENKELKIRVEVLGTKTRAIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPKDILLEILGPSRVYKQVIKKVINSTVAAYVEK
        +SS+ E   +T       N E+K+ V+V G KT+ +FN+VF++MVA AQPIPGFRRVKGGKTPNIPKD+LLEILG S+VYKQVIKK+INS +  YV++
Subjt:  SSSQFEDFSVTDATNINENKELKIRVEVLGTKTRAIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPKDILLEILGPSRVYKQVIKKVINSTVAAYVEK

AT2G30695.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: protein folding, protein transport; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Trigger factor, ribosome-binding, bacterial (InterPro:IPR008881); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).1.3e-2661.22Show/hide
Query:  SSSQFEDFSVTDATNINENKELKIRVEVLGTKTRAIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPKDILLEILGPSRVYKQVIKKVINSTVAAYVEK
        +SS+ E   +T       N E+K+ V+V G KT+ +FN+VF++MVA AQPIPGFRRVKGGKTPNIPKD+LLEILG S+VYKQVIKK+INS +  YV++
Subjt:  SSSQFEDFSVTDATNINENKELKIRVEVLGTKTRAIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPKDILLEILGPSRVYKQVIKKVINSTVAAYVEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGGTTTAGCTCGGGTTGTAGGAAGAATAATTGAAGAAGAAGAGGTCCAAGTAGTCGAAGATGATGCCCAAGCCTTGAGCAGAGAAGATGTGAGTGTTTCTTCTTC
TCAGTTTGAAGATTTCTCTGTCACTGATGCTACTAATATCAATGAGAATAAAGAACTAAAGATTCGTGTAGAGGTGTTGGGTACCAAAACTCGAGCAATTTTCAACAATG
TTTTTGATGAAATGGTTGCTGAAGCACAGCCTATTCCAGGTTTTCGAAGAGTGAAAGGAGGAAAGACGCCGAACATACCCAAAGACATTCTATTAGAGATACTTGGACCA
TCTAGGGTTTACAAGCAAGTTATCAAGAAAGTTATCAACTCTACTGTTGCTGCATATGTGGAAAAGCACGTCAACTCTGGCAATTACTGCATTCGTGACCCCGATTGGAT
CCATTGTAAATCAATCAACACCGAAAAAGACGACCCCAACACTTTGTTTTGGCCTGTCAAAGTTGATAACAAATTTGTAGCTCTTCGCAATAAAGGCAACAATCGTTTCT
GCAAGAGGCTCACAACAGAAGGAAAGACAAATTGTCTTAATGCCGCCGTTTCAACCATTACCGAGACCGCACGTCTGGAAGCGATAGAGATTGTCATTGCTAGGAGCATT
GATAATGTAGACTATCGCCTTAACGATGCTAGGGTTTATAATAAGAAGATCCTCACCGTGGCTAAAGGAATTGCCATTAACAACACAGAAGTTGCAGACAAAGTAACCTT
CAAATTCCGATACGAAAAGAAGGTGGAAAGAACTTGGGGTTCCTCTGTCTCGTCGACTTTTGGCATCGCCACAAGATTTAAATCGAAGATTCCACACGAAGACACACGAG
AAGAAACTGAGAAGGAGAAATCGTTCGAAGAAACTGCAGAAACCATTACTGTACCTCCAATGTCGAAAGTGAAGTTCAGTGGCATAATAACGCAAGCATCCTGTGATGTT
CCTTTCTCCTACACTCAACGTGACACTCTGAAAGATGGGAAACAAGTGACTCATCGCTTTGAAGATGGGATTTTCACCGGCGTTACCACCTTCGACTACAAATTTGAGAC
AGAGAAACTACCACTACTGATTGGAGATAATAATGGAGTTATCACTTCTTGCCCCCACTCTTATCTTCAACGTGCAAAAAATCTCAAAGGTGAGCTGAAGCTGCTACCAC
ACACTTACCCTATAAAAAAGCTTCCTCCATGGAGCTCCAATGGCAGAGACACCCGCAGAGCATTTACAAAGAGAGAGAGAGAGAAGAAGAAGAAGAAGGAGAAGATGGCT
TTCTTCATGAGGGGAATAGGGAAAGCGGGGACGGACATTTTGGGCGGCGTAGTGCATGGAGCAGGGAAACTCGTGGAAGACGTCGGGGAGGCGACTGGGTCGGCGGGCAT
TGTCGGCGGCGTGGTCGGCGGCGTCATGGAGGGAACTGGAAAGGCGATCGAATCGGTGGGGGAGGCGACGGAGGATCTCGGCGAAGCAGTGTTTGGGAAGAAAGAAAACA
AGCCGAAAACTTCGAAAGATCTTGTGGCCGACGAAGATGGAGATAATGATGATGACATTAAAATGGAAGACTTTAAAGACAGCGCTGAATACAAAGATTTATTGAAGAAC
TGCGGAGGAAAAGAAGATGATGAGATAGATGAAGCAGAAAAGAAGTTGATGAGGTCTGAAGAGGGAACAACTGTAGAACAATCTGATGACGAAGAAGTTGGAGAAGAGGC
CTCAAAGGTAATCCCAAAGCATTTCTCCCTCAGATGCAACCGCAACAAGAGATACCTTCGCTACATAAATGAAGATGAGGAATCCGATGGACTCCTCCGGTACTCCAGTA
AGAACGTTGTCGGTCCGTATTCAAAATTTGCGATTCGCGCTTCCAAAACCGAGCCGGGTTTCGTCCATATAAGGTGCTGTTACAACAACAAGTTCTGGGTTTGTCGGTCG
GAGGACTCTAAGTATATTGCTGCTGTTGCCAACGAAGAAGAAGATGACAAATCAAAATGGTCGTGCACGTTGTTCGAACCCATTTTAGTGCCTGAGAAAAATGGATACTA
TTTTCGTCACGTCCAACTCAACACTTTTCTCTGTTTAGCTGAAGGGGATCCTGCACCATACAAAGACTGTTTAGTAGCGAGAGTTGAAGATCTAACCACCATTGACGAGG
ACCTTGTTCTCTCATCTACCGTCGACTGGGACTCCATATTTATACTGCCGAAATATGTAGCCTTCAAGTGTAACAACGACTGTTATCTCGAACCGTCTGGCAAATATTTG
AAATTTTCCGCTTCTAATGTCGAAGATCCATCTGTTGCCTTCGAGGTCATACCCATGGCGGACGGGTATATTCGCATGAAGCATGTCAACTCTGGCAATTACTGCATTCG
TGACCCCAACTGGATATGGTGTCAATCAATCAACACCGAAAAAGACGACCCCAACACTTTGTTTTGGCCTGTCAAAGTTGATAACAAATTTGTAGCTCTTCGCAATAAAG
GCAACAATCGTTTCTGCAAGAGGCTCACAACAGAAGGAAAGACAAATTGTCTTAATGCCGCCGTTTCAACCATTACCGAGACCGCACGTCTGGAAGCGATAGAGATTGTC
ATTGCTAGGAGCATTGATAATGTAGACTATCGCCTTAACGATGCTAGGGTTTATAATAAGAAGATCCTCACCGTGGCTAAAGGAATTGCCATTAACAACACAGAAGTTGC
AGACAAAGTAACCTTCAAATTCCGATACGAAAAGAAGGTGGAAAGAACTTGGGGTTCCTCCGTCTCGTCGACTTTTGGCCTCGCCACAAGATTTAAATCGAAGATTCCAA
CAGTTGGAAGTTTAAAGTTTGAATTTGAATTCGAGGTTACCAGAGAAGACACACGAGAAGAAACTGAGAAGGAGAAATCGTTCGAAGAAACTGCAGAAACCATTACTGTA
CCTCCAATGTCGAAAGTGAAGTTCAGTGGCATAATAACGCAAGCATCCTGTGATGTTCCTTTCTCCTACACTCAACGTGACACTCTGAAAGATGGGAAACAAGTGACTCA
TCGCTTTGAAGATGGGATTTTCACCGGCGTTACCACCTTTGACTACAAATTTGAGACGGAGAAACTACCATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTGGTTTAGCTCGGGTTGTAGGAAGAATAATTGAAGAAGAAGAGGTCCAAGTAGTCGAAGATGATGCCCAAGCCTTGAGCAGAGAAGATGTGAGTGTTTCTTCTTC
TCAGTTTGAAGATTTCTCTGTCACTGATGCTACTAATATCAATGAGAATAAAGAACTAAAGATTCGTGTAGAGGTGTTGGGTACCAAAACTCGAGCAATTTTCAACAATG
TTTTTGATGAAATGGTTGCTGAAGCACAGCCTATTCCAGGTTTTCGAAGAGTGAAAGGAGGAAAGACGCCGAACATACCCAAAGACATTCTATTAGAGATACTTGGACCA
TCTAGGGTTTACAAGCAAGTTATCAAGAAAGTTATCAACTCTACTGTTGCTGCATATGTGGAAAAGCACGTCAACTCTGGCAATTACTGCATTCGTGACCCCGATTGGAT
CCATTGTAAATCAATCAACACCGAAAAAGACGACCCCAACACTTTGTTTTGGCCTGTCAAAGTTGATAACAAATTTGTAGCTCTTCGCAATAAAGGCAACAATCGTTTCT
GCAAGAGGCTCACAACAGAAGGAAAGACAAATTGTCTTAATGCCGCCGTTTCAACCATTACCGAGACCGCACGTCTGGAAGCGATAGAGATTGTCATTGCTAGGAGCATT
GATAATGTAGACTATCGCCTTAACGATGCTAGGGTTTATAATAAGAAGATCCTCACCGTGGCTAAAGGAATTGCCATTAACAACACAGAAGTTGCAGACAAAGTAACCTT
CAAATTCCGATACGAAAAGAAGGTGGAAAGAACTTGGGGTTCCTCTGTCTCGTCGACTTTTGGCATCGCCACAAGATTTAAATCGAAGATTCCACACGAAGACACACGAG
AAGAAACTGAGAAGGAGAAATCGTTCGAAGAAACTGCAGAAACCATTACTGTACCTCCAATGTCGAAAGTGAAGTTCAGTGGCATAATAACGCAAGCATCCTGTGATGTT
CCTTTCTCCTACACTCAACGTGACACTCTGAAAGATGGGAAACAAGTGACTCATCGCTTTGAAGATGGGATTTTCACCGGCGTTACCACCTTCGACTACAAATTTGAGAC
AGAGAAACTACCACTACTGATTGGAGATAATAATGGAGTTATCACTTCTTGCCCCCACTCTTATCTTCAACGTGCAAAAAATCTCAAAGGTGAGCTGAAGCTGCTACCAC
ACACTTACCCTATAAAAAAGCTTCCTCCATGGAGCTCCAATGGCAGAGACACCCGCAGAGCATTTACAAAGAGAGAGAGAGAGAAGAAGAAGAAGAAGGAGAAGATGGCT
TTCTTCATGAGGGGAATAGGGAAAGCGGGGACGGACATTTTGGGCGGCGTAGTGCATGGAGCAGGGAAACTCGTGGAAGACGTCGGGGAGGCGACTGGGTCGGCGGGCAT
TGTCGGCGGCGTGGTCGGCGGCGTCATGGAGGGAACTGGAAAGGCGATCGAATCGGTGGGGGAGGCGACGGAGGATCTCGGCGAAGCAGTGTTTGGGAAGAAAGAAAACA
AGCCGAAAACTTCGAAAGATCTTGTGGCCGACGAAGATGGAGATAATGATGATGACATTAAAATGGAAGACTTTAAAGACAGCGCTGAATACAAAGATTTATTGAAGAAC
TGCGGAGGAAAAGAAGATGATGAGATAGATGAAGCAGAAAAGAAGTTGATGAGGTCTGAAGAGGGAACAACTGTAGAACAATCTGATGACGAAGAAGTTGGAGAAGAGGC
CTCAAAGGTAATCCCAAAGCATTTCTCCCTCAGATGCAACCGCAACAAGAGATACCTTCGCTACATAAATGAAGATGAGGAATCCGATGGACTCCTCCGGTACTCCAGTA
AGAACGTTGTCGGTCCGTATTCAAAATTTGCGATTCGCGCTTCCAAAACCGAGCCGGGTTTCGTCCATATAAGGTGCTGTTACAACAACAAGTTCTGGGTTTGTCGGTCG
GAGGACTCTAAGTATATTGCTGCTGTTGCCAACGAAGAAGAAGATGACAAATCAAAATGGTCGTGCACGTTGTTCGAACCCATTTTAGTGCCTGAGAAAAATGGATACTA
TTTTCGTCACGTCCAACTCAACACTTTTCTCTGTTTAGCTGAAGGGGATCCTGCACCATACAAAGACTGTTTAGTAGCGAGAGTTGAAGATCTAACCACCATTGACGAGG
ACCTTGTTCTCTCATCTACCGTCGACTGGGACTCCATATTTATACTGCCGAAATATGTAGCCTTCAAGTGTAACAACGACTGTTATCTCGAACCGTCTGGCAAATATTTG
AAATTTTCCGCTTCTAATGTCGAAGATCCATCTGTTGCCTTCGAGGTCATACCCATGGCGGACGGGTATATTCGCATGAAGCATGTCAACTCTGGCAATTACTGCATTCG
TGACCCCAACTGGATATGGTGTCAATCAATCAACACCGAAAAAGACGACCCCAACACTTTGTTTTGGCCTGTCAAAGTTGATAACAAATTTGTAGCTCTTCGCAATAAAG
GCAACAATCGTTTCTGCAAGAGGCTCACAACAGAAGGAAAGACAAATTGTCTTAATGCCGCCGTTTCAACCATTACCGAGACCGCACGTCTGGAAGCGATAGAGATTGTC
ATTGCTAGGAGCATTGATAATGTAGACTATCGCCTTAACGATGCTAGGGTTTATAATAAGAAGATCCTCACCGTGGCTAAAGGAATTGCCATTAACAACACAGAAGTTGC
AGACAAAGTAACCTTCAAATTCCGATACGAAAAGAAGGTGGAAAGAACTTGGGGTTCCTCCGTCTCGTCGACTTTTGGCCTCGCCACAAGATTTAAATCGAAGATTCCAA
CAGTTGGAAGTTTAAAGTTTGAATTTGAATTCGAGGTTACCAGAGAAGACACACGAGAAGAAACTGAGAAGGAGAAATCGTTCGAAGAAACTGCAGAAACCATTACTGTA
CCTCCAATGTCGAAAGTGAAGTTCAGTGGCATAATAACGCAAGCATCCTGTGATGTTCCTTTCTCCTACACTCAACGTGACACTCTGAAAGATGGGAAACAAGTGACTCA
TCGCTTTGAAGATGGGATTTTCACCGGCGTTACCACCTTTGACTACAAATTTGAGACGGAGAAACTACCATTGTAA
Protein sequenceShow/hide protein sequence
MLGLARVVGRIIEEEEVQVVEDDAQALSREDVSVSSSQFEDFSVTDATNINENKELKIRVEVLGTKTRAIFNNVFDEMVAEAQPIPGFRRVKGGKTPNIPKDILLEILGP
SRVYKQVIKKVINSTVAAYVEKHVNSGNYCIRDPDWIHCKSINTEKDDPNTLFWPVKVDNKFVALRNKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIVIARSI
DNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVSSTFGIATRFKSKIPHEDTREETEKEKSFEETAETITVPPMSKVKFSGIITQASCDV
PFSYTQRDTLKDGKQVTHRFEDGIFTGVTTFDYKFETEKLPLLIGDNNGVITSCPHSYLQRAKNLKGELKLLPHTYPIKKLPPWSSNGRDTRRAFTKREREKKKKKEKMA
FFMRGIGKAGTDILGGVVHGAGKLVEDVGEATGSAGIVGGVVGGVMEGTGKAIESVGEATEDLGEAVFGKKENKPKTSKDLVADEDGDNDDDIKMEDFKDSAEYKDLLKN
CGGKEDDEIDEAEKKLMRSEEGTTVEQSDDEEVGEEASKVIPKHFSLRCNRNKRYLRYINEDEESDGLLRYSSKNVVGPYSKFAIRASKTEPGFVHIRCCYNNKFWVCRS
EDSKYIAAVANEEEDDKSKWSCTLFEPILVPEKNGYYFRHVQLNTFLCLAEGDPAPYKDCLVARVEDLTTIDEDLVLSSTVDWDSIFILPKYVAFKCNNDCYLEPSGKYL
KFSASNVEDPSVAFEVIPMADGYIRMKHVNSGNYCIRDPNWIWCQSINTEKDDPNTLFWPVKVDNKFVALRNKGNNRFCKRLTTEGKTNCLNAAVSTITETARLEAIEIV
IARSIDNVDYRLNDARVYNKKILTVAKGIAINNTEVADKVTFKFRYEKKVERTWGSSVSSTFGLATRFKSKIPTVGSLKFEFEFEVTREDTREETEKEKSFEETAETITV
PPMSKVKFSGIITQASCDVPFSYTQRDTLKDGKQVTHRFEDGIFTGVTTFDYKFETEKLPL