; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030369 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030369
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein RETICULATA-RELATED 4, chloroplastic-like
Genome locationtig00153640:2890964..2902810
RNA-Seq ExpressionSgr030369
SyntenySgr030369
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0031969 - chloroplast membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR021825 - Protein RETICULATA-related


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581562.1 Protein RETICULATA-RELATED 4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]8.5e-10273.85Show/hide
Query:  LPVEARSHSPLRPPEAGGGRRGGGGGGDGDGEDDEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFK
        + V   S   + P    GG  GGGGGG G  E  EEEDRE+NRAEAI VLAEAGRS ESLPKDLAAAI AGRVPGVIVERF+E+EKSA  RWL+QFGGFK
Subjt:  LPVEARSHSPLRPPEAGGGRRGGGGGGDGDGEDDEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFK

Query:  ERLLADDLFLAKVAMECGVGIFTK-----------------------IMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYS
        ERLLADDLFLAK+AMECGVGIFTK                       IMA++ADFMLVWLPAPTVSLKPPLAISAG L K FYGCPENAFQVAFAGTSYS
Subjt:  ERLLADDLFLAKVAMECGVGIFTK-----------------------IMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYS

Query:  FLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI
        FLQRVGAVVRNGAKLFAVG+ AS+VGTG+TN+LIN+RKL DKSYA+EAED+PVL TSI YG YM+VSSNLRYQ+LAGVIEQR+
Subjt:  FLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI

TYJ99870.1 protein RETICULATA-RELATED 4 [Cucumis melo var. makuwa]1.7e-10570.03Show/hide
Query:  AEKRLSDLS-PPIFLSLQQWRSETSPISPQRLRFSAVLLLRRNSLTSSSPAVPLPLFASAASPPQPLPVEARSHSPLRPPEAGGGRRGGGGGGDGD-GED
        A +  S+LS PP FLS        SP +P    F+ +L     S T S P++ L    S   P + L +     S       GG   GGGG G GD GE+
Subjt:  AEKRLSDLS-PPIFLSLQQWRSETSPISPQRLRFSAVLLLRRNSLTSSSPAVPLPLFASAASPPQPLPVEARSHSPLRPPEAGGGRRGGGGGGDGD-GED

Query:  DEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTKIMALIADFM
        +EEEDRE+NRAEAI VLAEAGRS+ESLPKDLA AI AGRVP VIVERF+ELEKSAA RWL+QFGGFKER+LADDLFLAKVAMECGVGIFTKIMA++ADFM
Subjt:  DEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTKIMALIADFM

Query:  LVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPT
        LVWLPAPTVSLKP LAISAGPL KFFYGCPENAFQVA AGTS+SFLQRVGAVVRNGAKLFAVG+ AS+VGTG+TN LINIRK  DKSYA+EAED+PVL T
Subjt:  LVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPT

Query:  SIAYGVYMAVSSNLRYQVLAGVIEQRI
        SI YGVYM+VSSNLRYQ++AGVIEQR+
Subjt:  SIAYGVYMAVSSNLRYQVLAGVIEQRI

XP_008461823.1 PREDICTED: protein RETICULATA-RELATED 4, chloroplastic-like [Cucumis melo]1.5e-10165.43Show/hide
Query:  AEKRLSDLS-PPIFLSLQQWRSETSPISPQRLRFSAVLLLRRNSLTSSSPAVPLPLFASAASPPQPLPVEARSHSPLRPPEAGGGRRGGGGGGDGD-GED
        A +  S+LS PP FLS        SP +P    F+ +L     S T S P++ L    S   P + L +     S       GG   GGGG G GD GE+
Subjt:  AEKRLSDLS-PPIFLSLQQWRSETSPISPQRLRFSAVLLLRRNSLTSSSPAVPLPLFASAASPPQPLPVEARSHSPLRPPEAGGGRRGGGGGGDGD-GED

Query:  DEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTK---------
        +EEEDRE+NRAEAI VLAEAGRS+ESLPKDLA AI AGRVP VIVERF+ELEKSAA RWL+QFGGFKER+LADDLFLAKVAMECGVGIFTK         
Subjt:  DEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTK---------

Query:  --------------IMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNAL
                      IMA++ADFMLVWLPAPTVSLKP LAISAGPL KFFYGCPENAFQVA AGTS+SFLQRVGAVVRNGAKLFAVG+ AS+VGTG+TN L
Subjt:  --------------IMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNAL

Query:  INIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI
        INIRK  DKSYA+EAED+PVL TSI YGVYM+VSSNLRYQ++AGVIEQR+
Subjt:  INIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI

XP_018817211.1 protein RETICULATA-RELATED 4, chloroplastic-like isoform X2 [Juglans regia]1.6e-10381.56Show/hide
Query:  GGGRRGGGGGGDGDGEDDEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAME
        GGG  GGGGGGD DG DD+ EDRE+NR EA++ L EAGRSVES+ KDL AAIEAGRVPG IV R+ ELE+SA +RWLLQFGGFKERLLADDLFLAKVAME
Subjt:  GGGRRGGGGGGDGDGEDDEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAME

Query:  CGVGIFTKIMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKL
        CGVGIFTK+MA+IADFMLVWLPAPTVSL+PPLA+SAGP+AKFFYGCP+NAFQVA AGTSYS LQR+GA++RNGAKLFAVGT ASLVGTG+TN LIN RK 
Subjt:  CGVGIFTKIMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKL

Query:  IDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI
        +DKS+A EAED+PVL TS+AYGVYMAVSSNLRYQVLAG+IEQRI
Subjt:  IDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI

XP_022152568.1 protein RETICULATA-RELATED 4, chloroplastic-like [Momordica charantia]6.5e-11074.92Show/hide
Query:  LTSSSPAVPLPLFASAASPPQPLPVE--ARSHSPLRPPEAGGGRRGGGGGGDGDGEDDEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGV
        + +++P++ L   ++A SPP+   V   A +++   PP  GGG  GGGGGG GDG    EEDRE+N+AEAIVVLAE GRSVESLPKDLAAAIE+GRVP V
Subjt:  LTSSSPAVPLPLFASAASPPQPLPVE--ARSHSPLRPPEAGGGRRGGGGGGDGDGEDDEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGV

Query:  IVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTK-----------------------IMALIADFMLVWLPAPTVSLKPPLAISAG
        IVERF+ELEKSA WRWLLQFGGFKERLLADDLF+AKVAMECGVGIFTK                       IMA+IADFMLVWLPAPTVSLKPPLAISAG
Subjt:  IVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTK-----------------------IMALIADFMLVWLPAPTVSLKPPLAISAG

Query:  PLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLA
        PLAKFFYGCPENAFQVA AGTSYSFLQRVGAVVRNGAKLF VGT+ASLVGTGVTNALIN+RK++DKSYAVEAEDLPV+ TSIAYGVYMAVSSNLRYQVLA
Subjt:  PLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLA

Query:  GVIEQRI
        GVIEQRI
Subjt:  GVIEQRI

TrEMBL top hitse value%identityAlignment
A0A1S3CG26 protein RETICULATA-RELATED 4, chloroplastic-like7.0e-10265.43Show/hide
Query:  AEKRLSDLS-PPIFLSLQQWRSETSPISPQRLRFSAVLLLRRNSLTSSSPAVPLPLFASAASPPQPLPVEARSHSPLRPPEAGGGRRGGGGGGDGD-GED
        A +  S+LS PP FLS        SP +P    F+ +L     S T S P++ L    S   P + L +     S       GG   GGGG G GD GE+
Subjt:  AEKRLSDLS-PPIFLSLQQWRSETSPISPQRLRFSAVLLLRRNSLTSSSPAVPLPLFASAASPPQPLPVEARSHSPLRPPEAGGGRRGGGGGGDGD-GED

Query:  DEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTK---------
        +EEEDRE+NRAEAI VLAEAGRS+ESLPKDLA AI AGRVP VIVERF+ELEKSAA RWL+QFGGFKER+LADDLFLAKVAMECGVGIFTK         
Subjt:  DEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTK---------

Query:  --------------IMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNAL
                      IMA++ADFMLVWLPAPTVSLKP LAISAGPL KFFYGCPENAFQVA AGTS+SFLQRVGAVVRNGAKLFAVG+ AS+VGTG+TN L
Subjt:  --------------IMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNAL

Query:  INIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI
        INIRK  DKSYA+EAED+PVL TSI YGVYM+VSSNLRYQ++AGVIEQR+
Subjt:  INIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI

A0A2I4ECU0 protein RETICULATA-RELATED 4, chloroplastic-like isoform X27.5e-10481.56Show/hide
Query:  GGGRRGGGGGGDGDGEDDEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAME
        GGG  GGGGGGD DG DD+ EDRE+NR EA++ L EAGRSVES+ KDL AAIEAGRVPG IV R+ ELE+SA +RWLLQFGGFKERLLADDLFLAKVAME
Subjt:  GGGRRGGGGGGDGDGEDDEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAME

Query:  CGVGIFTKIMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKL
        CGVGIFTK+MA+IADFMLVWLPAPTVSL+PPLA+SAGP+AKFFYGCP+NAFQVA AGTSYS LQR+GA++RNGAKLFAVGT ASLVGTG+TN LIN RK 
Subjt:  CGVGIFTKIMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKL

Query:  IDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI
        +DKS+A EAED+PVL TS+AYGVYMAVSSNLRYQVLAG+IEQRI
Subjt:  IDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI

A0A5A7TXM9 Protein RETICULATA-RELATED 47.0e-10265.43Show/hide
Query:  AEKRLSDLS-PPIFLSLQQWRSETSPISPQRLRFSAVLLLRRNSLTSSSPAVPLPLFASAASPPQPLPVEARSHSPLRPPEAGGGRRGGGGGGDGD-GED
        A +  S+LS PP FLS        SP +P    F+ +L     S T S P++ L    S   P + L +     S       GG   GGGG G GD GE+
Subjt:  AEKRLSDLS-PPIFLSLQQWRSETSPISPQRLRFSAVLLLRRNSLTSSSPAVPLPLFASAASPPQPLPVEARSHSPLRPPEAGGGRRGGGGGGDGD-GED

Query:  DEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTK---------
        +EEEDRE+NRAEAI VLAEAGRS+ESLPKDLA AI AGRVP VIVERF+ELEKSAA RWL+QFGGFKER+LADDLFLAKVAMECGVGIFTK         
Subjt:  DEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTK---------

Query:  --------------IMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNAL
                      IMA++ADFMLVWLPAPTVSLKP LAISAGPL KFFYGCPENAFQVA AGTS+SFLQRVGAVVRNGAKLFAVG+ AS+VGTG+TN L
Subjt:  --------------IMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNAL

Query:  INIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI
        INIRK  DKSYA+EAED+PVL TSI YGVYM+VSSNLRYQ++AGVIEQR+
Subjt:  INIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI

A0A5D3BJ78 Protein RETICULATA-RELATED 48.0e-10670.03Show/hide
Query:  AEKRLSDLS-PPIFLSLQQWRSETSPISPQRLRFSAVLLLRRNSLTSSSPAVPLPLFASAASPPQPLPVEARSHSPLRPPEAGGGRRGGGGGGDGD-GED
        A +  S+LS PP FLS        SP +P    F+ +L     S T S P++ L    S   P + L +     S       GG   GGGG G GD GE+
Subjt:  AEKRLSDLS-PPIFLSLQQWRSETSPISPQRLRFSAVLLLRRNSLTSSSPAVPLPLFASAASPPQPLPVEARSHSPLRPPEAGGGRRGGGGGGDGD-GED

Query:  DEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTKIMALIADFM
        +EEEDRE+NRAEAI VLAEAGRS+ESLPKDLA AI AGRVP VIVERF+ELEKSAA RWL+QFGGFKER+LADDLFLAKVAMECGVGIFTKIMA++ADFM
Subjt:  DEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTKIMALIADFM

Query:  LVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPT
        LVWLPAPTVSLKP LAISAGPL KFFYGCPENAFQVA AGTS+SFLQRVGAVVRNGAKLFAVG+ AS+VGTG+TN LINIRK  DKSYA+EAED+PVL T
Subjt:  LVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPT

Query:  SIAYGVYMAVSSNLRYQVLAGVIEQRI
        SI YGVYM+VSSNLRYQ++AGVIEQR+
Subjt:  SIAYGVYMAVSSNLRYQVLAGVIEQRI

A0A6J1DF73 protein RETICULATA-RELATED 4, chloroplastic-like3.2e-11074.92Show/hide
Query:  LTSSSPAVPLPLFASAASPPQPLPVE--ARSHSPLRPPEAGGGRRGGGGGGDGDGEDDEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGV
        + +++P++ L   ++A SPP+   V   A +++   PP  GGG  GGGGGG GDG    EEDRE+N+AEAIVVLAE GRSVESLPKDLAAAIE+GRVP V
Subjt:  LTSSSPAVPLPLFASAASPPQPLPVE--ARSHSPLRPPEAGGGRRGGGGGGDGDGEDDEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGV

Query:  IVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTK-----------------------IMALIADFMLVWLPAPTVSLKPPLAISAG
        IVERF+ELEKSA WRWLLQFGGFKERLLADDLF+AKVAMECGVGIFTK                       IMA+IADFMLVWLPAPTVSLKPPLAISAG
Subjt:  IVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTK-----------------------IMALIADFMLVWLPAPTVSLKPPLAISAG

Query:  PLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLA
        PLAKFFYGCPENAFQVA AGTSYSFLQRVGAVVRNGAKLF VGT+ASLVGTGVTNALIN+RK++DKSYAVEAEDLPV+ TSIAYGVYMAVSSNLRYQVLA
Subjt:  PLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLA

Query:  GVIEQRI
        GVIEQRI
Subjt:  GVIEQRI

SwissProt top hitse value%identityAlignment
Q8RWG3 Protein RETICULATA-RELATED 6, chloroplastic4.2e-2733.09Show/hide
Query:  NRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQ-FGGFKERLLADDLFLAKVAMECGVGIFTKIMA-------------
        +RAE   V+  AGR  ++LP D+   ++ G V   +++   +LE+      L Q F GF+ERLLAD  FL ++A+E  + I T ++A             
Subjt:  NRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQ-FGGFKERLLADDLFLAKVAMECGVGIFTKIMA-------------

Query:  ----------LIADFMLVWLPAPTVSLKPPLAISAGP-----LAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALIN
                   + DF  VWLPAPT+S       + GP     L       P+NAFQ + AG  ++   R+ +V+  G KL  VG  +S    G +NAL  
Subjt:  ----------LIADFMLVWLPAPTVSLKPPLAISAGP-----LAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALIN

Query:  IRKLIDKSYAV--EAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRIWSLCCTNKSWHSVQSALSFELATLS
         RK+I     V  + +  P+L T++ YG ++  S+NLRYQ++AG+IE R   L     S   + +A+SF + TL+
Subjt:  IRKLIDKSYAV--EAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRIWSLCCTNKSWHSVQSALSFELATLS

Q94CJ5 Protein RETICULATA-RELATED 4, chloroplastic9.5e-8862.96Show/hide
Query:  GGGRRGGGGGGDG---DGEDDEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKV
        GGG  GGGGGGDG   DG+   +EDR++NR EA+++L E+G  +ESLPKDLAAAIEAGR+PG ++ RF+EL+KSA  RWL+QFGGF+ERLLADDLF+AK+
Subjt:  GGGRRGGGGGGDG---DGEDDEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKV

Query:  AMECGVGIFTK-----------------------IMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGA
        AMECGVGIFTK                        MA+IADFMLV+LPAPTVSL+PPLA++AG ++KFF+ CP+NAFQVA +GTSY+ LQR+GA+ RNGA
Subjt:  AMECGVGIFTK-----------------------IMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGA

Query:  KLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI
        KLFAVGT++SLVGT +TNA I  RK +D++   E E +P++ TS+AYGVYMAVSSNLRYQ++AGVIEQR+
Subjt:  KLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI

Q9C9Z2 Protein RETICULATA-RELATED 3, chloroplastic7.7e-1328.46Show/hide
Query:  EKSAAWRWL-LQFGGFKERLLADDLFLAKVAMECGVGI-----------------------FTKIMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFY
        E+S+ W  + L   G++ R+ AD  F  KV ME  VG+                        T ++  I +F+L+++ APT +       S+  L   F 
Subjt:  EKSAAWRWL-LQFGGFKERLLADDLFLAKVAMECGVGI-----------------------FTKIMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFY

Query:  GCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAG------
         CP +     F   S++ + R G +V  G    +VG +A LVGT ++N LI +RK +D S+    +  P +  S+ +  +M VS+N RYQ L G      
Subjt:  GCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAG------

Query:  ------VIEQRIWSLCCTNKSWHSVQSALSFELATLSWVGGLCKMDRGSEIERVEREKDN
              V +  +  L C N    +V   +SF L  L+ + G   ++  +EI   E+EKD+
Subjt:  ------VIEQRIWSLCCTNKSWHSVQSALSFELATLSWVGGLCKMDRGSEIERVEREKDN

Q9C9Z3 Protein RETICULATA-RELATED 2, chloroplastic2.6e-1332.81Show/hide
Query:  ELEKSAAWRWL-LQFGGFKERLLADDLFLAKVAMECGVGIFTKIMALIA----------DFMLVWLPAPTV------SLKPPLAISAGP---LAKFFYGC
        E E+S+ W  L L   G++ R+ AD  F  KV ME  VG+   ++  +A          DF+   L   ++       L  P AIS G    L   F  C
Subjt:  ELEKSAAWRWL-LQFGGFKERLLADDLFLAKVAMECGVGIFTKIMALIA----------DFMLVWLPAPTV------SLKPPLAISAGP---LAKFFYGC

Query:  PENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAG
        P +     F   +++ + R G +V  G     VG +A LVGT ++N LI +RK ID S+    +  P L  S+ +  +M VS+N+RYQ L G
Subjt:  PENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAG

Q9SIY5 Protein RETICULATA-RELATED 5, chloroplastic2.0e-2935.6Show/hide
Query:  NRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQ-FGGFKERLLADDLFLAKVAMECGVGIFT-----------------
        +RAE   V+  AGR  ++LP+D+   ++ G V   I++ F +LE+      L Q F GF+ERLLAD  FL ++A+E  + I T                 
Subjt:  NRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQ-FGGFKERLLADDLFLAKVAMECGVGIFT-----------------

Query:  ------KIMALIADFMLVWLPAPTVSLKPPLAISAGP-----LAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALIN
               + A + DF  VWLPAPT+S       + GP     L       P+NAFQ +  G  ++   R+ +V+  G KL  VG  +S    G +NAL  
Subjt:  ------KIMALIADFMLVWLPAPTVSLKPPLAISAGP-----LAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALIN

Query:  IRKLIDKSYAV--EAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI
        IRK I     V  +A+  P+L T++ YG Y+  SSN+RYQ++AG+IE RI
Subjt:  IRKLIDKSYAV--EAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI

Arabidopsis top hitse value%identityAlignment
AT2G37860.3 Protein of unknown function (DUF3411)4.6e-1326.35Show/hide
Query:  SPISPQRLRFSAVLLLR---RNSLTSSSPAVPLPLFASAASPPQPLPVEARSHSPLRPPEAGGGRRGGGGGGDGDGEDDEEEDREQN---RAEAIVVLAE
        SPI PQ   F+A   ++    NS+        L          + L ++          +  GG  GGG GG+GDGE ++ E++E     + E ++   E
Subjt:  SPISPQRLRFSAVLLLR---RNSLTSSSPAVPLPLFASAASPPQPLPVEARSHSPLRPPEAGGGRRGGGGGGDGDGEDDEEEDREQN---RAEAIVVLAE

Query:  AGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRW--LLQFGGFKERLLADDLFLAKVAMEC---------------GVGIFTKIMALIADFM--
        A  +  +LP D+  A +   +  V++ R+++L+ SA      +  +   + R+LAD  FL K+  E                G   + +    +AD +  
Subjt:  AGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRW--LLQFGGFKERLLADDLFLAKVAMEC---------------GVGIFTKIMALIADFM--

Query:  ------LVWLPAPTVSLKPPLAISAGPLAKFFY---GCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVE
              LV + AP V    P A S G L +  +     P + F+    G  +S  QR+      G    AVG    +VG G+ N ++  ++ I+KS    
Subjt:  ------LVWLPAPTVSLKPPLAISAGPLAKFFY---GCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVE

Query:  AEDLPVLP---TSIAYGVYMAVSSNLRYQVLAGV
         E++PV P   ++  +GV+++VSSN RYQ++ G+
Subjt:  AEDLPVLP---TSIAYGVYMAVSSNLRYQVLAGV

AT2G40400.1 Protein of unknown function (DUF399 and DUF3411)1.4e-3035.6Show/hide
Query:  NRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQ-FGGFKERLLADDLFLAKVAMECGVGIFT-----------------
        +RAE   V+  AGR  ++LP+D+   ++ G V   I++ F +LE+      L Q F GF+ERLLAD  FL ++A+E  + I T                 
Subjt:  NRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQ-FGGFKERLLADDLFLAKVAMECGVGIFT-----------------

Query:  ------KIMALIADFMLVWLPAPTVSLKPPLAISAGP-----LAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALIN
               + A + DF  VWLPAPT+S       + GP     L       P+NAFQ +  G  ++   R+ +V+  G KL  VG  +S    G +NAL  
Subjt:  ------KIMALIADFMLVWLPAPTVSLKPPLAISAGP-----LAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALIN

Query:  IRKLIDKSYAV--EAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI
        IRK I     V  +A+  P+L T++ YG Y+  SSN+RYQ++AG+IE RI
Subjt:  IRKLIDKSYAV--EAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI

AT2G40400.2 Protein of unknown function (DUF399 and DUF3411)1.4e-3035.6Show/hide
Query:  NRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQ-FGGFKERLLADDLFLAKVAMECGVGIFT-----------------
        +RAE   V+  AGR  ++LP+D+   ++ G V   I++ F +LE+      L Q F GF+ERLLAD  FL ++A+E  + I T                 
Subjt:  NRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQ-FGGFKERLLADDLFLAKVAMECGVGIFT-----------------

Query:  ------KIMALIADFMLVWLPAPTVSLKPPLAISAGP-----LAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALIN
               + A + DF  VWLPAPT+S       + GP     L       P+NAFQ +  G  ++   R+ +V+  G KL  VG  +S    G +NAL  
Subjt:  ------KIMALIADFMLVWLPAPTVSLKPPLAISAGP-----LAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALIN

Query:  IRKLIDKSYAV--EAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI
        IRK I     V  +A+  P+L T++ YG Y+  SSN+RYQ++AG+IE RI
Subjt:  IRKLIDKSYAV--EAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI

AT3G56140.1 Protein of unknown function (DUF399 and DUF3411)3.0e-2833.09Show/hide
Query:  NRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQ-FGGFKERLLADDLFLAKVAMECGVGIFTKIMA-------------
        +RAE   V+  AGR  ++LP D+   ++ G V   +++   +LE+      L Q F GF+ERLLAD  FL ++A+E  + I T ++A             
Subjt:  NRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQ-FGGFKERLLADDLFLAKVAMECGVGIFTKIMA-------------

Query:  ----------LIADFMLVWLPAPTVSLKPPLAISAGP-----LAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALIN
                   + DF  VWLPAPT+S       + GP     L       P+NAFQ + AG  ++   R+ +V+  G KL  VG  +S    G +NAL  
Subjt:  ----------LIADFMLVWLPAPTVSLKPPLAISAGP-----LAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALIN

Query:  IRKLIDKSYAV--EAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRIWSLCCTNKSWHSVQSALSFELATLS
         RK+I     V  + +  P+L T++ YG ++  S+NLRYQ++AG+IE R   L     S   + +A+SF + TL+
Subjt:  IRKLIDKSYAV--EAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRIWSLCCTNKSWHSVQSALSFELATLS

AT5G12470.1 Protein of unknown function (DUF3411)6.7e-8962.96Show/hide
Query:  GGGRRGGGGGGDG---DGEDDEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKV
        GGG  GGGGGGDG   DG+   +EDR++NR EA+++L E+G  +ESLPKDLAAAIEAGR+PG ++ RF+EL+KSA  RWL+QFGGF+ERLLADDLF+AK+
Subjt:  GGGRRGGGGGGDG---DGEDDEEEDREQNRAEAIVVLAEAGRSVESLPKDLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKV

Query:  AMECGVGIFTK-----------------------IMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGA
        AMECGVGIFTK                        MA+IADFMLV+LPAPTVSL+PPLA++AG ++KFF+ CP+NAFQVA +GTSY+ LQR+GA+ RNGA
Subjt:  AMECGVGIFTK-----------------------IMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFAGTSYSFLQRVGAVVRNGA

Query:  KLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI
        KLFAVGT++SLVGT +TNA I  RK +D++   E E +P++ TS+AYGVYMAVSSNLRYQ++AGVIEQR+
Subjt:  KLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAAACATACTATGGCAGCAAGCAAGTTTATGACAAGGTCTTCTTCGCTGAAGTCAGCTTTTCCAAACGCCGGAAATGGGCTTCAGCTGTTTCTACAAACTTTGT
GCTAGTCATTTCCTTGAGCAGAGATATTGCTGTCTTGAGGGGCAACGCGGCTTTCCCTTTCTTGGGTTTAGGCGCAGCGACAGTAGCAGCCGCCGCATCGAGCCCCTCCT
CTCCTTTTGCCTCCGCCTCGGCGACGTCCGCTTCGGCGGCTAGAGCAGCGACAATCACATGGTGCCTCGCTCGTCTCTGGAAATTAAGGAAGACCCAGTCGGTTGACCTC
CTCGCCCTGCAGAGGAAGTGGCAGGAGGAGAAGGTTTTGATGGTGGAAAATTGTGGCTTCGGCTTGAATGAAACAATGGAAGAATTGAGGTCCTGGGGGTGTACTGAAGA
AGAAGCAGAAGTGAGCATGAGGGAAGAAGGAGAAGCGGGAGTGCAGGCAGCCATTGATGGAGACATGGCGACTTTTGACATTTCAGCAACTTTCAGTCTCTCTGTTATTT
CAGCAGTTTCTCAGCCACCAACACAGAGAGATTGTGAACAGAGGAGGCAGAAACCAACCGCCGAGAAACGGCTCTCTGATCTCTCTCCGCCAATCTTCCTCTCCCTGCAA
CAATGGCGATCAGAAACTTCACCAATCTCTCCGCAACGCCTCCGTTTCTCCGCCGTTCTTCTTCTCCGCAGAAATTCGCTTACATCCTCATCACCTGCCGTTCCGCTCCC
TCTCTTCGCCTCCGCGGCCTCTCCACCGCAGCCTCTCCCCGTCGAAGCTCGGTCGCATTCGCCACTACGCCCCCCGGAGGCGGGGGGGGGCCGAAGAGGAGGAGGCGGGG
GCGGGGACGGAGATGGAGAAGACGATGAGGAGGAAGATCGAGAGCAGAACAGGGCGGAGGCGATAGTGGTGCTGGCGGAGGCCGGGAGATCGGTGGAGAGCTTACCGAAG
GACCTGGCGGCGGCGATTGAGGCTGGGAGAGTGCCGGGAGTGATCGTGGAGAGGTTCGTGGAGCTGGAGAAGTCGGCGGCGTGGCGGTGGTTGCTTCAGTTCGGCGGGTT
CAAAGAGCGGCTGCTGGCGGATGACTTGTTCTTGGCTAAGGTTGCCATGGAATGTGGCGTTGGAATTTTCACAAAGATAATGGCTCTTATAGCTGATTTCATGCTTGTTT
GGCTTCCTGCTCCCACCGTCTCTCTCAAACCTCCTCTAGCAATCAGTGCTGGACCTCTCGCCAAGTTCTTCTATGGCTGCCCTGAAAACGCCTTCCAGGTGGCTTTTGCG
GGTACCTCGTATTCGTTTCTTCAGAGAGTTGGTGCAGTAGTGCGTAATGGGGCCAAGCTATTTGCTGTAGGCACTAGTGCATCTCTGGTGGGTACGGGTGTAACGAACGC
CTTGATAAATATAAGGAAGTTGATTGACAAATCATATGCCGTGGAAGCGGAGGATCTGCCTGTCCTACCAACGAGCATTGCCTATGGAGTTTACATGGCAGTTTCTAGCA
ACCTAAGATACCAAGTTCTTGCTGGAGTAATAGAACAGAGGATCTGGAGCCTTTGCTGCACCAACAAAAGTTGGCACTCAGTGCAATCTGCTTTGTCGTTCGAACTGGCA
ACACTTTCTTGGGTGGGTGGATTATGCAAGATGGATCGGGGTTCAGAGATCGAGAGAGTAGAGCGGGAAAAAGACAACACTTTTGTTCGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAAACATACTATGGCAGCAAGCAAGTTTATGACAAGGTCTTCTTCGCTGAAGTCAGCTTTTCCAAACGCCGGAAATGGGCTTCAGCTGTTTCTACAAACTTTGT
GCTAGTCATTTCCTTGAGCAGAGATATTGCTGTCTTGAGGGGCAACGCGGCTTTCCCTTTCTTGGGTTTAGGCGCAGCGACAGTAGCAGCCGCCGCATCGAGCCCCTCCT
CTCCTTTTGCCTCCGCCTCGGCGACGTCCGCTTCGGCGGCTAGAGCAGCGACAATCACATGGTGCCTCGCTCGTCTCTGGAAATTAAGGAAGACCCAGTCGGTTGACCTC
CTCGCCCTGCAGAGGAAGTGGCAGGAGGAGAAGGTTTTGATGGTGGAAAATTGTGGCTTCGGCTTGAATGAAACAATGGAAGAATTGAGGTCCTGGGGGTGTACTGAAGA
AGAAGCAGAAGTGAGCATGAGGGAAGAAGGAGAAGCGGGAGTGCAGGCAGCCATTGATGGAGACATGGCGACTTTTGACATTTCAGCAACTTTCAGTCTCTCTGTTATTT
CAGCAGTTTCTCAGCCACCAACACAGAGAGATTGTGAACAGAGGAGGCAGAAACCAACCGCCGAGAAACGGCTCTCTGATCTCTCTCCGCCAATCTTCCTCTCCCTGCAA
CAATGGCGATCAGAAACTTCACCAATCTCTCCGCAACGCCTCCGTTTCTCCGCCGTTCTTCTTCTCCGCAGAAATTCGCTTACATCCTCATCACCTGCCGTTCCGCTCCC
TCTCTTCGCCTCCGCGGCCTCTCCACCGCAGCCTCTCCCCGTCGAAGCTCGGTCGCATTCGCCACTACGCCCCCCGGAGGCGGGGGGGGGCCGAAGAGGAGGAGGCGGGG
GCGGGGACGGAGATGGAGAAGACGATGAGGAGGAAGATCGAGAGCAGAACAGGGCGGAGGCGATAGTGGTGCTGGCGGAGGCCGGGAGATCGGTGGAGAGCTTACCGAAG
GACCTGGCGGCGGCGATTGAGGCTGGGAGAGTGCCGGGAGTGATCGTGGAGAGGTTCGTGGAGCTGGAGAAGTCGGCGGCGTGGCGGTGGTTGCTTCAGTTCGGCGGGTT
CAAAGAGCGGCTGCTGGCGGATGACTTGTTCTTGGCTAAGGTTGCCATGGAATGTGGCGTTGGAATTTTCACAAAGATAATGGCTCTTATAGCTGATTTCATGCTTGTTT
GGCTTCCTGCTCCCACCGTCTCTCTCAAACCTCCTCTAGCAATCAGTGCTGGACCTCTCGCCAAGTTCTTCTATGGCTGCCCTGAAAACGCCTTCCAGGTGGCTTTTGCG
GGTACCTCGTATTCGTTTCTTCAGAGAGTTGGTGCAGTAGTGCGTAATGGGGCCAAGCTATTTGCTGTAGGCACTAGTGCATCTCTGGTGGGTACGGGTGTAACGAACGC
CTTGATAAATATAAGGAAGTTGATTGACAAATCATATGCCGTGGAAGCGGAGGATCTGCCTGTCCTACCAACGAGCATTGCCTATGGAGTTTACATGGCAGTTTCTAGCA
ACCTAAGATACCAAGTTCTTGCTGGAGTAATAGAACAGAGGATCTGGAGCCTTTGCTGCACCAACAAAAGTTGGCACTCAGTGCAATCTGCTTTGTCGTTCGAACTGGCA
ACACTTTCTTGGGTGGGTGGATTATGCAAGATGGATCGGGGTTCAGAGATCGAGAGAGTAGAGCGGGAAAAAGACAACACTTTTGTTCGATAA
Protein sequenceShow/hide protein sequence
MGKTYYGSKQVYDKVFFAEVSFSKRRKWASAVSTNFVLVISLSRDIAVLRGNAAFPFLGLGAATVAAAASSPSSPFASASATSASAARAATITWCLARLWKLRKTQSVDL
LALQRKWQEEKVLMVENCGFGLNETMEELRSWGCTEEEAEVSMREEGEAGVQAAIDGDMATFDISATFSLSVISAVSQPPTQRDCEQRRQKPTAEKRLSDLSPPIFLSLQ
QWRSETSPISPQRLRFSAVLLLRRNSLTSSSPAVPLPLFASAASPPQPLPVEARSHSPLRPPEAGGGRRGGGGGGDGDGEDDEEEDREQNRAEAIVVLAEAGRSVESLPK
DLAAAIEAGRVPGVIVERFVELEKSAAWRWLLQFGGFKERLLADDLFLAKVAMECGVGIFTKIMALIADFMLVWLPAPTVSLKPPLAISAGPLAKFFYGCPENAFQVAFA
GTSYSFLQRVGAVVRNGAKLFAVGTSASLVGTGVTNALINIRKLIDKSYAVEAEDLPVLPTSIAYGVYMAVSSNLRYQVLAGVIEQRIWSLCCTNKSWHSVQSALSFELA
TLSWVGGLCKMDRGSEIERVEREKDNTFVR