; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg26508 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg26508
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionNicalin
Genome locationCarg_Chr13:4869665..4881252
RNA-Seq ExpressionCarg26508
SyntenyCarg26508
Gene Ontology termsGO:0009966 - regulation of signal transduction (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR016574 - Nicalin
IPR018247 - EF-Hand 1, calcium-binding site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583712.1 Nicalin-1, partial [Cucurbita argyrosperma subsp. sororia]4.9e-204100Show/hide
Query:  MNQQEPRADSISMCASTPILLRSNSLSLQRPSPPPTELLQILHPSMAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGS
        MNQQEPRADSISMCASTPILLRSNSLSLQRPSPPPTELLQILHPSMAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGS
Subjt:  MNQQEPRADSISMCASTPILLRSNSLSLQRPSPPPTELLQILHPSMAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGS

Query:  RAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYP
        RAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYP
Subjt:  RAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYP

Query:  VYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIV
        VYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIV
Subjt:  VYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIV

Query:  ALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHRIRESIDYAICLNSIG
        ALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHRIRESIDYAICLNSIG
Subjt:  ALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHRIRESIDYAICLNSIG

KAG7019358.1 Nicalin-1 [Cucurbita argyrosperma subsp. argyrosperma]6.8e-206100Show/hide
Query:  MNQQEPRADSISMCASTPILLRSNSLSLQRPSPPPTELLQILHPSMAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGS
        MNQQEPRADSISMCASTPILLRSNSLSLQRPSPPPTELLQILHPSMAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGS
Subjt:  MNQQEPRADSISMCASTPILLRSNSLSLQRPSPPPTELLQILHPSMAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGS

Query:  RAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYP
        RAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYP
Subjt:  RAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYP

Query:  VYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIV
        VYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIV
Subjt:  VYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIV

Query:  ALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHRIRESIDYAICLNSIGFGL
        ALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHRIRESIDYAICLNSIGFGL
Subjt:  ALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHRIRESIDYAICLNSIGFGL

XP_022927448.1 nicalin-1-like [Cucurbita moschata]1.3e-17598.75Show/hide
Query:  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS
        MA RKPREPQVLESFYPLLALVFLLVA TELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS
Subjt:  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS

Query:  QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK
        QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVS AEPRK
Subjt:  QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK

Query:  LVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
        LVSSTITNIQGWLPGLKSDGDA+QLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
Subjt:  LVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS

Query:  FDHRIRESIDYAICLNSIG
        FDHRIRESIDYAICLNSIG
Subjt:  FDHRIRESIDYAICLNSIG

XP_022973118.1 nicalin-1-like [Cucurbita maxima]2.8e-17597.81Show/hide
Query:  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS
        MAPRKPREPQVLESFYPLLALVFLLVACTELCDAA VVDVYRLIHYDIS VPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKEC+S
Subjt:  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS

Query:  QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK
        QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVS AEPRK
Subjt:  QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK

Query:  LVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
        LVSSTITNIQGWLPGLK DGDASQLPTIAIVASYDTFGA+PELSVGSDSNGSGIVALLEIARLFSLLYS+PKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
Subjt:  LVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS

Query:  FDHRIRESIDYAICLNSIG
        FDHRIRESIDYAICLNSIG
Subjt:  FDHRIRESIDYAICLNSIG

XP_023520704.1 nicalin-1-like [Cucurbita pepo subsp. pepo]6.9e-17497.81Show/hide
Query:  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS
        MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDIS VPFGSRAASLNHHAASLHFPP   AADLSRTVFIIPLCELNFTFVKECIS
Subjt:  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS

Query:  QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK
        QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVS AEPRK
Subjt:  QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK

Query:  LVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
        LVSSTITNIQGWLPGLKSDGDA+QLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYS+PKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
Subjt:  LVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS

Query:  FDHRIRESIDYAICLNSIG
        FDHRIRESIDYAICLNSIG
Subjt:  FDHRIRESIDYAICLNSIG

TrEMBL top hitse value%identityAlignment
A0A0A0LYB7 Uncharacterized protein1.7e-15787.81Show/hide
Query:  SMAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECI
        SMAPRKPREPQV +SFYP+LALVF+LVAC ELCDAATVVDVYRLI YDIS VPFGSRAA+LNHHA+SLHFP     ADLSRTV IIPLCELN TF++ECI
Subjt:  SMAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECI

Query:  SQRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPR
        SQ+KRLGGLL+LLP+ILGS+  KNDD KCP NG+G+IK L VELERLL+H+T+PYPVYFASEGEDI+AVLADVK+NDATGQLATATTGGYKLVVS AEPR
Subjt:  SQRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPR

Query:  KLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ
        KLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAP+LSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ
Subjt:  KLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ

Query:  SFDHRIRESIDYAICLNSIG
        SFDHR+RE IDYAICLNSIG
Subjt:  SFDHRIRESIDYAICLNSIG

A0A1S3CJR5 nicalin-13.7e-15787.77Show/hide
Query:  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS
        MAPRKPREPQVL+SFYP+LALVF+LVAC ELCDAATVVDVYRLI YDIS VPFGSRAA+LNHHA+SLHFP   + ADLSRTV IIPLCEL  TF++ECIS
Subjt:  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS

Query:  QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK
        Q+KRLGGLL+LLP+ILGS+  KNDD KC  NG+G+IKDLLVELERLLIH+T+PYPVYFAS+GEDI+AVLADVK+NDATGQLATATTGGYKLVVS AEP+K
Subjt:  QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK

Query:  LVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
        L+SSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
Subjt:  LVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS

Query:  FDHRIRESIDYAICLNSIG
        FDHR+RE IDYAICLNSIG
Subjt:  FDHRIRESIDYAICLNSIG

A0A5A7T1R9 Nicalin-13.7e-15787.77Show/hide
Query:  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS
        MAPRKPREPQVL+SFYP+LALVF+LVAC ELCDAATVVDVYRLI YDIS VPFGSRAA+LNHHA+SLHFP   + ADLSRTV IIPLCEL  TF++ECIS
Subjt:  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS

Query:  QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK
        Q+KRLGGLL+LLP+ILGS+  KNDD KC  NG+G+IKDLLVELERLLIH+T+PYPVYFAS+GEDI+AVLADVK+NDATGQLATATTGGYKLVVS AEP+K
Subjt:  QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK

Query:  LVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
        L+SSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
Subjt:  LVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS

Query:  FDHRIRESIDYAICLNSIG
        FDHR+RE IDYAICLNSIG
Subjt:  FDHRIRESIDYAICLNSIG

A0A6J1EL15 nicalin-1-like6.1e-17698.75Show/hide
Query:  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS
        MA RKPREPQVLESFYPLLALVFLLVA TELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS
Subjt:  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS

Query:  QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK
        QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVS AEPRK
Subjt:  QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK

Query:  LVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
        LVSSTITNIQGWLPGLKSDGDA+QLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
Subjt:  LVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS

Query:  FDHRIRESIDYAICLNSIG
        FDHRIRESIDYAICLNSIG
Subjt:  FDHRIRESIDYAICLNSIG

A0A6J1IC52 nicalin-1-like1.4e-17597.81Show/hide
Query:  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS
        MAPRKPREPQVLESFYPLLALVFLLVACTELCDAA VVDVYRLIHYDIS VPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKEC+S
Subjt:  MAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECIS

Query:  QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK
        QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVS AEPRK
Subjt:  QRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK

Query:  LVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
        LVSSTITNIQGWLPGLK DGDASQLPTIAIVASYDTFGA+PELSVGSDSNGSGIVALLEIARLFSLLYS+PKTRGRYNLLFGLTSGGPYNYNGTHKWLQS
Subjt:  LVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQS

Query:  FDHRIRESIDYAICLNSIG
        FDHRIRESIDYAICLNSIG
Subjt:  FDHRIRESIDYAICLNSIG

SwissProt top hitse value%identityAlignment
Q5XIA1 Nicalin1.4e-3934.89Show/hide
Query:  FYPLLALVFLLVA-CTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLP
        F   L  V LLVA      DAA    VYR+  YD+   P+G+R A LN  A ++       A  LSR   ++ L + ++   ++ +  R+  G ++I+LP
Subjt:  FYPLLALVFLLVA-CTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLP

Query:  KILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATG--------QLATATTGGYKLVVSVAEPRKLVSST
        + + +          PQ+    +    +E+E  ++      PVYFA E E + ++    ++  A+          L TAT  G+++V S A+ + +    
Subjt:  KILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATG--------QLATATTGGYKLVVSVAEPRKLVSST

Query:  ITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFDHR
        IT+++G L GL  +     LPTI IVA YD FG AP LS+G+DSNGSGI  LLE+ARLFS LY+  +T   YNLLF  + GG +NY GT +WL+ S DH 
Subjt:  ITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFDHR

Query:  ----IRESIDYAICLNSIGFG
            +++++ + +CL+++G G
Subjt:  ----IRESIDYAICLNSIGFG

Q5ZJH2 Nicalin5.7e-3832.71Show/hide
Query:  SFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLP
        SF   +  V LL+      +AA    VYR+  Y++   P+G+R+A LN  A ++       A  LSR   ++ L + ++   ++ +  R+  G ++I+LP
Subjt:  SFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLP

Query:  KILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATG--------QLATATTGGYKLVVSVAEPRKLVSST
        + + S          PQ+   ++K  + E+E  ++      PVYFA E +++ ++    ++  A+          L TAT  G+++V S A+ + +    
Subjt:  KILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATG--------QLATATTGGYKLVVSVAEPRKLVSST

Query:  ITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFDHR
        I +++G L GL  +     LPT+ IVA YD+FG AP LS G+DSNGSG+  LLE+ARLFS LY+  +T   YNLLF  + GG +NY GT +WL+ + DH 
Subjt:  ITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFDHR

Query:  ----IRESIDYAICLNSIGFG
            +++++ + +CL+++G G
Subjt:  ----IRESIDYAICLNSIGFG

Q6NZ07 Nicalin-11.1e-4135.71Show/hide
Query:  VLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLI
        +L+  +PL  ++FL++ C    +AA    VYR+  YD+    +GSR A LN  A ++       A  LSR   ++ L +  F++ K   + R+  G ++I
Subjt:  VLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLI

Query:  LLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVK--------SNDATGQLATATTGGYKLVVSVAEPRKLV
        +LP  + +          PQ+    I    +ELE  L+      PVYFA E E++ ++    +        S+ A   L TAT  G+++V S A+ + + 
Subjt:  LLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVK--------SNDATGQLATATTGGYKLVVSVAEPRKLV

Query:  SSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SF
           IT+++G L G  S G+   LPTI +VA YD+FG AP LS G+DSNGSG+  LLE+ARLFS LYS  +T   YNLLF L+ GG +NY GT +WL+ + 
Subjt:  SSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SF

Query:  DHR----IRESIDYAICLNSIG
        DH     +++++ + +CL+++G
Subjt:  DHR----IRESIDYAICLNSIG

Q8VCM8 Nicalin1.4e-3934.89Show/hide
Query:  FYPLLALVFLLVA-CTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLP
        F   L  V LLVA      DAA    VYR+  YD+   P+G+R A LN  A ++       A  LSR   ++ L + ++   ++ +  R+  G ++I+LP
Subjt:  FYPLLALVFLLVA-CTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLP

Query:  KILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATG--------QLATATTGGYKLVVSVAEPRKLVSST
        + + +          PQ+    +    +E+E  ++      PVYFA E E + ++    ++  A+          L TAT  G+++V S A+ + +    
Subjt:  KILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATG--------QLATATTGGYKLVVSVAEPRKLVSST

Query:  ITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFDHR
        IT+++G L GL  +     LPTI IVA YD FG AP LS+G+DSNGSGI  LLE+ARLFS LY+  +T   YNLLF  + GG +NY GT +WL+ S DH 
Subjt:  ITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFDHR

Query:  ----IRESIDYAICLNSIGFG
            +++++ + +CL+++G G
Subjt:  ----IRESIDYAICLNSIGFG

Q969V3 Nicalin4.0e-3934.58Show/hide
Query:  FYPLLALVFLLVA-CTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLP
        F   L  V LLVA      DAA    VYR+  YD+   P+G+R A LN  A ++      AA  LSR   ++ L  L+F++ +   + R+  G ++I+LP
Subjt:  FYPLLALVFLLVA-CTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLP

Query:  KILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATG--------QLATATTGGYKLVVSVAEPRKLVSST
        + + +          PQ+    +    +E+E  ++      PVYFA E E + ++    ++  A+          L TAT  G+++V S  + + +    
Subjt:  KILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATG--------QLATATTGGYKLVVSVAEPRKLVSST

Query:  ITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFDHR
        I +++G L GL  +     LPTI IVA YD FG AP LS+G+DSNGSG+  LLE+ARLFS LY+  +T   YNLLF  + GG +NY GT +WL+ + DH 
Subjt:  ITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQ-SFDHR

Query:  ----IRESIDYAICLNSIGFG
            +++++ + +CL+++G G
Subjt:  ----IRESIDYAICLNSIGFG

Arabidopsis top hitse value%identityAlignment
AT3G44330.1 INVOLVED IN: protein processing; LOCATED IN: mitochondrion, endoplasmic reticulum, plasma membrane, vacuole; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Nicalin (InterPro:IPR016574), EF-Hand 1, calcium-binding site (InterPro:IPR018247), Nicastrin (InterPro:IPR008710); Has 245 Blast hits to 243 proteins in 99 species: Archae - 6; Bacteria - 10; Metazoa - 139; Fungi - 0; Plants - 46; Viruses - 0; Other Eukaryotes - 44 (source: NCBI BLink).1.4e-11968.99Show/hide
Query:  VLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLI
        V ES YP+LAL+ +LVAC ELCDAATVVDVYRLI YDIS VPFGSR +SLNHHAASL F      ADLSR+V I+PL EL+  FV++ ISQ++ LGGLLI
Subjt:  VLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAASLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLI

Query:  LLPKIL-------GSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRKLVS
        LLP+         GS   +ND F          + LL +LE+LL+H  +P+PVYFA E E+ +A+LADVK NDA GQ ATATTGGYKLV+SV+EPRK+ S
Subjt:  LLPKIL-------GSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRKLVS

Query:  STITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH
         TITNIQGWLPGL+++GD+SQLPTIA+VASYDTFGAAP LSVGSDSNGSG+VALLE+ARLFS+LYSNPKTRG+YNLLF LTSGGPYNY GT KWL+S D 
Subjt:  STITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDH

Query:  RIRESIDYAICLNSIG
        R+RESIDYAICLNS+G
Subjt:  RIRESIDYAICLNSIG

AT3G52640.1 Zn-dependent exopeptidases superfamily protein9.4e-0425.13Show/hide
Query:  YPVYFASEG--EDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK------LVSSTITNIQGW-----LPGLKSDGDASQLPTIAIVASYDTFGAA
        +PVY  SE     ++ +L+  K    T    T+    + +V+   +         L   T   + G+     LP +      ++ P +  VAS DT    
Subjt:  YPVYFASEG--EDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK------LVSSTITNIQGW-----LPGLKSDGDASQLPTIAIVASYDTFGAA

Query:  PELSVGSDSNGSGIVALLEIARLFSLL--YSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR-------IRESIDYAICLNSIGFGL
         + S G+DS  SG+VALL      S +   SN K +    L+F + +G  + Y G+ ++L   D            SI+  + + S+G GL
Subjt:  PELSVGSDSNGSGIVALLEIARLFSLL--YSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR-------IRESIDYAICLNSIGFGL

AT3G52640.2 Zn-dependent exopeptidases superfamily protein9.4e-0425.13Show/hide
Query:  YPVYFASEG--EDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK------LVSSTITNIQGW-----LPGLKSDGDASQLPTIAIVASYDTFGAA
        +PVY  SE     ++ +L+  K    T    T+    + +V+   +         L   T   + G+     LP +      ++ P +  VAS DT    
Subjt:  YPVYFASEG--EDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK------LVSSTITNIQGW-----LPGLKSDGDASQLPTIAIVASYDTFGAA

Query:  PELSVGSDSNGSGIVALLEIARLFSLL--YSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR-------IRESIDYAICLNSIGFGL
         + S G+DS  SG+VALL      S +   SN K +    L+F + +G  + Y G+ ++L   D            SI+  + + S+G GL
Subjt:  PELSVGSDSNGSGIVALLEIARLFSLL--YSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR-------IRESIDYAICLNSIGFGL

AT3G52640.3 Zn-dependent exopeptidases superfamily protein9.4e-0425.13Show/hide
Query:  YPVYFASEG--EDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK------LVSSTITNIQGW-----LPGLKSDGDASQLPTIAIVASYDTFGAA
        +PVY  SE     ++ +L+  K    T    T+    + +V+   +         L   T   + G+     LP +      ++ P +  VAS DT    
Subjt:  YPVYFASEG--EDINAVLADVKSNDATGQLATATTGGYKLVVSVAEPRK------LVSSTITNIQGW-----LPGLKSDGDASQLPTIAIVASYDTFGAA

Query:  PELSVGSDSNGSGIVALLEIARLFSLL--YSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR-------IRESIDYAICLNSIGFGL
         + S G+DS  SG+VALL      S +   SN K +    L+F + +G  + Y G+ ++L   D            SI+  + + S+G GL
Subjt:  PELSVGSDSNGSGIVALLEIARLFSLL--YSNPKTRGRYNLLFGLTSGGPYNYNGTHKWLQSFDHR-------IRESIDYAICLNSIGFGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCAGCAAGAGCCACGAGCTGATTCCATTTCCATGTGCGCATCCACACCCATATTATTGCGATCAAATTCCCTATCCCTCCAACGCCCTTCCCCTCCCCCAACTGA
GCTCCTCCAGATCTTGCATCCATCCATGGCTCCTCGTAAACCCCGCGAGCCTCAAGTCCTCGAATCCTTCTACCCTCTCCTTGCTCTTGTCTTCCTTCTAGTTGCCTGCA
CCGAGCTCTGTGACGCCGCCACTGTCGTCGATGTCTATCGCCTCATTCACTACGATATCTCTGCCGTTCCCTTTGGATCCCGCGCCGCTTCACTCAATCACCATGCTGCT
TCTCTTCATTTTCCCCCTGCTGCTGCTGCTGCTGATCTCTCTCGCACCGTTTTCATCATTCCTCTTTGTGAACTCAACTTCACCTTCGTCAAAGAATGTATATCTCAAAG
AAAGCGTCTGGGAGGTCTCCTGATCTTGCTTCCCAAGATTCTTGGCTCGGATGGCCCGAAAAATGATGACTTTAAATGTCCACAAAATGGAGATGGGATGATTAAGGATT
TACTGGTTGAACTTGAACGGTTGCTCATACATGCTACTCTACCTTACCCTGTATATTTTGCTTCGGAAGGTGAAGATATTAATGCTGTTTTGGCTGATGTCAAGAGCAAT
GATGCCACTGGTCAGCTTGCTACTGCAACTACTGGCGGGTACAAGCTTGTTGTTTCAGTAGCAGAACCAAGGAAACTTGTATCTTCCACGATTACAAATATTCAGGGTTG
GCTGCCTGGACTAAAATCTGATGGAGATGCTAGTCAACTCCCAACAATTGCTATTGTAGCATCATATGATACATTTGGCGCTGCTCCTGAATTATCTGTGGGAAGTGATA
GCAACGGAAGTGGAATTGTTGCACTTCTTGAAATTGCAAGGTTATTTTCTCTTCTTTATTCCAATCCTAAGACACGAGGAAGGTACAATCTACTTTTTGGGCTCACTTCT
GGCGGACCTTACAACTACAATGGGACTCACAAGTGGCTTCAAAGCTTTGATCACCGTATCCGTGAGAGTATTGACTATGCCATTTGCTTAAATAGTATTGGATTTGGGCT
TTAA
mRNA sequenceShow/hide mRNA sequence
CTGAACGGCGAGATGAACCAGCAAGAGCCACGAGCTGATTCCATTTCCATGTGCGCATCCACACCCATATTATTGCGATCAAATTCCCTATCCCTCCAACGCCCTTCCCC
TCCCCCAACTGAGCTCCTCCAGATCTTGCATCCATCCATGGCTCCTCGTAAACCCCGCGAGCCTCAAGTCCTCGAATCCTTCTACCCTCTCCTTGCTCTTGTCTTCCTTC
TAGTTGCCTGCACCGAGCTCTGTGACGCCGCCACTGTCGTCGATGTCTATCGCCTCATTCACTACGATATCTCTGCCGTTCCCTTTGGATCCCGCGCCGCTTCACTCAAT
CACCATGCTGCTTCTCTTCATTTTCCCCCTGCTGCTGCTGCTGCTGATCTCTCTCGCACCGTTTTCATCATTCCTCTTTGTGAACTCAACTTCACCTTCGTCAAAGAATG
TATATCTCAAAGAAAGCGTCTGGGAGGTCTCCTGATCTTGCTTCCCAAGATTCTTGGCTCGGATGGCCCGAAAAATGATGACTTTAAATGTCCACAAAATGGAGATGGGA
TGATTAAGGATTTACTGGTTGAACTTGAACGGTTGCTCATACATGCTACTCTACCTTACCCTGTATATTTTGCTTCGGAAGGTGAAGATATTAATGCTGTTTTGGCTGAT
GTCAAGAGCAATGATGCCACTGGTCAGCTTGCTACTGCAACTACTGGCGGGTACAAGCTTGTTGTTTCAGTAGCAGAACCAAGGAAACTTGTATCTTCCACGATTACAAA
TATTCAGGGTTGGCTGCCTGGACTAAAATCTGATGGAGATGCTAGTCAACTCCCAACAATTGCTATTGTAGCATCATATGATACATTTGGCGCTGCTCCTGAATTATCTG
TGGGAAGTGATAGCAACGGAAGTGGAATTGTTGCACTTCTTGAAATTGCAAGGTTATTTTCTCTTCTTTATTCCAATCCTAAGACACGAGGAAGGTACAATCTACTTTTT
GGGCTCACTTCTGGCGGACCTTACAACTACAATGGGACTCACAAGTGGCTTCAAAGCTTTGATCACCGTATCCGTGAGAGTATTGACTATGCCATTTGCTTAAATAGTAT
TGGATTTGGGCTTTAAAGTTGATCTGAAGCACAAGAAGATTAATATTTCAAAAAGACTGTTCTTGGTTTCCCTTAGCATAGTATATGATTTTTACTTTTATTTTGTGATT
ATCGTAGCCCACTCGATAAATCCTTTGTATGCTGTAGGTAGCCTGGGAGCACGAACAGTTTTCAAGATTGAGAGTAACTGCTGCTACCCTTTCTGGAATCTCTGCTGCTC
CTGAGCTTTTGGAAAGGACTGGAGGTTTAGCTGATAACAGATTGTTTTTGAACGAGAGTGCAATTGCCAAGAGTATCAAGTTAGTCGCAGAGAGTCTTGCAAGGCATATT
TACAGATATGAAGGAAAAAATATACAAGTATTTGCAGATGATAGTAGTTTGGCAATCAATCCAACTTATATTCGATCATGGTTGGATCTTTTGTCAAGAACACCTCGAGT
TGCTCCATTTCTGTCGAAAGATGACCCTTTCATATCAGCATTAAAAAAGGAACTGGAGGTCCATACCCATGATGTGAGCTTGCAACATGAGCCATTTGACGGGATGTTCA
CCTTTTATGATTCAACTGCAGCTAAACTTCACATATACCAGGTTGCTAGCGTGACATTCGACTTGGTTTTGCTTTTGGTCTTGGGATCATATTTAGTTTTACTCTTCTGT
TTCCTTGTGATCACAACCAGGGGTCTCGACGATCTGATTGGTCTATTTAGACGCCCTCCTTCCCGAAAAGTAAAAACAGCTTGATGACTGCAGTAGATTTTGATGCTTGG
ATTTGCATCATTGCCACCCGAATTTTTTCCCATCTAGATTAGACAGGCCTTACGATGGGGTCCAATGTGGCATAAGAGATCGAGGAACTGAAGATTTTATGTGGCATTCC
TCTCAGGATTCTCTCTTTCTTCTTTTTGACATCCTTACATAATAACAGTTATGAGAACTCTCGAGGTGCGATAATTTTCGTTTTCTCATTTAAATTCGCACAGTTTAACC
CTCATTTGTGGTTTCCACTTTGGGCAATGATTTTTTCCTTAGAATTTGTTGGTGTCCATTATTTCTTGCTCAGTTGATGTTAGAGACAGGCAATAGATTCAGTTCGAACA
GTTCTGAAAATCTCTTAGAATATATTGATGTTATTTATCATTATTATTTCACAGTTGCCTGATGGCTTTGGAAT
Protein sequenceShow/hide protein sequence
MNQQEPRADSISMCASTPILLRSNSLSLQRPSPPPTELLQILHPSMAPRKPREPQVLESFYPLLALVFLLVACTELCDAATVVDVYRLIHYDISAVPFGSRAASLNHHAA
SLHFPPAAAAADLSRTVFIIPLCELNFTFVKECISQRKRLGGLLILLPKILGSDGPKNDDFKCPQNGDGMIKDLLVELERLLIHATLPYPVYFASEGEDINAVLADVKSN
DATGQLATATTGGYKLVVSVAEPRKLVSSTITNIQGWLPGLKSDGDASQLPTIAIVASYDTFGAAPELSVGSDSNGSGIVALLEIARLFSLLYSNPKTRGRYNLLFGLTS
GGPYNYNGTHKWLQSFDHRIRESIDYAICLNSIGFGL