; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017602 (gene) of Chayote v1 genome

Gene IDSed0017602
OrganismSechium edule (Chayote v1)
Descriptionbasic 7S globulin-like
Genome locationLG12:4062085..4064519
RNA-Seq ExpressionSed0017602
SyntenySed0017602
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597463.1 putative aspartic proteinase GIP2, partial [Cucurbita argyrosperma subsp. sororia]1.5e-9752.8Show/hide
Query:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------
        ATSFRPK LVLP              QRTPLVPV+LTVDLG +F WVDC R Y+SSTYKPARC SA                                  
Subjt:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------

Query:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------
               GELASD+VSVSSTDGFNPT+ VT+ NFLFVC +TFLL+GLAGG TGMA FGRN IS+P+QF+AAFSFNRK A+CLS S + PG          
Subjt:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------

Query:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDPT--GGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--
                                           YFI V+SIL+ S++VPLN TLL+ID    GGTKISTV+PYTVLESSIY+AV++TF   + +VP  
Subjt:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDPT--GGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--

Query:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAGN-----DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR
          VAPF ACF A+  S TR+GPGVP I+L      V    +GA +     D VLCLGFVD G N RTSIVIGAHQIED+LLEFD+ TSR GFS+TLL R 
Subjt:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAGN-----DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR

Query:  TNCANFNFTSK
        T CANFNFTSK
Subjt:  TNCANFNFTSK

KAG7028922.1 Basic 7S globulin, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-9752.8Show/hide
Query:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------
        ATSFRPK LVLP              QRTPLVPV+LTVDLG +F WVDC R Y+SSTYKPARC SA                                  
Subjt:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------

Query:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------
               GELASD+VSVSSTDGFNPT+ VT+ NFLFVC +TFLL+GLAGG TGMA FGRN IS+P+QF+AAFSFNRK A+CLS S + PG          
Subjt:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------

Query:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDPT--GGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--
                                           YFI V+SI++ S++VPLN TLL+ID    GGTKISTV+PYTVLESSIY+AV++TF   + +VP  
Subjt:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDPT--GGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--

Query:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAGN-----DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR
          VAPF ACF A+  S TR+GPGVP I+L      V    +GA +     D VLCLGFVD G N RTSIVIGAHQIED+LLEFDL TSR GFS+TLL R 
Subjt:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAGN-----DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR

Query:  TNCANFNFTSK
        T CANFNFTSK
Subjt:  TNCANFNFTSK

XP_022936844.1 basic 7S globulin-like [Cucurbita moschata]3.3e-9752.55Show/hide
Query:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------
        ATSFRPK LVLP              QRTPLVPV+LTVDLG +F WVDC R Y+SSTYKPARC SA                                  
Subjt:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------

Query:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------
               GELASD+VSVSSTDGFNPT+ VT+ NFLFVC +TFLL+GLAGG TGMA FGRN IS+P+QF+AAFSFNRK A+CLS S + PG          
Subjt:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------

Query:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDPT--GGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--
                                           YFI V+SI++ S++VPLN TLL+ID    GGTKISTV+PYTVLESSIY+AV++TF   + +VP  
Subjt:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDPT--GGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--

Query:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAGN-----DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR
          VAPF ACF A+  S TR+GPGVP I+L      V    +GA +     D VLCLGFVD G N RTSIVIGAHQIED+LLEFD+ TSR GFS+TLL R 
Subjt:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAGN-----DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR

Query:  TNCANFNFTSK
        T CANFNFTSK
Subjt:  TNCANFNFTSK

XP_022973769.1 basic 7S globulin-like [Cucurbita maxima]2.5e-9752.8Show/hide
Query:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------
        ATSFRPK LVLP              QRTPLVPV+LTVDLG +F WVDC R Y+SSTYKPARC SA                                  
Subjt:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------

Query:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------
               GELASD+VSVSSTDGFNPT+ VT+ NFLFVC TTFLL+GLAGG TGMA FGRN IS+P+QF+AAFSFNRK AICLS S   PG          
Subjt:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------

Query:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDP--TGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--
                                           YFI V+SI++ S++VPLN TLL+ID    GGTKISTV+PYTVLESSIY+AV++TF   + ++P  
Subjt:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDP--TGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--

Query:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAGN-----DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR
          VAPF ACF A+  S TR+GPGVP I+L      V    +GA +     D VLCLGFVD G N RTSIVIGAHQIE++LLEFDL TSR GFS+TLL R 
Subjt:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAGN-----DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR

Query:  TNCANFNFTSK
        T CANFNFTSK
Subjt:  TNCANFNFTSK

XP_038896289.1 probable aspartic proteinase GIP2 [Benincasa hispida]3.9e-9853.55Show/hide
Query:  TSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY----------------------------------
        TSFRPK LVLP              QRTPLVPV+LTVDLGG+F WVDC R YVSSTYKPARC SA                                   
Subjt:  TSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY----------------------------------

Query:  ------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG-----------
              GE+ASD+VSVSSTDGFNPTR+V+L NFLFVC +TFLLEGLAGG TGMA FGR  IS+P+QFAAAFSFNRK A+CLS S + PG           
Subjt:  ------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG-----------

Query:  ----------------------------------YFIAVESILVGSRSVPLNATLLQIDP--TGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP---
                                          YFI V+SI++ S++VPLN TLL+ID    GGTKISTVNPYTVLESSIY+AVV+TF   +  VP   
Subjt:  ----------------------------------YFIAVESILVGSRSVPLNATLLQIDP--TGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP---

Query:  -VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRT
         VAPF  C+ ++ FS TR+GPGVP IDL      V    +GA      ND VLCLGFVD G   RT+IVIGAHQIED+LLEFDL TSR GFSSTLL R T
Subjt:  -VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRT

Query:  NCANFNFTS
         CANFNFTS
Subjt:  NCANFNFTS

TrEMBL top hitse value%identityAlignment
A0A5A7U0M7 Basic 7S globulin-like1.1e-9351.71Show/hide
Query:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------
        ATSFRPK LVLP              QRTPLVPV+LTVDLGG+F WVDC R YVSS+YKPARC SA                                  
Subjt:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------

Query:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------
               GE+ASD+VSVSST+GFNPTR+V++ NFLFVC +TFLLEGLA G TGMA FGRN IS+P+QFAAAFSFNRK A+CLS S + PG          
Subjt:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------

Query:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDP--TGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--
                                           YFI V SI+V S+ VPLN TLL+ID    GGTKISTVNP+TVLESSIY A+V+ F   V  VP  
Subjt:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDP--TGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--

Query:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR
          VAPF+ C+ ++ F  TR+G GVP IDL      V    +GA      ND VLCLGFVD G + RT+IVIGAHQIED LLEFDL TSR GF+ TLL R 
Subjt:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR

Query:  TNCANFNFTS
        T CANFNFTS
Subjt:  TNCANFNFTS

A0A6J1F9F8 basic 7S globulin-like1.6e-9752.55Show/hide
Query:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------
        ATSFRPK LVLP              QRTPLVPV+LTVDLG +F WVDC R Y+SSTYKPARC SA                                  
Subjt:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------

Query:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------
               GELASD+VSVSSTDGFNPT+ VT+ NFLFVC +TFLL+GLAGG TGMA FGRN IS+P+QF+AAFSFNRK A+CLS S + PG          
Subjt:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------

Query:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDPT--GGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--
                                           YFI V+SI++ S++VPLN TLL+ID    GGTKISTV+PYTVLESSIY+AV++TF   + +VP  
Subjt:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDPT--GGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--

Query:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAGN-----DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR
          VAPF ACF A+  S TR+GPGVP I+L      V    +GA +     D VLCLGFVD G N RTSIVIGAHQIED+LLEFD+ TSR GFS+TLL R 
Subjt:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAGN-----DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR

Query:  TNCANFNFTSK
        T CANFNFTSK
Subjt:  TNCANFNFTSK

A0A6J1IFL5 basic 7S globulin-like1.2e-9752.8Show/hide
Query:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------
        ATSFRPK LVLP              QRTPLVPV+LTVDLG +F WVDC R Y+SSTYKPARC SA                                  
Subjt:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------

Query:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------
               GELASD+VSVSSTDGFNPT+ VT+ NFLFVC TTFLL+GLAGG TGMA FGRN IS+P+QF+AAFSFNRK AICLS S   PG          
Subjt:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------

Query:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDP--TGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--
                                           YFI V+SI++ S++VPLN TLL+ID    GGTKISTV+PYTVLESSIY+AV++TF   + ++P  
Subjt:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDP--TGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--

Query:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAGN-----DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR
          VAPF ACF A+  S TR+GPGVP I+L      V    +GA +     D VLCLGFVD G N RTSIVIGAHQIE++LLEFDL TSR GFS+TLL R 
Subjt:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAGN-----DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR

Query:  TNCANFNFTSK
        T CANFNFTSK
Subjt:  TNCANFNFTSK

A0A6J1IGY1 basic 7S globulin-like2.7e-9752.55Show/hide
Query:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------
        ATSFRPK LVLP              QRTPLVP++LTVDLG +F WVDC R Y+SSTYKPARC SA                                  
Subjt:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------

Query:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------
               GELASD+VSVSSTDGFNPT+ VT+ NFLFVC TTFLL+GLAGG TGMA FGRN IS+P+QF+AAFSFNRK AICLS S   PG          
Subjt:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------

Query:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDP--TGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--
                                           YFI V+SI++ S++VPLN TLL+ID    GGTKISTV+PYTVLESSIY+AV++TF   + ++P  
Subjt:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDP--TGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--

Query:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAGN-----DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR
          VAPF ACF A+  S TR+GPGVP I+L      V    +GA +     D VLCLGFVD G N RTSIVIGAHQIE++LLEFDL TSR GFS+TLL R 
Subjt:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAGN-----DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR

Query:  TNCANFNFTSK
        T CANFNFTSK
Subjt:  TNCANFNFTSK

A0A6J1IWG3 basic 7S globulin-like1.3e-9451.58Show/hide
Query:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------
        ATSFRPK L+LP              QRTPLVPV+LTVDLGG+F WVDC R+Y SSTYKPARC S+                                  
Subjt:  ATSFRPKGLVLP--------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------

Query:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------
               GE+ASD+VSVSST+GFNPT  V++ NFLFVC TTFLL+GLAGG TGMA FGR  IS+P+QFAAAFSFNRK AICLS S K PG          
Subjt:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------

Query:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDP--TGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--
                                           YFI V+SI++ S++VPLN TLL+I+    GGTKISTVNPYTVLESSIY AV++T    +G +P  
Subjt:  -----------------------------------YFIAVESILVGSRSVPLNATLLQIDP--TGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP--

Query:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR
          VAPF+ACF A  F  TR+GPG+P IDL     +V    +GA      N+ VLCLGFVD G   RTSIVIGAHQIED LLEFDL TSR GF STLL R 
Subjt:  --VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARR

Query:  TNCANFNFTSK
        T C+NFNFT+K
Subjt:  TNCANFNFTSK

SwissProt top hitse value%identityAlignment
I1JNS6 Probable aspartic proteinase GIP11.1e-4233.6Show/hide
Query:  RTPLVPVRLTVDLGGKFTWVDCARSYVSST--YKPAR---CSSAYGELASDLVSVSSTDGFNP-TRS----------------------VTLRNFLFVCA
        +TPL P +L + LG   +WV C  +Y SS+  + P     C+S      S+  S+ +    NP TR+                      V + +F+F CA
Subjt:  RTPLVPVRLTVDLGGKFTWVDCARSYVSST--YKPAR---CSSAYGELASDLVSVSSTDGFNP-TRS----------------------VTLRNFLFVCA

Query:  TTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG-------------------------------------------YFIAVE
        T  LL+GLA  A G+AS GR+  S+P Q + + +  R   +CL  S  + G                                           YFI + 
Subjt:  TTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG-------------------------------------------YFIAVE

Query:  SILVGSRSVPLNATLLQIDPT--GGTKISTVNPYTVLESSIYDAVVRTFA------AVVGDVPVAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGE
        SI +  + + +N+++L +D T  GGTKIST  PYTVLE+SIY   V+ F        +     V PF  C+ A   + TRVGP VP +DL     DV+  
Subjt:  SILVGSRSVPLNATLLQIDPT--GGTKISTVNPYTVLESSIYDAVVRTFA------AVVGDVPVAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGE

Query:  FYGA--------GNDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRTNCAN
         +G         G   V CLGFVD G   RT IVIG HQ+ED+L++FDL ++RFGF+STLL +   C+N
Subjt:  FYGA--------GNDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRTNCAN

P0DO21 Probable aspartic proteinase GIP25.8e-8144.77Show/hide
Query:  TSFRPKGLVLP---------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------
        TSFRPKGL+LP               QRTPLVPV LT+DLGG+F WVDC + YVSSTY+PARC SA                                  
Subjt:  TSFRPKGLVLP---------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------

Query:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------
               GELASD V V S++G NP R V+ ++FLFVC +TFLLEGLA G  GMA  GR +IS+P+QF+A FSF RK A+CLS S    G          
Subjt:  -------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPSCKHPG----------

Query:  ------------------------------------YFIAVESILVGSRSVPLNATLLQID--PTGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDV--
                                            YFI V+SI +  + VP+N TLL ID    GGTKISTVNPYT+LE+SIY+AV   F   + ++  
Subjt:  ------------------------------------YFIAVESILVGSRSVPLNATLLQID--PTGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDV--

Query:  --PVAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLAR
           VAPF+ACF +   + TRVGP VP IDL     +V+   +GA      ++ VLCLGFVD G + RTSIV+G + IED+LL+FDL  SR GF+S++L R
Subjt:  --PVAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLAR

Query:  RTNCANFNFTS
        +T CANFNFTS
Subjt:  RTNCANFNFTS

P82952 Gamma conglutin 13.0e-4535.47Show/hide
Query:  QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARC----------------------------------------SSAYGELASDLVSVSSTDGFNPT
        +RTPLV     +DL G+F  V+C   Y SSTYK   C                                         SA GELA D++ + ST G +P 
Subjt:  QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARC----------------------------------------SSAYGELASDLVSVSSTDGFNPT

Query:  RSVTLRNFLFVCATTFLLE-GLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLS--------------PSCKHPG-------------------YF
          VT  +FLF CA + +L+ GL     G+A  G + IS+P Q A+ F F  K A+CL+              P    PG                   Y+
Subjt:  RSVTLRNFLFVCATTFLLE-GLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLS--------------PSCKHPG-------------------YF

Query:  IAVESILVGSRSVPLNATLLQIDPTGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDV----PVAPFKACFRAEGFSGTRVGPGVPWIDL------DVWG
        I V+S  + +  +P     +     GG  IST  PYT L++ I+ A+ + F   +  V    PVAPF ACF A     +++GP VP IDL      ++  
Subjt:  IAVESILVGSRSVPLNATLLQIDPTGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDV----PVAPFKACFRAEGFSGTRVGPGVPWIDL------DVWG

Query:  EFYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRTNCANFNF-TSKT
          +GA        GV+CL FVD G   +  IVIG  Q+ED+LL+FDL  SR GFSS+LL RRTNCANFNF TS T
Subjt:  EFYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRTNCANFNF-TSKT

Q42369 Gamma conglutin 11.6e-3029.2Show/hide
Query:  TSFRPKGLVLP---------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARC-------------------------------------
        TS +P  LVLP               +RTPL+ V L +DL GK  WV C++ Y SSTY+   C                                     
Subjt:  TSFRPKGLVLP---------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARC-------------------------------------

Query:  ----SSAYGELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLE-GLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLS---------------
             S  GELA D++++ ST G      V +  FLF CA +FL + GL     G    G+  IS+  Q  + F   R+ ++CLS               
Subjt:  ----SSAYGELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLE-GLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLS---------------

Query:  -----------------------PSCKHPGYFIAVESILVGSRSV---------PLNATLLQIDPTGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP
                                  K   YFI V +I V    V         P + +       GG  I+T +PYTVL  SI++   + FA    ++P
Subjt:  -----------------------PSCKHPGYFIAVESILVGSRSV---------PLNATLLQIDPTGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVP

Query:  -------VAPFKACFRAEGFSGTRVGPGVPWIDL------DVW----GEFYGAGNDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGF-SS
               V PF  C+ +   SG     G P +DL       VW      F     DGV CLGFVD G +AR  I +GAH +E++L+ FDL  SR GF S+
Subjt:  -------VAPFKACFRAEGFSGTRVGPGVPWIDL------DVW----GEFYGAGNDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGF-SS

Query:  TLLARRTNCAN
        +L +    C+N
Subjt:  TLLARRTNCAN

Q9FSH9 Gamma conglutin 14.2e-3129.06Show/hide
Query:  TSFRPKGLVLP---------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARC-------------------------------------
        +S +P  LVLP               +RTPL+ V + +DL GK  WV C++ Y SSTY+   C                                     
Subjt:  TSFRPKGLVLP---------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARC-------------------------------------

Query:  ----SSAYGELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLE-GLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLS---------------
             S  GELA D++++ ST G      V +  FLF CA TFL + GL     G    G   IS+P Q  + F   R+  +CLS               
Subjt:  ----SSAYGELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLE-GLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLS---------------

Query:  -----------------------PSCKHPGYFIAVESILVGSRSV-----------PLNATLLQIDPTGGTKISTVNPYTVLESSIYDAVVRTFAAVVGD
                                  K   YFI V +I V    V             +++  +    GG  I+T NPYTVL  SI++   + FA    +
Subjt:  -----------------------PSCKHPGYFIAVESILVGSRSV-----------PLNATLLQIDPTGGTKISTVNPYTVLESSIYDAVVRTFAAVVGD

Query:  VP-------VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYG-----AGNDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGF-
        VP       V PF  C+  +  SG     GVP +DL     DV     G        DGV CLGFVD G + R  I +G HQ+E++L+ FDL  SR GF 
Subjt:  VP-------VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYG-----AGNDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGF-

Query:  SSTLLARRTNCAN
        +++L +   +C+N
Subjt:  SSTLLARRTNCAN

Arabidopsis top hitse value%identityAlignment
AT1G03220.1 Eukaryotic aspartyl protease family protein1.6e-7342.4Show/hide
Query:  TSFRPKGLVLP---------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------
        T FRPK L+LP               QRTPLVP  +  DLGG+  WVDC + YVSSTY+  RC+SA                                  
Subjt:  TSFRPKGLVLP---------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY---------------------------------

Query:  ------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLS-----------PSCKHPG
              GE A D+VS+ ST+G NP R V + N +F C  TFLL+GLA G  GMA  GR+ I +P+QFAAAFSF+RK A+CL+           P    PG
Subjt:  ------GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLS-----------PSCKHPG

Query:  ------------------------------YFIAVESILVGSRSVPLNATLLQIDPT---GGTKISTVNPYTVLESSIYDAVVRTFA------AVVGDVP
                                      YFI V +I +  ++VP+N TLL+I+ +   GGTKIS+VNPYTVLESSIY+A    F       ++     
Subjt:  ------------------------------YFIAVESILVGSRSVPLNATLLQIDPT---GGTKISTVNPYTVLESSIYDAVVRTFA------AVVGDVP

Query:  VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRTN
        V PF ACF  +    TR+G  VP I+L     DV    +GA      +D V+CLGFVD G NARTS+VIG  Q+ED+L+EFDL +++FGFSSTLL R+TN
Subjt:  VAPFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRTN

Query:  CANFNFTS
        CANFNFTS
Subjt:  CANFNFTS

AT1G03230.1 Eukaryotic aspartyl protease family protein1.6e-7041.28Show/hide
Query:  SFRPKGLVLP---------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY----------------------------------
        SFRPK L+LP               QRTPLVP  +  DLGG+  WVDC + YVS+TY+  RC+SA                                   
Subjt:  SFRPKGLVLP---------------QRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAY----------------------------------

Query:  -----GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICL--------------------
             GE A D+VS+ ST+G NP R V + N +F C +T LL+GLA GA GMA  GR+ I +P QFAAAFSFNRK A+CL                    
Subjt:  -----GELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICL--------------------

Query:  ---------------------SPSCKHPGYFIAVESILVGSRSVPLNATLLQIDPT---GGTKISTVNPYTVLESSIYDAVVRTF------AAVVGDVPV
                             S   K P YFI V +I +  +++P++ TLL+I+ +   GGTKIS+VNPYTVLESSIY A    F       ++     V
Subjt:  ---------------------SPSCKHPGYFIAVESILVGSRSVPLNATLLQIDPT---GGTKISTVNPYTVLESSIYDAVVRTF------AAVVGDVPV

Query:  APFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRTNC
         PF ACF  +    TR+G  VP I L     DV    +GA      +D V+CLGFVD G N   S+VIG  Q+ED+L+EFDL +++FGFSSTLL R+TNC
Subjt:  APFKACFRAEGFSGTRVGPGVPWIDL-----DVWGEFYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRTNC

Query:  ANFNFTS
        ANFNFTS
Subjt:  ANFNFTS

AT5G19100.1 Eukaryotic aspartyl protease family protein6.1e-2542.42Show/hide
Query:  YFIAVESILVGSRSVPLNATLLQIDPTGGTKISTVNPYTVLESSIYDAVVRTFA---AVVGDVPVAPFKACFRAEGFSGTRVGPGVPWIDLDVWG----E
        Y I V+SI +G+++VP+        P G TKIST+ PYTV ++S+Y A++  F     +     V PF ACF + G      G GVP IDL + G     
Subjt:  YFIAVESILVGSRSVPLNATLLQIDPTGGTKISTVNPYTVLESSIYDAVVRTFA---AVVGDVPVAPFKACFRAEGFSGTRVGPGVPWIDLDVWG----E

Query:  FYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRTNCA
         YG+      N  V+CLGFVD G   +  IVIG  Q+ED+L+EFDL  S+F FSS+LL   T+C+
Subjt:  FYGAG-----NDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRTNCA

AT5G19110.1 Eukaryotic aspartyl protease family protein6.1e-3330.9Show/hide
Query:  PVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSA-----------------------------YGELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFL
        PV L +DLG   TW+DC +    S+ +   C S+                              G +  D  S+ +TDG      V++R+F F CA    
Subjt:  PVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSA-----------------------------YGELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFL

Query:  LEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPS---------------------------------CKHPGYFIAVESILVGSRSVPLNAT
        L+GL     G+ +      S   Q  +AF+   K ++CL  S                                      Y I V+SI VG  ++ LN  
Subjt:  LEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSPS---------------------------------CKHPGYFIAVESILVGSRSVPLNAT

Query:  LLQIDPTGGTKISTVNPYTVLESSIYDAVVRTF-----AAVVGDVP-VAPFKACF--RAEGFSGTRVGPGVPWIDLDV--------WGEFYGAG-----N
        LL    TGG K+STV  YTVL++ IY+A+ ++F     A  +  VP VAPFK CF  R  G + T  GP VP I++ +        WG FYGA       
Subjt:  LLQIDPTGGTKISTVNPYTVLESSIYDAVVRTF-----AAVVGDVP-VAPFKACF--RAEGFSGTRVGPGVPWIDLDV--------WGEFYGAG-----N

Query:  DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRTNCANF
        + V+CL F+D G   +  +VIG HQ++DH+LEFD   +   FS +LL   T+C+ +
Subjt:  DGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRTNCANF

AT5G19120.1 Eukaryotic aspartyl protease family protein3.4e-2831.23Show/hide
Query:  PVRLTVDLGGKFTWVDCARSYVSSTY-----------------------------KPARCS----------SAYGELASDLVSVSSTDGFNPTRSVTLRN
        PV+L VDL G   W DC+  +VSS+                              + A C           +A GEL SD++SV S        S    +
Subjt:  PVRLTVDLGGKFTWVDCARSYVSSTY-----------------------------KPARCS----------SAYGELASDLVSVSSTDGFNPTRSVTLRN

Query:  FLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSP-------SCKHPGYFIAVE------SILVGS--------RSVPLNATLL
         LF C   +LL GLA GA G+   GR QIS+P+Q AA  +  R+L + LSP       S     + +A         +L GS        +S+ +N   L
Subjt:  FLFVCATTFLLEGLAGGATGMASFGRNQISIPTQFAAAFSFNRKLAICLSP-------SCKHPGYFIAVE------SILVGS--------RSVPLNATLL

Query:  QIDPTGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDV----PVAPFKACFRAEGFSGTRVGPGVPWIDLDVWGE-----FYGAG-----NDGVLCLGFV
         ++     ++STV PYT+LESSIY      +A   G+     PVAPF  CF ++           P +DL +  E      +G         GV C G V
Subjt:  QIDPTGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDV----PVAPFKACFRAEGFSGTRVGPGVPWIDLDVWGE-----FYGAG-----NDGVLCLGFV

Query:  DCGANARTSIVIGAHQIEDHLLEFDLGTSRFGF
        D G++    IV+G  Q+E  +L+FDLG S  GF
Subjt:  DCGANARTSIVIGAHQIEDHLLEFDLGTSRFGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGGCGCAACCTCATTCCGGCCCAAGGGCCTGGTTCTCCCTCAGCGGACTCCACTAGTTCCTGTGAGGCTGACCGTGGACCTAGGCGGGAAGTTCACGTGGGTGGA
CTGTGCCCGCAGCTACGTTTCTTCCACCTACAAGCCTGCCCGGTGCAGCTCGGCTTACGGCGAACTCGCTTCCGACCTCGTTTCTGTTTCCTCCACTGACGGCTTTAATC
CCACCAGGTCGGTAACTTTACGGAATTTCCTATTCGTCTGCGCTACAACGTTTCTCCTAGAAGGCCTAGCCGGCGGCGCCACCGGAATGGCCAGTTTCGGAAGGAACCAA
ATCTCAATACCCACCCAATTCGCCGCCGCCTTCAGCTTCAACCGAAAACTCGCCATTTGCCTTTCCCCCTCATGCAAACACCCCGGCTATTTCATCGCCGTCGAATCCAT
CCTCGTCGGCTCCAGATCCGTCCCCCTCAACGCCACCCTCCTTCAAATTGACCCCACCGGCGGCACCAAAATCAGCACTGTCAATCCCTACACCGTGTTGGAATCCTCGA
TCTATGACGCCGTCGTGAGAACCTTCGCCGCCGTGGTCGGGGACGTTCCGGTGGCGCCGTTCAAGGCGTGTTTTAGGGCGGAGGGGTTTTCGGGTACTCGGGTCGGGCCG
GGTGTGCCGTGGATTGATTTAGATGTTTGGGGGGAATTCTATGGTGCGGGAAACGACGGCGTTTTGTGCTTGGGATTTGTGGATTGTGGGGCCAACGCGCGAACCTCGAT
TGTGATTGGGGCGCACCAGATTGAGGACCATTTGCTTGAATTTGATTTGGGCACTTCCAGATTTGGATTTAGTTCCACTCTTTTGGCTAGAAGGACTAATTGTGCTAATT
TTAACTTTACTTCTAAAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGGCGCAACCTCATTCCGGCCCAAGGGCCTGGTTCTCCCTCAGCGGACTCCACTAGTTCCTGTGAGGCTGACCGTGGACCTAGGCGGGAAGTTCACGTGGGTGGA
CTGTGCCCGCAGCTACGTTTCTTCCACCTACAAGCCTGCCCGGTGCAGCTCGGCTTACGGCGAACTCGCTTCCGACCTCGTTTCTGTTTCCTCCACTGACGGCTTTAATC
CCACCAGGTCGGTAACTTTACGGAATTTCCTATTCGTCTGCGCTACAACGTTTCTCCTAGAAGGCCTAGCCGGCGGCGCCACCGGAATGGCCAGTTTCGGAAGGAACCAA
ATCTCAATACCCACCCAATTCGCCGCCGCCTTCAGCTTCAACCGAAAACTCGCCATTTGCCTTTCCCCCTCATGCAAACACCCCGGCTATTTCATCGCCGTCGAATCCAT
CCTCGTCGGCTCCAGATCCGTCCCCCTCAACGCCACCCTCCTTCAAATTGACCCCACCGGCGGCACCAAAATCAGCACTGTCAATCCCTACACCGTGTTGGAATCCTCGA
TCTATGACGCCGTCGTGAGAACCTTCGCCGCCGTGGTCGGGGACGTTCCGGTGGCGCCGTTCAAGGCGTGTTTTAGGGCGGAGGGGTTTTCGGGTACTCGGGTCGGGCCG
GGTGTGCCGTGGATTGATTTAGATGTTTGGGGGGAATTCTATGGTGCGGGAAACGACGGCGTTTTGTGCTTGGGATTTGTGGATTGTGGGGCCAACGCGCGAACCTCGAT
TGTGATTGGGGCGCACCAGATTGAGGACCATTTGCTTGAATTTGATTTGGGCACTTCCAGATTTGGATTTAGTTCCACTCTTTTGGCTAGAAGGACTAATTGTGCTAATT
TTAACTTTACTTCTAAAACTTGA
Protein sequenceShow/hide protein sequence
MFGATSFRPKGLVLPQRTPLVPVRLTVDLGGKFTWVDCARSYVSSTYKPARCSSAYGELASDLVSVSSTDGFNPTRSVTLRNFLFVCATTFLLEGLAGGATGMASFGRNQ
ISIPTQFAAAFSFNRKLAICLSPSCKHPGYFIAVESILVGSRSVPLNATLLQIDPTGGTKISTVNPYTVLESSIYDAVVRTFAAVVGDVPVAPFKACFRAEGFSGTRVGP
GVPWIDLDVWGEFYGAGNDGVLCLGFVDCGANARTSIVIGAHQIEDHLLEFDLGTSRFGFSSTLLARRTNCANFNFTSKT