; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G001070 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G001070
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionaspartic proteinase PCS1-like
Genome locationCmo_Chr15:519475..525423
RNA-Seq ExpressionCmoCh15G001070
SyntenyCmoCh15G001070
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8539480.1 hypothetical protein F0562_026172 [Nyssa sinensis]6.7e-26374.84Show/hide
Query:  LLQLLICCVSFKQGLCFSATQTMVLPLKTQM---GVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSP
        LLQLL+ C+  +   C S+T T++LPLKT +   G   +P NKLSFHHNV+LTV+LT+G+PPQPVTMV+DTGSELSWL+CKKTPN  S+F+PL SSSYSP
Subjt:  LLQLLICCVSFKQGLCFSATQTMVLPLKTQM---GVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSP

Query:  VPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFS
        +PC+SP CRTRTRD   PV+CDPKKLCH  +SYADASS+EGNLASDTF +G+S  PGT FG MDSG SSN EED+KTTGL+GMNRGSLSFVTQ+  PKFS
Subjt:  VPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFS

Query:  YCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFV
        YCISGRDSSG+LLFG+AS  WL  L YTPLVQ+STPLPY+DRVAYTVQL+GI+V  K+LA+PKS+  PDHTGAGQTMVDSGTQFTFLLGPVYT L+NEF+
Subjt:  YCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFV

Query:  VQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEF
         QTKG+L  L DPNFVFQGAMDLCYRV   +  LPPLP VSLMFRGAEM V GE LMY+VPG  RG D V+C TFGNSDLLGIEA+VIGHHHQQN+WMEF
Subjt:  VQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEF

Query:  DLVKSRVGFVET-------RCDLAA--------------VATTADMSFVFRGTRVPDIENGLSGFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNS
        DLVKSRVG  E        R DL+                     MSFVFRGTR  DIE G  GFIPERRAMRVHAARPVNSNSLAFLVTVLLLFM+LNS
Subjt:  DLVKSRVGFVET-------RCDLAA--------------VATTADMSFVFRGTRVPDIENGLSGFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNS

Query:  HQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQAR--AMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAP
        HQM PNFLLWLV G+FLMAT+LRMYATCQQLQAQAQA+  A AASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDN P
Subjt:  HQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQAR--AMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAP

Query:  TTPSMSEEQINALPVHKYKVSGPQ
        T+PSMSEE+INALPVHKYKV+ PQ
Subjt:  TTPSMSEEQINALPVHKYKVSGPQ

KAF4351959.1 hypothetical protein G4B88_020587 [Cannabis sativa]3.2e-27360.77Show/hide
Query:  LRLLQLLIC--CVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYS
        L+L+ +++C            S+  T++LPLK Q    ++PS+KLSFHHNVTLTV+LT+GSPPQ VTMVLDTGSELSWLHCKK  N+NSVFNPL+SSSYS
Subjt:  LRLLQLLIC--CVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYS

Query:  PVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKF
        PVPC+S +CRT+TRD   PV+CDPKKLCH  +SYADASS+EGNLAS+TF +GSS +P T FGCMDSGFSSNSEED+KTTGL+GMNRGSLSFV+Q+GL KF
Subjt:  PVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKF

Query:  SYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEF
        SYCISGRDSSG +LFG+AS +WLG L YTPLV+MS PLPYYDRVAYTVQL GI+V NK+L L KS+F PDHTGAGQTMVDSGTQFTFLLGPVYTAL+NEF
Subjt:  SYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEF

Query:  VVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWME
          QTK +   L D NFVFQGAMDLCY++P  +      P V+L+F+GAEM V G+ L+Y+VPGM +G D V+C TFGNSDLLGIEAFVIGHHHQQNVW+E
Subjt:  VVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWME

Query:  FDLVKSRVGFVETRCDLAA------------------------------------------VATTA----------------------------------
        FDL KSRVG  E RCDLA+                                          V  T                                   
Subjt:  FDLVKSRVGFVETRCDLAA------------------------------------------VATTA----------------------------------

Query:  -------------------------------------------------------------------------------DMSFVFRGTRVPDIENGLSGF
                                                                                        MSFVFRGTR  DIE+G +GF
Subjt:  -------------------------------------------------------------------------------DMSFVFRGTRVPDIENGLSGF

Query:  IPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASGLLGHTELRLHMPPSIALATRG
        +PERR MR+H+ARPVNSNSLAFLVTVLLLFM+LNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQA A AA GLLGHTELRLH+PPSI+LATRG
Subjt:  IPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASGLLGHTELRLHMPPSIALATRG

Query:  RLQGLRLQLALLDREFDDL--DYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSANAVGSTKASEDELTCSVCL
        RLQGLRLQLALLDREFDDL  DYETLRALD+DNAPT  SM+EE+INALPVHKYKV   QS  S  QQASSS S EK Q S +AVG+ KASEDELTCSVCL
Subjt:  RLQGLRLQLALLDREFDDL--DYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSANAVGSTKASEDELTCSVCL

Query:  EQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA
        EQVNVGE+         FHANCIDPWLRQQGTCPVCKF+A S W E G+GE DA
Subjt:  EQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA

KAF4354473.1 hypothetical protein G4B88_019942 [Cannabis sativa]1.3e-28266.2Show/hide
Query:  LRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPV
        L+L+ +++C    +     S+  T++LPLK Q    ++PS+KLSFHHNVTLTV+LT+GSPPQ VTMVLDTGSELSWLHCKK  N+NSVFNPL+SSSYSPV
Subjt:  LRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPV

Query:  PCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSY
        PC+S +CRT+TRD   PV+CDPKKLCH  +SYADASS+EGNLAS+TF +GSS +P T FGCMDSGFSSNSEED+KTTGL+GMNRGSLSFV+Q+GL KFSY
Subjt:  PCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSY

Query:  CISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVV
        CISGRDSSG +LFG+AS +WLG L YTPLV+MS PLPYYDRVAYTVQL GI+V NK+L L KS+F PDHTGAGQTMVDSGTQFTFLLGPVYTAL+NEF  
Subjt:  CISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVV

Query:  QTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFD
        QTK +   L D NFVFQGAMDLCY++P  +      P V+L+F+GAEM V G+ L+Y+VPGM +G D V+C TFGNSDLLGIEAFVIGHHHQQNVW+EFD
Subjt:  QTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFD

Query:  LVKSRVGFVETRCDLAA------------------------------------VATTA------------------------------------------
        L KSRVG  E RCDLA+                                    VA  A                                          
Subjt:  LVKSRVGFVETRCDLAA------------------------------------VATTA------------------------------------------

Query:  -----------DMSFVFRGTRVPDIENGLSGFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQA
                    MSFVFRGTR  DIE+G +GF+PERR MR+H+ARPVNSNSLAFLVTVLLLFM+LNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQA
Subjt:  -----------DMSFVFRGTRVPDIENGLSGFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQA

Query:  QAQARAMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASS
        QAQA A AA GLLGHTELRLH+PPSI+LATRGRLQGLRLQLALLDREFDDLDYETLRALD+DNAPT  SM+EE+INALPVHKYKV+  Q+  S  QQASS
Subjt:  QAQARAMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASS

Query:  SESNEKRQDSANAVGSTKASEDELTCSVCLEQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA
        S S EK Q S +AVG+ KASEDELTCSVCLEQVNVGE+         FHANCIDPWLRQQGTCPVCKF+A S W E G+GE DA
Subjt:  SESNEKRQDSANAVGSTKASEDELTCSVCLEQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA

RXI08936.1 hypothetical protein DVH24_023080 [Malus domestica]2.3e-26365.54Show/hide
Query:  LRLLQLLICCVSFKQGLCFS--ATQTMVLPLKTQM---GVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSS
        L LLQLL         LCFS    +T++LPLKTQ    G   + +NKLSFHHNVTLT+SL++GSPPQ VTMVLDTGSELSWL CKK PN NSVFNPL+S 
Subjt:  LRLLQLLICCVSFKQGLCFS--ATQTMVLPLKTQM---GVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSS

Query:  SYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGL
        SYSPVPC+SPVCRTRTRD P PV+CDPKKLCH  +SY DASS+EGNLA +TF +GSSAQPGT FGCMDSG SSN+EEDAKTTGLMGMNRGSLSFVTQ+G 
Subjt:  SYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGL

Query:  PKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALK
        PKFSYCISGRDSSGVLLFG+A   WL  L YTPLV +STPLPY+DRVAYTVQL+GIRVG K+L LPKS+F PDH+GAGQTMVDSGTQFTFLLGPVYTALK
Subjt:  PKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALK

Query:  NEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNV
         EF  QTK +L  L DPNFVFQGA+DLC++VP  +  LP LP V+LMFRGAEM V GE L+Y+VPGMVRGG+QV+C T+GNSDLLGIEAFVIGH+HQQNV
Subjt:  NEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNV

Query:  WMEFDLVKSRVGFVE--------------------------------------------------------TRCDLAAVATTADMSFVFRGTRVPDIENG
        WMEFDL KSRVG  E                                                         R     +  + +MSFVFRGTR  DIE+G
Subjt:  WMEFDLVKSRVGFVE--------------------------------------------------------TRCDLAAVATTADMSFVFRGTRVPDIENG

Query:  LSGFIPERRAMRVHAARPVNSNSL-------------AFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASGLLGH
           FIPERRAM +    P N + L             A  +   LLF+            LWLVLGVFLMATTLRMYATCQQLQAQAQ  A AASGLLGH
Subjt:  LSGFIPERRAMRVHAARPVNSNSL-------------AFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASGLLGH

Query:  TELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSANAVG
        TELRL MPPSI+LATRGRLQGLRLQLALLDREFDDLDYETLRALDSDN P   SMSEE+INALPVHKYK  GPQ+  S  QQASSS  +E  Q++ +AVG
Subjt:  TELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSANAVG

Query:  STKASEDELTCSVCLEQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA
        STKA EDELTCSVCLEQV VGEL         FHA+CIDPWL+QQGTCPVCKFRA SG  E GQ   DA
Subjt:  STKASEDELTCSVCLEQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA

XP_022938661.1 aspartic proteinase PCS1-like [Cucurbita moschata]5.4e-244100Show/hide
Query:  MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSS
        MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSS
Subjt:  MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSS

Query:  YSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP
        YSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP
Subjt:  YSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP

Query:  KFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN
        KFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN
Subjt:  KFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN

Query:  EFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVW
        EFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVW
Subjt:  EFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVW

Query:  MEFDLVKSRVGFVETRCDLA
        MEFDLVKSRVGFVETRCDLA
Subjt:  MEFDLVKSRVGFVETRCDLA

TrEMBL top hitse value%identityAlignment
A0A498KN45 Uncharacterized protein1.1e-26365.54Show/hide
Query:  LRLLQLLICCVSFKQGLCFS--ATQTMVLPLKTQM---GVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSS
        L LLQLL         LCFS    +T++LPLKTQ    G   + +NKLSFHHNVTLT+SL++GSPPQ VTMVLDTGSELSWL CKK PN NSVFNPL+S 
Subjt:  LRLLQLLICCVSFKQGLCFS--ATQTMVLPLKTQM---GVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSS

Query:  SYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGL
        SYSPVPC+SPVCRTRTRD P PV+CDPKKLCH  +SY DASS+EGNLA +TF +GSSAQPGT FGCMDSG SSN+EEDAKTTGLMGMNRGSLSFVTQ+G 
Subjt:  SYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGL

Query:  PKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALK
        PKFSYCISGRDSSGVLLFG+A   WL  L YTPLV +STPLPY+DRVAYTVQL+GIRVG K+L LPKS+F PDH+GAGQTMVDSGTQFTFLLGPVYTALK
Subjt:  PKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALK

Query:  NEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNV
         EF  QTK +L  L DPNFVFQGA+DLC++VP  +  LP LP V+LMFRGAEM V GE L+Y+VPGMVRGG+QV+C T+GNSDLLGIEAFVIGH+HQQNV
Subjt:  NEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNV

Query:  WMEFDLVKSRVGFVE--------------------------------------------------------TRCDLAAVATTADMSFVFRGTRVPDIENG
        WMEFDL KSRVG  E                                                         R     +  + +MSFVFRGTR  DIE+G
Subjt:  WMEFDLVKSRVGFVE--------------------------------------------------------TRCDLAAVATTADMSFVFRGTRVPDIENG

Query:  LSGFIPERRAMRVHAARPVNSNSL-------------AFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASGLLGH
           FIPERRAM +    P N + L             A  +   LLF+            LWLVLGVFLMATTLRMYATCQQLQAQAQ  A AASGLLGH
Subjt:  LSGFIPERRAMRVHAARPVNSNSL-------------AFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASGLLGH

Query:  TELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSANAVG
        TELRL MPPSI+LATRGRLQGLRLQLALLDREFDDLDYETLRALDSDN P   SMSEE+INALPVHKYK  GPQ+  S  QQASSS  +E  Q++ +AVG
Subjt:  TELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSANAVG

Query:  STKASEDELTCSVCLEQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA
        STKA EDELTCSVCLEQV VGEL         FHA+CIDPWL+QQGTCPVCKFRA SG  E GQ   DA
Subjt:  STKASEDELTCSVCLEQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA

A0A5J5BAH9 Peptidase A1 domain-containing protein3.3e-26374.84Show/hide
Query:  LLQLLICCVSFKQGLCFSATQTMVLPLKTQM---GVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSP
        LLQLL+ C+  +   C S+T T++LPLKT +   G   +P NKLSFHHNV+LTV+LT+G+PPQPVTMV+DTGSELSWL+CKKTPN  S+F+PL SSSYSP
Subjt:  LLQLLICCVSFKQGLCFSATQTMVLPLKTQM---GVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSP

Query:  VPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFS
        +PC+SP CRTRTRD   PV+CDPKKLCH  +SYADASS+EGNLASDTF +G+S  PGT FG MDSG SSN EED+KTTGL+GMNRGSLSFVTQ+  PKFS
Subjt:  VPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFS

Query:  YCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFV
        YCISGRDSSG+LLFG+AS  WL  L YTPLVQ+STPLPY+DRVAYTVQL+GI+V  K+LA+PKS+  PDHTGAGQTMVDSGTQFTFLLGPVYT L+NEF+
Subjt:  YCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFV

Query:  VQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEF
         QTKG+L  L DPNFVFQGAMDLCYRV   +  LPPLP VSLMFRGAEM V GE LMY+VPG  RG D V+C TFGNSDLLGIEA+VIGHHHQQN+WMEF
Subjt:  VQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEF

Query:  DLVKSRVGFVET-------RCDLAA--------------VATTADMSFVFRGTRVPDIENGLSGFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNS
        DLVKSRVG  E        R DL+                     MSFVFRGTR  DIE G  GFIPERRAMRVHAARPVNSNSLAFLVTVLLLFM+LNS
Subjt:  DLVKSRVGFVET-------RCDLAA--------------VATTADMSFVFRGTRVPDIENGLSGFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNS

Query:  HQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQAR--AMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAP
        HQM PNFLLWLV G+FLMAT+LRMYATCQQLQAQAQA+  A AASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDN P
Subjt:  HQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQAR--AMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAP

Query:  TTPSMSEEQINALPVHKYKVSGPQ
        T+PSMSEE+INALPVHKYKV+ PQ
Subjt:  TTPSMSEEQINALPVHKYKVSGPQ

A0A6J1FDS6 aspartic proteinase PCS1-like2.6e-244100Show/hide
Query:  MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSS
        MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSS
Subjt:  MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSS

Query:  YSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP
        YSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP
Subjt:  YSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP

Query:  KFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN
        KFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN
Subjt:  KFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN

Query:  EFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVW
        EFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVW
Subjt:  EFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVW

Query:  MEFDLVKSRVGFVETRCDLA
        MEFDLVKSRVGFVETRCDLA
Subjt:  MEFDLVKSRVGFVETRCDLA

A0A7J6E2Q5 Uncharacterized protein1.6e-27360.77Show/hide
Query:  LRLLQLLIC--CVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYS
        L+L+ +++C            S+  T++LPLK Q    ++PS+KLSFHHNVTLTV+LT+GSPPQ VTMVLDTGSELSWLHCKK  N+NSVFNPL+SSSYS
Subjt:  LRLLQLLIC--CVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYS

Query:  PVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKF
        PVPC+S +CRT+TRD   PV+CDPKKLCH  +SYADASS+EGNLAS+TF +GSS +P T FGCMDSGFSSNSEED+KTTGL+GMNRGSLSFV+Q+GL KF
Subjt:  PVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKF

Query:  SYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEF
        SYCISGRDSSG +LFG+AS +WLG L YTPLV+MS PLPYYDRVAYTVQL GI+V NK+L L KS+F PDHTGAGQTMVDSGTQFTFLLGPVYTAL+NEF
Subjt:  SYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEF

Query:  VVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWME
          QTK +   L D NFVFQGAMDLCY++P  +      P V+L+F+GAEM V G+ L+Y+VPGM +G D V+C TFGNSDLLGIEAFVIGHHHQQNVW+E
Subjt:  VVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWME

Query:  FDLVKSRVGFVETRCDLAA------------------------------------------VATTA----------------------------------
        FDL KSRVG  E RCDLA+                                          V  T                                   
Subjt:  FDLVKSRVGFVETRCDLAA------------------------------------------VATTA----------------------------------

Query:  -------------------------------------------------------------------------------DMSFVFRGTRVPDIENGLSGF
                                                                                        MSFVFRGTR  DIE+G +GF
Subjt:  -------------------------------------------------------------------------------DMSFVFRGTRVPDIENGLSGF

Query:  IPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASGLLGHTELRLHMPPSIALATRG
        +PERR MR+H+ARPVNSNSLAFLVTVLLLFM+LNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQA A AA GLLGHTELRLH+PPSI+LATRG
Subjt:  IPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASGLLGHTELRLHMPPSIALATRG

Query:  RLQGLRLQLALLDREFDDL--DYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSANAVGSTKASEDELTCSVCL
        RLQGLRLQLALLDREFDDL  DYETLRALD+DNAPT  SM+EE+INALPVHKYKV   QS  S  QQASSS S EK Q S +AVG+ KASEDELTCSVCL
Subjt:  RLQGLRLQLALLDREFDDL--DYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSANAVGSTKASEDELTCSVCL

Query:  EQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA
        EQVNVGE+         FHANCIDPWLRQQGTCPVCKF+A S W E G+GE DA
Subjt:  EQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA

A0A7J6E7N2 Uncharacterized protein6.3e-28366.2Show/hide
Query:  LRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPV
        L+L+ +++C    +     S+  T++LPLK Q    ++PS+KLSFHHNVTLTV+LT+GSPPQ VTMVLDTGSELSWLHCKK  N+NSVFNPL+SSSYSPV
Subjt:  LRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPV

Query:  PCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSY
        PC+S +CRT+TRD   PV+CDPKKLCH  +SYADASS+EGNLAS+TF +GSS +P T FGCMDSGFSSNSEED+KTTGL+GMNRGSLSFV+Q+GL KFSY
Subjt:  PCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSY

Query:  CISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVV
        CISGRDSSG +LFG+AS +WLG L YTPLV+MS PLPYYDRVAYTVQL GI+V NK+L L KS+F PDHTGAGQTMVDSGTQFTFLLGPVYTAL+NEF  
Subjt:  CISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVV

Query:  QTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFD
        QTK +   L D NFVFQGAMDLCY++P  +      P V+L+F+GAEM V G+ L+Y+VPGM +G D V+C TFGNSDLLGIEAFVIGHHHQQNVW+EFD
Subjt:  QTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFD

Query:  LVKSRVGFVETRCDLAA------------------------------------VATTA------------------------------------------
        L KSRVG  E RCDLA+                                    VA  A                                          
Subjt:  LVKSRVGFVETRCDLAA------------------------------------VATTA------------------------------------------

Query:  -----------DMSFVFRGTRVPDIENGLSGFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQA
                    MSFVFRGTR  DIE+G +GF+PERR MR+H+ARPVNSNSLAFLVTVLLLFM+LNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQA
Subjt:  -----------DMSFVFRGTRVPDIENGLSGFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQA

Query:  QAQARAMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASS
        QAQA A AA GLLGHTELRLH+PPSI+LATRGRLQGLRLQLALLDREFDDLDYETLRALD+DNAPT  SM+EE+INALPVHKYKV+  Q+  S  QQASS
Subjt:  QAQARAMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASS

Query:  SESNEKRQDSANAVGSTKASEDELTCSVCLEQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA
        S S EK Q S +AVG+ KASEDELTCSVCLEQVNVGE+         FHANCIDPWLRQQGTCPVCKF+A S W E G+GE DA
Subjt:  SESNEKRQDSANAVGSTKASEDELTCSVCLEQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-21.3e-3831.34Show/hide
Query:  VSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNS----VFNPLSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFR
        +++ +G+P    + ++DTGS+L W  C+      S    +FNP  SSS+S +PC S  C    +DLP+  TC+  + C     Y D S+ +G +A++TF 
Subjt:  VSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNS----VFNPLSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFR

Query:  VGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCIS--GRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTV
          +S+ P   FGC   G  +         GL+GM  G LS  +QLG+ +FSYC++  G  S   L  G A+         T L+  S    Y     Y +
Subjt:  VGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCIS--GRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTV

Query:  QLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGA
         L GI VG   L +P S F     G G  ++DSGT  T+L    Y A+   F   T  I +P  D +      +  C++ P   G    +P +S+ F G 
Subjt:  QLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGA

Query:  EMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC
         + +G + +      ++   + V CL  G+S  LGI  F  G+  QQ   + +DL    V FV T+C
Subjt:  EMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC

Q766C3 Aspartic proteinase nepenthesin-15.3e-3730.16Show/hide
Query:  VSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNL----NSVFNPLSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFR
        ++L++G+P QP + ++DTGS+L W  C+           +FNP  SSS+S +PC+S +C+     L +P TC     C     Y D S  +G++ ++T  
Subjt:  VSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNL----NSVFNPLSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFR

Query:  VGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCIS--GRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTV
         GS + P   FGC   G ++         GL+GM RG LS  +QL + KFSYC++  G  +   LL G  + S       T L+Q S+ +P +    Y +
Subjt:  VGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCIS--GRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTV

Query:  QLDGIRVGNKILALPKSIFA-PDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRG
         L+G+ VG+  L +  S FA   + G G  ++DSGT  T+ +   Y +++ EF+ Q   I +P+ + +       DLC++ P     L  +P   + F G
Subjt:  QLDGIRVGNKILALPKSIFA-PDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRG

Query:  AEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC
         ++ +  E         +   + + CL  G+S   G+  F  G+  QQN+ + +D   S V F   +C
Subjt:  AEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC

Q9LNJ3 Aspartyl protease family protein 23.4e-3131.89Show/hide
Query:  LTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNS----VFNPLSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVG
        L +G+P + V MVLDTGS++ WL C       S    +F+P  S +Y+ +PC+SP CR     L +      +K C   VSY D S   G+ +++T    
Subjt:  LTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNS----VFNPLSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVG

Query:  SSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLG---LPKFSYCISGRDSS---GVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAY
         +   G   GC       N        GL+G+ +G LSF  Q G     KFSYC+  R +S     ++FG+A++S +    +TPL+      P  D   Y
Subjt:  SSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLG---LPKFSYCISGRDSS---GVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAY

Query:  TVQLDGIRV-GNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMF
         V L GI V G ++  +  S+F  D  G G  ++DSGT  T L+ P Y A+++ F V  K +      P+F      D C+ +         +P V L F
Subjt:  TVQLDGIRV-GNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMF

Query:  RGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC
        RGA+  V      Y +P    G     C  F  + + G+   +IG+  QQ   + +DL  SRVGF    C
Subjt:  RGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC

Q9LZL3 Aspartic proteinase PCS11.8e-14661.26Show/hide
Query:  SFKQGLCFSATQTMVLPLKTQMGVTS-RPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSV--FNPLSSSSYSPVPCASPVCR
        SF      S++QT+VLPLKT++  T  RP++KL FHHNVTLTV+LT+G+PPQ ++MV+DTGSELSWL C ++ N N V  F+P  SSSYSP+PC+SP CR
Subjt:  SFKQGLCFSATQTMVLPLKTQMGVTS-RPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSV--FNPLSSSSYSPVPCASPVCR

Query:  TRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGT-FFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDS
        TRTRD   P +CD  KLCH  +SYADASS EGNLA++ F  G+S       FGCM S   S+ EED KTTGL+GMNRGSLSF++Q+G PKFSYCISG D 
Subjt:  TRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGT-FFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDS

Query:  -SGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGIL
          G LL GD++ +WL  L YTPL+++STPLPY+DRVAYTVQL GI+V  K+L +PKS+  PDHTGAGQTMVDSGTQFTFLLGPVYTAL++ F+ +T GIL
Subjt:  -SGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGIL

Query:  VPLGDPNFVFQGAMDLCYRVPE---KQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVK
            DP+FVFQG MDLCYR+     + G L  LP VSL+F GAE+ V G+ L+Y+VP +  G D V+C TFGNSDL+G+EA+VIGHHHQQN+W+EFDL +
Subjt:  VPLGDPNFVFQGAMDLCYRVPE---KQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVK

Query:  SRVGFVETRCDLA
        SR+G     CD++
Subjt:  SRVGFVETRCDLA

Q9M2S6 E3 ubiquitin-protein ligase SDIR11.2e-10578.02Show/hide
Query:  MSFVFRGTRVPDIENGLS-GFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASG
        MSFVFRG+R  D+E+G S GF+PERRAMRVH ARPVNSNSLAFLVTVLLLFM+LNSHQM PNFLLWLVLGVFLMATTLRMYATCQQLQA AQA+A AASG
Subjt:  MSFVFRGTRVPDIENGLS-GFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASG

Query:  LLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSA
        L  HTELRLH+PPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDN  TT SMSEE+INALPVHKYKV  P++  S+ +QAS+S S EK  DSA
Subjt:  LLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSA

Query:  NAVGSTKASEDELTCSVCLEQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA
        N   S K +EDELTCSVCLEQV VGE+         FHA CIDPWLRQQGTCPVCKFRA SGW EQ + + DA
Subjt:  NAVGSTKASEDELTCSVCLEQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA

Arabidopsis top hitse value%identityAlignment
AT1G66180.1 Eukaryotic aspartyl protease family protein6.4e-6236.36Show/hide
Query:  TSRPSN-KLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHC---KKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVS
        +S P N +  F +++ L +SL +G+PPQ   MVLDTGS+LSW+ C   K  P   + F+P  SSS+S +PC+ P+C+ R  D   P +CD  +LCH    
Subjt:  TSRPSN-KLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHC---KKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVS

Query:  YADASSLEGNLASDTFRVGSS-AQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCI------SGRDSSGVLLFGDASLSWLGNL
        YAD +  EGNL  +     ++   P    GC        + E +   G++GMNRG LSFV+Q  + KFSYCI       G   +G    GD   S     
Subjt:  YADASSLEGNLASDTFRVGSS-AQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCI------SGRDSSGVLLFGDASLSWLGNL

Query:  TYTPLVQM--STPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGILVPLGDPNFVFQGAMDL
         Y  L+    S  +P  D +AYTV + GIR G K L +  S+F PD  G+GQTMVDSG++FT L+   Y  ++ E + +    L       +V+ G  D+
Subjt:  TYTPLVQM--STPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGILVPLGDPNFVFQGAMDL

Query:  CYRVPEKQGKLPPLP-----VVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC
        C+      G +  +P     +V +  RG E++V  E ++  V      G  +HC+  G S +LG  + +IG+ HQQN+W+EFD+   RVGF +  C
Subjt:  CYRVPEKQGKLPPLP-----VVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC

AT2G39710.1 Eukaryotic aspartyl protease family protein2.3e-16870.52Show/hide
Query:  FLRLLQLLICCVSFKQGLC--FSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSY
        FLR+  LL+    F    C   S  QT++  LKTQ  +    S+KLSF HNVTLTV+L +G PPQ ++MVLDTGSELSWLHCKK+PNL SVFNP+SSS+Y
Subjt:  FLRLLQLLICCVSFKQGLC--FSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSY

Query:  SPVPCASPVCRTRTRDLPNPVTCDPK-KLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP
        SPVPC+SP+CRTRTRDLP P +CDPK  LCHV +SYADA+S+EGNLA +TF +GS  +PGT FGCMDSG SSNSEEDAK+TGLMGMNRGSLSFV QLG  
Subjt:  SPVPCASPVCRTRTRDLPNPVTCDPK-KLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP

Query:  KFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN
        KFSYCISG DSSG LL GDAS SWLG + YTPLV  STPLPY+DRVAYTVQL+GIRVG+KIL+LPKS+F PDHTGAGQTMVDSGTQFTFL+GPVYTALKN
Subjt:  KFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN

Query:  EFVVQTKGILVPLGDPNFVFQGAMDLCYRV-PEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGM-VRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQN
        EF+ QTK +L  + DP+FVFQG MDLCY+V    +     LP+VSLMFRGAEM V G+ L+Y+V G    G ++V+C TFGNSDLLGIEAFVIGHHHQQN
Subjt:  EFVVQTKGILVPLGDPNFVFQGAMDLCYRV-PEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGM-VRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQN

Query:  VWMEFDLVKSRVGFV-ETRCDLAA
        VWMEFDL KSRVGF    RCDLA+
Subjt:  VWMEFDLVKSRVGFV-ETRCDLAA

AT3G55530.1 RING/U-box superfamily protein8.6e-10778.02Show/hide
Query:  MSFVFRGTRVPDIENGLS-GFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASG
        MSFVFRG+R  D+E+G S GF+PERRAMRVH ARPVNSNSLAFLVTVLLLFM+LNSHQM PNFLLWLVLGVFLMATTLRMYATCQQLQA AQA+A AASG
Subjt:  MSFVFRGTRVPDIENGLS-GFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASG

Query:  LLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSA
        L  HTELRLH+PPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDN  TT SMSEE+INALPVHKYKV  P++  S+ +QAS+S S EK  DSA
Subjt:  LLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSA

Query:  NAVGSTKASEDELTCSVCLEQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA
        N   S K +EDELTCSVCLEQV VGE+         FHA CIDPWLRQQGTCPVCKFRA SGW EQ + + DA
Subjt:  NAVGSTKASEDELTCSVCLEQVNVGEL---------FHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA

AT5G02190.1 Eukaryotic aspartyl protease family protein1.3e-14761.26Show/hide
Query:  SFKQGLCFSATQTMVLPLKTQMGVTS-RPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSV--FNPLSSSSYSPVPCASPVCR
        SF      S++QT+VLPLKT++  T  RP++KL FHHNVTLTV+LT+G+PPQ ++MV+DTGSELSWL C ++ N N V  F+P  SSSYSP+PC+SP CR
Subjt:  SFKQGLCFSATQTMVLPLKTQMGVTS-RPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSV--FNPLSSSSYSPVPCASPVCR

Query:  TRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGT-FFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDS
        TRTRD   P +CD  KLCH  +SYADASS EGNLA++ F  G+S       FGCM S   S+ EED KTTGL+GMNRGSLSF++Q+G PKFSYCISG D 
Subjt:  TRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGT-FFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDS

Query:  -SGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGIL
          G LL GD++ +WL  L YTPL+++STPLPY+DRVAYTVQL GI+V  K+L +PKS+  PDHTGAGQTMVDSGTQFTFLLGPVYTAL++ F+ +T GIL
Subjt:  -SGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGIL

Query:  VPLGDPNFVFQGAMDLCYRVPE---KQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVK
            DP+FVFQG MDLCYR+     + G L  LP VSL+F GAE+ V G+ L+Y+VP +  G D V+C TFGNSDL+G+EA+VIGHHHQQN+W+EFDL +
Subjt:  VPLGDPNFVFQGAMDLCYRVPE---KQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVK

Query:  SRVGFVETRCDLA
        SR+G     CD++
Subjt:  SRVGFVETRCDLA

AT5G37540.1 Eukaryotic aspartyl protease family protein9.0e-6437.06Show/hide
Query:  SRPSNKLSFHHNV----TLTVSLTLGSPPQPVTMVLDTGSELSWLHC------KKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLC
        S PS+  +F  N+     L +SL +G+P Q   +VLDTGS+LSW+ C      K  P   + F+P  SSS+S +PC+ P+C+ R  D   P +CD  +LC
Subjt:  SRPSNKLSFHHNV----TLTVSLTLGSPPQPVTMVLDTGSELSWLHC------KKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLC

Query:  HVFVSYADASSLEGNLASDTFRVGSS-AQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRD------SSGVLLFGD----
        H    YAD +  EGNL  + F   +S   P    GC        ++E     G++GMN G LSF++Q  + KFSYCI  R       S+G    GD    
Subjt:  HVFVSYADASSLEGNLASDTFRVGSS-AQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRD------SSGVLLFGD----

Query:  ASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGILVPLGDPNFV
            ++  LT+      S  +P  D +AYTV L GIR+G K L +P S+F PD  G+GQTMVDSG++FT L+   Y  +K E +V+  G  +  G   +V
Subjt:  ASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGILVPLGDPNFV

Query:  FQGAMDLCYRVPEKQ--GKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC
        +    D+C+        G+L    +  L+F   E   G E+L+ K   +V  G  +HC+  G S +LG  + +IG+ HQQN+W+EFD+   RVGF +  C
Subjt:  FQGAMDLCYRVPEKQ--GKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC

Query:  DL
         L
Subjt:  DL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTTCTTCCTCCGCCTGCTGCAGCTTCTTATCTGCTGCGTCTCTTTCAAACAGGGCCTCTGTTTTTCTGCGACTCAGACCATGGTTTTGCCCCTCAAAACACAGAT
GGGTGTCACTTCTCGGCCTTCCAATAAGCTCAGTTTTCACCATAATGTCACTTTGACTGTTTCCTTAACGCTTGGCTCGCCTCCTCAACCCGTTACTATGGTTCTCGATA
CAGGGAGTGAACTCTCATGGCTTCACTGCAAAAAAACCCCAAATTTGAACTCTGTTTTTAACCCACTTTCTTCCTCTTCTTACTCGCCAGTCCCCTGTGCTTCCCCTGTT
TGCCGGACCCGAACCCGAGATTTACCCAACCCGGTTACCTGCGACCCAAAGAAACTCTGCCACGTCTTTGTCTCTTATGCCGACGCCTCGTCGCTCGAGGGTAATCTCGC
GTCGGATACGTTTCGAGTCGGGTCATCGGCTCAACCCGGAACTTTTTTTGGGTGTATGGATTCGGGTTTCAGTTCGAATTCGGAGGAGGACGCGAAGACCACTGGGCTGA
TGGGGATGAACAGGGGCTCGCTCTCGTTTGTCACCCAATTGGGTTTGCCCAAATTCTCTTATTGCATATCGGGTCGTGATTCTTCTGGGGTTCTGCTTTTCGGCGACGCG
AGTCTTTCTTGGCTTGGGAATTTGACCTACACGCCTTTGGTTCAAATGTCTACGCCATTGCCGTATTACGACCGAGTCGCCTACACGGTCCAACTAGACGGAATCAGAGT
AGGGAACAAAATTCTGGCACTCCCGAAGTCAATATTCGCACCAGACCACACCGGCGCCGGGCAAACCATGGTAGATTCAGGGACCCAGTTCACGTTTCTTCTGGGACCAG
TGTACACGGCTTTAAAGAACGAGTTTGTGGTACAAACGAAGGGCATTTTGGTCCCACTGGGTGATCCAAATTTCGTGTTTCAAGGAGCGATGGACTTGTGCTACAGAGTA
CCCGAGAAACAGGGGAAACTGCCGCCACTGCCGGTAGTGAGTCTGATGTTCCGTGGGGCGGAGATGGTGGTTGGCGGAGAGGTGCTAATGTATAAAGTACCGGGAATGGT
AAGGGGTGGTGACCAAGTGCATTGCTTGACGTTTGGGAATTCGGATTTGTTAGGAATAGAAGCATTTGTGATTGGGCATCATCATCAACAAAACGTGTGGATGGAATTCG
ACTTGGTGAAATCAAGGGTTGGATTTGTAGAGACGAGGTGTGATTTGGCGGCTGTCGCAACTACTGCAGACATGAGTTTTGTTTTCCGTGGAACAAGAGTTCCAGACATA
GAAAATGGTCTATCTGGATTTATACCTGAACGGCGTGCAATGCGGGTTCATGCAGCACGTCCTGTTAACTCAAACTCACTTGCCTTTCTTGTCACAGTCCTTTTGTTGTT
CATGATGTTAAATTCTCACCAGATGTCACCGAACTTTCTGCTCTGGCTTGTGCTTGGTGTGTTTTTGATGGCTACAACATTAAGGATGTATGCGACCTGTCAACAACTTC
AGGCTCAAGCCCAAGCTAGAGCTATGGCAGCCAGTGGCCTTCTTGGGCACACTGAATTGAGGTTACATATGCCACCATCAATAGCACTTGCTACTAGAGGACGTTTACAA
GGGCTAAGGCTTCAACTTGCTCTGCTTGATCGGGAATTTGATGATTTAGATTATGAAACTTTGAGAGCTTTGGATTCTGACAATGCTCCGACAACACCTTCTATGAGTGA
GGAACAAATAAATGCTCTTCCTGTTCATAAATACAAGGTTTCTGGTCCTCAAAGCGACCCCTCTGTGAACCAGCAGGCTTCATCTTCAGAGTCTAATGAGAAGAGACAAG
ATTCAGCTAATGCAGTTGGCAGTACCAAGGCCTCGGAGGATGAACTTACGTGCAGTGTTTGCTTGGAGCAAGTAAATGTTGGTGAACTCTTCCATGCCAACTGCATAGAT
CCATGGCTGCGACAGCAGGGCACGTGCCCTGTTTGTAAATTCAGAGCGGTGTCTGGGTGGTCAGAACAGGGACAAGGGGAAACCGATGCGTATTCGGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTTCTTCCTCCGCCTGCTGCAGCTTCTTATCTGCTGCGTCTCTTTCAAACAGGGCCTCTGTTTTTCTGCGACTCAGACCATGGTTTTGCCCCTCAAAACACAGAT
GGGTGTCACTTCTCGGCCTTCCAATAAGCTCAGTTTTCACCATAATGTCACTTTGACTGTTTCCTTAACGCTTGGCTCGCCTCCTCAACCCGTTACTATGGTTCTCGATA
CAGGGAGTGAACTCTCATGGCTTCACTGCAAAAAAACCCCAAATTTGAACTCTGTTTTTAACCCACTTTCTTCCTCTTCTTACTCGCCAGTCCCCTGTGCTTCCCCTGTT
TGCCGGACCCGAACCCGAGATTTACCCAACCCGGTTACCTGCGACCCAAAGAAACTCTGCCACGTCTTTGTCTCTTATGCCGACGCCTCGTCGCTCGAGGGTAATCTCGC
GTCGGATACGTTTCGAGTCGGGTCATCGGCTCAACCCGGAACTTTTTTTGGGTGTATGGATTCGGGTTTCAGTTCGAATTCGGAGGAGGACGCGAAGACCACTGGGCTGA
TGGGGATGAACAGGGGCTCGCTCTCGTTTGTCACCCAATTGGGTTTGCCCAAATTCTCTTATTGCATATCGGGTCGTGATTCTTCTGGGGTTCTGCTTTTCGGCGACGCG
AGTCTTTCTTGGCTTGGGAATTTGACCTACACGCCTTTGGTTCAAATGTCTACGCCATTGCCGTATTACGACCGAGTCGCCTACACGGTCCAACTAGACGGAATCAGAGT
AGGGAACAAAATTCTGGCACTCCCGAAGTCAATATTCGCACCAGACCACACCGGCGCCGGGCAAACCATGGTAGATTCAGGGACCCAGTTCACGTTTCTTCTGGGACCAG
TGTACACGGCTTTAAAGAACGAGTTTGTGGTACAAACGAAGGGCATTTTGGTCCCACTGGGTGATCCAAATTTCGTGTTTCAAGGAGCGATGGACTTGTGCTACAGAGTA
CCCGAGAAACAGGGGAAACTGCCGCCACTGCCGGTAGTGAGTCTGATGTTCCGTGGGGCGGAGATGGTGGTTGGCGGAGAGGTGCTAATGTATAAAGTACCGGGAATGGT
AAGGGGTGGTGACCAAGTGCATTGCTTGACGTTTGGGAATTCGGATTTGTTAGGAATAGAAGCATTTGTGATTGGGCATCATCATCAACAAAACGTGTGGATGGAATTCG
ACTTGGTGAAATCAAGGGTTGGATTTGTAGAGACGAGGTGTGATTTGGCGGCTGTCGCAACTACTGCAGACATGAGTTTTGTTTTCCGTGGAACAAGAGTTCCAGACATA
GAAAATGGTCTATCTGGATTTATACCTGAACGGCGTGCAATGCGGGTTCATGCAGCACGTCCTGTTAACTCAAACTCACTTGCCTTTCTTGTCACAGTCCTTTTGTTGTT
CATGATGTTAAATTCTCACCAGATGTCACCGAACTTTCTGCTCTGGCTTGTGCTTGGTGTGTTTTTGATGGCTACAACATTAAGGATGTATGCGACCTGTCAACAACTTC
AGGCTCAAGCCCAAGCTAGAGCTATGGCAGCCAGTGGCCTTCTTGGGCACACTGAATTGAGGTTACATATGCCACCATCAATAGCACTTGCTACTAGAGGACGTTTACAA
GGGCTAAGGCTTCAACTTGCTCTGCTTGATCGGGAATTTGATGATTTAGATTATGAAACTTTGAGAGCTTTGGATTCTGACAATGCTCCGACAACACCTTCTATGAGTGA
GGAACAAATAAATGCTCTTCCTGTTCATAAATACAAGGTTTCTGGTCCTCAAAGCGACCCCTCTGTGAACCAGCAGGCTTCATCTTCAGAGTCTAATGAGAAGAGACAAG
ATTCAGCTAATGCAGTTGGCAGTACCAAGGCCTCGGAGGATGAACTTACGTGCAGTGTTTGCTTGGAGCAAGTAAATGTTGGTGAACTCTTCCATGCCAACTGCATAGAT
CCATGGCTGCGACAGCAGGGCACGTGCCCTGTTTGTAAATTCAGAGCGGTGTCTGGGTGGTCAGAACAGGGACAAGGGGAAACCGATGCGTATTCGGTTTGATTGAAGGC
AAGGCAAGTTATCGAACTATGCTTGATGCCTGCTTAAAGCGCAGCGTATATGGACAGACATGGTGTAATTAGGATGAGGGTAGGAATTACAGGAAGTGCGATTGCCTTCT
TTGAACTTCTACAAGCCATTTATTAATTGTATTCTTGAAGTTGATCATTTGTTATATAGTCTCTGACCATTGCGAGTGAGCAGAATTTTGTTCTTTCACTATCTTCCATG
TAGTCCAAACCTTACCAGCAAGTGCTTGTTATATCTAGAACTCTTTCATGATTCTCTAATGCGTTCAAATCTCC
Protein sequenceShow/hide protein sequence
MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPV
CRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDA
SLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGILVPLGDPNFVFQGAMDLCYRV
PEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAAVATTADMSFVFRGTRVPDI
ENGLSGFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASGLLGHTELRLHMPPSIALATRGRLQ
GLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSANAVGSTKASEDELTCSVCLEQVNVGELFHANCID
PWLRQQGTCPVCKFRAVSGWSEQGQGETDAYSV