; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G17440 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G17440
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of Unknown Function (DUF239)
Genome locationClcChr08:27778178..27781594
RNA-Seq ExpressionClc08G17440
SyntenyClc08G17440
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598306.1 hypothetical protein SDJN03_08084, partial [Cucurbita argyrosperma subsp. sororia]8.5e-21182.57Show/hide
Query:  IKKPEFRREFSMGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPT
        +++P+ RREF MGKRF     +  MLVVTL V+V G VEG SLSKQKSL    +MNSLRKQA  SI+SEDGDIIDCV+IYDQPAFDHPALRNHTIQ+APT
Subjt:  IKKPEFRREFSMGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPT

Query:  YDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNY
        YD T+D+HSKKATAEREG EEK+SMVVKQTWRKSGSCP+GTIPIRRI+KHVLLKA+SIERYGRK+P  S E  QL  +RSSH LL N SKAFLL  GNNY
Subjt:  YDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNY

Query:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGL
        NAAKGDIKVCNP+VEFDDEYSTSQVALLTGP++N+EAIESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQTSNEIALGSAIYPIST  GL
Subjt:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGL

Query:  PYEITMLLFRDLETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWV
        PYEI M LFRD +T+NWWVQYGESINIGYWP ELF AL  TAETVQWGGEVYSTK+G PPHTRT MGSGRFPDFISGTSGWVKR+RVRDNSM+L FPGWV
Subjt:  PYEITMLLFRDLETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWV

Query:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        EHYSDEYDCYDVDFIRDYL+DPELYYGGPGKNP+CP
Subjt:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

XP_004143037.3 uncharacterized protein LOC101203978 [Cucumis sativus]1.2e-21285.88Show/hide
Query:  MGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQAT-SIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSKK
        MGK   CNGLMFKMLV  LTV+V GVVE  SLSK K+    KKM+SLRKQAT SIQSEDGDIIDCV+IYDQPAFDHPALRNHTIQMAPTYD T+DKHSKK
Subjt:  MGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQAT-SIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSKK

Query:  ATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVCN
        ATAE EGM EK SM VKQ WRKSGSCPK TIPIRRIRKHV LKA+S+  YG+KRPT  LEI QLSNSRSSHFLL NHSKA LLA+G+N+N AKGDIKVCN
Subjt:  ATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVCN

Query:  PKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRD
        P VEFDDEYSTSQVALLTGP++NYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTST LP+EITM LFRD
Subjt:  PKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRD

Query:  LETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYD
         ETNNWWVQYGESINIGYWPSELFKALK+TAETVQWGGEVYSTKLGGPPHT TGMG+G+FPD+ISG SGWVKRIRVRDNSMILKFP +VEHYSDEYDCYD
Subjt:  LETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYD

Query:  VDFIRDYLEDPELYYGGPGKNPKCP
        VDFIR+YL+DPELYYGGPGKN +CP
Subjt:  VDFIRDYLEDPELYYGGPGKNPKCP

XP_022961958.1 uncharacterized protein LOC111462577 isoform X1 [Cucurbita moschata]1.4e-21082.34Show/hide
Query:  IKKPEFRREFSMGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPT
        +++PE  REF MGKRF     +  ML+VTLTV+V G VEG SLSKQKS+    +MNSLRKQA  SI+SEDGDIIDCV+IYDQPAFDHPALRNHTIQ+APT
Subjt:  IKKPEFRREFSMGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPT

Query:  YDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNY
        YD T+D+HSKKATAEREG EEK+SMVVKQTWRKSGSCP+GTIPIRRIRKHVLLKA SIERYGRK+P  S E  QL  +RSSH LL N SKAFLL  GNNY
Subjt:  YDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNY

Query:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGL
        NAAKGDIKVCNP+VEFDDEYSTSQVALLTGP++N+EAIESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQTSNEIALGSAIYPIST  GL
Subjt:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGL

Query:  PYEITMLLFRDLETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWV
        PYEI M LFRD +T+NWWVQYGESINIGYWP ELF AL  TAETVQWGGEVYST +G PPHTRT MGSGRFPDFISGTSGWVKR+RVRDNSM+L FPGWV
Subjt:  PYEITMLLFRDLETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWV

Query:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        EHYSDEYDCYDVDFIRDYL+DPELYYGGPGKNP+CP
Subjt:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

XP_022997646.1 uncharacterized protein LOC111492515 isoform X1 [Cucurbita maxima]3.8e-21182.8Show/hide
Query:  IKKPEFRREFSMGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPT
        +++PE RREF MGKRF     +  MLVVTLTV+V G VEG SLSKQKSL    +MNSLRKQA  SI+SEDGDIIDCV+IYDQPAFDHPAL NHTIQ+APT
Subjt:  IKKPEFRREFSMGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPT

Query:  YDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNY
        YD T+D+HSKKATAEREG EEK+SMVVKQTWRKSGSCP+GTIPIRRIRKHVLLKA+S+ERYGRK+P  S E  QL  +RSSH LL N SKAFLL  GNNY
Subjt:  YDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNY

Query:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGL
        NAAKGDIKVCNP+VEFDDEYSTSQVALLTGP++N+EAIESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQTSNEIALGSAIYPIST  GL
Subjt:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGL

Query:  PYEITMLLFRDLETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWV
        PYEI M LFRD +T+NWWVQYGESINIGYWP ELF AL  TAETVQWGGEVYSTK+G PPHTRT MGSGRFPDFISGTSGWVKR+RVRDNSM+L FPGWV
Subjt:  PYEITMLLFRDLETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWV

Query:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        EHYSDEYDCYDVDFIRDYL+DPELYYGGPGKNP+CP
Subjt:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

XP_038886336.1 uncharacterized protein LOC120076548 [Benincasa hispida]4.1e-22989.44Show/hide
Query:  SMGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQAT-SIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSK
        S  KRFSCN LMFKMLVVTLTV+V G+VEG SL+KQKSL +EKKMNSLRKQAT SIQS+DGDIIDC+NIYDQPAFDHPALRNHTIQMAPTYD T+D+HSK
Subjt:  SMGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQAT-SIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSK

Query:  KATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVC
        KATAEREGME KDSMVVKQTWRKSGSCPKGTIPIRRI+KH+L KADS+E YGRKRP SS EI QLSNSRSSH+LL NHSKA LLALGNNYN AKGDIKVC
Subjt:  KATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVC

Query:  NPKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFR
        NPKVEFDDEYSTSQVALLTGP++NYEA+ESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITM LFR
Subjt:  NPKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFR

Query:  DLETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCY
        DL+TNNWWVQYGESI+IGYWPSELF ALK+TAETVQWGGEVYSTKLGGPPHTRTGMG+G+FPD+ISG SGWVKRIRVRDNSM+LKFP WVEHYSDEYDCY
Subjt:  DLETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCY

Query:  DVDFIRDYLEDPELYYGGPGKNPKCP
        D+DFIRDYL+DPELYYGGPGKNPKCP
Subjt:  DVDFIRDYLEDPELYYGGPGKNPKCP

TrEMBL top hitse value%identityAlignment
A0A0A0LQZ6 Uncharacterized protein5.7e-21385.88Show/hide
Query:  MGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQAT-SIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSKK
        MGK   CNGLMFKMLV  LTV+V GVVE  SLSK K+    KKM+SLRKQAT SIQSEDGDIIDCV+IYDQPAFDHPALRNHTIQMAPTYD T+DKHSKK
Subjt:  MGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQAT-SIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSKK

Query:  ATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVCN
        ATAE EGM EK SM VKQ WRKSGSCPK TIPIRRIRKHV LKA+S+  YG+KRPT  LEI QLSNSRSSHFLL NHSKA LLA+G+N+N AKGDIKVCN
Subjt:  ATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVCN

Query:  PKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRD
        P VEFDDEYSTSQVALLTGP++NYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTST LP+EITM LFRD
Subjt:  PKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRD

Query:  LETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYD
         ETNNWWVQYGESINIGYWPSELFKALK+TAETVQWGGEVYSTKLGGPPHT TGMG+G+FPD+ISG SGWVKRIRVRDNSMILKFP +VEHYSDEYDCYD
Subjt:  LETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYD

Query:  VDFIRDYLEDPELYYGGPGKNPKCP
        VDFIR+YL+DPELYYGGPGKN +CP
Subjt:  VDFIRDYLEDPELYYGGPGKNPKCP

A0A1S3BAI4 uncharacterized protein LOC1034877983.8e-20984.71Show/hide
Query:  MGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQAT-SIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSKK
        MGK    +GLMFKMLVV LTV+V GVVE  SLSK +S    KKMNSLRKQAT SIQSEDGDIIDCV+IYDQPAFDHPALRNHTIQMAPTYD T+DKHSKK
Subjt:  MGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQAT-SIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSKK

Query:  ATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVCN
        ATAE+EGM EK+SM VKQ WR SGSCPK TIPIRRIRKH    A+S+  YG+KRPT  LEI QLSNSRSSHFLL NHSKA LLA+GNN+N AKGDIKVCN
Subjt:  ATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVCN

Query:  PKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRD
        P VEFDDEYSTSQVALLTGP++NYEAIESGWAVNPGVYGDRQTRLFVYWT+DASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTS  LP+EITM LFRD
Subjt:  PKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRD

Query:  LETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYD
         ETNNWWVQYGESINIGYWPSELFKALK+TAETVQWGGEVYSTKLGGPPHT TGMG+G+FPD+ISG SGWVKRIRVRDNSMILKFP +VEHYSDEYDCYD
Subjt:  LETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYD

Query:  VDFIRDYLEDPELYYGGPGKNPKCP
        VDFIR+YL+DPELYYGGPGKN +CP
Subjt:  VDFIRDYLEDPELYYGGPGKNPKCP

A0A5A7V3A1 Uncharacterized protein3.8e-20984.71Show/hide
Query:  MGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQAT-SIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSKK
        MGK    +GLMFKMLVV LTV+V GVVE  SLSK +S    KKMNSLRKQAT SIQSEDGDIIDCV+IYDQPAFDHPALRNHTIQMAPTYD T+DKHSKK
Subjt:  MGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQAT-SIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSKK

Query:  ATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVCN
        ATAE+EGM EK+SM VKQ WR SGSCPK TIPIRRIRKH    A+S+  YG+KRPT  LEI QLSNSRSSHFLL NHSKA LLA+GNN+N AKGDIKVCN
Subjt:  ATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVCN

Query:  PKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRD
        P VEFDDEYSTSQVALLTGP++NYEAIESGWAVNPGVYGDRQTRLFVYWT+DASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTS  LP+EITM LFRD
Subjt:  PKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRD

Query:  LETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYD
         ETNNWWVQYGESINIGYWPSELFKALK+TAETVQWGGEVYSTKLGGPPHT TGMG+G+FPD+ISG SGWVKRIRVRDNSMILKFP +VEHYSDEYDCYD
Subjt:  LETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYD

Query:  VDFIRDYLEDPELYYGGPGKNPKCP
        VDFIR+YL+DPELYYGGPGKN +CP
Subjt:  VDFIRDYLEDPELYYGGPGKNPKCP

A0A6J1HDC5 uncharacterized protein LOC111462577 isoform X17.0e-21182.34Show/hide
Query:  IKKPEFRREFSMGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPT
        +++PE  REF MGKRF     +  ML+VTLTV+V G VEG SLSKQKS+    +MNSLRKQA  SI+SEDGDIIDCV+IYDQPAFDHPALRNHTIQ+APT
Subjt:  IKKPEFRREFSMGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPT

Query:  YDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNY
        YD T+D+HSKKATAEREG EEK+SMVVKQTWRKSGSCP+GTIPIRRIRKHVLLKA SIERYGRK+P  S E  QL  +RSSH LL N SKAFLL  GNNY
Subjt:  YDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNY

Query:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGL
        NAAKGDIKVCNP+VEFDDEYSTSQVALLTGP++N+EAIESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQTSNEIALGSAIYPIST  GL
Subjt:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGL

Query:  PYEITMLLFRDLETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWV
        PYEI M LFRD +T+NWWVQYGESINIGYWP ELF AL  TAETVQWGGEVYST +G PPHTRT MGSGRFPDFISGTSGWVKR+RVRDNSM+L FPGWV
Subjt:  PYEITMLLFRDLETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWV

Query:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        EHYSDEYDCYDVDFIRDYL+DPELYYGGPGKNP+CP
Subjt:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

A0A6J1K820 uncharacterized protein LOC111492515 isoform X11.8e-21182.8Show/hide
Query:  IKKPEFRREFSMGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPT
        +++PE RREF MGKRF     +  MLVVTLTV+V G VEG SLSKQKSL    +MNSLRKQA  SI+SEDGDIIDCV+IYDQPAFDHPAL NHTIQ+APT
Subjt:  IKKPEFRREFSMGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPT

Query:  YDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNY
        YD T+D+HSKKATAEREG EEK+SMVVKQTWRKSGSCP+GTIPIRRIRKHVLLKA+S+ERYGRK+P  S E  QL  +RSSH LL N SKAFLL  GNNY
Subjt:  YDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNY

Query:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGL
        NAAKGDIKVCNP+VEFDDEYSTSQVALLTGP++N+EAIESGWAVNPGVYGDRQTRLFVYWT DAS KTGCFDLTCPGFVQTSNEIALGSAIYPIST  GL
Subjt:  NAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGL

Query:  PYEITMLLFRDLETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWV
        PYEI M LFRD +T+NWWVQYGESINIGYWP ELF AL  TAETVQWGGEVYSTK+G PPHTRT MGSGRFPDFISGTSGWVKR+RVRDNSM+L FPGWV
Subjt:  PYEITMLLFRDLETNNWWVQYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWV

Query:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        EHYSDEYDCYDVDFIRDYL+DPELYYGGPGKNP+CP
Subjt:  EHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.2e-8240.91Show/hide
Query:  LSKQKSLEIEKKMNSLRKQAT-SIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTI
        +SKQK  E++K +N L K A  SIQS DGD+IDCV I  QPAFDHP L++H IQM P Y         K +A +   +E     + Q W + G C +GTI
Subjt:  LSKQKSLEIEKKMNSLRKQAT-SIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTI

Query:  PIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMN-HSKAFLLALGNNYNAAKGDIKVCNPKVEFDDEYSTSQVALLTGPH-HNYEAIES
        P+RR ++  +L+A S++RYG+K+  S      L  S     +  + H  A     G+ Y  AK  I V  PK++  +E+S SQ+ LL G    +  +IE+
Subjt:  PIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMN-HSKAFLLALGNNYNAAKGDIKVCNPKVEFDDEYSTSQVALLTGPH-HNYEAIES

Query:  GWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRDLETNNWWVQYGESINIGYWPSELFKALKF
        GW V+P +YGD  TRLF YWT DA   TGC++L C GF+Q +++IA+G++I P+S      Y+I++L+++D +  +WW+Q+G    +GYWPS LF  L  
Subjt:  GWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRDLETNNWWVQYGESINIGYWPSELFKALKF

Query:  TAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP
        +A  ++WGGEV +++  G  HT T MGSG+FP+     + + + I+V D S  LK P  +  ++++ +CYDV    +       YYGGPGKN KCP
Subjt:  TAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPKCP

AT2G44210.1 Protein of Unknown Function (DUF239)1.6e-8240.53Show/hide
Query:  FKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYD-STVDKHSKKATAEREGMEE
        F  LV+T+ +L   VV G   +    L+I   +  L K A  SI+S DGD+IDCV I DQPAF HP L NHT+QM P+ +  +V   SK ++  +     
Subjt:  FKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYD-STVDKHSKKATAEREGMEE

Query:  KDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMN-HSKAFLLALGNNYNAAKGDIKVCNPKVEFDDEY
        + S  + Q W  +G CPK TIPIRR R+  L +A S+E YG K   S   I +  +S   + L  N H  A +      +  AK  I V  P VE  +E+
Subjt:  KDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMN-HSKAFLLALGNNYNAAKGDIKVCNPKVEFDDEY

Query:  STSQVALLTGP-HHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRDLETNNWWV
        S +Q+ +L G  + +  +IE+GW V+P +YGD +TRLF YWT DA   TGC++L C GFVQ + EIA+G +I P+S      Y+IT+L+++D +  +WW+
Subjt:  STSQVALLTGP-HHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRDLETNNWWV

Query:  QYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYDVDFIRDYL
        Q+GE   IGYWP+ LF  L  +A  ++WGGEV +++     HT T MGSGRF +   G + + K ++V D S  L+ P  ++ ++D+ +CY+V       
Subjt:  QYGESINIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYDVDFIRDYL

Query:  EDPELYYGGPGKNPKCP
             YYGGPG+NP CP
Subjt:  EDPELYYGGPGKNPKCP

AT5G25950.1 Protein of Unknown Function (DUF239)1.9e-9142.93Show/hide
Query:  SLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRI
        SL+I+ K+ +L K A  +I+SEDGDIIDC++IY Q AFDHPAL+NH IQM P    +V   +KK T    G  E    +  Q W KSG CP GTIP+RR+
Subjt:  SLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRI

Query:  RKHVLLKADSIERYGRKRP------------TSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHHNY
         +  + +A S   +GRK P              +  IT    + +   L    S+AF++ALG N+  A+ DI + NP      +YST+Q+ L+ G   N+
Subjt:  RKHVLLKADSIERYGRKRP------------TSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHHNY

Query:  EAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRDLETNNWWVQYGESINIGYWPSELF
        E++E GW VNP V+GD +TRLF+ WT D   KTGC +L C GFVQTS + ALG+ + P+S+S+   Y IT+ +F D  + NWW+    ++ +GYWP  LF
Subjt:  EAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRDLETNNWWVQYGESINIGYWPSELF

Query:  KALKFTAETVQWGGEVYSTKLG-GPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYDVDFIR-DYLEDPELYYGGPGKNP
          LK +A  VQWGGEV+S  +    PHT T MGSG++  +I   + +   +R++D SM LK+P ++  Y+DEY+CY     R  Y+ +P  Y+GGPG+N 
Subjt:  KALKFTAETVQWGGEVYSTKLG-GPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYDVDFIR-DYLEDPELYYGGPGKNP

Query:  KCP
        +CP
Subjt:  KCP

AT5G25960.1 Protein of Unknown Function (DUF239)1.0e-8442.56Show/hide
Query:  SLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRI
        SL+I+ K+ SL K +  +I+SEDGDIIDC++IY Q AFDHPALRNH IQM P    +VD  +KK T    G  E+   +  Q W KSG+CPKGTIP    
Subjt:  SLEIEKKMNSLRKQA-TSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDSTVDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRI

Query:  RKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPG
                                                  +A L+ALG N+  A+ DI V NP      +YS++Q+ LL G    +E+IE+GWAVNP 
Subjt:  RKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVCNPKVEFDDEYSTSQVALLTGPHHNYEAIESGWAVNPG

Query:  VYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRDLETNNWWVQYGESINIGYWPSELFKALKFTAETVQW
        V+GD +TRLF YWT D   KTGC +L C GFVQT+ + ALG+AI P+ST++   + IT     D  + NWW+    ++ IGYWP  LF  LK +A  VQ 
Subjt:  VYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRDLETNNWWVQYGESINIGYWPSELFKALKFTAETVQW

Query:  GGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYDVDFIR-DYLEDPELYYGGPGKNPKCP
        GGEV+S  +G  PHTRT MGSG++  ++   + +   IR++D S+ +K+P ++  Y+DEY+CY     R  Y+ +P  Y+GGPG+N +CP
Subjt:  GGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYDVDFIR-DYLEDPELYYGGPGKNPKCP

AT5G56530.1 Protein of Unknown Function (DUF239)4.7e-8240.15Show/hide
Query:  WGVVE-----GVSLSKQKSLEIEKKMNSLRKQAT-SIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTY-DSTVDKHSKKATAEREGMEEKDSMVVK
        WG++       +S+S+Q + E+ K +N L K A  SIQS DGDIIDCV+I  QPAFDHP L++H IQM P+Y   ++   SK +   +E +       + 
Subjt:  WGVVE-----GVSLSKQKSLEIEKKMNSLRKQAT-SIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTY-DSTVDKHSKKATAEREGMEEKDSMVVK

Query:  QTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMN---HSKAFLLALGNNYNAAKGDIKVCNPKVEFDDEYSTSQV
        Q W ++G C +GTIP+RR +K  +L+A S++RYG+K+  S      +   RS+   L+N   H  A     G  +  AK  I V  PKV+  +E+S SQ+
Subjt:  QTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMN---HSKAFLLALGNNYNAAKGDIKVCNPKVEFDDEYSTSQV

Query:  ALLTGPH-HNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRDLETNNWWVQYGES
         +L G    +  +IE+GW V+P +YGD  TRLF YWT DA   TGC++L C GF+Q +++IA+G++I P+S      Y+I++ +++D +  +WW+Q+G+ 
Subjt:  ALLTGPH-HNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRDLETNNWWVQYGES

Query:  INIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYDVDFIRDYLEDPEL
          +GYWPS LF  L  +A  V+WGGEV + +  G  HT T MGSG+FPD     + + + I+V D+S  LK P  +  ++++ +CYDV+  ++       
Subjt:  INIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYDVDFIRDYLEDPEL

Query:  YYGGPGKNPKC
        YYGGPG+NP C
Subjt:  YYGGPGKNPKC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTTAGTAGAGTTGGCATCAAGAAACCCGAGTTTCGTCGTGAGTTTTCGATGGGGAAGAGATTCTCTTGCAATGGTTTGATGTTCAAAATGTTGGTTGTGACATT
GACAGTCTTAGTTTGGGGAGTTGTGGAGGGTGTTTCACTTTCTAAACAGAAGAGTTTGGAAATTGAGAAGAAAATGAACTCTCTTAGGAAGCAAGCAACAAGCATTCAGA
GCGAAGATGGCGACATCATAGATTGCGTTAACATCTACGACCAACCTGCTTTTGATCATCCTGCCTTGAGAAATCACACCATCCAGATGGCACCCACTTATGATTCCACC
GTGGATAAGCATTCAAAGAAAGCAACAGCAGAAAGGGAAGGGATGGAGGAAAAGGATTCCATGGTTGTGAAACAAACATGGAGGAAAAGTGGCAGTTGTCCTAAAGGAAC
AATACCAATTCGAAGGATCCGAAAACATGTCCTACTCAAAGCTGATTCAATAGAACGCTATGGAAGAAAGAGACCGACGTCTTCGCTAGAAATCACTCAGCTTTCCAACA
GCCGAAGTTCACACTTTCTACTAATGAATCATTCGAAGGCATTTCTGCTTGCTTTAGGAAACAACTACAATGCAGCCAAAGGAGACATTAAAGTATGCAACCCCAAGGTC
GAATTTGACGATGAATACAGTACTTCCCAAGTGGCTCTATTAACCGGCCCTCACCATAATTACGAGGCTATCGAATCCGGTTGGGCAGTAAATCCAGGTGTTTACGGGGA
CCGACAGACTCGACTGTTCGTGTATTGGACTGTTGATGCTTCCCACAAAACAGGCTGCTTTGATCTCACTTGCCCTGGTTTTGTTCAAACAAGCAATGAAATTGCTCTTG
GTTCTGCTATTTATCCAATCTCAACCTCAACTGGGCTTCCATATGAAATAACTATGCTACTCTTTAGAGATTTAGAGACAAATAATTGGTGGGTACAATATGGTGAAAGC
ATTAACATAGGTTATTGGCCTTCTGAGTTATTCAAAGCTCTAAAATTTACTGCAGAAACAGTACAATGGGGAGGAGAAGTTTACAGCACAAAGTTGGGAGGACCTCCTCA
CACTAGAACAGGTATGGGCAGTGGAAGGTTTCCTGACTTCATTTCTGGTACTTCAGGTTGGGTAAAACGGATAAGAGTTCGAGATAACTCGATGATTTTGAAGTTTCCTG
GTTGGGTTGAGCACTACTCTGATGAATATGATTGTTACGATGTCGATTTTATCCGAGATTATTTAGAAGATCCTGAACTGTACTATGGAGGTCCTGGTAAAAATCCAAAG
TGCCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTTTAGTAGAGTTGGCATCAAGAAACCCGAGTTTCGTCGTGAGTTTTCGATGGGGAAGAGATTCTCTTGCAATGGTTTGATGTTCAAAATGTTGGTTGTGACATT
GACAGTCTTAGTTTGGGGAGTTGTGGAGGGTGTTTCACTTTCTAAACAGAAGAGTTTGGAAATTGAGAAGAAAATGAACTCTCTTAGGAAGCAAGCAACAAGCATTCAGA
GCGAAGATGGCGACATCATAGATTGCGTTAACATCTACGACCAACCTGCTTTTGATCATCCTGCCTTGAGAAATCACACCATCCAGATGGCACCCACTTATGATTCCACC
GTGGATAAGCATTCAAAGAAAGCAACAGCAGAAAGGGAAGGGATGGAGGAAAAGGATTCCATGGTTGTGAAACAAACATGGAGGAAAAGTGGCAGTTGTCCTAAAGGAAC
AATACCAATTCGAAGGATCCGAAAACATGTCCTACTCAAAGCTGATTCAATAGAACGCTATGGAAGAAAGAGACCGACGTCTTCGCTAGAAATCACTCAGCTTTCCAACA
GCCGAAGTTCACACTTTCTACTAATGAATCATTCGAAGGCATTTCTGCTTGCTTTAGGAAACAACTACAATGCAGCCAAAGGAGACATTAAAGTATGCAACCCCAAGGTC
GAATTTGACGATGAATACAGTACTTCCCAAGTGGCTCTATTAACCGGCCCTCACCATAATTACGAGGCTATCGAATCCGGTTGGGCAGTAAATCCAGGTGTTTACGGGGA
CCGACAGACTCGACTGTTCGTGTATTGGACTGTTGATGCTTCCCACAAAACAGGCTGCTTTGATCTCACTTGCCCTGGTTTTGTTCAAACAAGCAATGAAATTGCTCTTG
GTTCTGCTATTTATCCAATCTCAACCTCAACTGGGCTTCCATATGAAATAACTATGCTACTCTTTAGAGATTTAGAGACAAATAATTGGTGGGTACAATATGGTGAAAGC
ATTAACATAGGTTATTGGCCTTCTGAGTTATTCAAAGCTCTAAAATTTACTGCAGAAACAGTACAATGGGGAGGAGAAGTTTACAGCACAAAGTTGGGAGGACCTCCTCA
CACTAGAACAGGTATGGGCAGTGGAAGGTTTCCTGACTTCATTTCTGGTACTTCAGGTTGGGTAAAACGGATAAGAGTTCGAGATAACTCGATGATTTTGAAGTTTCCTG
GTTGGGTTGAGCACTACTCTGATGAATATGATTGTTACGATGTCGATTTTATCCGAGATTATTTAGAAGATCCTGAACTGTACTATGGAGGTCCTGGTAAAAATCCAAAG
TGCCCCTAAGAAACTTCTAAAAAAATTGGACGATTGTCTCTTGTTTTCATTAGATTCCTTCATGTAATGAATAATTTAGTAGTATCAAGAAGAGAGTTCTTTATTTTGTT
TCAAAATTCTACTTGTAAGTTGCAATAGTGTTGTCTTCCTTCCTAGGCCACTTCTATATCCAGAAAGAAAGAACTTCCTTTCAGATACTTGATTACACATAAC
Protein sequenceShow/hide protein sequence
MNFSRVGIKKPEFRREFSMGKRFSCNGLMFKMLVVTLTVLVWGVVEGVSLSKQKSLEIEKKMNSLRKQATSIQSEDGDIIDCVNIYDQPAFDHPALRNHTIQMAPTYDST
VDKHSKKATAEREGMEEKDSMVVKQTWRKSGSCPKGTIPIRRIRKHVLLKADSIERYGRKRPTSSLEITQLSNSRSSHFLLMNHSKAFLLALGNNYNAAKGDIKVCNPKV
EFDDEYSTSQVALLTGPHHNYEAIESGWAVNPGVYGDRQTRLFVYWTVDASHKTGCFDLTCPGFVQTSNEIALGSAIYPISTSTGLPYEITMLLFRDLETNNWWVQYGES
INIGYWPSELFKALKFTAETVQWGGEVYSTKLGGPPHTRTGMGSGRFPDFISGTSGWVKRIRVRDNSMILKFPGWVEHYSDEYDCYDVDFIRDYLEDPELYYGGPGKNPK
CP