; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008407 (gene) of Snake gourd v1 genome

Gene IDTan0008407
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionFiber Fb32-like protein isoform 3
Genome locationLG01:108259457..108264054
RNA-Seq ExpressionTan0008407
SyntenyTan0008407
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146096.1 uncharacterized protein LOC101204627 [Cucumis sativus]3.1e-27657.87Show/hide
Query:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA
        MDLKHKGI+WVGNMFQKFEAVC EVDNIINQD VKYVENQVSSAS NVK+LYS VVQG+ PP  DP+  EA A AQRG V  NAYF+S   NEGKAA+N 
Subjt:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA

Query:  VNKSSVGHG--ITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNA---------------VNKSSVGHGITDQIDNHSQASC--KVSFVNEEVAQVPNH
        VNKSSVGHG   TDQIDN SQA C+V FVNEEVAQVPNH  LE NA                 +S   + + + +  ++  SC  K++ +  + +    H
Subjt:  VNKSSVGHG--ITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNA---------------VNKSSVGHGITDQIDNHSQASC--KVSFVNEEVAQVPNH

Query:  SFLEWNVDL--------------PLQKKDDVLL---------DKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDI----
        S    N D+               +Q +DDVLL         +KD  ++ KEN VNELLSEK+D   TDKLS MES ASDPLSHSL NVS+ IND+    
Subjt:  SFLEWNVDL--------------PLQKKDDVLL---------DKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDI----

Query:  ------------------------------------------------------------------------------ASKRASSVCDDSDLQLEDDVLL
                                                                                      ++K+AS VCDD D+QLEDDVLL
Subjt:  ------------------------------------------------------------------------------ASKRASSVCDDSDLQLEDDVLL

Query:  AGNNDGGLTDKDAGKSSEEDITMEFNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNH
          NNDG LTDKD  KSSEED +M+FNASDPLKHMAN T C+VKVTN E  LI DNSHLP+ESS LS KN+ + S+E+S EFLKK + ME N +D+LN NH
Subjt:  AGNNDGGLTDKDAGKSSEEDITMEFNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNH

Query:  LILSHVWSETNFVSEEADDSNLLLKSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNE
          L+HVWS TNFV +EADDSN LLKS V SGR+DH +MD+ FN+S +K AI  FED P S+LLNLP H NG+ FT EEAIMV D NHLQLE E+ AR N+
Subjt:  LILSHVWSETNFVSEEADDSNLLLKSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNE

Query:  DALTDKYSNESLKEDTILVLEHDASYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDASCEGQANLELSTELISHCDEESI
        D LT K+SNESL +DTIL LEHDA YPLKNQPRCTS+  ++K EEVSSVSNDS  KL S +  GKN +AL DKASD SC+ QANLELSTEL  HC EESI
Subjt:  DALTDKYSNESLKEDTILVLEHDASYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDASCEGQANLELSTELISHCDEESI

Query:  KETLGSRDNECKEDIVTSTGGPLVTSIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSS
        KE+L S  NEC+ DIVT  G    TSIH ADVES HNV QASS LVN+L+ FS   ETTSKYLENG G SSNA DATSSE  S+VLTSGETVEE++PVSS
Subjt:  KETLGSRDNECKEDIVTSTGGPLVTSIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSS

Query:  LKSLPKSSFPASRSLVKNFSSCTVIHEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESL-GTHASRANESAFVPKLHASGQVAFS
        LK L K SF A RS V N SS TV+HEKP E  A  E  SR  F V  N S  N A  MK  SS SSLSSMESL GTHASRAN++ F+PK     Q   S
Subjt:  LKSLPKSSFPASRSLVKNFSSCTVIHEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESL-GTHASRANESAFVPKLHASGQVAFS

Query:  EFTSSGTLSLSTEVGRTYNSGCSILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTD
        + TSS   S STE G  ++S   ILDAE+ETVDLGHKV+ +D+CD +DYKALHA+SRR QKLRSYKK+IQDAF SKKRLAKEYEQLAIWYGD D+E ST+
Subjt:  EFTSSGTLSLSTEVGRTYNSGCSILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTD

Query:  SSQKLD---TTTSYASDSEWEIL
        S QKL+    +T+Y SDSEWE+L
Subjt:  SSQKLD---TTTSYASDSEWEIL

XP_008463725.1 PREDICTED: uncharacterized protein LOC103501804 isoform X1 [Cucumis melo]5.1e-27155.18Show/hide
Query:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA
        MDLKHKGI+WVGNMFQKFEAVC EVDNIINQD VKYVENQVSSAS NVK+LYS VVQG+ PP+ DP++ EA A AQRG V  NAYF+S   NEGKAA+N 
Subjt:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA

Query:  VNKSSVGHGI--TDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWN---------------AVNKSSVGHGITDQID--NHSQASCKVSFVNEEVAQVPNH
        VN SSVGHG   TDQIDN SQASC+V FVNEEVAQVPN S LE N                +++S   + +++ +   N    + K++ +    +   NH
Subjt:  VNKSSVGHGI--TDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWN---------------AVNKSSVGHGITDQID--NHSQASCKVSFVNEEVAQVPNH

Query:  SFLEWNVDLPLQKK-----------------------DDVLLDKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDI----
        S    N D+   KK                       D VL +KD  ++ K+N  +ELLSEK+D   TDKLS ME  ASDPLSHSL NVS+ IND+    
Subjt:  SFLEWNVDLPLQKK-----------------------DDVLLDKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDI----

Query:  ------------------------------------------------------------------------------ASKRASSVCDDSDLQLEDDVLL
                                                                                      ++K+AS VCDD D+QLEDDVLL
Subjt:  ------------------------------------------------------------------------------ASKRASSVCDDSDLQLEDDVLL

Query:  AGNNDGGLTDKDAGKSSEEDITMEFNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNH
         GNN G LTDKD  KSSEED TM+ NASDPLKHMAN TSC+VKVTN E  LI DNSHLP+ESS LS KND++ S+E+SDEFLKK + ME N +D+LN NH
Subjt:  AGNNDGGLTDKDAGKSSEEDITMEFNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNH

Query:  LILSHVWSETNFVSEEADDSNLLLKSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNE
           +HVWS TNFV +EADDSN LLKS VLSG +DH +MD+ F+ S +K AI  FED P S+LLNLP H NG+ FT EE IMVSD NHLQL  E+ AR N+
Subjt:  LILSHVWSETNFVSEEADDSNLLLKSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNE

Query:  DALTDKYSNESLKEDTILVLEHDASYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDASCEGQANLELSTELISHCDEESI
        DALT K+SNESLK DTIL LEHDA+YPLKNQPRCTSS  K+K EEVSSVSNDS LKL+S +  GKN +ALIDKASD SC+ QANLELSTEL  HC EESI
Subjt:  DALTDKYSNESLKEDTILVLEHDASYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDASCEGQANLELSTELISHCDEESI

Query:  KETLGSRDNECKEDIVTSTGGPLVTSIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSEL-------------------
        KETL S  NE + D+VT  GG   T IH  DVES H   Q S+  VN+L+ FS  METTSKYLENG  CSSNA DATSSEL                   
Subjt:  KETLGSRDNECKEDIVTSTGGPLVTSIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSEL-------------------

Query:  -----------------------------ISLVLTSGETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTVIHEKPAEQRASIEYESRPCFEVVTNASN
                                      S+VLTSGETVEE++PVSSLK L K SF A      N SS TV+HEKP E  A  E  SR  FEV  + S 
Subjt:  -----------------------------ISLVLTSGETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTVIHEKPAEQRASIEYESRPCFEVVTNASN

Query:  ENKALKMKFVSSSSSLSSMESLG-THASRANESAFVPKLHASGQVAFSEFTSSGTLSLSTEVGRTYNSGCSILDAEMETVDLGHKVTLKDECDVVDYKAL
         N A  MK VSS SSLSSMESL  THASRAN++ F+PK +   Q   S+ TSSG  S ST VG  ++S   ILDAEMETVDLGHKVT ++ECDV+DYKAL
Subjt:  ENKALKMKFVSSSSSLSSMESLG-THASRANESAFVPKLHASGQVAFSEFTSSGTLSLSTEVGRTYNSGCSILDAEMETVDLGHKVTLKDECDVVDYKAL

Query:  HAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLD---TTTSYASDSEWEIL
        HAVSRR QKLRSYKK+IQDAF SKKRLAKEYEQLAIWYGD D+E ST+SSQKL+   T+T+Y SDSEWE+L
Subjt:  HAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLD---TTTSYASDSEWEIL

XP_038898347.1 uncharacterized protein LOC120086024 isoform X1 [Benincasa hispida]1.1e-28155.92Show/hide
Query:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA
        MDLKHKGI+WVGNMFQKFEAVCQEVDNIINQD VKYVENQVSSAS NVK+LYS VVQGL PP+ DP++ EA    QRG V  NAYF+S   NEGKAA+N 
Subjt:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA

Query:  VNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNAVNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNVDLPLQKKDDVLL
         NKSSVGH I                                             DQIDN SQASC V FVNEEVAQVPNHS LE N DLPL+K DDVLL
Subjt:  VNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNAVNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNVDLPLQKKDDVLL

Query:  DKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDIASKRASSVCDDSDLQLEDDVLLAGNNDGGLTDKDAGKSSEEDITME
        DKDLYENMKEN V+ELLSEK+D   TDKL+ MES ASDPLS SL NVS++IND  +KRASSVCD  D++LEDDVLL GN+D  LTDKD  KSSEEDITM+
Subjt:  DKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDIASKRASSVCDDSDLQLEDDVLLAGNNDGGLTDKDAGKSSEEDITME

Query:  FNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNHLILSHVWSETNFVSEEADDSNLLL
        FNASDPLKHMAN TSC+VKVTN E  LI DNSHLP+ESSR   KND D S+ENSDEFLKKV+ MEPN +D+LN NH  LSHVWS TNFVS+EADDSNL  
Subjt:  FNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNHLILSHVWSETNFVSEEADDSNLLL

Query:  KSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNEDALTDKYSNESLKEDTILVLEHDA
        +S VLS RI H++MD+ FN+SP+K+AI  FED PNSYLLNLP H NGV FT EEAIMV D N L+LE E+  R N+DALT KYSNESLK DTIL LEHDA
Subjt:  KSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNEDALTDKYSNESLKEDTILVLEHDA

Query:  SYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEI--------------------------------YC-----------------------------
        +Y LKN+PRCTSS +++KNEEVSSVSN S LKLESE+                                +C                             
Subjt:  SYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEI--------------------------------YC-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------GKNSEALIDKASDASCEGQANLELSTELISHCDEESIKETLGSRDNECKEDIVTSTGGPLVT
                                              GKNS ALIDKASDASC+ QANLELSTEL  HC EESIKETL S  NE + DIVTS G P  T
Subjt:  --------------------------------------GKNSEALIDKASDASCEGQANLELSTELISHCDEESIKETLGSRDNECKEDIVTSTGGPLVT

Query:  SIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTVI
        SIH ADV+S HNV QAS I+VN+ V  SP+ ETTSKYLENG   SSNA DAT     SLVLT GETVEE+ PVSSLK L K SF A RSLV N S+ TVI
Subjt:  SIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTVI

Query:  HEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESLG-THASRANESAFVPKLHASGQVAFSEFTSSGTLSLSTEVGRTYNSGCSIL
        HEKP EQ A IE ESRP FEV+ + S  NKA KMKFVSS SSLSS+E L   HASRAN++AF+PK +   Q  FS+ TSSG  SLSTE G  ++S     
Subjt:  HEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESLG-THASRANESAFVPKLHASGQVAFSEFTSSGTLSLSTEVGRTYNSGCSIL

Query:  DAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLD---TTTSYASDSEWEIL
         ++METVDLGHKVTL+DE DVVDYK LHAVSRR QKLRSYKK+IQDAF+SKKRLAKEYEQLAIWYGD DLE ST++SQKL+    +TSY SDSEWE+L
Subjt:  DAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLD---TTTSYASDSEWEIL

XP_038898348.1 uncharacterized protein LOC120086024 isoform X2 [Benincasa hispida]7.8e-28055.83Show/hide
Query:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA
        MDLKHKGI+WVGNMFQKFEAVCQEVDNIINQD VKYVENQVSSAS NVK+LYS VVQGL PP+ DP++ EA    QRG V  NAYF+S   NEGKAA+N 
Subjt:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA

Query:  VNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNAVNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNVDLPLQKKDDVLL
         NKSSVGH I                                             DQIDN SQASC V FVNEEVAQVPNHS LE N DLPL+K DDVLL
Subjt:  VNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNAVNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNVDLPLQKKDDVLL

Query:  DKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDIASKRASSVCDDSDLQLEDDVLLAGNNDGGLTDKDAGKSSEEDITME
        DKDLYENMKEN V+ELLSEK+D   TDKL+ MES ASDPLS SL NVS++IND  +KRASSVCD  D++LEDDVLL GN+D  LTDKD  KSSEEDITM+
Subjt:  DKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDIASKRASSVCDDSDLQLEDDVLLAGNNDGGLTDKDAGKSSEEDITME

Query:  FNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNHLILSHVWSETNFVSEEADDSNLLL
        FNASDPLKHMAN TSC+VKVTN E  LI DNSHLP+ESSR   KND D S+ENSDEFLKKV+ MEPN +D+LN NH  LSHVWS TNFVS+EADDSNL  
Subjt:  FNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNHLILSHVWSETNFVSEEADDSNLLL

Query:  KSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNEDALTDKYSNESLKEDTILVLEHDA
        +S VLS RI H++MD+ FN+SP+K+AI  FED PNSYLLNLP H NGV FT EEAIMV D N L+LE E+  R N+DALT KYSNESLK DTIL LEHDA
Subjt:  KSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNEDALTDKYSNESLKEDTILVLEHDA

Query:  SYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEI--------------------------------YC-----------------------------
        +Y LKN+PRCTSS +++KNEEVSSVSN S LKLESE+                                +C                             
Subjt:  SYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEI--------------------------------YC-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------GKNSEALIDKASDASCEGQANLELSTELISHCDEESIKETLGSRDNECKEDIVTSTGGPLVT
                                              GKNS ALIDKASDASC+ QANLELSTEL  HC EESIKETL S  NE + DIVTS G P  T
Subjt:  --------------------------------------GKNSEALIDKASDASCEGQANLELSTELISHCDEESIKETLGSRDNECKEDIVTSTGGPLVT

Query:  SIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTVI
        SIH ADV+S HNV QAS I+VN+ V  SP+ ETTSKYLENG   SSNA DAT     SLVLT GETVEE+ PVSSLK L K SF A RSLV N S+ TVI
Subjt:  SIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTVI

Query:  HEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESLG-THASRANESAFVPKLHASGQVAFSEFTSSGTLSLSTEVGRTYNSGCSIL
        HEKP EQ A IE ESRP FEV+ + S  NKA KMKFVSS SSLSS+E L   HASRAN++AF+PK +   Q  FS+ TSSG  SLSTE G  ++S     
Subjt:  HEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESLG-THASRANESAFVPKLHASGQVAFSEFTSSGTLSLSTEVGRTYNSGCSIL

Query:  DAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLD---TTTSYASDSEWEIL
         ++METVDLGHKVTL+DE DVVDYK LHAVSRR QKLRSY K+IQDAF+SKKRLAKEYEQLAIWYGD DLE ST++SQKL+    +TSY SDSEWE+L
Subjt:  DAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLD---TTTSYASDSEWEIL

XP_038898349.1 uncharacterized protein LOC120086024 isoform X3 [Benincasa hispida]2.2e-27454.86Show/hide
Query:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA
        MDLKHKGI+WVGNMFQKFEAVCQEVDNIINQD VKYVENQVSSAS NVK+LYS VVQGL PP+ DP++ EA    QRG V  NAYF+S   NEGKAA+N 
Subjt:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA

Query:  VNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNAVNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNVDLPLQKKDDVLL
         NKSSVGH I                                             DQIDN SQASC V FVNEEVAQVPNHS LE N DLPL+K DDVLL
Subjt:  VNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNAVNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNVDLPLQKKDDVLL

Query:  DKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDIASKRASSVCDDSDLQLEDDVLLAGNNDGGLTDKDAGKSSEEDITME
        DKDLYENMKEN V+ELLSEK+D   TDKL+ MES ASDPLS SL NVS++IND  +KRASSVCD  D++LEDDVLL GN+D  LTDKD  KSSEEDITM+
Subjt:  DKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDIASKRASSVCDDSDLQLEDDVLLAGNNDGGLTDKDAGKSSEEDITME

Query:  FNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNHLILSHVWSETNFVSEEADDSNLLL
        FNASDPLKHMAN TSC+VKVTN E  LI DNSHLP+ESSR   KND D S+ENSDEFLKKV+ MEPN +D+LN NH  LSHVWS TNFVS+EADDSNL  
Subjt:  FNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNHLILSHVWSETNFVSEEADDSNLLL

Query:  KSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNEDALTDKYSNESLKEDTILVLEHDA
        +S VLS RI H++MD+ FN+SP+K+AI  FED PNSYLLNLP H NGV FT EEAIMV D N L+LE E+  R N+DALT KYSNESLK DTIL LEHDA
Subjt:  KSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNEDALTDKYSNESLKEDTILVLEHDA

Query:  SYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEI--------------------------------YC-----------------------------
        +Y LKN+PRCTSS +++KNEEVSSVSN S LKLESE+                                +C                             
Subjt:  SYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEI--------------------------------YC-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------GKNSEALIDKASDASCEGQANLELSTELISHCDEESIKETLGSRDNECKEDIVTSTGGPLVT
                                              GKNS ALIDKASDASC+ QANLELSTEL  HC EESIKETL S  NE + DIVTS G P  T
Subjt:  --------------------------------------GKNSEALIDKASDASCEGQANLELSTELISHCDEESIKETLGSRDNECKEDIVTSTGGPLVT

Query:  SIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTVI
        SIH ADV+S HNV QAS I+VN+ V  SP+ ETTSKYLENG   SSNA DAT     SLVLT GETVEE+ PVSSLK L K SF A RSLV N S+ TVI
Subjt:  SIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTVI

Query:  HEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESLG-THASRANESAFVPKLHASGQVAFSEFTSSGTLSLSTEVGRTYNSGC---
        HEKP EQ A IE ESRP FEV+ + S  NKA KMKFVSS SSLSS+E L   HASRAN++AF+PK +                        T + GC   
Subjt:  HEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESLG-THASRANESAFVPKLHASGQVAFSEFTSSGTLSLSTEVGRTYNSGC---

Query:  SILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLD---TTTSYASDSEWEI
        S   ++METVDLGHKVTL+DE DVVDYK LHAVSRR QKLRSYKK+IQDAF+SKKRLAKEYEQLAIWYGD DLE ST++SQKL+    +TSY SDSEWE+
Subjt:  SILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLD---TTTSYASDSEWEI

Query:  L
        L
Subjt:  L

TrEMBL top hitse value%identityAlignment
A0A0A0KZJ5 Uncharacterized protein1.5e-27657.87Show/hide
Query:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA
        MDLKHKGI+WVGNMFQKFEAVC EVDNIINQD VKYVENQVSSAS NVK+LYS VVQG+ PP  DP+  EA A AQRG V  NAYF+S   NEGKAA+N 
Subjt:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA

Query:  VNKSSVGHG--ITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNA---------------VNKSSVGHGITDQIDNHSQASC--KVSFVNEEVAQVPNH
        VNKSSVGHG   TDQIDN SQA C+V FVNEEVAQVPNH  LE NA                 +S   + + + +  ++  SC  K++ +  + +    H
Subjt:  VNKSSVGHG--ITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNA---------------VNKSSVGHGITDQIDNHSQASC--KVSFVNEEVAQVPNH

Query:  SFLEWNVDL--------------PLQKKDDVLL---------DKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDI----
        S    N D+               +Q +DDVLL         +KD  ++ KEN VNELLSEK+D   TDKLS MES ASDPLSHSL NVS+ IND+    
Subjt:  SFLEWNVDL--------------PLQKKDDVLL---------DKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDI----

Query:  ------------------------------------------------------------------------------ASKRASSVCDDSDLQLEDDVLL
                                                                                      ++K+AS VCDD D+QLEDDVLL
Subjt:  ------------------------------------------------------------------------------ASKRASSVCDDSDLQLEDDVLL

Query:  AGNNDGGLTDKDAGKSSEEDITMEFNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNH
          NNDG LTDKD  KSSEED +M+FNASDPLKHMAN T C+VKVTN E  LI DNSHLP+ESS LS KN+ + S+E+S EFLKK + ME N +D+LN NH
Subjt:  AGNNDGGLTDKDAGKSSEEDITMEFNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNH

Query:  LILSHVWSETNFVSEEADDSNLLLKSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNE
          L+HVWS TNFV +EADDSN LLKS V SGR+DH +MD+ FN+S +K AI  FED P S+LLNLP H NG+ FT EEAIMV D NHLQLE E+ AR N+
Subjt:  LILSHVWSETNFVSEEADDSNLLLKSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNE

Query:  DALTDKYSNESLKEDTILVLEHDASYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDASCEGQANLELSTELISHCDEESI
        D LT K+SNESL +DTIL LEHDA YPLKNQPRCTS+  ++K EEVSSVSNDS  KL S +  GKN +AL DKASD SC+ QANLELSTEL  HC EESI
Subjt:  DALTDKYSNESLKEDTILVLEHDASYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDASCEGQANLELSTELISHCDEESI

Query:  KETLGSRDNECKEDIVTSTGGPLVTSIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSS
        KE+L S  NEC+ DIVT  G    TSIH ADVES HNV QASS LVN+L+ FS   ETTSKYLENG G SSNA DATSSE  S+VLTSGETVEE++PVSS
Subjt:  KETLGSRDNECKEDIVTSTGGPLVTSIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSS

Query:  LKSLPKSSFPASRSLVKNFSSCTVIHEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESL-GTHASRANESAFVPKLHASGQVAFS
        LK L K SF A RS V N SS TV+HEKP E  A  E  SR  F V  N S  N A  MK  SS SSLSSMESL GTHASRAN++ F+PK     Q   S
Subjt:  LKSLPKSSFPASRSLVKNFSSCTVIHEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESL-GTHASRANESAFVPKLHASGQVAFS

Query:  EFTSSGTLSLSTEVGRTYNSGCSILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTD
        + TSS   S STE G  ++S   ILDAE+ETVDLGHKV+ +D+CD +DYKALHA+SRR QKLRSYKK+IQDAF SKKRLAKEYEQLAIWYGD D+E ST+
Subjt:  EFTSSGTLSLSTEVGRTYNSGCSILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTD

Query:  SSQKLD---TTTSYASDSEWEIL
        S QKL+    +T+Y SDSEWE+L
Subjt:  SSQKLD---TTTSYASDSEWEIL

A0A1S3CJX6 uncharacterized protein LOC103501804 isoform X12.4e-27155.18Show/hide
Query:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA
        MDLKHKGI+WVGNMFQKFEAVC EVDNIINQD VKYVENQVSSAS NVK+LYS VVQG+ PP+ DP++ EA A AQRG V  NAYF+S   NEGKAA+N 
Subjt:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA

Query:  VNKSSVGHGI--TDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWN---------------AVNKSSVGHGITDQID--NHSQASCKVSFVNEEVAQVPNH
        VN SSVGHG   TDQIDN SQASC+V FVNEEVAQVPN S LE N                +++S   + +++ +   N    + K++ +    +   NH
Subjt:  VNKSSVGHGI--TDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWN---------------AVNKSSVGHGITDQID--NHSQASCKVSFVNEEVAQVPNH

Query:  SFLEWNVDLPLQKK-----------------------DDVLLDKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDI----
        S    N D+   KK                       D VL +KD  ++ K+N  +ELLSEK+D   TDKLS ME  ASDPLSHSL NVS+ IND+    
Subjt:  SFLEWNVDLPLQKK-----------------------DDVLLDKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDI----

Query:  ------------------------------------------------------------------------------ASKRASSVCDDSDLQLEDDVLL
                                                                                      ++K+AS VCDD D+QLEDDVLL
Subjt:  ------------------------------------------------------------------------------ASKRASSVCDDSDLQLEDDVLL

Query:  AGNNDGGLTDKDAGKSSEEDITMEFNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNH
         GNN G LTDKD  KSSEED TM+ NASDPLKHMAN TSC+VKVTN E  LI DNSHLP+ESS LS KND++ S+E+SDEFLKK + ME N +D+LN NH
Subjt:  AGNNDGGLTDKDAGKSSEEDITMEFNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNH

Query:  LILSHVWSETNFVSEEADDSNLLLKSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNE
           +HVWS TNFV +EADDSN LLKS VLSG +DH +MD+ F+ S +K AI  FED P S+LLNLP H NG+ FT EE IMVSD NHLQL  E+ AR N+
Subjt:  LILSHVWSETNFVSEEADDSNLLLKSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNE

Query:  DALTDKYSNESLKEDTILVLEHDASYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDASCEGQANLELSTELISHCDEESI
        DALT K+SNESLK DTIL LEHDA+YPLKNQPRCTSS  K+K EEVSSVSNDS LKL+S +  GKN +ALIDKASD SC+ QANLELSTEL  HC EESI
Subjt:  DALTDKYSNESLKEDTILVLEHDASYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDASCEGQANLELSTELISHCDEESI

Query:  KETLGSRDNECKEDIVTSTGGPLVTSIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSEL-------------------
        KETL S  NE + D+VT  GG   T IH  DVES H   Q S+  VN+L+ FS  METTSKYLENG  CSSNA DATSSEL                   
Subjt:  KETLGSRDNECKEDIVTSTGGPLVTSIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSEL-------------------

Query:  -----------------------------ISLVLTSGETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTVIHEKPAEQRASIEYESRPCFEVVTNASN
                                      S+VLTSGETVEE++PVSSLK L K SF A      N SS TV+HEKP E  A  E  SR  FEV  + S 
Subjt:  -----------------------------ISLVLTSGETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTVIHEKPAEQRASIEYESRPCFEVVTNASN

Query:  ENKALKMKFVSSSSSLSSMESLG-THASRANESAFVPKLHASGQVAFSEFTSSGTLSLSTEVGRTYNSGCSILDAEMETVDLGHKVTLKDECDVVDYKAL
         N A  MK VSS SSLSSMESL  THASRAN++ F+PK +   Q   S+ TSSG  S ST VG  ++S   ILDAEMETVDLGHKVT ++ECDV+DYKAL
Subjt:  ENKALKMKFVSSSSSLSSMESLG-THASRANESAFVPKLHASGQVAFSEFTSSGTLSLSTEVGRTYNSGCSILDAEMETVDLGHKVTLKDECDVVDYKAL

Query:  HAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLD---TTTSYASDSEWEIL
        HAVSRR QKLRSYKK+IQDAF SKKRLAKEYEQLAIWYGD D+E ST+SSQKL+   T+T+Y SDSEWE+L
Subjt:  HAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLD---TTTSYASDSEWEIL

A0A5A7VK64 Fiber Fb32-like protein isoform 35.3e-26652.86Show/hide
Query:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA
        MDLKHKGI+WVGNMFQKFEAVC EVDNIINQD VKYVENQVSSAS NVK+LYS VVQG+ PP+ DP++ EA A AQRG V  NAYF+S   NEGKAA+N 
Subjt:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA

Query:  VNKSSVGHGI--TDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWN---------------AVNKSSVGHGITDQID--NHSQASCKVSFVNEEVAQVPNH
        VN SSVGHG   TDQIDN SQASC+V FVNEEVAQVPN S LE N                +++S   + +++ +   N    + K++ +    +   NH
Subjt:  VNKSSVGHGI--TDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWN---------------AVNKSSVGHGITDQID--NHSQASCKVSFVNEEVAQVPNH

Query:  SFLEWNVDLPLQKK-----------------------DDVLLDKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDI----
        S    N D+   KK                       D VL +KD  ++ K+N  +ELLSEK+D   TDKLS ME  ASDPLSHSL NVS+ IND+    
Subjt:  SFLEWNVDLPLQKK-----------------------DDVLLDKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDI----

Query:  ------------------------------------------------------------------------------ASKRASSVCDDSDLQLEDDVLL
                                                                                      ++K+AS VCDD D+QLEDDVLL
Subjt:  ------------------------------------------------------------------------------ASKRASSVCDDSDLQLEDDVLL

Query:  AGNNDGGLTDKDAGKSSEEDITMEFNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNH
         GNN G LTDKD  KSSEED TM+ NASDPLKHMAN TSC+VKVTN E  LI DNSHLP+ESS LS KND++ S+E+SDEFLKK + ME N +D+LN NH
Subjt:  AGNNDGGLTDKDAGKSSEEDITMEFNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNH

Query:  LILSHVWSETNFVSEEADDSNLLLKSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNE
           +HVWS TNFV +EADDSN LLKS VLSG +DH +MD+ F+ S +K AI  FED P S+LLNLP H NG+ FT EE IMVSD NHLQL  E+ AR N+
Subjt:  LILSHVWSETNFVSEEADDSNLLLKSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNE

Query:  DALTDKYSNESLKEDTILVLEHDASYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDASCEGQANLELSTELISHCDEESI
        DALT K+SNESLK DTIL LEHDA+YPLKNQPRCTSS  K+K EEVSSVSNDS LKL+S +  GKN +ALIDKASD SC+ QANLELSTEL  HC EESI
Subjt:  DALTDKYSNESLKEDTILVLEHDASYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDASCEGQANLELSTELISHCDEESI

Query:  KETLGSRDNECKEDIVTSTGGPLVTSIHGADVESTH----------------------------------------------------------------
        KETL S  NE + D+VT  GG   T IH  DVES H                                                                
Subjt:  KETLGSRDNECKEDIVTSTGGPLVTSIHGADVESTH----------------------------------------------------------------

Query:  ------------------------NVNQASSILV-------NSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSSLKSLP
                                +  QAS +L        N+L+ FS  METT KYLENG GCSSNA DATSSE  S+VLTSGETVEE++PVSSLK L 
Subjt:  ------------------------NVNQASSILV-------NSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSSLKSLP

Query:  KSSFPASRSLVKNFSSCTVIHEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESLG-THASRANESAFVPKLHASGQVAFSEFTSS
        K SF A      N SS TV+HEKP E  A  E  SR  FEV  + S  N A  MK VSS SSLSSMESL  THASRAN++ F+PK +   Q   S+ TSS
Subjt:  KSSFPASRSLVKNFSSCTVIHEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESLG-THASRANESAFVPKLHASGQVAFSEFTSS

Query:  GTLSLSTEVGRTYNSGCSILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKL
        G  S ST VG  ++S   ILDAEMETVDLGHKVT ++ECDV+DYKALHAVSRR QKLRSYKK+IQDAF SKKRLAKEYEQLAIWYGD D+E ST+SSQKL
Subjt:  GTLSLSTEVGRTYNSGCSILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKL

Query:  D---TTTSYASDSEWEIL
        +   T+T+Y SDSEWE+L
Subjt:  D---TTTSYASDSEWEIL

A0A6J1IGV6 uncharacterized protein LOC111476834 isoform X21.8e-26161.9Show/hide
Query:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA
        MDLKHKGI+WVGNMFQKFEAVCQEVDNIINQD V+YVENQVSSAS NVK+LYS VVQGL PP E  V+ EA A A RGRV  N YFKS   NE K+ANN 
Subjt:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA

Query:  VNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNAVNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNVDLPLQKKDDVLL
         NKSSVGHG      NH QASCKV FVNEEVA+ PNHS L  NA                                                        
Subjt:  VNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNAVNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNVDLPLQKKDDVLL

Query:  DKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDIASKRASSVCDDSDLQLEDDVLLAGNNDGGLTDKDAGKSSEEDITME
           L+EN KEN VNELLSEKSD   TDK + MES A DPLS SLRNVS ++NDI +K  SSVCDDSDL L D+VLL GNN+G LT+ DA KSS+ED T+E
Subjt:  DKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDIASKRASSVCDDSDLQLEDDVLLAGNNDGGLTDKDAGKSSEEDITME

Query:  FNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNHLILSHVWSETNFVSEEADDSNLLL
        FNASDPL H AN  SCQVKVTNGEE  I DNSHLP+ESSRLS KND+D S+EN+DEF+KKV IMEPNA+D+LN+ H  LSHVWS TNFVS+EAD+SN+LL
Subjt:  FNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNHLILSHVWSETNFVSEEADDSNLLL

Query:  KSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNEDALTDKYSNESLKEDTILVLEHDA
        KSEV S RIDH+L+D+ FNESP+KDAI  FED   SYLLNLPS          E  MVS+ NHLQ+EPEL A  N+DALTD YSNE L++DTIL +E+DA
Subjt:  KSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNEDALTDKYSNESLKEDTILVLEHDA

Query:  SYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDASCEGQANLELSTELISHCDEESIKETLGSRDNECKEDIVTSTGGPLV
        SYPLKNQPR  SS +K+KNEEVSSVS                    IDKASDASC+ QANLELSTEL  HCDEESIK       NE + DI T TG P  
Subjt:  SYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDASCEGQANLELSTELISHCDEESIKETLGSRDNECKEDIVTSTGGPLV

Query:  TSIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTV
        TSIHGADVES H V + SSI  N+LV+ SP+MET  +Y E       N P ATSSEL S+VL SGETV+E++ VSSLK LPK S  ASRS V +FSS TV
Subjt:  TSIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTV

Query:  IHEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESLGTHASRANESAFVPKLHASGQVAFSEFTSSGTLSLSTEVGRTYNSGCSIL
         HEKP ++RA IE  S P FEVVT+ASN NKA + +F SS SSLSS ESLGT              HAS QV FS+ T SG LS STEVG  Y+    IL
Subjt:  IHEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESLGTHASRANESAFVPKLHASGQVAFSEFTSSGTLSLSTEVGRTYNSGCSIL

Query:  DAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLDTTTSYASDSEWEIL
        D EMETVDLGHKVT++DEC V+DYKALHAVSRR QKLRSYKK+IQDAF +KKRLAK+YEQLAIWYGD DL+  TDSSQK D     ASDSEWE+L
Subjt:  DAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLDTTTSYASDSEWEIL

A0A6J1IJF7 uncharacterized protein LOC111476834 isoform X14.3e-26061.83Show/hide
Query:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA
        MDLKHKGI+WVGNMFQKFEAVCQEVDNIINQD V+YVENQVSSAS NVK+LYS VVQGL PP E  V+ EA A A RGRV  N YFKS   NE K+ANN 
Subjt:  MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNA

Query:  VNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNAVNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNVDLPLQKKDDVLL
         NKSSVGHG      NH QASCKV FVNEEVA+ PNHS L  NA                                                        
Subjt:  VNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNAVNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNVDLPLQKKDDVLL

Query:  DKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDIASKRASSVCDDSDLQLEDDVLLAGNNDGGLTDKDAGKSSEEDITME
           L+EN KEN VNELLSEKSD   TDK + MES A DPLS SLRNVS ++NDI +K  SSVCDDSDL L D+VLL GNN+G LT+ DA KSS+ED T+E
Subjt:  DKDLYENMKENDVNELLSEKSD---TDKLSFMESAASDPLSHSLRNVSSDINDIASKRASSVCDDSDLQLEDDVLLAGNNDGGLTDKDAGKSSEEDITME

Query:  FNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNHLILSHVWSETNFVSEEADDSNLLL
        FNASDPL H AN  SCQVKVTNGEE  I DNSHLP+ESSRLS KND+D S+EN+DEF+KKV IMEPNA+D+LN+ H  LSHVWS TNFVS+EAD+SN+LL
Subjt:  FNASDPLKHMANRTSCQVKVTNGEE-LISDNSHLPLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNHLILSHVWSETNFVSEEADDSNLLL

Query:  KSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNEDALTDKYSNESLKEDTILVLEHDA
        KSEV S RIDH+L+D+ FNESP+KDAI  FED   SYLLNLPS          E  MVS+ NHLQ+EPEL A  N+DALTD YSNE L++DTIL +E+DA
Subjt:  KSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSHGNGVGFTYEEAIMVSDSNHLQLEPELHARVNEDALTDKYSNESLKEDTILVLEHDA

Query:  SYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDASCEGQANLELSTELISHCDEESIKETLGSRDNECKEDIVTSTGGPLV
        SYPLKNQPR  SS +K+KNEEVSSVS                    IDKASDASC+ QANLELSTEL  HCDEESIK       NE + DI T TG P  
Subjt:  SYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDASCEGQANLELSTELISHCDEESIKETLGSRDNECKEDIVTSTGGPLV

Query:  TSIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTV
        TSIHGADVES H V + SSI  N+LV+ SP+MET  +Y E       N P ATSSEL S+VL SGETV+E++ VSSLK LPK S  ASRS V +FSS TV
Subjt:  TSIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTSGETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTV

Query:  IHEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESL-GTHASRANESAFVPKLHASGQVAFSEFTSSGTLSLSTEVGRTYNSGCSI
         HEKP ++RA IE  S P FEVVT+ASN NKA + +F SS SSLSS ESL GT              HAS QV FS+ T SG LS STEVG  Y+    I
Subjt:  IHEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESL-GTHASRANESAFVPKLHASGQVAFSEFTSSGTLSLSTEVGRTYNSGCSI

Query:  LDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLDTTTSYASDSEWEIL
        LD EMETVDLGHKVT++DEC V+DYKALHAVSRR QKLRSYKK+IQDAF +KKRLAK+YEQLAIWYGD DL+  TDSSQK D     ASDSEWE+L
Subjt:  LDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLDTTTSYASDSEWEIL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G17780.2 unknown protein1.6e-1242.72Show/hide
Query:  METVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDL--ELSTDSSQKLDTTTSYAS---------DSEW
        M+T+DL + +T +++    D   L+A+  R ++LRS+K+KI DA  SK+R  KEYEQLAIW+GD D+  +L  D  Q   +  S +S         DSEW
Subjt:  METVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDL--ELSTDSSQKLDTTTSYAS---------DSEW

Query:  EIL
        EIL
Subjt:  EIL

AT1G73130.1 unknown protein7.3e-1044.58Show/hide
Query:  VDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDL--ELST-DSSQKLDTTTSYAS------DSEWEIL
        V+   L+A+  R +KLRS+K+K+ D   SK+R  KEYEQL IWYGD  +  +L+T + SQ+++ T S +S      DS+WE+L
Subjt:  VDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDL--ELST-DSSQKLDTTTSYAS------DSEWEIL

AT2G16575.1 unknown protein1.3e-1139.81Show/hide
Query:  METVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDL-----------ELSTDSSQKLDTTTSYASDSEW
        M T+DL + +T +++    D   L+A+  R ++LRS+K+KI DA  SK+R  KEYEQLAIW+GD D+             S DS          + DSEW
Subjt:  METVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDL-----------ELSTDSSQKLDTTTSYASDSEW

Query:  EIL
        E+L
Subjt:  EIL

AT2G31130.1 unknown protein1.2e-1262.5Show/hide
Query:  KGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRP
        KGI WVGN++QKFEA+C EV+ II QDT KYVENQV +   +VKK  S VV  L P
Subjt:  KGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGAAACATAAAGGTATCACGTGGGTTGGAAACATGTTCCAGAAGTTTGAAGCAGTGTGCCAGGAAGTGGATAATATTATAAACCAGGATACAGTTAAATATGT
TGAAAACCAGGTTAGTTCAGCAAGTGAAAATGTGAAGAAATTATACTCTGGTGTTGTTCAAGGTTTACGTCCCCCTGTAGAAGATCCTGTGGAATGTGAAGCTACAGCAA
CGGCTCAGAGGGGACGTGTTGCAAGAAATGCATATTTCAAGTCATCATTACTCAATGAAGGAAAAGCTGCTAATAATGCTGTTAATAAATCATCTGTGGGCCATGGTATT
ACTGATCAGATAGATAACCACAGTCAAGCATCTTGTAAAGTTTCCTTTGTAAATGAAGAAGTTGCTCAAGTTCCTAATCATTCTTTTCTAGAGTGGAATGCTGTTAATAA
ATCATCTGTGGGCCATGGTATTACTGATCAGATAGATAACCACAGTCAAGCATCTTGTAAAGTTTCCTTTGTAAATGAAGAAGTTGCTCAAGTTCCTAATCATTCTTTTC
TAGAGTGGAATGTTGATTTACCTTTGCAAAAGAAAGATGATGTCTTGTTAGATAAAGACTTATATGAGAACATGAAAGAAAATGACGTCAATGAACTACTTTCAGAGAAA
AGTGATACAGATAAGCTTTCCTTCATGGAGTCAGCTGCTAGTGATCCTTTGAGTCATTCACTGAGAAATGTAAGTAGTGACATTAATGATATAGCAAGCAAAAGAGCTTC
TTCAGTTTGTGATGACTCTGATCTGCAATTGGAGGATGATGTACTTTTAGCAGGGAATAATGATGGGGGTTTGACAGATAAAGATGCAGGCAAGAGTTCTGAAGAGGATA
TCACTATGGAGTTCAATGCTAGTGATCCTTTGAAGCATATGGCTAATCGAACATCTTGTCAAGTTAAAGTTACAAATGGAGAAGAACTGATTTCGGATAATTCTCATCTA
CCTTTGGAATCTTCCAGACTCTCAAGGAAGAATGACAATGACTCGTCAGATGAGAACTCAGATGAGTTTTTAAAGAAGGTTATCATCATGGAGCCTAATGCTTCTGATAA
TTTGAACAACAACCATCTTATTCTTAGCCATGTATGGAGTGAGACAAACTTCGTAAGTGAAGAAGCTGATGATTCTAATTTGCTTTTGAAGTCTGAGGTACTTTCAGGCA
GAATTGATCATTCCTTGATGGATGAAGGCTTTAATGAGAGTCCTATAAAGGATGCTATCTTTGACTTTGAGGATGTTCCTAACAGTTATTTGTTGAATCTGCCCAGTCAT
GGAAATGGAGTTGGCTTCACCTATGAAGAAGCTATTATGGTTTCTGATAGTAACCATCTGCAGTTGGAGCCTGAGCTACATGCTAGAGTGAATGAAGATGCCTTGACAGA
TAAGTACTCTAATGAAAGTTTAAAAGAAGATACCATCTTGGTGTTGGAGCATGATGCAAGTTATCCTTTAAAGAACCAGCCAAGATGTACATCAAGCTGCATAAAACATA
AAAATGAAGAAGTTTCTTCAGTTTCAAATGATTCCCATCTAAAGTTGGAGAGTGAGATTTATTGTGGGAAGAATAGCGAAGCTTTAATAGATAAAGCATCAGATGCAAGT
TGTGAAGGACAAGCCAATTTAGAATTATCAACTGAGTTAATTTCTCATTGTGACGAAGAGTCAATTAAGGAGACTTTGGGAAGTCGTGATAATGAATGTAAAGAGGATAT
TGTGACCTCAACTGGAGGTCCACTGGTAACTTCGATTCATGGTGCAGATGTTGAATCCACCCATAATGTAAACCAAGCTTCTAGCATCTTGGTAAACAGTTTAGTTAACT
TCTCACCAAAGATGGAGACAACTTCTAAGTACTTGGAAAATGGAACTGGTTGTTCTTCTAATGCTCCAGATGCTACTTCTTCTGAATTGATTTCACTAGTATTAACTAGT
GGGGAAACTGTAGAGGAGTCGAGGCCAGTCTCCTCTTTGAAATCCCTACCAAAGAGTTCTTTTCCTGCTTCCAGAAGTTTGGTTAAAAACTTTTCTAGTTGTACTGTTAT
TCATGAAAAACCTGCTGAACAGCGTGCATCTATTGAATATGAATCTCGTCCATGTTTTGAAGTGGTCACTAATGCATCTAATGAAAACAAGGCTTTGAAGATGAAATTTG
TCTCTTCCAGCAGTTCCTTATCATCAATGGAATCACTTGGGACCCATGCTTCTAGAGCCAATGAAAGTGCTTTTGTTCCTAAACTCCATGCCAGTGGCCAGGTTGCATTT
TCCGAATTTACTAGTTCTGGGACTCTAAGTCTCTCTACTGAAGTAGGTCGTACATACAATTCTGGATGTTCTATTCTGGATGCTGAAATGGAAACAGTGGATTTGGGACA
TAAGGTGACCCTTAAAGACGAATGTGACGTTGTTGACTATAAAGCTCTCCATGCTGTCTCTCGCAGAGCCCAAAAGCTCCGTTCTTACAAGAAGAAAATCCAGGATGCTT
TCAATTCCAAGAAGAGGTTGGCTAAGGAGTATGAACAGCTAGCAATCTGGTATGGAGATATTGATCTGGAACTCAGTACAGACAGTTCACAGAAGTTGGACACAACAACT
AGTTATGCATCTGACTCTGAGTGGGAGATACTGTAA
mRNA sequenceShow/hide mRNA sequence
ATTCTCTCTGAGTCTGCCATAAAAAAAAAACAGCTAAAACCCTGTTCCTGTGCCGCTGCCGATGAGGAAGAGCAGCTTTCCAACCCTGTGATGATTTACTGATCTCGACG
GCTGCTCAGGTGAATAAAAATGGATTTGAAACATAAAGGTATCACGTGGGTTGGAAACATGTTCCAGAAGTTTGAAGCAGTGTGCCAGGAAGTGGATAATATTATAAACC
AGGATACAGTTAAATATGTTGAAAACCAGGTTAGTTCAGCAAGTGAAAATGTGAAGAAATTATACTCTGGTGTTGTTCAAGGTTTACGTCCCCCTGTAGAAGATCCTGTG
GAATGTGAAGCTACAGCAACGGCTCAGAGGGGACGTGTTGCAAGAAATGCATATTTCAAGTCATCATTACTCAATGAAGGAAAAGCTGCTAATAATGCTGTTAATAAATC
ATCTGTGGGCCATGGTATTACTGATCAGATAGATAACCACAGTCAAGCATCTTGTAAAGTTTCCTTTGTAAATGAAGAAGTTGCTCAAGTTCCTAATCATTCTTTTCTAG
AGTGGAATGCTGTTAATAAATCATCTGTGGGCCATGGTATTACTGATCAGATAGATAACCACAGTCAAGCATCTTGTAAAGTTTCCTTTGTAAATGAAGAAGTTGCTCAA
GTTCCTAATCATTCTTTTCTAGAGTGGAATGTTGATTTACCTTTGCAAAAGAAAGATGATGTCTTGTTAGATAAAGACTTATATGAGAACATGAAAGAAAATGACGTCAA
TGAACTACTTTCAGAGAAAAGTGATACAGATAAGCTTTCCTTCATGGAGTCAGCTGCTAGTGATCCTTTGAGTCATTCACTGAGAAATGTAAGTAGTGACATTAATGATA
TAGCAAGCAAAAGAGCTTCTTCAGTTTGTGATGACTCTGATCTGCAATTGGAGGATGATGTACTTTTAGCAGGGAATAATGATGGGGGTTTGACAGATAAAGATGCAGGC
AAGAGTTCTGAAGAGGATATCACTATGGAGTTCAATGCTAGTGATCCTTTGAAGCATATGGCTAATCGAACATCTTGTCAAGTTAAAGTTACAAATGGAGAAGAACTGAT
TTCGGATAATTCTCATCTACCTTTGGAATCTTCCAGACTCTCAAGGAAGAATGACAATGACTCGTCAGATGAGAACTCAGATGAGTTTTTAAAGAAGGTTATCATCATGG
AGCCTAATGCTTCTGATAATTTGAACAACAACCATCTTATTCTTAGCCATGTATGGAGTGAGACAAACTTCGTAAGTGAAGAAGCTGATGATTCTAATTTGCTTTTGAAG
TCTGAGGTACTTTCAGGCAGAATTGATCATTCCTTGATGGATGAAGGCTTTAATGAGAGTCCTATAAAGGATGCTATCTTTGACTTTGAGGATGTTCCTAACAGTTATTT
GTTGAATCTGCCCAGTCATGGAAATGGAGTTGGCTTCACCTATGAAGAAGCTATTATGGTTTCTGATAGTAACCATCTGCAGTTGGAGCCTGAGCTACATGCTAGAGTGA
ATGAAGATGCCTTGACAGATAAGTACTCTAATGAAAGTTTAAAAGAAGATACCATCTTGGTGTTGGAGCATGATGCAAGTTATCCTTTAAAGAACCAGCCAAGATGTACA
TCAAGCTGCATAAAACATAAAAATGAAGAAGTTTCTTCAGTTTCAAATGATTCCCATCTAAAGTTGGAGAGTGAGATTTATTGTGGGAAGAATAGCGAAGCTTTAATAGA
TAAAGCATCAGATGCAAGTTGTGAAGGACAAGCCAATTTAGAATTATCAACTGAGTTAATTTCTCATTGTGACGAAGAGTCAATTAAGGAGACTTTGGGAAGTCGTGATA
ATGAATGTAAAGAGGATATTGTGACCTCAACTGGAGGTCCACTGGTAACTTCGATTCATGGTGCAGATGTTGAATCCACCCATAATGTAAACCAAGCTTCTAGCATCTTG
GTAAACAGTTTAGTTAACTTCTCACCAAAGATGGAGACAACTTCTAAGTACTTGGAAAATGGAACTGGTTGTTCTTCTAATGCTCCAGATGCTACTTCTTCTGAATTGAT
TTCACTAGTATTAACTAGTGGGGAAACTGTAGAGGAGTCGAGGCCAGTCTCCTCTTTGAAATCCCTACCAAAGAGTTCTTTTCCTGCTTCCAGAAGTTTGGTTAAAAACT
TTTCTAGTTGTACTGTTATTCATGAAAAACCTGCTGAACAGCGTGCATCTATTGAATATGAATCTCGTCCATGTTTTGAAGTGGTCACTAATGCATCTAATGAAAACAAG
GCTTTGAAGATGAAATTTGTCTCTTCCAGCAGTTCCTTATCATCAATGGAATCACTTGGGACCCATGCTTCTAGAGCCAATGAAAGTGCTTTTGTTCCTAAACTCCATGC
CAGTGGCCAGGTTGCATTTTCCGAATTTACTAGTTCTGGGACTCTAAGTCTCTCTACTGAAGTAGGTCGTACATACAATTCTGGATGTTCTATTCTGGATGCTGAAATGG
AAACAGTGGATTTGGGACATAAGGTGACCCTTAAAGACGAATGTGACGTTGTTGACTATAAAGCTCTCCATGCTGTCTCTCGCAGAGCCCAAAAGCTCCGTTCTTACAAG
AAGAAAATCCAGGATGCTTTCAATTCCAAGAAGAGGTTGGCTAAGGAGTATGAACAGCTAGCAATCTGGTATGGAGATATTGATCTGGAACTCAGTACAGACAGTTCACA
GAAGTTGGACACAACAACTAGTTATGCATCTGACTCTGAGTGGGAGATACTGTAAATAAAACAGCTAATTCAGTTTATCTCTGCAATCAGAATTGTTTCCAGGTAGAGGT
GAATCCTATATGCTGGAGATGAAGAGGAAGCTAGTCTGTAAATAAACACTCAAGATTAAGATTCATTACTTTATCTATTGAAGTGCATAACACCAGGTATAAGTTACCTT
GGAAATTCCAAAATAAACGAGTTGCAAGAAATTTTGCACATATTGGCACTCTTTCTGGGCTGTGAACCTATGATGTTTTAATTTATTTATTTAAAGTTATATATCCTGGT
AAATAAAGCTGTCCTGTTTTTTGTGTTTGGAA
Protein sequenceShow/hide protein sequence
MDLKHKGITWVGNMFQKFEAVCQEVDNIINQDTVKYVENQVSSASENVKKLYSGVVQGLRPPVEDPVECEATATAQRGRVARNAYFKSSLLNEGKAANNAVNKSSVGHGI
TDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNAVNKSSVGHGITDQIDNHSQASCKVSFVNEEVAQVPNHSFLEWNVDLPLQKKDDVLLDKDLYENMKENDVNELLSEK
SDTDKLSFMESAASDPLSHSLRNVSSDINDIASKRASSVCDDSDLQLEDDVLLAGNNDGGLTDKDAGKSSEEDITMEFNASDPLKHMANRTSCQVKVTNGEELISDNSHL
PLESSRLSRKNDNDSSDENSDEFLKKVIIMEPNASDNLNNNHLILSHVWSETNFVSEEADDSNLLLKSEVLSGRIDHSLMDEGFNESPIKDAIFDFEDVPNSYLLNLPSH
GNGVGFTYEEAIMVSDSNHLQLEPELHARVNEDALTDKYSNESLKEDTILVLEHDASYPLKNQPRCTSSCIKHKNEEVSSVSNDSHLKLESEIYCGKNSEALIDKASDAS
CEGQANLELSTELISHCDEESIKETLGSRDNECKEDIVTSTGGPLVTSIHGADVESTHNVNQASSILVNSLVNFSPKMETTSKYLENGTGCSSNAPDATSSELISLVLTS
GETVEESRPVSSLKSLPKSSFPASRSLVKNFSSCTVIHEKPAEQRASIEYESRPCFEVVTNASNENKALKMKFVSSSSSLSSMESLGTHASRANESAFVPKLHASGQVAF
SEFTSSGTLSLSTEVGRTYNSGCSILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRAQKLRSYKKKIQDAFNSKKRLAKEYEQLAIWYGDIDLELSTDSSQKLDTTT
SYASDSEWEIL