; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g16440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g16440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:12387354..12397272
RNA-Seq ExpressionMoc02g16440
SyntenyMoc02g16440
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.4e-10645.05Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQCRPVD-PHRVAEQEEPSFSQAILDAPIPPRFKPLVMSSYDGSGDPISYVKVFEGKMDFLAVSDAMKCRAFQIALEGSAR
        R E D LR Q   ++E LK +C   + P    +  E  F+  +L+APIPP+FK   +  YDGS DP  YV+VFE  MDF A SDA+KCRAF+IAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQCRPVD-PHRVAEQEEPSFSQAILDAPIPPRFKPLVMSSYDGSGDPISYVKVFEGKMDFLAVSDAMKCRAFQIALEGSAR

Query:  FWYRQLKPWSIDSYQQLR----------------------VKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEML
         WYR+L   SI +Y QLR                      ++Q++ E+L EY+ RF +E +KV  C+DD AM YF TGL    LT++ G   PA+  E+L
Subjt:  FWYRQLKPWSIDSYQQLR----------------------VKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEML

Query:  TRARQYIDG-----LSCGKP---MELGEATMSIGRRDEKD------PSGR-----------RGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLRR
         +A++ IDG        G+P   +  G +   I   D K        SGR           R   +++FTP    I+EI    E++ +E L   PEKLR 
Subjt:  TRARQYIDG-----LSCGKP---MELGEATMSIGRRDEKD------PSGR-----------RGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLRR

Query:  PPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHKRDKSHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALAREA
         P +R K  YCRFH++HGH+TS  + LK Q+E+LIQ GY KK+VG+   +  E    + +R +S  P R+ DRPA+INTI GGPSGG+SG+KRK LAR A
Subjt:  PPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHKRDKSHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALAREA

Query:  AHEVCTSYPREPAMPILFDDRDGERVHMPHNDALVIAPLIDHVKVRRVLIDGGASANILSFSTYTAL------------------GESVSVEGCVSLPVT
          EVC    + P  PI FD  D E VH+PHNDALVIAPLIDHV V RVL+DGG SANILS  TY AL                  GESV  EG + LPVT
Subjt:  AHEVCTSYPREPAMPILFDDRDGERVHMPHNDALVIAPLIDHVKVRRVLIDGGASANILSFSTYTAL------------------GESVSVEGCVSLPVT

Query:  IGEGDQQVTKVAEFV
        +G+   QVT++AEFV
Subjt:  IGEGDQQVTKVAEFV

XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]3.2e-10666.67Show/hide
Query:  MDFLAVSDAMKCRAFQIALEGSARFWYRQLKPWSIDSYQQLR----------------------VKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFT
        MDFLA SDA+KCRAFQIALEGS R WY+QLKP SIDSYQQLR                      VKQRD+ESLTEYIAR MDEHVKVVSCTDDIAMMYFT
Subjt:  MDFLAVSDAMKCRAFQIALEGSARFWYRQLKPWSIDSYQQLR----------------------VKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFT

Query:  TGLNARNLTIEFGSRPPASLNEMLTRARQYIDGL-----------SCGKPMELGEA------------------TMSIGRRDEKDPSGRRGPKFDKFTPL
        TGLN RNLTIEFGSRPPASLN+ML RARQYIDGL           S GK  +   +                    S G+ DE+  S R GPKFDKFTPL
Subjt:  TGLNARNLTIEFGSRPPASLNEMLTRARQYIDGL-----------SCGKPMELGEA------------------TMSIGRRDEKDPSGRRGPKFDKFTPL

Query:  NASIAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHKRDKSHPPRRKED
        NAS+AEIYA  E+TD++ALF AP+KL RP GKRDKRLYCRFHKDHGH++SRCFHLKEQV+DLI+RGYLKKYVG RERA+PEGS  E KR++S PP RKED
Subjt:  NASIAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHKRDKSHPPRRKED

Query:  RPAIINTIHGGPSGGRSG
        RPA+INTIHGGPSG +SG
Subjt:  RPAIINTIHGGPSGGRSG

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]6.9e-10943.77Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQCRPVD-PHRVAEQEEPSFSQAILDAPIPPRFKPLVMSSYDGSGDPISYVKVFEGKMDFLAVSDAMKCRAFQIALEGSAR
        R E D LR +   ++E LK +C   + P    +  E  F+  +L+AP         + SYDGS DP  YV+VFEG MDF A SDA+KCRAFQIAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQCRPVD-PHRVAEQEEPSFSQAILDAPIPPRFKPLVMSSYDGSGDPISYVKVFEGKMDFLAVSDAMKCRAFQIALEGSAR

Query:  FWYRQLKPWSIDSYQQLRVKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEMLTRARQYIDG-----LSCGKPME
         W                               F ++ +KV   +DD AM YF TGL    LT++ G   PA+  E+L +A++ IDG        G+P E
Subjt:  FWYRQLKPWSIDSYQQLRVKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEMLTRARQYIDG-----LSCGKPME

Query:  LGEATMSIGRRDEKD---------PSGR-----------RGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYCRFHKDHGHDTS
         G      G+ ++ D          SGR           R   +++FTP    I+EI    E++ +E L   PEKLR  P +R+K  YCRFH++H H+TS
Subjt:  LGEATMSIGRRDEKD---------PSGR-----------RGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYCRFHKDHGHDTS

Query:  RCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHKRDK--SHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALAREAAHEVCTSYPREPAMPILFDD
          + LK Q+EDLIQ  Y KK+VG+     P  S+ E K ++  S  P R+ DRPA+INTI GGPSGG+SG KRK LAR A  EVC    + P  PI FD 
Subjt:  RCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHKRDK--SHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALAREAAHEVCTSYPREPAMPILFDD

Query:  RDGERVHMPHNDALVIAPLIDHVKVRRVLIDGGASANILSFSTYTALG------------------ESVSVEGCVSLPVTIGEGDQQVTKVAEFVVIDRS
         D E VH+PHNDALVIAPLIDHV VRRVL+D G SANI+S  TY ALG                  ESV  EGC+ LPVT+G    QVT++AEFVVID  
Subjt:  RDGERVHMPHNDALVIAPLIDHVKVRRVLIDGGASANILSFSTYTALG------------------ESVSVEGCVSLPVTIGEGDQQVTKVAEFVVIDRS

Query:  SAYNAIIGRPLIHDLRAVPSTYHQVLKYPTSAGVATVRGP-----------------CADEPEPSR---------------GTPTEELELVPLL
        SAYNAI GRP+IH  RA+PST HQVLKY T  GV  VRG                  CA E   SR                 PTEELELVPLL
Subjt:  SAYNAIIGRPLIHDLRAVPSTYHQVLKYPTSAGVATVRGP-----------------CADEPEPSR---------------GTPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.2e-11445.02Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQC-RPVDPHRVAEQEEPSFSQAILDAPIPPRFKPLVMSSYDGSGDPISYVKVFEGKMDFLAVSDAMKCRAFQIALEGSAR
        R E D L+ +F  ++E LK +C +        +  E SFS  IL+A IPP+FK   M  YDGS DP  YV+VFE  MDF A +DA+KC AFQIAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQC-RPVDPHRVAEQEEPSFSQAILDAPIPPRFKPLVMSSYDGSGDPISYVKVFEGKMDFLAVSDAMKCRAFQIALEGSAR

Query:  FWYRQLKPWSIDSYQQLR----------------------VKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEML
         WYR+L    I +Y QLR                      ++Q++ E+L EY+ RF +E +KV  C+DD AM YF TGL    LT++     PA+  E+L
Subjt:  FWYRQLKPWSIDSYQQLR----------------------VKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEML

Query:  TRARQYIDG-----LSCGKP---MELGEATMSIGRRDEK------------------DPSGRRGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLR
         + ++ IDG        G+P   ++ G A    G+ D K                  + S  +   ++ +TP    I EI    E+T +E L   PEKLR
Subjt:  TRARQYIDG-----LSCGKP---MELGEATMSIGRRDEK------------------DPSGRRGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLR

Query:  RPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHK--RDKSHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALA
          P KR+   YCRFH+DHGH+TS  + LK Q+EDLIQ GY KK+VG+     P  ++ E K  R +   P R++DRPA+IN             K+K LA
Subjt:  RPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHK--RDKSHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALA

Query:  REAAHEVCTSYPREPAMPILFDDRDGERVHMPHNDALVIAPLIDHVKVRRVLIDGGASANILSFSTYTAL------------------GESVSVEGCVSL
        REA  EVC    + P   I F+  D E VH+PHNDALVIAPLID V VRR+L+DGGASANILS STY AL                  GES+S+EGC+ L
Subjt:  REAAHEVCTSYPREPAMPILFDDRDGERVHMPHNDALVIAPLIDHVKVRRVLIDGGASANILSFSTYTAL------------------GESVSVEGCVSL

Query:  PVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTSAGVATVRG
        PV+I + D QVT++AEFVVID  SAYNAI GRP+IH  RAVPST HQVLKY T  GV TVRG
Subjt:  PVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTSAGVATVRG

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]6.8e-14161.69Show/hide
Query:  MDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEMLTRARQYIDGLSCGK-----------------------------PMELGEATMSIGR
        MDEHVKVVSCTDDIAMMYFTTGLN RNLTIEF SRPPASLNEM  RARQYIDGL   K                                  +   S  R
Subjt:  MDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEMLTRARQYIDGLSCGK-----------------------------PMELGEATMSIGR

Query:  RDEKDPSGRRGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEP
        RDE+  S RRGPKFDKFTPLNASIAEIYA  EDTD+E LFA+PEKLRRP GKR+KRLYCRFHKDHGHDTSRCFHLKEQVEDLI+ GYLKKYVG RE+AE 
Subjt:  RDEKDPSGRRGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEP

Query:  EGSAWEHKRDKSHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALAREAAHEVCTSYPREPAMPILFDDRDGERVHMPHNDALVIAPLIDHVKVRRVLIDG
        EGSA E KR++S PPR KEDRPA+INTIHGGPSG +SGQKRKALARE AHEVCTSYP+ P MPILFD++DGERVHMPHNDALVIAPLIDHVKVRRV +DG
Subjt:  EGSAWEHKRDKSHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALAREAAHEVCTSYPREPAMPILFDDRDGERVHMPHNDALVIAPLIDHVKVRRVLIDG

Query:  GASANILSFSTYTALG------------------ESVSVEGCVSLPVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTSA
        GASANI SFSTYTALG                  ESVS EGC+SLPVTI EG+ QVT+VAEFVVIDRSSAY  +       + ++  +TY +    P  A
Subjt:  GASANILSFSTYTALG------------------ESVSVEGCVSLPVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTSA

Query:  GVATVRGPCADEPEPSRGTPTEELELVPLLGSEKQVP--------------RFENSNADALA
                            +   ELVPLLG ++QV               RF  SN+D  A
Subjt:  GVATVRGPCADEPEPSRGTPTEELELVPLLGSEKQVP--------------RFENSNADALA

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.2e-10645.05Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQCRPVD-PHRVAEQEEPSFSQAILDAPIPPRFKPLVMSSYDGSGDPISYVKVFEGKMDFLAVSDAMKCRAFQIALEGSAR
        R E D LR Q   ++E LK +C   + P    +  E  F+  +L+APIPP+FK   +  YDGS DP  YV+VFE  MDF A SDA+KCRAF+IAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQCRPVD-PHRVAEQEEPSFSQAILDAPIPPRFKPLVMSSYDGSGDPISYVKVFEGKMDFLAVSDAMKCRAFQIALEGSAR

Query:  FWYRQLKPWSIDSYQQLR----------------------VKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEML
         WYR+L   SI +Y QLR                      ++Q++ E+L EY+ RF +E +KV  C+DD AM YF TGL    LT++ G   PA+  E+L
Subjt:  FWYRQLKPWSIDSYQQLR----------------------VKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEML

Query:  TRARQYIDG-----LSCGKP---MELGEATMSIGRRDEKD------PSGR-----------RGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLRR
         +A++ IDG        G+P   +  G +   I   D K        SGR           R   +++FTP    I+EI    E++ +E L   PEKLR 
Subjt:  TRARQYIDG-----LSCGKP---MELGEATMSIGRRDEKD------PSGR-----------RGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLRR

Query:  PPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHKRDKSHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALAREA
         P +R K  YCRFH++HGH+TS  + LK Q+E+LIQ GY KK+VG+   +  E    + +R +S  P R+ DRPA+INTI GGPSGG+SG+KRK LAR A
Subjt:  PPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHKRDKSHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALAREA

Query:  AHEVCTSYPREPAMPILFDDRDGERVHMPHNDALVIAPLIDHVKVRRVLIDGGASANILSFSTYTAL------------------GESVSVEGCVSLPVT
          EVC    + P  PI FD  D E VH+PHNDALVIAPLIDHV V RVL+DGG SANILS  TY AL                  GESV  EG + LPVT
Subjt:  AHEVCTSYPREPAMPILFDDRDGERVHMPHNDALVIAPLIDHVKVRRVLIDGGASANILSFSTYTAL------------------GESVSVEGCVSLPVT

Query:  IGEGDQQVTKVAEFV
        +G+   QVT++AEFV
Subjt:  IGEGDQQVTKVAEFV

A0A6J1D5T3 uncharacterized protein LOC1110175481.5e-10666.67Show/hide
Query:  MDFLAVSDAMKCRAFQIALEGSARFWYRQLKPWSIDSYQQLR----------------------VKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFT
        MDFLA SDA+KCRAFQIALEGS R WY+QLKP SIDSYQQLR                      VKQRD+ESLTEYIAR MDEHVKVVSCTDDIAMMYFT
Subjt:  MDFLAVSDAMKCRAFQIALEGSARFWYRQLKPWSIDSYQQLR----------------------VKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFT

Query:  TGLNARNLTIEFGSRPPASLNEMLTRARQYIDGL-----------SCGKPMELGEA------------------TMSIGRRDEKDPSGRRGPKFDKFTPL
        TGLN RNLTIEFGSRPPASLN+ML RARQYIDGL           S GK  +   +                    S G+ DE+  S R GPKFDKFTPL
Subjt:  TGLNARNLTIEFGSRPPASLNEMLTRARQYIDGL-----------SCGKPMELGEA------------------TMSIGRRDEKDPSGRRGPKFDKFTPL

Query:  NASIAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHKRDKSHPPRRKED
        NAS+AEIYA  E+TD++ALF AP+KL RP GKRDKRLYCRFHKDHGH++SRCFHLKEQV+DLI+RGYLKKYVG RERA+PEGS  E KR++S PP RKED
Subjt:  NASIAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHKRDKSHPPRRKED

Query:  RPAIINTIHGGPSGGRSG
        RPA+INTIHGGPSG +SG
Subjt:  RPAIINTIHGGPSGGRSG

A0A6J1D9E1 uncharacterized protein LOC1110188233.3e-10943.77Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQCRPVD-PHRVAEQEEPSFSQAILDAPIPPRFKPLVMSSYDGSGDPISYVKVFEGKMDFLAVSDAMKCRAFQIALEGSAR
        R E D LR +   ++E LK +C   + P    +  E  F+  +L+AP         + SYDGS DP  YV+VFEG MDF A SDA+KCRAFQIAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQCRPVD-PHRVAEQEEPSFSQAILDAPIPPRFKPLVMSSYDGSGDPISYVKVFEGKMDFLAVSDAMKCRAFQIALEGSAR

Query:  FWYRQLKPWSIDSYQQLRVKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEMLTRARQYIDG-----LSCGKPME
         W                               F ++ +KV   +DD AM YF TGL    LT++ G   PA+  E+L +A++ IDG        G+P E
Subjt:  FWYRQLKPWSIDSYQQLRVKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEMLTRARQYIDG-----LSCGKPME

Query:  LGEATMSIGRRDEKD---------PSGR-----------RGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYCRFHKDHGHDTS
         G      G+ ++ D          SGR           R   +++FTP    I+EI    E++ +E L   PEKLR  P +R+K  YCRFH++H H+TS
Subjt:  LGEATMSIGRRDEKD---------PSGR-----------RGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYCRFHKDHGHDTS

Query:  RCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHKRDK--SHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALAREAAHEVCTSYPREPAMPILFDD
          + LK Q+EDLIQ  Y KK+VG+     P  S+ E K ++  S  P R+ DRPA+INTI GGPSGG+SG KRK LAR A  EVC    + P  PI FD 
Subjt:  RCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHKRDK--SHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALAREAAHEVCTSYPREPAMPILFDD

Query:  RDGERVHMPHNDALVIAPLIDHVKVRRVLIDGGASANILSFSTYTALG------------------ESVSVEGCVSLPVTIGEGDQQVTKVAEFVVIDRS
         D E VH+PHNDALVIAPLIDHV VRRVL+D G SANI+S  TY ALG                  ESV  EGC+ LPVT+G    QVT++AEFVVID  
Subjt:  RDGERVHMPHNDALVIAPLIDHVKVRRVLIDGGASANILSFSTYTALG------------------ESVSVEGCVSLPVTIGEGDQQVTKVAEFVVIDRS

Query:  SAYNAIIGRPLIHDLRAVPSTYHQVLKYPTSAGVATVRGP-----------------CADEPEPSR---------------GTPTEELELVPLL
        SAYNAI GRP+IH  RA+PST HQVLKY T  GV  VRG                  CA E   SR                 PTEELELVPLL
Subjt:  SAYNAIIGRPLIHDLRAVPSTYHQVLKYPTSAGVATVRGP-----------------CADEPEPSR---------------GTPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204791.5e-11445.02Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQC-RPVDPHRVAEQEEPSFSQAILDAPIPPRFKPLVMSSYDGSGDPISYVKVFEGKMDFLAVSDAMKCRAFQIALEGSAR
        R E D L+ +F  ++E LK +C +        +  E SFS  IL+A IPP+FK   M  YDGS DP  YV+VFE  MDF A +DA+KC AFQIAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQC-RPVDPHRVAEQEEPSFSQAILDAPIPPRFKPLVMSSYDGSGDPISYVKVFEGKMDFLAVSDAMKCRAFQIALEGSAR

Query:  FWYRQLKPWSIDSYQQLR----------------------VKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEML
         WYR+L    I +Y QLR                      ++Q++ E+L EY+ RF +E +KV  C+DD AM YF TGL    LT++     PA+  E+L
Subjt:  FWYRQLKPWSIDSYQQLR----------------------VKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEML

Query:  TRARQYIDG-----LSCGKP---MELGEATMSIGRRDEK------------------DPSGRRGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLR
         + ++ IDG        G+P   ++ G A    G+ D K                  + S  +   ++ +TP    I EI    E+T +E L   PEKLR
Subjt:  TRARQYIDG-----LSCGKP---MELGEATMSIGRRDEK------------------DPSGRRGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLR

Query:  RPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHK--RDKSHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALA
          P KR+   YCRFH+DHGH+TS  + LK Q+EDLIQ GY KK+VG+     P  ++ E K  R +   P R++DRPA+IN             K+K LA
Subjt:  RPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEPEGSAWEHK--RDKSHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALA

Query:  REAAHEVCTSYPREPAMPILFDDRDGERVHMPHNDALVIAPLIDHVKVRRVLIDGGASANILSFSTYTAL------------------GESVSVEGCVSL
        REA  EVC    + P   I F+  D E VH+PHNDALVIAPLID V VRR+L+DGGASANILS STY AL                  GES+S+EGC+ L
Subjt:  REAAHEVCTSYPREPAMPILFDDRDGERVHMPHNDALVIAPLIDHVKVRRVLIDGGASANILSFSTYTAL------------------GESVSVEGCVSL

Query:  PVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTSAGVATVRG
        PV+I + D QVT++AEFVVID  SAYNAI GRP+IH  RAVPST HQVLKY T  GV TVRG
Subjt:  PVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTSAGVATVRG

A0A6J1E0L8 uncharacterized protein LOC1110253103.3e-14161.69Show/hide
Query:  MDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEMLTRARQYIDGLSCGK-----------------------------PMELGEATMSIGR
        MDEHVKVVSCTDDIAMMYFTTGLN RNLTIEF SRPPASLNEM  RARQYIDGL   K                                  +   S  R
Subjt:  MDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNEMLTRARQYIDGLSCGK-----------------------------PMELGEATMSIGR

Query:  RDEKDPSGRRGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEP
        RDE+  S RRGPKFDKFTPLNASIAEIYA  EDTD+E LFA+PEKLRRP GKR+KRLYCRFHKDHGHDTSRCFHLKEQVEDLI+ GYLKKYVG RE+AE 
Subjt:  RDEKDPSGRRGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIQRGYLKKYVGRRERAEP

Query:  EGSAWEHKRDKSHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALAREAAHEVCTSYPREPAMPILFDDRDGERVHMPHNDALVIAPLIDHVKVRRVLIDG
        EGSA E KR++S PPR KEDRPA+INTIHGGPSG +SGQKRKALARE AHEVCTSYP+ P MPILFD++DGERVHMPHNDALVIAPLIDHVKVRRV +DG
Subjt:  EGSAWEHKRDKSHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALAREAAHEVCTSYPREPAMPILFDDRDGERVHMPHNDALVIAPLIDHVKVRRVLIDG

Query:  GASANILSFSTYTALG------------------ESVSVEGCVSLPVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTSA
        GASANI SFSTYTALG                  ESVS EGC+SLPVTI EG+ QVT+VAEFVVIDRSSAY  +       + ++  +TY +    P  A
Subjt:  GASANILSFSTYTALG------------------ESVSVEGCVSLPVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTSA

Query:  GVATVRGPCADEPEPSRGTPTEELELVPLLGSEKQVP--------------RFENSNADALA
                            +   ELVPLLG ++QV               RF  SN+D  A
Subjt:  GVATVRGPCADEPEPSRGTPTEELELVPLLGSEKQVP--------------RFENSNADALA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCTAAACACTGATCTTGCCTGTTTCGAGGAGAAGAGGGCGCCCAAGACAGGTGATCGCTTCTACTTGTGTGCTCGGAAGATCTTAGGTACCGTCATCGACGGTCC
TTCCTCCGTGAAGAAATGGAAGGAAAAGTCGTTCTACGCTTTTGTTGGGACTTTCTCTACCTTTAAAATGATTAGAGACTCGTGCAAGCGCACACGGTTACAAGTAATAC
TTGGGAGTAATCCAAGGTCGAATTCTCATGGACTTGTTGCTTTTAAAACTGTTTTTAAGAAGTCTACTGATACATATTTCTGCAGTAAAAAGTCACCCCTAACAACTTTA
GAGGATTGGCTCGCTCTGGACGAGCTAGACGAGCAAGTGTCTTCAAAGCCAATACTGGAGCTGACTGAGCAGAGTGTCAAGATCCTTCTTAACTTTCAAAACTCTGAAAC
TTGCTCTGAAGAGCCTGAGACTTCAAACCTCGCCATGATGTCTCTTTATGCAGAGAAAGTGAGGAGAAAACGTCCTTCAACCAAGTCGACCAAGTCTGGAGCTGGACCAT
TAGACTCTTGGCATTGCCCCTCTTCAAGTCGTGGTGCAGGAAGAGCCTTCTCTAAAACGGTTGAAGATAAAGAAGAGGAAGGTGACATGACCAGACGTTTTGTTCATGAA
GGCCGAGCTGGAAACTCGAAACCAGCTGGTGGAGAAAGACCATATGGCCTCACTGTCGGTCCAAGAGAAGGCAAAGGCCGCATGAAGGATATTCTTCTAGAGAAGGCGTA
CACCATGGTGACCAATCTGCGCAATGAAAGTAGTTTCCTAGTCGAAAAGGGTAAGACCAGAGAGGCTGTCATAGGCTCGACCAAGGATGAGCTAAGGAAGCTGAAAGCCA
ACCTCAAGTACGTTGAATTCATAGAGATCTCCTTCAAGAAACTCTCTAAGTTTGATGAGTTTGCCAATGCATTCTCAGATTCTTGGTTGAGGTCTCAAGAAGTGCTTAGA
GAAGGTGTGGGACCATTGGCCATAGTTTGTCCTGAGCGTACCGATCTAAATTGTATCCGAGCTCATCAATTTCGGGCTCAAACATCTGGCGCCATCTGTGGGGAAGACAC
ATCTGCAAGTCAGCCTCACCTTTTGCAATCTGAGCCAATGGCCCAAACCCGCTCCCAGCAGTCACGATCTCAACTTTCGCCTATCTGCTCTCCGATGAGGGATGCTGAAA
ATGGAGGTCGGCCGGCCATATCCGATCCAGTAGCAGTCCGGGACTTCCACCTCGCCTCAGATCAGTTCCCGCCACTGCAACCTCAGAAAAACGGGTTGCCGCCCCGCGCA
CCTCGCCTCCGTGGTTGGGGAAACACAGGTGCGCGTTCCGGGGCAAGTGCTGATGCGGGCGTGGACCCCATCATAGTAGCCAACGTGATCACCGAGCTCAGGGAAGTCAT
GGCAAGACTCAAGGCGGTCGAGAGAGGCAGCGAGGTGTCTGGCTCTTCCGTCTCCAGGGATCCCATCCGAGGAAAGGGACCGATGCATCCTACCCAAAGAACGGATGCCC
TGGAAGCACAGGTTCGCGACCACCCTCAGCAGGACAATCGGGTCGAGGGCCGGCGCCCGAGGATCCGACCAATTCGGACTCCCTTGGCCTCTTTCGATAGCTCCAACGCC
CATCAGGGTCGAGGTGCCGAGACGCCGAGGCGACAAGTAGTTGCACCCGGAGATCGGGAATATTTGGTTGACGATGAGGAGGAAAGCCCAGTGGTCGACGTTCAAGAGAG
GTCCTCCTACGCTGACCATTCGTTCCGGTCTGAAGTGGACCTTCTCCGAGATCAGTTTCAAAGGGAGATAGAAGATCTCAAGCGACAGTGCAGGCCTGTGGATCCGCATC
GCGTGGCCGAGCAAGAGGAACCGTCTTTCTCCCAAGCGATCTTGGATGCGCCTATCCCACCGAGGTTCAAGCCTCTAGTCATGAGTTCTTACGACGGGTCTGGAGATCCG
ATCTCCTACGTAAAGGTGTTCGAGGGGAAGATGGATTTCCTGGCCGTAAGCGACGCTATGAAGTGCCGAGCATTTCAAATAGCCTTGGAAGGCTCGGCAAGATTTTGGTA
CCGACAGTTGAAGCCCTGGTCCATCGACAGTTACCAACAGCTGAGAGTAAAGCAACGAGACAGCGAGTCCCTGACGGAGTACATCGCTCGGTTCATGGACGAGCATGTCA
AAGTGGTAAGTTGCACCGATGACATTGCCATGATGTACTTCACCACGGGCTTGAACGCCAGGAACCTGACGATAGAGTTCGGAAGCCGACCACCGGCCTCCCTGAACGAG
ATGCTTACTAGAGCTCGCCAGTACATCGACGGCTTGAGTTGTGGAAAGCCAATGGAGCTCGGGGAAGCAACCATGAGTATAGGCCGCCGAGACGAGAAGGACCCTTCGGG
CCGTCGAGGGCCAAAGTTCGACAAGTTCACTCCGCTGAACGCCTCAATCGCGGAGATCTACGCAGCGGCCGAAGATACCGACCTGGAGGCGCTTTTCGCAGCCCCAGAAA
AGCTCCGCCGACCTCCAGGGAAACGAGACAAGCGACTCTATTGCCGATTCCACAAGGATCATGGCCACGACACTTCACGTTGTTTCCACTTGAAGGAACAGGTTGAGGAT
CTGATCCAGAGGGGTTATCTGAAAAAATACGTCGGCAGGCGTGAAAGGGCGGAGCCAGAAGGGTCGGCTTGGGAGCACAAGCGAGATAAGTCGCACCCGCCGAGACGGAA
GGAAGATCGTCCCGCCATTATAAATACCATCCATGGGGGCCCGAGTGGGGGACGGTCGGGGCAGAAGAGAAAAGCTCTGGCTCGGGAAGCGGCACACGAGGTTTGTACCT
CGTACCCTAGGGAGCCTGCAATGCCGATCTTATTTGACGACCGAGATGGCGAAAGAGTGCACATGCCCCATAATGATGCCCTGGTAATCGCCCCACTCATAGATCATGTG
AAGGTGAGAAGAGTTCTTATCGACGGAGGAGCGTCGGCCAACATCTTATCGTTCTCGACCTACACGGCCCTGGGGGAGTCAGTCAGCGTGGAAGGGTGTGTCTCGCTCCC
TGTTACCATCGGCGAGGGAGATCAACAAGTAACTAAGGTTGCAGAATTTGTCGTGATAGATCGGAGCTCTGCGTACAACGCCATAATCGGTCGGCCCTTGATTCATGATC
TCAGGGCAGTTCCATCTACTTACCACCAGGTCTTGAAGTATCCCACCTCGGCCGGAGTTGCGACAGTCCGGGGGCCGTGTGCCGACGAACCAGAGCCGAGCCGTGGCACC
CCAACTGAAGAACTAGAACTTGTCCCCCTGCTGGGGTCGGAAAAGCAGGTGCCGAGGTTCGAAAACTCCAATGCCGACGCACTGGCTCGCCTGGCCTCGGCATACGAGAC
CGATCTACCAAGAACGGTTCCAGTTGAAATACTTTCTGAGTCGTCCATCGACCGGCCTGAGGTAATAGAGATCCAATCAGCTGAGCCTACATGGATGAACCCAATTAAGG
ACTTCCTGGTCAATGGCGCTGTTCTCGCCGATCCGAGGCAGGCTAGGAAGCTCCGACGCCAAGCTGCTCACTACTTGATGCAAGAAGGCAAGCTCTTCAAGAGGGGATAT
TCCCTACCATTACTGCGAGTGGTTACCACTACCGAGCTACATCAAATGGGGGCCAAGGAGGACTACGTGGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCTAAACACTGATCTTGCCTGTTTCGAGGAGAAGAGGGCGCCCAAGACAGGTGATCGCTTCTACTTGTGTGCTCGGAAGATCTTAGGTACCGTCATCGACGGTCC
TTCCTCCGTGAAGAAATGGAAGGAAAAGTCGTTCTACGCTTTTGTTGGGACTTTCTCTACCTTTAAAATGATTAGAGACTCGTGCAAGCGCACACGGTTACAAGTAATAC
TTGGGAGTAATCCAAGGTCGAATTCTCATGGACTTGTTGCTTTTAAAACTGTTTTTAAGAAGTCTACTGATACATATTTCTGCAGTAAAAAGTCACCCCTAACAACTTTA
GAGGATTGGCTCGCTCTGGACGAGCTAGACGAGCAAGTGTCTTCAAAGCCAATACTGGAGCTGACTGAGCAGAGTGTCAAGATCCTTCTTAACTTTCAAAACTCTGAAAC
TTGCTCTGAAGAGCCTGAGACTTCAAACCTCGCCATGATGTCTCTTTATGCAGAGAAAGTGAGGAGAAAACGTCCTTCAACCAAGTCGACCAAGTCTGGAGCTGGACCAT
TAGACTCTTGGCATTGCCCCTCTTCAAGTCGTGGTGCAGGAAGAGCCTTCTCTAAAACGGTTGAAGATAAAGAAGAGGAAGGTGACATGACCAGACGTTTTGTTCATGAA
GGCCGAGCTGGAAACTCGAAACCAGCTGGTGGAGAAAGACCATATGGCCTCACTGTCGGTCCAAGAGAAGGCAAAGGCCGCATGAAGGATATTCTTCTAGAGAAGGCGTA
CACCATGGTGACCAATCTGCGCAATGAAAGTAGTTTCCTAGTCGAAAAGGGTAAGACCAGAGAGGCTGTCATAGGCTCGACCAAGGATGAGCTAAGGAAGCTGAAAGCCA
ACCTCAAGTACGTTGAATTCATAGAGATCTCCTTCAAGAAACTCTCTAAGTTTGATGAGTTTGCCAATGCATTCTCAGATTCTTGGTTGAGGTCTCAAGAAGTGCTTAGA
GAAGGTGTGGGACCATTGGCCATAGTTTGTCCTGAGCGTACCGATCTAAATTGTATCCGAGCTCATCAATTTCGGGCTCAAACATCTGGCGCCATCTGTGGGGAAGACAC
ATCTGCAAGTCAGCCTCACCTTTTGCAATCTGAGCCAATGGCCCAAACCCGCTCCCAGCAGTCACGATCTCAACTTTCGCCTATCTGCTCTCCGATGAGGGATGCTGAAA
ATGGAGGTCGGCCGGCCATATCCGATCCAGTAGCAGTCCGGGACTTCCACCTCGCCTCAGATCAGTTCCCGCCACTGCAACCTCAGAAAAACGGGTTGCCGCCCCGCGCA
CCTCGCCTCCGTGGTTGGGGAAACACAGGTGCGCGTTCCGGGGCAAGTGCTGATGCGGGCGTGGACCCCATCATAGTAGCCAACGTGATCACCGAGCTCAGGGAAGTCAT
GGCAAGACTCAAGGCGGTCGAGAGAGGCAGCGAGGTGTCTGGCTCTTCCGTCTCCAGGGATCCCATCCGAGGAAAGGGACCGATGCATCCTACCCAAAGAACGGATGCCC
TGGAAGCACAGGTTCGCGACCACCCTCAGCAGGACAATCGGGTCGAGGGCCGGCGCCCGAGGATCCGACCAATTCGGACTCCCTTGGCCTCTTTCGATAGCTCCAACGCC
CATCAGGGTCGAGGTGCCGAGACGCCGAGGCGACAAGTAGTTGCACCCGGAGATCGGGAATATTTGGTTGACGATGAGGAGGAAAGCCCAGTGGTCGACGTTCAAGAGAG
GTCCTCCTACGCTGACCATTCGTTCCGGTCTGAAGTGGACCTTCTCCGAGATCAGTTTCAAAGGGAGATAGAAGATCTCAAGCGACAGTGCAGGCCTGTGGATCCGCATC
GCGTGGCCGAGCAAGAGGAACCGTCTTTCTCCCAAGCGATCTTGGATGCGCCTATCCCACCGAGGTTCAAGCCTCTAGTCATGAGTTCTTACGACGGGTCTGGAGATCCG
ATCTCCTACGTAAAGGTGTTCGAGGGGAAGATGGATTTCCTGGCCGTAAGCGACGCTATGAAGTGCCGAGCATTTCAAATAGCCTTGGAAGGCTCGGCAAGATTTTGGTA
CCGACAGTTGAAGCCCTGGTCCATCGACAGTTACCAACAGCTGAGAGTAAAGCAACGAGACAGCGAGTCCCTGACGGAGTACATCGCTCGGTTCATGGACGAGCATGTCA
AAGTGGTAAGTTGCACCGATGACATTGCCATGATGTACTTCACCACGGGCTTGAACGCCAGGAACCTGACGATAGAGTTCGGAAGCCGACCACCGGCCTCCCTGAACGAG
ATGCTTACTAGAGCTCGCCAGTACATCGACGGCTTGAGTTGTGGAAAGCCAATGGAGCTCGGGGAAGCAACCATGAGTATAGGCCGCCGAGACGAGAAGGACCCTTCGGG
CCGTCGAGGGCCAAAGTTCGACAAGTTCACTCCGCTGAACGCCTCAATCGCGGAGATCTACGCAGCGGCCGAAGATACCGACCTGGAGGCGCTTTTCGCAGCCCCAGAAA
AGCTCCGCCGACCTCCAGGGAAACGAGACAAGCGACTCTATTGCCGATTCCACAAGGATCATGGCCACGACACTTCACGTTGTTTCCACTTGAAGGAACAGGTTGAGGAT
CTGATCCAGAGGGGTTATCTGAAAAAATACGTCGGCAGGCGTGAAAGGGCGGAGCCAGAAGGGTCGGCTTGGGAGCACAAGCGAGATAAGTCGCACCCGCCGAGACGGAA
GGAAGATCGTCCCGCCATTATAAATACCATCCATGGGGGCCCGAGTGGGGGACGGTCGGGGCAGAAGAGAAAAGCTCTGGCTCGGGAAGCGGCACACGAGGTTTGTACCT
CGTACCCTAGGGAGCCTGCAATGCCGATCTTATTTGACGACCGAGATGGCGAAAGAGTGCACATGCCCCATAATGATGCCCTGGTAATCGCCCCACTCATAGATCATGTG
AAGGTGAGAAGAGTTCTTATCGACGGAGGAGCGTCGGCCAACATCTTATCGTTCTCGACCTACACGGCCCTGGGGGAGTCAGTCAGCGTGGAAGGGTGTGTCTCGCTCCC
TGTTACCATCGGCGAGGGAGATCAACAAGTAACTAAGGTTGCAGAATTTGTCGTGATAGATCGGAGCTCTGCGTACAACGCCATAATCGGTCGGCCCTTGATTCATGATC
TCAGGGCAGTTCCATCTACTTACCACCAGGTCTTGAAGTATCCCACCTCGGCCGGAGTTGCGACAGTCCGGGGGCCGTGTGCCGACGAACCAGAGCCGAGCCGTGGCACC
CCAACTGAAGAACTAGAACTTGTCCCCCTGCTGGGGTCGGAAAAGCAGGTGCCGAGGTTCGAAAACTCCAATGCCGACGCACTGGCTCGCCTGGCCTCGGCATACGAGAC
CGATCTACCAAGAACGGTTCCAGTTGAAATACTTTCTGAGTCGTCCATCGACCGGCCTGAGGTAATAGAGATCCAATCAGCTGAGCCTACATGGATGAACCCAATTAAGG
ACTTCCTGGTCAATGGCGCTGTTCTCGCCGATCCGAGGCAGGCTAGGAAGCTCCGACGCCAAGCTGCTCACTACTTGATGCAAGAAGGCAAGCTCTTCAAGAGGGGATAT
TCCCTACCATTACTGCGAGTGGTTACCACTACCGAGCTACATCAAATGGGGGCCAAGGAGGACTACGTGGTATGA
Protein sequenceShow/hide protein sequence
MPLNTDLACFEEKRAPKTGDRFYLCARKILGTVIDGPSSVKKWKEKSFYAFVGTFSTFKMIRDSCKRTRLQVILGSNPRSNSHGLVAFKTVFKKSTDTYFCSKKSPLTTL
EDWLALDELDEQVSSKPILELTEQSVKILLNFQNSETCSEEPETSNLAMMSLYAEKVRRKRPSTKSTKSGAGPLDSWHCPSSSRGAGRAFSKTVEDKEEEGDMTRRFVHE
GRAGNSKPAGGERPYGLTVGPREGKGRMKDILLEKAYTMVTNLRNESSFLVEKGKTREAVIGSTKDELRKLKANLKYVEFIEISFKKLSKFDEFANAFSDSWLRSQEVLR
EGVGPLAIVCPERTDLNCIRAHQFRAQTSGAICGEDTSASQPHLLQSEPMAQTRSQQSRSQLSPICSPMRDAENGGRPAISDPVAVRDFHLASDQFPPLQPQKNGLPPRA
PRLRGWGNTGARSGASADAGVDPIIVANVITELREVMARLKAVERGSEVSGSSVSRDPIRGKGPMHPTQRTDALEAQVRDHPQQDNRVEGRRPRIRPIRTPLASFDSSNA
HQGRGAETPRRQVVAPGDREYLVDDEEESPVVDVQERSSYADHSFRSEVDLLRDQFQREIEDLKRQCRPVDPHRVAEQEEPSFSQAILDAPIPPRFKPLVMSSYDGSGDP
ISYVKVFEGKMDFLAVSDAMKCRAFQIALEGSARFWYRQLKPWSIDSYQQLRVKQRDSESLTEYIARFMDEHVKVVSCTDDIAMMYFTTGLNARNLTIEFGSRPPASLNE
MLTRARQYIDGLSCGKPMELGEATMSIGRRDEKDPSGRRGPKFDKFTPLNASIAEIYAAAEDTDLEALFAAPEKLRRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVED
LIQRGYLKKYVGRRERAEPEGSAWEHKRDKSHPPRRKEDRPAIINTIHGGPSGGRSGQKRKALAREAAHEVCTSYPREPAMPILFDDRDGERVHMPHNDALVIAPLIDHV
KVRRVLIDGGASANILSFSTYTALGESVSVEGCVSLPVTIGEGDQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTSAGVATVRGPCADEPEPSRGT
PTEELELVPLLGSEKQVPRFENSNADALARLASAYETDLPRTVPVEILSESSIDRPEVIEIQSAEPTWMNPIKDFLVNGAVLADPRQARKLRRQAAHYLMQEGKLFKRGY
SLPLLRVVTTTELHQMGAKEDYVV