; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024601 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024601
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00002486:1097411..1103558
RNA-Seq ExpressionSgr024601
SyntenySgr024601
Gene Ontology termsNA
InterPro domainsIPR027902 - Protein of unknown function DUF4487


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK17380.1 uncharacterized protein E5676_scaffold434G002400 [Cucumis melo var. makuwa]0.0e+0079.75Show/hide
Query:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK--------YL---VWKASKDDAFVNPGVTGRRAFQPV--------------------FA
        M+G  SS DLQSI+DAISSSDVVESRIQLL KLEDLDLS K        +L   V KA KDDA+VNPGVTGRRAF+PV                    F+
Subjt:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK--------YL---VWKASKDDAFVNPGVTGRRAFQPV--------------------FA

Query:  ALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINFCEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCEL
        ALAR P SEDK LM+TVENFTLEQLNL+ ESVSEIQ I +FG EILKAVQMVIDA I F E H QALDWE S E+F+KTS + NH INV+KCIIEKLCEL
Subjt:  ALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINFCEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCEL

Query:  GTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAATWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAY
        GTIAAKGGGGLVT+LNVSWKGVFT+LQ GN+V+ SKVN+A IIL LVSLVIEP+K AAATWSS MKE VSAT+ARRIFLP+KFFLINAVKIS LCPCQAY
Subjt:  GTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAATWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAY

Query:  LVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLKLEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMN
        LV KEII C+L+I TYKV +SNEKLLETV EAITELLEQ CLDLVKC LNST+LKQDLK +I+DLLFTTERCS+PD   SACF  DP + IF+ NCEG N
Subjt:  LVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLKLEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMN

Query:  DARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFLYFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSF
        DA+ILLLGRINFLLNL+++S DLSDDAKL IT+KLNWLLDILVQEDVYASVLLL+VPF Y SG+TTELKW PL S LLHALKTFMVAVS +YAW+ELQ F
Subjt:  DARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFLYFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSF

Query:  LLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVLRKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVA
        LL+NLLHPHFLCWD+VMELWCFMLRYAD+ LVN VIS  FSVMK LASSEPVL+YSS LRKMARS+TMLLTYGAH+K NEIFESI IQDKS+LSTV+WVA
Subjt:  LLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVLRKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVA

Query:  LILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQSMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLIS
        LILEGF LNLLSEK KNI +Q+ I DYL+FIG+F+ETSML SSS  IGLPV SAS  IQSMKLS SDIDVRTLKFLLALLRSYKISGVEQ KG C KLIS
Subjt:  LILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQSMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLIS

Query:  ETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKLTETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAAR
        ETLGIIS ++HLYA+NEMEEVILELEKLFISGPTASDALLYECKS LAPFLAGLAHIK+TETD+NAKSCAVWELYHMLFKERHWAFIHL LTAFGYFAAR
Subjt:  ETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKLTETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAAR

Query:  TSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLMKEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPE
        TSCDELWRFVPQNAALSYDLESGKQV+E+GFM+EFKIFLEKEMALLTVTP +EQLALLMKEGL+LKDMLN  LK CG G +CKSME DEGPSSRKRKLPE
Subjt:  TSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLMKEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPE

Query:  GMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGGSD
        G+SKGMELLKNGLKVMRQG+SLLEE+HVDSRELH+KL SHFSGLEDE+ RL SQGG D
Subjt:  GMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGGSD

XP_022146998.1 uncharacterized protein LOC111016053 isoform X1 [Momordica charantia]0.0e+0081.04Show/hide
Query:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK----------YLVW-----------------------KASKDDA-----FVNPGVT---
        MEGR SS DLQSI+DAISSSDVVESRIQLL KLEDLDLSSK           + W                       +  KD A     F+  GV    
Subjt:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK----------YLVW-----------------------KASKDDA-----FVNPGVT---

Query:  ---------------------GRRAFQPV-------------FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAM
                                 FQ V             F+ALARYPL EDK LM TVENFTLEQLNL+NESVSEIQRIQEFGSEILKAVQM+IDAM
Subjt:  ---------------------GRRAFQPV-------------FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAM

Query:  INFCEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKC
        I FCEVHSQALD EFSDE  DKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAG ILILVSLVIEPLKC
Subjt:  INFCEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKC

Query:  AAATWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQ
        AAATWSS  KEAVSA EARRIFLP+KFFLINAVKIS LCPCQAYLVHKEIILCVL IFTYK SLS+EKLLETVAEAITELLE  CLDLVKC +NST+LKQ
Subjt:  AAATWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQ

Query:  DLKLEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRV
        DLKLEI+ LLFTTERCS+PD DPSACFRIDPM+ +FSINCEGMN+ + LLLGRIN LLNLLRYS DLSDDAKL IT+KL+WLLD LVQEDVYASVLLL+V
Subjt:  DLKLEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRV

Query:  PFLYFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYS
        PFLY SG+TTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWD+VMELWCFMLRYADNGLVNGVISN  SVMK+L SSEPVL+YS
Subjt:  PFLYFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYS

Query:  SVLRKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASA
        S LRKMARSI M+LTYGAHSKLNEI E+I IQDKSRLSTV+ VALILEGFPLNLLSEK K+IA+Q ++HDYLSFIG+FDETSMLTSSSG+IGLPV SASA
Subjt:  SVLRKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASA

Query:  TIQSMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAH
        TIQSMKLS SDIDVRTLKFLLALLR YKISGV +VKGFC KLISETLGIIS MKHLYASNEMEEVILELEKLFISGPTASD LLYECKSGLAPFLAGLAH
Subjt:  TIQSMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAH

Query:  IKLTETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLA
        IK+TETD+NAKSCAVWELYHMLFKERHWA IHL LTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFM+EFKIFLEKEMALLTVT S +QLA
Subjt:  IKLTETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLA

Query:  LLMKEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGG
        LLMKEGLMLKD+LNKILKSC KGIECKSM+TDEGPSSRKRKLPEG+SKGM+LLKNGLKVMRQG+SLLE SHVDSRELHNKL SHF GLEDEI RL +QGG
Subjt:  LLMKEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGG

Query:  SD
         D
Subjt:  SD

XP_022147009.1 uncharacterized protein LOC111016053 isoform X2 [Momordica charantia]0.0e+0081.18Show/hide
Query:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK----------YLVW-----------------------KASKDDA-----FVNPGVT---
        MEGR SS DLQSI+DAISSSDVVESRIQLL KLEDLDLSSK           + W                       +  KD A     F+  GV    
Subjt:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK----------YLVW-----------------------KASKDDA-----FVNPGVT---

Query:  ---------------------GRRAFQPV----------FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINF
                                 FQ +          F+ALARYPL EDK LM TVENFTLEQLNL+NESVSEIQRIQEFGSEILKAVQM+IDAMI F
Subjt:  ---------------------GRRAFQPV----------FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINF

Query:  CEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAA
        CEVHSQALD EFSDE  DKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAG ILILVSLVIEPLKCAAA
Subjt:  CEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAA

Query:  TWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLK
        TWSS  KEAVSA EARRIFLP+KFFLINAVKIS LCPCQAYLVHKEIILCVL IFTYK SLS+EKLLETVAEAITELLE  CLDLVKC +NST+LKQDLK
Subjt:  TWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLK

Query:  LEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFL
        LEI+ LLFTTERCS+PD DPSACFRIDPM+ +FSINCEGMN+ + LLLGRIN LLNLLRYS DLSDDAKL IT+KL+WLLD LVQEDVYASVLLL+VPFL
Subjt:  LEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFL

Query:  YFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVL
        Y SG+TTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWD+VMELWCFMLRYADNGLVNGVISN  SVMK+L SSEPVL+YSS L
Subjt:  YFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVL

Query:  RKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQ
        RKMARSI M+LTYGAHSKLNEI E+I IQDKSRLSTV+ VALILEGFPLNLLSEK K+IA+Q ++HDYLSFIG+FDETSMLTSSSG+IGLPV SASATIQ
Subjt:  RKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQ

Query:  SMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKL
        SMKLS SDIDVRTLKFLLALLR YKISGV +VKGFC KLISETLGIIS MKHLYASNEMEEVILELEKLFISGPTASD LLYECKSGLAPFLAGLAHIK+
Subjt:  SMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKL

Query:  TETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLM
        TETD+NAKSCAVWELYHMLFKERHWA IHL LTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFM+EFKIFLEKEMALLTVT S +QLALLM
Subjt:  TETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLM

Query:  KEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGGSD
        KEGLMLKD+LNKILKSC KGIECKSM+TDEGPSSRKRKLPEG+SKGM+LLKNGLKVMRQG+SLLE SHVDSRELHNKL SHF GLEDEI RL +QGG D
Subjt:  KEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGGSD

XP_022949176.1 uncharacterized protein LOC111452603 [Cucurbita moschata]0.0e+0079.04Show/hide
Query:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK---------------------------------YLVWKASKDDA---------FVNPGV
        MEG  S+  LQSIIDAISSSDVVESRIQLL KLEDLDLS K                                   V +  KD A          V   V
Subjt:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK---------------------------------YLVWKASKDDA---------FVNPGV

Query:  TGRRAFQPV------------------------------FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINF
          R+  +                                F+ALARYPLSEDKALM+TVENF LEQLNL+NESVSEIQRI+EFG EILKAVQMVIDAMI F
Subjt:  TGRRAFQPV------------------------------FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINF

Query:  CEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAA
        CEVHSQALD EFS E FD TSSA NH INVHK IIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQ GN VL SKVNIAGIIL LVSLV+EPLKCAAA
Subjt:  CEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAA

Query:  TWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLK
        TWSS   EAVSA+EARRIFLP+KFFLINAVKIS L PCQAYLVHKEIILCVL I TYKVSLSNEKLL TVAEAITELLE  CLDLVKC LN+T+LKQDLK
Subjt:  TWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLK

Query:  LEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFL
        L I+DLLFT+ERCS+PD DPS CFRIDPM+ IF+ NCE M DA+ LLLGRINFLLNLLR+S DLSDD KL IT+KL+ LLDILVQEDVYASVLLL+VPFL
Subjt:  LEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFL

Query:  YFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVL
        YFSG+TTELKWQPLFSSL+HALKTFMVAVSSS AW+ELQSFLL+NLLHPHFLCWD+VMELWCFMLRYAD+GLVNGVISNFFSVMK LASSE VL +SS L
Subjt:  YFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVL

Query:  RKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQ
        RKMAR ITMLLTYGAHSKLNEI ESI IQDKSR STV+W ALILEGFPLNLLSEK KNIA+Q++IHDYL+FIGSFDETSML  SS +IGLPV SAS TIQ
Subjt:  RKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQ

Query:  SMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKL
        S+KLS SDIDVRTLKFLLALLRSYK+SGVE+VKGFC KLISETL IIS MKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGL PFLAGLAHIK+
Subjt:  SMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKL

Query:  TETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLM
         ETD+NAKSCAVWELYHMLFKERHWAFIHL LTAFGYFAARTSCDELWRFVPQNAALSYDLESGK VNEEGFM+EFKIFLEKEMALLTVTPSAEQLALLM
Subjt:  TETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLM

Query:  KEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGG
        KEGL+LKD  N +LKSCGKGIECKSME DEGPSSRKRKLPEG+SKGMELLKNGLK MRQG+SLLE +HVDSRELHNKLLSHFSGLEDEI RLDSQGG
Subjt:  KEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGG

XP_023540692.1 uncharacterized protein LOC111800982 [Cucurbita pepo subsp. pepo]0.0e+0078.82Show/hide
Query:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK----------YLVW----------------------KASKDDA---------FVNPGVT
        MEG  S+ DLQSIIDAISSSDVVESRIQLL KLEDLDLS K           + W                      K  +D A          V   V 
Subjt:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK----------YLVW----------------------KASKDDA---------FVNPGVT

Query:  GRRAFQPV------------------------------FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINFC
         R+  +                                F+ALARYPLSEDKALM+TVENF LEQLNL+NESVSEIQRI+EFG EILKAVQMVIDAMI FC
Subjt:  GRRAFQPV------------------------------FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINFC

Query:  EVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAAT
        EVHSQALD EFS E FD TSSA NH INVHK IIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQ GN VL SKVNIAGIIL LVSLV+EPLKCAAAT
Subjt:  EVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAAT

Query:  WSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLKL
        WSS   EAVSA+EARRIFLP+KFFLINAVKIS L PCQAYLVHKEIILCVL I TYKVSLSNEKLL TVAEAI ELLE  CLDLVKC LN+T+LKQDLKL
Subjt:  WSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLKL

Query:  EIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFLY
         I+DLLFT+ERCS+PD DPS CFRIDPM  IF+ NCE M DA+ LLLGRINFLLNLLR+S DLSDD KL IT+KL+ LLDILVQEDVYASVLLL+VPFLY
Subjt:  EIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFLY

Query:  FSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVLR
        FSG+TTELKWQPLFSSL+HALKTFMVAVSSS AW+ELQSFLL+NLLHPHFLCWD+VMELWCFMLRYAD+GLVNGVISN FSVMK LASSE VL++SS LR
Subjt:  FSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVLR

Query:  KMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQS
        KMAR ITMLLTYGAHSKLNEI ESI IQDKSR STV+W ALILEGFPLNLLSEK KNI +Q++IHDYL+FIGSFDETSML  SS +IGLPV SAS TIQS
Subjt:  KMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQS

Query:  MKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKLT
        +KLS SDIDVRTLKFLLALLRSYK+SGVE+VKGFC KLISETL IIS MKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGL PFLAGLAHIK+ 
Subjt:  MKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKLT

Query:  ETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLMK
        ET++NAKSCAVWELYHMLFKERHWAFIHL LTAFGYFAARTSCDELWRFVPQNAALSYDLESGK VNEEGFM+EFKIFLEKEMALLTVTPSAEQLALLMK
Subjt:  ETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLMK

Query:  EGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGG
        EGL+LKD  N +LKSCGKGIECKSME DEGPSSRKRKLPEG+SKGMELLKNGLK MRQG+SLLE +HVDSRELHNKLLSHFSGLEDEI RLDSQGG
Subjt:  EGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGG

TrEMBL top hitse value%identityAlignment
A0A5D3D190 Uncharacterized protein0.0e+0079.75Show/hide
Query:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK--------YL---VWKASKDDAFVNPGVTGRRAFQPV--------------------FA
        M+G  SS DLQSI+DAISSSDVVESRIQLL KLEDLDLS K        +L   V KA KDDA+VNPGVTGRRAF+PV                    F+
Subjt:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK--------YL---VWKASKDDAFVNPGVTGRRAFQPV--------------------FA

Query:  ALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINFCEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCEL
        ALAR P SEDK LM+TVENFTLEQLNL+ ESVSEIQ I +FG EILKAVQMVIDA I F E H QALDWE S E+F+KTS + NH INV+KCIIEKLCEL
Subjt:  ALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINFCEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCEL

Query:  GTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAATWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAY
        GTIAAKGGGGLVT+LNVSWKGVFT+LQ GN+V+ SKVN+A IIL LVSLVIEP+K AAATWSS MKE VSAT+ARRIFLP+KFFLINAVKIS LCPCQAY
Subjt:  GTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAATWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAY

Query:  LVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLKLEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMN
        LV KEII C+L+I TYKV +SNEKLLETV EAITELLEQ CLDLVKC LNST+LKQDLK +I+DLLFTTERCS+PD   SACF  DP + IF+ NCEG N
Subjt:  LVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLKLEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMN

Query:  DARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFLYFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSF
        DA+ILLLGRINFLLNL+++S DLSDDAKL IT+KLNWLLDILVQEDVYASVLLL+VPF Y SG+TTELKW PL S LLHALKTFMVAVS +YAW+ELQ F
Subjt:  DARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFLYFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSF

Query:  LLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVLRKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVA
        LL+NLLHPHFLCWD+VMELWCFMLRYAD+ LVN VIS  FSVMK LASSEPVL+YSS LRKMARS+TMLLTYGAH+K NEIFESI IQDKS+LSTV+WVA
Subjt:  LLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVLRKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVA

Query:  LILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQSMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLIS
        LILEGF LNLLSEK KNI +Q+ I DYL+FIG+F+ETSML SSS  IGLPV SAS  IQSMKLS SDIDVRTLKFLLALLRSYKISGVEQ KG C KLIS
Subjt:  LILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQSMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLIS

Query:  ETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKLTETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAAR
        ETLGIIS ++HLYA+NEMEEVILELEKLFISGPTASDALLYECKS LAPFLAGLAHIK+TETD+NAKSCAVWELYHMLFKERHWAFIHL LTAFGYFAAR
Subjt:  ETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKLTETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAAR

Query:  TSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLMKEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPE
        TSCDELWRFVPQNAALSYDLESGKQV+E+GFM+EFKIFLEKEMALLTVTP +EQLALLMKEGL+LKDMLN  LK CG G +CKSME DEGPSSRKRKLPE
Subjt:  TSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLMKEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPE

Query:  GMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGGSD
        G+SKGMELLKNGLKVMRQG+SLLEE+HVDSRELH+KL SHFSGLEDE+ RL SQGG D
Subjt:  GMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGGSD

A0A6J1CYX4 uncharacterized protein LOC111016053 isoform X10.0e+0081.04Show/hide
Query:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK----------YLVW-----------------------KASKDDA-----FVNPGVT---
        MEGR SS DLQSI+DAISSSDVVESRIQLL KLEDLDLSSK           + W                       +  KD A     F+  GV    
Subjt:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK----------YLVW-----------------------KASKDDA-----FVNPGVT---

Query:  ---------------------GRRAFQPV-------------FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAM
                                 FQ V             F+ALARYPL EDK LM TVENFTLEQLNL+NESVSEIQRIQEFGSEILKAVQM+IDAM
Subjt:  ---------------------GRRAFQPV-------------FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAM

Query:  INFCEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKC
        I FCEVHSQALD EFSDE  DKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAG ILILVSLVIEPLKC
Subjt:  INFCEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKC

Query:  AAATWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQ
        AAATWSS  KEAVSA EARRIFLP+KFFLINAVKIS LCPCQAYLVHKEIILCVL IFTYK SLS+EKLLETVAEAITELLE  CLDLVKC +NST+LKQ
Subjt:  AAATWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQ

Query:  DLKLEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRV
        DLKLEI+ LLFTTERCS+PD DPSACFRIDPM+ +FSINCEGMN+ + LLLGRIN LLNLLRYS DLSDDAKL IT+KL+WLLD LVQEDVYASVLLL+V
Subjt:  DLKLEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRV

Query:  PFLYFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYS
        PFLY SG+TTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWD+VMELWCFMLRYADNGLVNGVISN  SVMK+L SSEPVL+YS
Subjt:  PFLYFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYS

Query:  SVLRKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASA
        S LRKMARSI M+LTYGAHSKLNEI E+I IQDKSRLSTV+ VALILEGFPLNLLSEK K+IA+Q ++HDYLSFIG+FDETSMLTSSSG+IGLPV SASA
Subjt:  SVLRKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASA

Query:  TIQSMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAH
        TIQSMKLS SDIDVRTLKFLLALLR YKISGV +VKGFC KLISETLGIIS MKHLYASNEMEEVILELEKLFISGPTASD LLYECKSGLAPFLAGLAH
Subjt:  TIQSMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAH

Query:  IKLTETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLA
        IK+TETD+NAKSCAVWELYHMLFKERHWA IHL LTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFM+EFKIFLEKEMALLTVT S +QLA
Subjt:  IKLTETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLA

Query:  LLMKEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGG
        LLMKEGLMLKD+LNKILKSC KGIECKSM+TDEGPSSRKRKLPEG+SKGM+LLKNGLKVMRQG+SLLE SHVDSRELHNKL SHF GLEDEI RL +QGG
Subjt:  LLMKEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGG

Query:  SD
         D
Subjt:  SD

A0A6J1CZS0 uncharacterized protein LOC111016053 isoform X20.0e+0081.18Show/hide
Query:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK----------YLVW-----------------------KASKDDA-----FVNPGVT---
        MEGR SS DLQSI+DAISSSDVVESRIQLL KLEDLDLSSK           + W                       +  KD A     F+  GV    
Subjt:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK----------YLVW-----------------------KASKDDA-----FVNPGVT---

Query:  ---------------------GRRAFQPV----------FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINF
                                 FQ +          F+ALARYPL EDK LM TVENFTLEQLNL+NESVSEIQRIQEFGSEILKAVQM+IDAMI F
Subjt:  ---------------------GRRAFQPV----------FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINF

Query:  CEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAA
        CEVHSQALD EFSDE  DKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAG ILILVSLVIEPLKCAAA
Subjt:  CEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAA

Query:  TWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLK
        TWSS  KEAVSA EARRIFLP+KFFLINAVKIS LCPCQAYLVHKEIILCVL IFTYK SLS+EKLLETVAEAITELLE  CLDLVKC +NST+LKQDLK
Subjt:  TWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLK

Query:  LEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFL
        LEI+ LLFTTERCS+PD DPSACFRIDPM+ +FSINCEGMN+ + LLLGRIN LLNLLRYS DLSDDAKL IT+KL+WLLD LVQEDVYASVLLL+VPFL
Subjt:  LEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFL

Query:  YFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVL
        Y SG+TTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWD+VMELWCFMLRYADNGLVNGVISN  SVMK+L SSEPVL+YSS L
Subjt:  YFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVL

Query:  RKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQ
        RKMARSI M+LTYGAHSKLNEI E+I IQDKSRLSTV+ VALILEGFPLNLLSEK K+IA+Q ++HDYLSFIG+FDETSMLTSSSG+IGLPV SASATIQ
Subjt:  RKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQ

Query:  SMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKL
        SMKLS SDIDVRTLKFLLALLR YKISGV +VKGFC KLISETLGIIS MKHLYASNEMEEVILELEKLFISGPTASD LLYECKSGLAPFLAGLAHIK+
Subjt:  SMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKL

Query:  TETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLM
        TETD+NAKSCAVWELYHMLFKERHWA IHL LTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFM+EFKIFLEKEMALLTVT S +QLALLM
Subjt:  TETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLM

Query:  KEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGGSD
        KEGLMLKD+LNKILKSC KGIECKSM+TDEGPSSRKRKLPEG+SKGM+LLKNGLKVMRQG+SLLE SHVDSRELHNKL SHF GLEDEI RL +QGG D
Subjt:  KEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGGSD

A0A6J1GBD2 uncharacterized protein LOC1114526030.0e+0079.04Show/hide
Query:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK---------------------------------YLVWKASKDDA---------FVNPGV
        MEG  S+  LQSIIDAISSSDVVESRIQLL KLEDLDLS K                                   V +  KD A          V   V
Subjt:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK---------------------------------YLVWKASKDDA---------FVNPGV

Query:  TGRRAFQPV------------------------------FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINF
          R+  +                                F+ALARYPLSEDKALM+TVENF LEQLNL+NESVSEIQRI+EFG EILKAVQMVIDAMI F
Subjt:  TGRRAFQPV------------------------------FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINF

Query:  CEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAA
        CEVHSQALD EFS E FD TSSA NH INVHK IIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQ GN VL SKVNIAGIIL LVSLV+EPLKCAAA
Subjt:  CEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAA

Query:  TWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLK
        TWSS   EAVSA+EARRIFLP+KFFLINAVKIS L PCQAYLVHKEIILCVL I TYKVSLSNEKLL TVAEAITELLE  CLDLVKC LN+T+LKQDLK
Subjt:  TWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLK

Query:  LEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFL
        L I+DLLFT+ERCS+PD DPS CFRIDPM+ IF+ NCE M DA+ LLLGRINFLLNLLR+S DLSDD KL IT+KL+ LLDILVQEDVYASVLLL+VPFL
Subjt:  LEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFL

Query:  YFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVL
        YFSG+TTELKWQPLFSSL+HALKTFMVAVSSS AW+ELQSFLL+NLLHPHFLCWD+VMELWCFMLRYAD+GLVNGVISNFFSVMK LASSE VL +SS L
Subjt:  YFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVL

Query:  RKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQ
        RKMAR ITMLLTYGAHSKLNEI ESI IQDKSR STV+W ALILEGFPLNLLSEK KNIA+Q++IHDYL+FIGSFDETSML  SS +IGLPV SAS TIQ
Subjt:  RKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQ

Query:  SMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKL
        S+KLS SDIDVRTLKFLLALLRSYK+SGVE+VKGFC KLISETL IIS MKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGL PFLAGLAHIK+
Subjt:  SMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKL

Query:  TETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLM
         ETD+NAKSCAVWELYHMLFKERHWAFIHL LTAFGYFAARTSCDELWRFVPQNAALSYDLESGK VNEEGFM+EFKIFLEKEMALLTVTPSAEQLALLM
Subjt:  TETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLM

Query:  KEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGG
        KEGL+LKD  N +LKSCGKGIECKSME DEGPSSRKRKLPEG+SKGMELLKNGLK MRQG+SLLE +HVDSRELHNKLLSHFSGLEDEI RLDSQGG
Subjt:  KEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGG

A0A6J1IF25 uncharacterized protein LOC111473635 isoform X10.0e+0077.83Show/hide
Query:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK---------------------------------YLVWKASKDDA---------FVNPGV
        ME   S+ DLQSIIDAISSSDVVESRIQLL KLEDLDLS K                                   V +  KD A          V   V
Subjt:  MEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSK---------------------------------YLVWKASKDDA---------FVNPGV

Query:  TGRRAFQ------------------------------PVFAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINF
          R+  +                               +F+ALARYPLSEDKALM+ VENF LEQLNL+NE VSEIQRI+EFG +ILKAVQMVIDAMI F
Subjt:  TGRRAFQ------------------------------PVFAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINF

Query:  CEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAA
        CEVHSQAL   FS E FD TSSA NH INVHK IIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQ GN VL SKVNIAGIIL LVSLV+EPLKCA+A
Subjt:  CEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAA

Query:  TWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLK
        TWSS   EAVSA+EARRIFLP+KFFLINAVKIS L PCQAYLVHKEIILCVL I TYKVSLSNEKLL TVAEAITELLE  CLDLVKC LN+T+LKQDLK
Subjt:  TWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLK

Query:  LEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFL
        L I+DLLFT+ERCS+PD DPS CFRIDPM++IF+ NCE M DA+ LLLGRINFLLNLLR+S DLSDD KL IT+KL+ LLDILVQEDVYASVLLL+VPFL
Subjt:  LEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFL

Query:  YFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVL
        YFSG+TTELKWQPLFSSL+HALKTFMVAVSSS AW+ELQSFLL+NL HPHFLCWD+VMELWCFMLRYAD+GLVNGVISNFFSVMK LASSE VL++SS L
Subjt:  YFSGRTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVL

Query:  RKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQ
        RKMAR ITMLLTYGAHSKLNEI ESI IQDKSR STV+W ALILEGFPLNLLS K KNIA+Q++IHDYL+FIGSFDETSML  SS +IGLPV SAS TIQ
Subjt:  RKMARSITMLLTYGAHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQ

Query:  SMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKL
        S+KLS SDIDVRTLKFLLAL+RSYK+SGVE+VKGFC KLISETLGIIS MKHLYA NEMEEVILELEKLFISGPTASDALLYECKSGL PFLAGLAHIK+
Subjt:  SMKLSISDIDVRTLKFLLALLRSYKISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKL

Query:  TETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLM
        TET++NAKSCAVWELYHMLFKERHWAFIHL LTAFGY+AARTSCDELWRFVPQNAALSYDLESGK VNEEGFM+EFKIFLEKEMALLTVTPSAEQL LLM
Subjt:  TETDNNAKSCAVWELYHMLFKERHWAFIHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLM

Query:  KEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGG
        KEGL+LKD  N +LKS GKGIECKSME DEGPSSRKRKLPEG+SKGMELLKNGLK MRQG+SLLE +HVDSRELHNKLLSHFSGLEDEI RLDSQGG
Subjt:  KEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04650.1 unknown protein5.2e-19244.05Show/hide
Query:  FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINFCEVHSQALDWEFSDEKFD--------KTSSAANHAINVH
        F A+ +     D A   TV  F  EQLNL  E +   ++++ F SEI KAVQ+VID+ +  C+ +SQ ++ E S+ K          +  +A  + +++ 
Subjt:  FAALARYPLSEDKALMDTVENFTLEQLNLINESVSEIQRIQEFGSEILKAVQMVIDAMINFCEVHSQALDWEFSDEKFD--------KTSSAANHAINVH

Query:  KCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAATWSSEMKEAVSATEARRIFLPLKFFLINAVK
           ++ L ELG +AA+ GG LV ILN SWKGV TLLQ     L+SKV++  IIL L+SL+ + L+ AA  WS  +KE +SATEARR+FLP+KF+LINAVK
Subjt:  KCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGIILILVSLVIEPLKCAAATWSSEMKEAVSATEARRIFLPLKFFLINAVK

Query:  ISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLKLEIVDLLFTTE----RCSYPDEDPSACFRID
        +  L P QA +V K+I LC+LMI  +KVSLS +   ++ +E +T+LLE+  +DL+   LN+  L Q+ +L ++D LF  E     C     D        
Subjt:  ISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTNLKQDLKLEIVDLLFTTE----RCSYPDEDPSACFRID

Query:  PMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFLYFSGRTTELKWQPLFSSLLHALKTFMV
         +  I S++ E    AR LLL R+    +++RYS +L  DAKL IT+KL WLLDIL  ++VY+SVL  ++P    SG+   + W+ ++S+LL +LKT M+
Subjt:  PMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFLYFSGRTTELKWQPLFSSLLHALKTFMV

Query:  AVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVLRKMARSITMLLTYGAHSKLNEIFESIL
         +SS+ AW EL++FLL+NLLHPHFLCW +VMELWCF +R+A + LV  +I+   + + ++ SSE  L   SVLR+  +SI  LLT+   S   ++++ I 
Subjt:  AVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVLRKMARSITMLLTYGAHSKLNEIFESIL

Query:  IQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQSMKLSISDIDVRTLKFLLALLRSYKIS
         + +S  +  V++AL+L+GFPLN L ++ KN A + I  D+ +FI  FDE    +S   ++G PV + SA ++ +K+SIS+ID +TL F++AL++ Y+ S
Subjt:  IQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQSMKLSISDIDVRTLKFLLALLRSYKIS

Query:  GVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKLTETDNNAKSCAVWELYHMLFKERHWAF
          E  K    +++SETL IIS  + LY   EM+ VI EL+KLF S        L + K  LA FL+GL+  +++ET    KS AVWELYHML ++RHWA 
Subjt:  GVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKLTETDNNAKSCAVWELYHMLFKERHWAF

Query:  IHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLMKEGLMLKDMLNKILKSCGKGIECKSME
        +H A+TAFGYF ARTSC++LWRFVP++AAL++D+ SGK+   E FM E K+FLEKE ALL++TPS E+L LL KEG  +K  + K+L    +G   +SME
Subjt:  IHLALTAFGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLMKEGLMLKDMLNKILKSCGKGIECKSME

Query:  TDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRL
         ++ P ++KRKLPEG+ +GMELL+NG+K + +G++ L     +S E    L + FS LED ++ L
Subjt:  TDEGPSSRKRKLPEGMSKGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAGAAAGAGAAAGAGGTGCGATGGAGGGAAGGGGCTCAAGCGACGATTTGCAGAGCATTATCGATGCTATTTCGTCCTCAGATGTTGTCGAGAGTCGTATACAGCT
GCTCAATAAACTGGAGGATTTGGATTTATCAAGCAAGTATCTGGTGTGGAAAGCATCTAAAGATGACGCTTTTGTCAATCCAGGAGTCACAGGAAGAAGAGCATTCCAAC
CTGTTTTTGCAGCTTTGGCAAGATATCCTCTTTCTGAAGATAAGGCGTTGATGGATACAGTTGAGAATTTCACCTTGGAACAGTTGAATTTAATAAATGAATCAGTATCA
GAAATTCAGAGGATTCAAGAATTTGGCTCAGAAATATTGAAAGCTGTGCAGATGGTCATTGATGCGATGATAAACTTTTGTGAAGTTCACTCACAAGCTTTGGATTGGGA
GTTCTCTGATGAAAAATTTGATAAAACCAGCAGTGCTGCTAACCATGCCATCAACGTACACAAGTGTATAATTGAGAAGTTATGTGAATTGGGGACCATTGCAGCCAAAG
GTGGTGGAGGATTGGTGACAATTCTTAATGTGTCATGGAAAGGAGTGTTTACCTTGCTTCAACATGGAAATGTGGTATTATTATCAAAGGTGAACATAGCAGGCATTATT
CTAATCCTAGTTTCACTTGTCATCGAGCCTCTGAAATGTGCAGCAGCGACTTGGTCTTCTGAAATGAAAGAAGCTGTCTCTGCAACTGAAGCCAGACGAATATTTCTTCC
ATTAAAATTTTTCCTGATTAATGCTGTGAAAATATCCCGCCTTTGTCCTTGCCAAGCATATCTAGTACATAAGGAGATCATTCTCTGTGTCCTAATGATCTTCACATATA
AAGTTTCACTAAGCAATGAAAAGTTATTGGAAACTGTAGCTGAAGCGATCACTGAACTTTTGGAGCAAATGTGCTTGGATTTAGTCAAGTGTTTCCTAAATTCCACTAAT
CTAAAACAAGACCTGAAACTTGAGATTGTGGACTTGTTATTCACTACCGAGAGGTGTTCCTATCCAGATGAAGATCCTAGTGCTTGTTTTAGGATTGATCCAATGGATCA
AATTTTTTCCATTAATTGTGAAGGCATGAATGATGCCAGAATCTTATTGCTTGGTCGCATTAATTTTCTACTTAATTTACTGAGGTATTCTTCTGATCTCAGTGATGATG
CAAAACTACCGATCACCTCAAAACTCAATTGGCTTTTGGACATTTTAGTTCAAGAAGATGTATATGCATCTGTTCTTCTCCTGCGAGTTCCTTTCTTATATTTCTCTGGA
AGAACTACAGAGCTGAAATGGCAGCCTCTGTTTTCTTCTCTTTTGCATGCGTTGAAGACTTTCATGGTTGCAGTCTCTTCAAGTTATGCTTGGGTGGAACTGCAATCCTT
CTTACTTGAAAATCTCTTGCATCCTCATTTTCTTTGCTGGGACGTTGTTATGGAACTTTGGTGCTTTATGTTGCGCTATGCTGATAATGGCCTGGTGAATGGTGTGATCT
CTAATTTTTTTTCAGTAATGAAGGCGTTGGCATCATCGGAGCCAGTTCTTATTTACAGTTCTGTTTTGAGAAAAATGGCAAGGTCTATAACCATGCTACTTACATATGGT
GCACATTCTAAACTAAATGAGATTTTTGAGTCTATTCTTATTCAGGACAAATCTCGGTTATCCACAGTGGTATGGGTCGCCTTGATCTTGGAAGGTTTTCCCCTAAACTT
ACTCTCCGAGAAGAATAAAAATATTGCAATGCAGAATATCATTCATGATTATTTGAGCTTCATTGGGAGTTTTGATGAGACTTCAATGTTAACTTCCTCTTCTGGGATTA
TTGGGTTGCCAGTAATTTCTGCATCTGCCACAATACAATCCATGAAGTTAAGCATCTCTGATATTGATGTTAGAACGTTGAAGTTCCTACTCGCTCTTCTCCGTAGCTAC
AAAATTTCTGGAGTGGAACAAGTAAAGGGATTCTGTGGAAAATTAATATCGGAAACATTGGGAATCATCTCGTTCATGAAACACCTTTATGCATCTAATGAGATGGAGGA
AGTCATCTTGGAGCTTGAAAAGCTTTTTATCTCAGGACCAACAGCCTCAGATGCACTGTTATATGAATGCAAATCAGGTTTGGCACCTTTCTTGGCAGGACTTGCGCACA
TAAAATTGACAGAAACTGATAATAATGCAAAAAGCTGTGCTGTGTGGGAGTTATATCACATGTTATTTAAGGAGCGGCACTGGGCATTCATACATTTGGCGCTAACAGCA
TTTGGATATTTTGCTGCACGTACTTCTTGCGATGAGTTGTGGAGGTTTGTGCCACAGAATGCAGCTCTTTCATACGATTTAGAATCAGGAAAACAGGTAAATGAAGAGGG
GTTTATGGTAGAGTTTAAGATATTTCTTGAGAAGGAAATGGCTCTCCTCACAGTAACACCGAGCGCTGAGCAGCTAGCACTGCTTATGAAAGAAGGATTAATGTTAAAGG
ATATGCTCAATAAGATATTAAAATCATGTGGAAAGGGTATCGAATGTAAGAGCATGGAGACTGATGAAGGACCATCTAGCAGGAAAAGAAAGCTTCCTGAAGGAATGAGC
AAGGGAATGGAATTGCTAAAGAATGGGTTAAAGGTTATGCGCCAGGGTATCTCACTGTTGGAAGAAAGTCATGTGGATTCCAGAGAACTTCATAACAAACTTTTGAGTCA
CTTTTCTGGCCTTGAAGATGAAATAACTCGCCTGGACAGCCAGGGTGGGAGTGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGAGAAAGAGAAAGAGGTGCGATGGAGGGAAGGGGCTCAAGCGACGATTTGCAGAGCATTATCGATGCTATTTCGTCCTCAGATGTTGTCGAGAGTCGTATACAGCT
GCTCAATAAACTGGAGGATTTGGATTTATCAAGCAAGTATCTGGTGTGGAAAGCATCTAAAGATGACGCTTTTGTCAATCCAGGAGTCACAGGAAGAAGAGCATTCCAAC
CTGTTTTTGCAGCTTTGGCAAGATATCCTCTTTCTGAAGATAAGGCGTTGATGGATACAGTTGAGAATTTCACCTTGGAACAGTTGAATTTAATAAATGAATCAGTATCA
GAAATTCAGAGGATTCAAGAATTTGGCTCAGAAATATTGAAAGCTGTGCAGATGGTCATTGATGCGATGATAAACTTTTGTGAAGTTCACTCACAAGCTTTGGATTGGGA
GTTCTCTGATGAAAAATTTGATAAAACCAGCAGTGCTGCTAACCATGCCATCAACGTACACAAGTGTATAATTGAGAAGTTATGTGAATTGGGGACCATTGCAGCCAAAG
GTGGTGGAGGATTGGTGACAATTCTTAATGTGTCATGGAAAGGAGTGTTTACCTTGCTTCAACATGGAAATGTGGTATTATTATCAAAGGTGAACATAGCAGGCATTATT
CTAATCCTAGTTTCACTTGTCATCGAGCCTCTGAAATGTGCAGCAGCGACTTGGTCTTCTGAAATGAAAGAAGCTGTCTCTGCAACTGAAGCCAGACGAATATTTCTTCC
ATTAAAATTTTTCCTGATTAATGCTGTGAAAATATCCCGCCTTTGTCCTTGCCAAGCATATCTAGTACATAAGGAGATCATTCTCTGTGTCCTAATGATCTTCACATATA
AAGTTTCACTAAGCAATGAAAAGTTATTGGAAACTGTAGCTGAAGCGATCACTGAACTTTTGGAGCAAATGTGCTTGGATTTAGTCAAGTGTTTCCTAAATTCCACTAAT
CTAAAACAAGACCTGAAACTTGAGATTGTGGACTTGTTATTCACTACCGAGAGGTGTTCCTATCCAGATGAAGATCCTAGTGCTTGTTTTAGGATTGATCCAATGGATCA
AATTTTTTCCATTAATTGTGAAGGCATGAATGATGCCAGAATCTTATTGCTTGGTCGCATTAATTTTCTACTTAATTTACTGAGGTATTCTTCTGATCTCAGTGATGATG
CAAAACTACCGATCACCTCAAAACTCAATTGGCTTTTGGACATTTTAGTTCAAGAAGATGTATATGCATCTGTTCTTCTCCTGCGAGTTCCTTTCTTATATTTCTCTGGA
AGAACTACAGAGCTGAAATGGCAGCCTCTGTTTTCTTCTCTTTTGCATGCGTTGAAGACTTTCATGGTTGCAGTCTCTTCAAGTTATGCTTGGGTGGAACTGCAATCCTT
CTTACTTGAAAATCTCTTGCATCCTCATTTTCTTTGCTGGGACGTTGTTATGGAACTTTGGTGCTTTATGTTGCGCTATGCTGATAATGGCCTGGTGAATGGTGTGATCT
CTAATTTTTTTTCAGTAATGAAGGCGTTGGCATCATCGGAGCCAGTTCTTATTTACAGTTCTGTTTTGAGAAAAATGGCAAGGTCTATAACCATGCTACTTACATATGGT
GCACATTCTAAACTAAATGAGATTTTTGAGTCTATTCTTATTCAGGACAAATCTCGGTTATCCACAGTGGTATGGGTCGCCTTGATCTTGGAAGGTTTTCCCCTAAACTT
ACTCTCCGAGAAGAATAAAAATATTGCAATGCAGAATATCATTCATGATTATTTGAGCTTCATTGGGAGTTTTGATGAGACTTCAATGTTAACTTCCTCTTCTGGGATTA
TTGGGTTGCCAGTAATTTCTGCATCTGCCACAATACAATCCATGAAGTTAAGCATCTCTGATATTGATGTTAGAACGTTGAAGTTCCTACTCGCTCTTCTCCGTAGCTAC
AAAATTTCTGGAGTGGAACAAGTAAAGGGATTCTGTGGAAAATTAATATCGGAAACATTGGGAATCATCTCGTTCATGAAACACCTTTATGCATCTAATGAGATGGAGGA
AGTCATCTTGGAGCTTGAAAAGCTTTTTATCTCAGGACCAACAGCCTCAGATGCACTGTTATATGAATGCAAATCAGGTTTGGCACCTTTCTTGGCAGGACTTGCGCACA
TAAAATTGACAGAAACTGATAATAATGCAAAAAGCTGTGCTGTGTGGGAGTTATATCACATGTTATTTAAGGAGCGGCACTGGGCATTCATACATTTGGCGCTAACAGCA
TTTGGATATTTTGCTGCACGTACTTCTTGCGATGAGTTGTGGAGGTTTGTGCCACAGAATGCAGCTCTTTCATACGATTTAGAATCAGGAAAACAGGTAAATGAAGAGGG
GTTTATGGTAGAGTTTAAGATATTTCTTGAGAAGGAAATGGCTCTCCTCACAGTAACACCGAGCGCTGAGCAGCTAGCACTGCTTATGAAAGAAGGATTAATGTTAAAGG
ATATGCTCAATAAGATATTAAAATCATGTGGAAAGGGTATCGAATGTAAGAGCATGGAGACTGATGAAGGACCATCTAGCAGGAAAAGAAAGCTTCCTGAAGGAATGAGC
AAGGGAATGGAATTGCTAAAGAATGGGTTAAAGGTTATGCGCCAGGGTATCTCACTGTTGGAAGAAAGTCATGTGGATTCCAGAGAACTTCATAACAAACTTTTGAGTCA
CTTTTCTGGCCTTGAAGATGAAATAACTCGCCTGGACAGCCAGGGTGGGAGTGATTAA
Protein sequenceShow/hide protein sequence
MRERERGAMEGRGSSDDLQSIIDAISSSDVVESRIQLLNKLEDLDLSSKYLVWKASKDDAFVNPGVTGRRAFQPVFAALARYPLSEDKALMDTVENFTLEQLNLINESVS
EIQRIQEFGSEILKAVQMVIDAMINFCEVHSQALDWEFSDEKFDKTSSAANHAINVHKCIIEKLCELGTIAAKGGGGLVTILNVSWKGVFTLLQHGNVVLLSKVNIAGII
LILVSLVIEPLKCAAATWSSEMKEAVSATEARRIFLPLKFFLINAVKISRLCPCQAYLVHKEIILCVLMIFTYKVSLSNEKLLETVAEAITELLEQMCLDLVKCFLNSTN
LKQDLKLEIVDLLFTTERCSYPDEDPSACFRIDPMDQIFSINCEGMNDARILLLGRINFLLNLLRYSSDLSDDAKLPITSKLNWLLDILVQEDVYASVLLLRVPFLYFSG
RTTELKWQPLFSSLLHALKTFMVAVSSSYAWVELQSFLLENLLHPHFLCWDVVMELWCFMLRYADNGLVNGVISNFFSVMKALASSEPVLIYSSVLRKMARSITMLLTYG
AHSKLNEIFESILIQDKSRLSTVVWVALILEGFPLNLLSEKNKNIAMQNIIHDYLSFIGSFDETSMLTSSSGIIGLPVISASATIQSMKLSISDIDVRTLKFLLALLRSY
KISGVEQVKGFCGKLISETLGIISFMKHLYASNEMEEVILELEKLFISGPTASDALLYECKSGLAPFLAGLAHIKLTETDNNAKSCAVWELYHMLFKERHWAFIHLALTA
FGYFAARTSCDELWRFVPQNAALSYDLESGKQVNEEGFMVEFKIFLEKEMALLTVTPSAEQLALLMKEGLMLKDMLNKILKSCGKGIECKSMETDEGPSSRKRKLPEGMS
KGMELLKNGLKVMRQGISLLEESHVDSRELHNKLLSHFSGLEDEITRLDSQGGSD