; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018996 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018996
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPlant protein of unknown function (DUF639)
Genome locationChr04:13504207..13533894
RNA-Seq ExpressionHG10018996
SyntenyHG10018996
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011660245.2 uncharacterized protein LOC101209123 isoform X1 [Cucumis sativus]8.7e-18268.57Show/hide
Query:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTRNVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPAG
        ML KLPSTYLKPSTAGLDPSIS H D   F CFTRNVP+PKYRFKLVG+SMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTS RGKTSKNKDHIPA 
Subjt:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTRNVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPAG

Query:  AYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITM
        AY TTE EDIV  E TVNIRTPNGLLSS AVVSIEQFSRMNGLTG KMQRIFKALVHESVYNDARSL+EYCCFRFLSRDSSNIHPSLSEPTFQRLIFITM
Subjt:  AYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITM

Query:  LAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDA
        LAWENPYHEH NVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLK                           
Subjt:  LAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDA

Query:  SVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNER
                                VHEGRKLYRVRDN QFFGENILC+GSSKKRPVLKWENNIAWPGKLTLTDKAVYFE                     
Subjt:  SVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNER

Query:  SRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVSV
                                                                   AVGIFGQKDIMRLDL KDGV+VDKAKVGPFGSILFDSAVSV
Subjt:  SRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVSV

Query:  ASSSEATVW
        +S+SE   W
Subjt:  ASSSEATVW

XP_022963792.1 uncharacterized protein LOC111463987 [Cucurbita moschata]2.2e-17767.84Show/hide
Query:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTR-NVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPA
        MLCKLPST LK S+AGLDPSIS HG  RKFGC TR NVPE KYRFK+VGLS GDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTS RGK SKNKDHIP 
Subjt:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTR-NVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPA

Query:  GAYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFIT
        GA  +T+IED+VMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTG KMQRIFKALV ESVYNDARSLVEYCCFRFLSRDSSN+HPSLSEPTFQRLIFIT
Subjt:  GAYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFIT

Query:  MLAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTD
        MLAWENPYHEHTN SEEI+FQKMLV EEAFTRIAPAISGVADRSTVH+LFKALAGDEQSIS SLWLKYVDELLK                          
Subjt:  MLAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTD

Query:  ASVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNE
                                 VHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFE                    
Subjt:  ASVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNE

Query:  RSRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVS
                                                                    AVGIFGQKDI+RLDL KDGVQVDKAKVGPFGSILFDSA+S
Subjt:  RSRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVS

Query:  VASSSEATVW
        VASSSE   W
Subjt:  VASSSEATVW

XP_022967569.1 uncharacterized protein LOC111467032 [Cucurbita maxima]9.0e-17968.24Show/hide
Query:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTR-NVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPA
        MLCKLPST LK S+AGLDPSIS HG  RKFGC TR NVPEPKYRFK+VGLS GDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTS RGK SKNKDHIP 
Subjt:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTR-NVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPA

Query:  GAYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFIT
        GA  +T+IED+VMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTG KMQRIFKALV ESVYNDARSLVEYCCFRFLSRDSSN+HPSLSEPTFQRLIFIT
Subjt:  GAYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFIT

Query:  MLAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTD
        MLAWENPYHEHTN SEEI+FQKMLV EEAFTRIAPAISGVADRSTVH+LFKALAGDEQSIS SLWLKYVDELLK                          
Subjt:  MLAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTD

Query:  ASVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNE
                                 VHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFE                    
Subjt:  ASVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNE

Query:  RSRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVS
                                                                    AVGIFGQKDI+RLDL KDGVQVDKAKVGPFGSILFDSAVS
Subjt:  RSRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVS

Query:  VASSSEATVW
        VASSSE   W
Subjt:  VASSSEATVW

XP_038887911.1 uncharacterized protein LOC120077886 isoform X1 [Benincasa hispida]6.4e-19371.71Show/hide
Query:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTRNVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPAG
        MLCKLPSTYLKPSTAGLDPSIS HGD RKFGCFTRNVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPAG
Subjt:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTRNVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPAG

Query:  AYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITM
        A+GTTEIED VMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTG KMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEP+FQRLIFITM
Subjt:  AYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITM

Query:  LAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDA
        LAWENPYH+HTN+SEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLK                           
Subjt:  LAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDA

Query:  SVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNER
                                VHEGRKLYRVRDNRQF GENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFE                     
Subjt:  SVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNER

Query:  SRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVSV
                                                                   AVGIFGQKDIMRLDL KDGVQVDKAKVGPFGSILFDSAVSV
Subjt:  SRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVSV

Query:  ASSSEATVW
        ASSSE   W
Subjt:  ASSSEATVW

XP_038887912.1 uncharacterized protein LOC120077886 isoform X2 [Benincasa hispida]6.4e-19371.71Show/hide
Query:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTRNVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPAG
        MLCKLPSTYLKPSTAGLDPSIS HGD RKFGCFTRNVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPAG
Subjt:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTRNVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPAG

Query:  AYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITM
        A+GTTEIED VMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTG KMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEP+FQRLIFITM
Subjt:  AYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITM

Query:  LAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDA
        LAWENPYH+HTN+SEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLK                           
Subjt:  LAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDA

Query:  SVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNER
                                VHEGRKLYRVRDNRQF GENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFE                     
Subjt:  SVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNER

Query:  SRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVSV
                                                                   AVGIFGQKDIMRLDL KDGVQVDKAKVGPFGSILFDSAVSV
Subjt:  SRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVSV

Query:  ASSSEATVW
        ASSSE   W
Subjt:  ASSSEATVW

TrEMBL top hitse value%identityAlignment
A0A1S3B8Z7 uncharacterized protein LOC1034874776.9e-17768.03Show/hide
Query:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTRNVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPAG
        ML KLPSTYLKPSTAGLDPSIS   D   FGCFTRNVPE KYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTS R KTSKNK+HIPAG
Subjt:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTRNVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPAG

Query:  AYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITM
        AYGTTE EDIV  E TVNIRTPNGLLSS AVVSIEQFSRMNGLTG KMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITM
Subjt:  AYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITM

Query:  LAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDA
        LAWENPYH+H +VSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGD++SISLSLWLKYVDEL+                            
Subjt:  LAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDA

Query:  SVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNER
                               RVHEGRKLYRVRDN QFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFE  +FL     Q      +N +
Subjt:  SVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNER

Query:  SRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRS---QFLGLDIAVGIFG-QKDIMRLDLAKDGVQVDKAKVGPFGSILFDS
          A  +    S+  +     +     L        + +A F+ L   L   S   + L  +  V   G    +  + + KDGVQVDKAKVGPFGSILFDS
Subjt:  SRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRS---QFLGLDIAVGIFG-QKDIMRLDLAKDGVQVDKAKVGPFGSILFDS

Query:  AVSVASSSEATVW
        AVSV+SSSE   W
Subjt:  AVSVASSSEATVW

A0A251RJ64 Uncharacterized protein5.0e-12751.27Show/hide
Query:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTRNVPE-PKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPA
        ML K+  T+LK S +      S HG+ R+FG   RN  +  K RFK+VG S+GD+W LN+IDANAVQ+ LN WLLKTQNFLNEVTS   +TS+ +  +  
Subjt:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTRNVPE-PKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPA

Query:  GAYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFIT
         A+ T ++EDI MAE T+N RTPNG+LS  A+VSIEQFSRMNGLTG KMQRIFKALV ES YNDAR+LVEYCCFRFLSRD+S+IHPSL EP FQRLIFIT
Subjt:  GAYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFIT

Query:  MLAWENPYHEH-TNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTT
        MLAWENPY E   N SE+ SFQ  LVREEAF R+APAISGVADRST HNLFKALAGDEQ ISLSLWL YVDEL+K                         
Subjt:  MLAWENPYHEH-TNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTT

Query:  DASVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRN
                                  VHEGRK Y+ R +     E ILCIGSS+KRPVLKWENN+AWPGK+TLTDKA+YFE                   
Subjt:  DASVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRN

Query:  ERSRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAV
                                                                     AVGI GQKD +RLDL K G++V+KAKVGPFGS LFDSAV
Subjt:  ERSRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAV

Query:  SVASSSEATVW
        S++   ++  W
Subjt:  SVASSSEATVW

A0A6J1DTB7 uncharacterized protein LOC111023786 isoform X13.6e-17366.54Show/hide
Query:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTRNVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPAG
        MLCKLPST+LK S AGL+P IS HGD RKFGC TRN+PEPK+RFKLVGLSMGDKW L DIDANAVQQNLNKWLLKTQNFLNEVTS  GKTSKNKDHIPAG
Subjt:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTRNVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPAG

Query:  AYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITM
        A+ + EIE+IVMAE+TVNI TPNGLLSSTAVVSIEQFSRMNGLTG KMQRIFKAL  ESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITM
Subjt:  AYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITM

Query:  LAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDA
        LAWENPYHE    SEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLK                           
Subjt:  LAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDA

Query:  SVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNER
                                VHEGRKLYRVRDNRQF GENIL IGSSKKRPVLKWENNIAWPGKLTLTDKAVYFE                     
Subjt:  SVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNER

Query:  SRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFG--QKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAV
                                                                   AVGIFG  QKD+ RLDL KDGVQVDKAKVGPFGS+LFDSAV
Subjt:  SRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFG--QKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAV

Query:  SVASSSEATVW
        SV+SSSE   W
Subjt:  SVASSSEATVW

A0A6J1HG64 uncharacterized protein LOC1114639871.1e-17767.84Show/hide
Query:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTR-NVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPA
        MLCKLPST LK S+AGLDPSIS HG  RKFGC TR NVPE KYRFK+VGLS GDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTS RGK SKNKDHIP 
Subjt:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTR-NVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPA

Query:  GAYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFIT
        GA  +T+IED+VMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTG KMQRIFKALV ESVYNDARSLVEYCCFRFLSRDSSN+HPSLSEPTFQRLIFIT
Subjt:  GAYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFIT

Query:  MLAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTD
        MLAWENPYHEHTN SEEI+FQKMLV EEAFTRIAPAISGVADRSTVH+LFKALAGDEQSIS SLWLKYVDELLK                          
Subjt:  MLAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTD

Query:  ASVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNE
                                 VHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFE                    
Subjt:  ASVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNE

Query:  RSRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVS
                                                                    AVGIFGQKDI+RLDL KDGVQVDKAKVGPFGSILFDSA+S
Subjt:  RSRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVS

Query:  VASSSEATVW
        VASSSE   W
Subjt:  VASSSEATVW

A0A6J1HX34 uncharacterized protein LOC1114670324.3e-17968.24Show/hide
Query:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTR-NVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPA
        MLCKLPST LK S+AGLDPSIS HG  RKFGC TR NVPEPKYRFK+VGLS GDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTS RGK SKNKDHIP 
Subjt:  MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTR-NVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPA

Query:  GAYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFIT
        GA  +T+IED+VMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTG KMQRIFKALV ESVYNDARSLVEYCCFRFLSRDSSN+HPSLSEPTFQRLIFIT
Subjt:  GAYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFIT

Query:  MLAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTD
        MLAWENPYHEHTN SEEI+FQKMLV EEAFTRIAPAISGVADRSTVH+LFKALAGDEQSIS SLWLKYVDELLK                          
Subjt:  MLAWENPYHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTD

Query:  ASVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNE
                                 VHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFE                    
Subjt:  ASVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNE

Query:  RSRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVS
                                                                    AVGIFGQKDI+RLDL KDGVQVDKAKVGPFGSILFDSAVS
Subjt:  RSRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVS

Query:  VASSSEATVW
        VASSSE   W
Subjt:  VASSSEATVW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48840.1 Plant protein of unknown function (DUF639)1.0e-1526.98Show/hide
Query:  DIDANAVQQNLNKWLL-KTQNFLNEVTSSRGKTSKNKDHIPAGAYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVH
        D+    V+ +  KWLL K  +F  E+       S   + IP                           LS  A V I + S++ G+   ++Q  FK    
Subjt:  DIDANAVQQNLNKWLL-KTQNFLNEVTSSRGKTSKNKDHIPAGAYGTTEIEDIVMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVH

Query:  ESVYNDA---RSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITMLAWENP-YHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALA
        ESV   +   R+ +EYCCFR L+  S  +   LS+ +F+RL F  M+AWE P     T +S +   +   V  EAF+RIAPA+  +AD     NLF  L 
Subjt:  ESVYNDA---RSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITMLAWENP-YHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALA

Query:  GDEQSISLSLWL--KYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDASVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCI-GS
            S+ L  ++  KY+  L + +                 K+   +++S+  G                            VR      GE IL + G+
Subjt:  GDEQSISLSLWL--KYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDASVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCI-GS

Query:  SKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQR
           +PVL+      WPG+L LTD ++YFE    +S  + +R
Subjt:  SKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQR

AT1G71240.1 Plant protein of unknown function (DUF639)2.6e-8339.92Show/hide
Query:  FTRN-VPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIP-AGAYGTTEIEDIVMAEYTVNIRTPNGLLSSTA
        F+RN     K R ++V      KW LNDID N VQ+  ++W+ K+Q  L++VTS   K S++   I         ++E+++  E TV   TP G LS  A
Subjt:  FTRN-VPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIP-AGAYGTTEIEDIVMAEYTVNIRTPNGLLSSTA

Query:  VVSIEQF-SRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITMLAWENPYHEHTN----VSEEISFQKMLVR
        ++SIEQF SRMNG+TG KMQ IF+ +V  ++  DAR LVEYCCFRFLSRDSS  HP L EP FQRLIFITMLAW NPY +  N     S + SFQ   + 
Subjt:  VVSIEQF-SRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITMLAWENPYHEHTN----VSEEISFQKMLVR

Query:  EEAFTRIAPAISGVADRSTVHNLFKAL--AGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDASVFVGRFSLVLETLVLPADVDHE
        EEAF RIAPAISG+ADR+TVHNLFKAL  A D++ ISL +WL Y+ EL+K                                                  
Subjt:  EEAFTRIAPAISGVADRSTVHNLFKAL--AGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDASVFVGRFSLVLETLVLPADVDHE

Query:  RVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNERSRAALALKKGSTNNLLDLRISGR
         +HEGRK ++  D  Q   E +LC+ +++K PVLKWENN+AWPGKLTLTDKA+YFE                                            
Subjt:  RVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNERSRAALALKKGSTNNLLDLRISGR

Query:  RNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVSVASSSEATVW
                                             V I G K ++RLDLA D   V+KAKVGP G  LFDSAVSV+S      W
Subjt:  RNHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVSVASSSEATVW

AT1G71240.2 Plant protein of unknown function (DUF639)1.0e-8440Show/hide
Query:  FTRN-VPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIP-AGAYGTTEIEDIVMAEYTVNIRTPNGLLSSTA
        F+RN     K R ++V      KW LNDID N VQ+  ++W+ K+Q  L++VTS   K S++   I         ++E+++  E TV   TP G LS  A
Subjt:  FTRN-VPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIP-AGAYGTTEIEDIVMAEYTVNIRTPNGLLSSTA

Query:  VVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITMLAWENPYHEHTN----VSEEISFQKMLVRE
        ++SIEQFSRMNG+TG KMQ IF+ +V  ++  DAR LVEYCCFRFLSRDSS  HP L EP FQRLIFITMLAW NPY +  N     S + SFQ   + E
Subjt:  VVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITMLAWENPYHEHTN----VSEEISFQKMLVRE

Query:  EAFTRIAPAISGVADRSTVHNLFKAL--AGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDASVFVGRFSLVLETLVLPADVDHER
        EAF RIAPAISG+ADR+TVHNLFKAL  A D++ ISL +WL Y+ EL+K                                                   
Subjt:  EAFTRIAPAISGVADRSTVHNLFKAL--AGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDASVFVGRFSLVLETLVLPADVDHER

Query:  VHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNERSRAALALKKGSTNNLLDLRISGRR
        +HEGRK ++  D  Q   E +LC+ +++K PVLKWENN+AWPGKLTLTDKA+YFE                                             
Subjt:  VHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNERSRAALALKKGSTNNLLDLRISGRR

Query:  NHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVSVASSSEATVW
                                            V I G K ++RLDLA D   V+KAKVGP G  LFDSAVSV+S      W
Subjt:  NHLEMDGEVAVEASARFRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVSVASSSEATVW

AT3G18350.1 Plant protein of unknown function (DUF639)1.5e-1427.94Show/hide
Query:  LSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDA---RSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITMLAWENP-YHEHTNVSEEISFQK
        LS  A V + + S++ G++ ++++  FK    ES+   +   R+ +EYCCFR LS  S  +   L++  F+RL F  M+ WE P       +S E   + 
Subjt:  LSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDA---RSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITMLAWENP-YHEHTNVSEEISFQK

Query:  MLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQS-ISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDASVFVGRFSLVLETLVLPADV
          V  EAF+RIAPA+  +AD     NLF+ L       +  S++ KY+  L + +                 K+   +++S+  G               
Subjt:  MLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQS-ISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDASVFVGRFSLVLETLVLPADV

Query:  DHERVHEGRKLYRVRDNRQFFGENILCI-GSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQR
                     VR  R    E IL I G+   +PVL+      WPG+L LTD ++YFE    +S  + +R
Subjt:  DHERVHEGRKLYRVRDNRQFFGENILCI-GSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQR

AT5G23390.1 Plant protein of unknown function (DUF639)3.1e-1226.21Show/hide
Query:  LSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESV---YNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITMLAWENP----------------
        LS  A   + + S++  +    +Q  F   + ESV      AR+ +E+C F+ L +        LS+  F++L+F  MLAWE P                
Subjt:  LSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESV---YNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITMLAWENP----------------

Query:  ------------YHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDE----QSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGF
                    Y   TN++ ++  +K  V +EAF RIAP    +AD  TVHNLF AL          I    +L+ +D++ K                 
Subjt:  ------------YHEHTNVSEEISFQKMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDE----QSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGF

Query:  GLKLTPTTDASVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFET
           L P+  A++ + +  +VL       D+D                           G++   PVLK     AWPGKLTLT+ A+YF++
Subjt:  GLKLTPTTDASVFVGRFSLVLETLVLPADVDHERVHEGRKLYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTGTAAACTCCCTTCCACCTACTTGAAACCATCTACCGCAGGATTGGATCCTTCAATTTCTCGTCATGGTGATATACGCAAGTTTGGGTGTTTCACTAGAAATGT
TCCGGAGCCCAAATATCGGTTTAAGCTTGTGGGTCTGTCTATGGGAGATAAATGGCCTCTCAATGACATTGATGCGAATGCAGTGCAACAAAACCTAAACAAATGGCTGC
TGAAGACGCAGAACTTCTTAAATGAAGTCACATCTTCCCGGGGAAAAACCAGTAAGAACAAAGATCATATTCCTGCAGGAGCCTATGGTACCACCGAAATAGAGGATATA
GTTATGGCGGAGTATACTGTTAATATCAGGACACCAAATGGCCTTCTCTCTTCTACTGCTGTTGTATCCATTGAGCAATTTAGCAGGATGAATGGCTTGACTGGGCATAA
AATGCAGAGGATATTTAAAGCTCTTGTGCATGAATCTGTTTACAATGATGCTCGCAGTCTGGTAGAGTATTGCTGTTTTAGATTCTTGTCAAGGGACAGCTCAAATATTC
ATCCTTCACTCAGTGAACCCACATTTCAGAGATTGATATTCATAACAATGCTTGCTTGGGAAAATCCATATCACGAGCATACTAATGTTTCAGAGGAAATTTCTTTTCAG
AAGATGTTAGTTAGAGAAGAGGCATTTACGCGTATTGCACCAGCTATTTCTGGTGTAGCAGATCGATCCACGGTACATAATCTATTCAAGGCCCTTGCGGGTGATGAACA
GAGCATCTCTTTGAGTTTGTGGCTCAAATATGTTGATGAACTGCTCAAATTACTTCCTTTTCCTGATAGGGGTGATCATTGGCCGGTCAACGTCGGTTTTGGGCTCAAAC
TGACGCCAACCACTGATGCGTCGGTTTTTGTCGGTCGGTTCTCATTGGTGTTGGAGACTTTGGTGCTACCAGCCGACGTTGACCATGAGCGGGTCCATGAAGGACGAAAA
TTATATCGAGTTCGAGATAACAGACAGTTCTTTGGTGAGAATATCCTGTGTATTGGTTCCAGCAAGAAGAGACCCGTTTTAAAATGGGAGAATAATATTGCATGGCCAGG
AAAACTTACTCTTACCGATAAAGCTGTCTATTTCGAGACCCCATCATTTTTATCTCTACAATCCAAGCAAAGAAGACCAGGAGATAAAAGAAACGAGAGATCAAGAGCAG
CATTGGCATTGAAGAAAGGAAGTACGAACAATCTCCTGGATTTGAGGATATCTGGAAGAAGAAATCATTTAGAAATGGATGGAGAAGTAGCTGTTGAAGCAAGTGCAAGA
TTTAGATATCTTAAGTATGGTCTTCACAGTAGGTCTCAGTTTCTGGGTTTAGATATTGCGGTTGGAATCTTTGGCCAGAAAGATATCATGAGATTGGATCTTGCAAAAGA
TGGAGTGCAGGTGGACAAGGCGAAAGTAGGACCTTTTGGTTCTATTCTTTTCGACTCTGCTGTTTCAGTAGCATCCAGCTCAGAAGCGACCGTCTGGGCCAACGTTGACG
GCCGCCAGTGGAAGACCGGAAAACTCTTGGTGACGACGATCACAAGAGGGAGCTTCTTCAAGAATGGATTGATTTCATCTACAAAAGAAAACACAAATGAATCAAAGTTG
GAAACTTTTCAATTAGAAAATCTTGAAGATAAAATTGGAAAAAATCTTGTTGAGACCCATGTAAGTAATGGAGGTGATTGGGACAAAAATCATGAGAACGAAGTAGAAGA
AGAAAAAGAAGAATTTAAAGATGTTAATTCGGAAATTATCTCAAGTGAAGAAGAAAAATTCAACGAAGAAGATGGAATAGAGGTATCGAACACTGAGCAAGAATCGAGCA
AGAACAATGACACATTGATTGCGTTCGAAATGGATCACCAAACTGTCGAAGAACTTGAAGCGAATGATAAAGAAGACGTGATTCTTGAACAATTCTCTTTGGAAGGTGGT
TTTTTAGTGACTGGTCCTTTGAATACGGTTTTGAATGAGCTTGAAGGAAACATCTTTTTTGGGTTTATGGCCAATGGAGAAGTTCATCTTTTGGTGGCAATAATGGATTT
TTTGTCTCGAATTTATCACTTTGATTATAATATTTTTGTTCAAAGTGTAGGAGACCAGTTGACCAACCCTAAGATAGCTTCAAGCAACCATGTTCAACCCCCGATTTTGC
ACAAAGGGGTGGCTATTAGACTTGAGGATAAAAATGGTATTGGTTACGGTCCAAAGAAATCCTTTGTAGAGGTCGTTCGCTCTTCGCCAAGCACCACTAAGGCCGTGTTA
CATCCAAAAGTTGTTGAGAGGAATTCGTATTGGATTCGAAAGGATCATGATATTGTTGAATTAAATCTTCAAGAGTCTTTAGTTGTCACCAAATTGATGTCACATTACTC
GTGGGGAAAAATTAAAGCATCGTTTGAAGATCTTTTAAAGACAAAGGTTTCCATTAATCCGATTATGGATGATAAAGCTTTATTTCACGTGAATATAGTCTGGGTTCCCT
CATCACTATGGATAAAGTTTATTAAAAGTTATGGAGGAAGAATCTGTATTAAAAATTTGCCCCTACAGTACTGGAAGCGTTTAGTTTTTGAAGCAATTGGCACTCATTTA
GGTGGATTAATTGTAATCTCATCTCATACCCTAAATTGTATTGATGCTTCCTCAGCTTTGATTCATGTTAAGAGGAACTCTTGTGGCTTTATCCCTGCATCCCTTGAGAT
TACAGATTATGTTCTTGAGAATTTTAGGATCTTGCTTGAAGATCGTGGTTATGCTGCATCGGAGATTACTCAACCATCAATTACCCCTGTCTTAAATCCGAATGCCTATT
CAAATTCCATAGATCAAGACCGAATTCGAAAAGTCCTGGCAGATGAAGAGTTGGATTTAAATTCAGACATTTTGGCGCCATCCGTTTTAAATTCTCCAAGCAAGTTGATT
TGGCCTTTTGAGTCCATGCCAAACGTCACATCTCTTCTCGAGGAAGTTATAGCAATGAAGAGTCCTAATGAAAATAAAGTTCTCCATCAAGAAGTTAATGAGAATTTAAT
TCATTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTGTAAACTCCCTTCCACCTACTTGAAACCATCTACCGCAGGATTGGATCCTTCAATTTCTCGTCATGGTGATATACGCAAGTTTGGGTGTTTCACTAGAAATGT
TCCGGAGCCCAAATATCGGTTTAAGCTTGTGGGTCTGTCTATGGGAGATAAATGGCCTCTCAATGACATTGATGCGAATGCAGTGCAACAAAACCTAAACAAATGGCTGC
TGAAGACGCAGAACTTCTTAAATGAAGTCACATCTTCCCGGGGAAAAACCAGTAAGAACAAAGATCATATTCCTGCAGGAGCCTATGGTACCACCGAAATAGAGGATATA
GTTATGGCGGAGTATACTGTTAATATCAGGACACCAAATGGCCTTCTCTCTTCTACTGCTGTTGTATCCATTGAGCAATTTAGCAGGATGAATGGCTTGACTGGGCATAA
AATGCAGAGGATATTTAAAGCTCTTGTGCATGAATCTGTTTACAATGATGCTCGCAGTCTGGTAGAGTATTGCTGTTTTAGATTCTTGTCAAGGGACAGCTCAAATATTC
ATCCTTCACTCAGTGAACCCACATTTCAGAGATTGATATTCATAACAATGCTTGCTTGGGAAAATCCATATCACGAGCATACTAATGTTTCAGAGGAAATTTCTTTTCAG
AAGATGTTAGTTAGAGAAGAGGCATTTACGCGTATTGCACCAGCTATTTCTGGTGTAGCAGATCGATCCACGGTACATAATCTATTCAAGGCCCTTGCGGGTGATGAACA
GAGCATCTCTTTGAGTTTGTGGCTCAAATATGTTGATGAACTGCTCAAATTACTTCCTTTTCCTGATAGGGGTGATCATTGGCCGGTCAACGTCGGTTTTGGGCTCAAAC
TGACGCCAACCACTGATGCGTCGGTTTTTGTCGGTCGGTTCTCATTGGTGTTGGAGACTTTGGTGCTACCAGCCGACGTTGACCATGAGCGGGTCCATGAAGGACGAAAA
TTATATCGAGTTCGAGATAACAGACAGTTCTTTGGTGAGAATATCCTGTGTATTGGTTCCAGCAAGAAGAGACCCGTTTTAAAATGGGAGAATAATATTGCATGGCCAGG
AAAACTTACTCTTACCGATAAAGCTGTCTATTTCGAGACCCCATCATTTTTATCTCTACAATCCAAGCAAAGAAGACCAGGAGATAAAAGAAACGAGAGATCAAGAGCAG
CATTGGCATTGAAGAAAGGAAGTACGAACAATCTCCTGGATTTGAGGATATCTGGAAGAAGAAATCATTTAGAAATGGATGGAGAAGTAGCTGTTGAAGCAAGTGCAAGA
TTTAGATATCTTAAGTATGGTCTTCACAGTAGGTCTCAGTTTCTGGGTTTAGATATTGCGGTTGGAATCTTTGGCCAGAAAGATATCATGAGATTGGATCTTGCAAAAGA
TGGAGTGCAGGTGGACAAGGCGAAAGTAGGACCTTTTGGTTCTATTCTTTTCGACTCTGCTGTTTCAGTAGCATCCAGCTCAGAAGCGACCGTCTGGGCCAACGTTGACG
GCCGCCAGTGGAAGACCGGAAAACTCTTGGTGACGACGATCACAAGAGGGAGCTTCTTCAAGAATGGATTGATTTCATCTACAAAAGAAAACACAAATGAATCAAAGTTG
GAAACTTTTCAATTAGAAAATCTTGAAGATAAAATTGGAAAAAATCTTGTTGAGACCCATGTAAGTAATGGAGGTGATTGGGACAAAAATCATGAGAACGAAGTAGAAGA
AGAAAAAGAAGAATTTAAAGATGTTAATTCGGAAATTATCTCAAGTGAAGAAGAAAAATTCAACGAAGAAGATGGAATAGAGGTATCGAACACTGAGCAAGAATCGAGCA
AGAACAATGACACATTGATTGCGTTCGAAATGGATCACCAAACTGTCGAAGAACTTGAAGCGAATGATAAAGAAGACGTGATTCTTGAACAATTCTCTTTGGAAGGTGGT
TTTTTAGTGACTGGTCCTTTGAATACGGTTTTGAATGAGCTTGAAGGAAACATCTTTTTTGGGTTTATGGCCAATGGAGAAGTTCATCTTTTGGTGGCAATAATGGATTT
TTTGTCTCGAATTTATCACTTTGATTATAATATTTTTGTTCAAAGTGTAGGAGACCAGTTGACCAACCCTAAGATAGCTTCAAGCAACCATGTTCAACCCCCGATTTTGC
ACAAAGGGGTGGCTATTAGACTTGAGGATAAAAATGGTATTGGTTACGGTCCAAAGAAATCCTTTGTAGAGGTCGTTCGCTCTTCGCCAAGCACCACTAAGGCCGTGTTA
CATCCAAAAGTTGTTGAGAGGAATTCGTATTGGATTCGAAAGGATCATGATATTGTTGAATTAAATCTTCAAGAGTCTTTAGTTGTCACCAAATTGATGTCACATTACTC
GTGGGGAAAAATTAAAGCATCGTTTGAAGATCTTTTAAAGACAAAGGTTTCCATTAATCCGATTATGGATGATAAAGCTTTATTTCACGTGAATATAGTCTGGGTTCCCT
CATCACTATGGATAAAGTTTATTAAAAGTTATGGAGGAAGAATCTGTATTAAAAATTTGCCCCTACAGTACTGGAAGCGTTTAGTTTTTGAAGCAATTGGCACTCATTTA
GGTGGATTAATTGTAATCTCATCTCATACCCTAAATTGTATTGATGCTTCCTCAGCTTTGATTCATGTTAAGAGGAACTCTTGTGGCTTTATCCCTGCATCCCTTGAGAT
TACAGATTATGTTCTTGAGAATTTTAGGATCTTGCTTGAAGATCGTGGTTATGCTGCATCGGAGATTACTCAACCATCAATTACCCCTGTCTTAAATCCGAATGCCTATT
CAAATTCCATAGATCAAGACCGAATTCGAAAAGTCCTGGCAGATGAAGAGTTGGATTTAAATTCAGACATTTTGGCGCCATCCGTTTTAAATTCTCCAAGCAAGTTGATT
TGGCCTTTTGAGTCCATGCCAAACGTCACATCTCTTCTCGAGGAAGTTATAGCAATGAAGAGTCCTAATGAAAATAAAGTTCTCCATCAAGAAGTTAATGAGAATTTAAT
TCATTCCTAA
Protein sequenceShow/hide protein sequence
MLCKLPSTYLKPSTAGLDPSISRHGDIRKFGCFTRNVPEPKYRFKLVGLSMGDKWPLNDIDANAVQQNLNKWLLKTQNFLNEVTSSRGKTSKNKDHIPAGAYGTTEIEDI
VMAEYTVNIRTPNGLLSSTAVVSIEQFSRMNGLTGHKMQRIFKALVHESVYNDARSLVEYCCFRFLSRDSSNIHPSLSEPTFQRLIFITMLAWENPYHEHTNVSEEISFQ
KMLVREEAFTRIAPAISGVADRSTVHNLFKALAGDEQSISLSLWLKYVDELLKLLPFPDRGDHWPVNVGFGLKLTPTTDASVFVGRFSLVLETLVLPADVDHERVHEGRK
LYRVRDNRQFFGENILCIGSSKKRPVLKWENNIAWPGKLTLTDKAVYFETPSFLSLQSKQRRPGDKRNERSRAALALKKGSTNNLLDLRISGRRNHLEMDGEVAVEASAR
FRYLKYGLHSRSQFLGLDIAVGIFGQKDIMRLDLAKDGVQVDKAKVGPFGSILFDSAVSVASSSEATVWANVDGRQWKTGKLLVTTITRGSFFKNGLISSTKENTNESKL
ETFQLENLEDKIGKNLVETHVSNGGDWDKNHENEVEEEKEEFKDVNSEIISSEEEKFNEEDGIEVSNTEQESSKNNDTLIAFEMDHQTVEELEANDKEDVILEQFSLEGG
FLVTGPLNTVLNELEGNIFFGFMANGEVHLLVAIMDFLSRIYHFDYNIFVQSVGDQLTNPKIASSNHVQPPILHKGVAIRLEDKNGIGYGPKKSFVEVVRSSPSTTKAVL
HPKVVERNSYWIRKDHDIVELNLQESLVVTKLMSHYSWGKIKASFEDLLKTKVSINPIMDDKALFHVNIVWVPSSLWIKFIKSYGGRICIKNLPLQYWKRLVFEAIGTHL
GGLIVISSHTLNCIDASSALIHVKRNSCGFIPASLEITDYVLENFRILLEDRGYAASEITQPSITPVLNPNAYSNSIDQDRIRKVLADEELDLNSDILAPSVLNSPSKLI
WPFESMPNVTSLLEEVIAMKSPNENKVLHQEVNENLIHS