; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009961 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009961
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF3741 domain-containing protein/DUF4378 domain-containing protein
Genome locationchr9:43591909..43597221
RNA-Seq ExpressionLag0009961
SyntenyLag0009961
Gene Ontology termsNA
InterPro domainsIPR025486 - Domain of unknown function DUF4378
IPR044257 - Protein TRM32-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570707.1 Protein TRM32, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0060.45Show/hide
Query:  KGDWIWNPFQFRRLGEEIGSSQNRSANPCVHLMKTCPEVEVMEFGPGEETRAGTKRTRIHQFWVRWVSTEPPPSEPAITCKNLLTSQRKPHGPHLFLSSC
        +G+WIW+PFQFRRLG EIGSSQ            T PEVEVMEFGPG+ETR+GTK TRIH  WV WVSTEPPPSEP+I C+                   
Subjt:  KGDWIWNPFQFRRLGEEIGSSQNRSANPCVHLMKTCPEVEVMEFGPGEETRAGTKRTRIHQFWVRWVSTEPPPSEPAITCKNLLTSQRKPHGPHLFLSSC

Query:  YCYLRELGRLLIMFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCS
             ELGRLLIMF+MEKH +R QDS++QFNRNVPGCFWSIFHTIDYHPWH+VKKMLPYR+HS+SKG PKST+NNHHVAE  +QINDENKPL CTAESC 
Subjt:  YCYLRELGRLLIMFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCS

Query:  FDRRPGE-----------------------------------VNFSECHGY-------------------------EDYFIQRKIAIRSTSFTEKSNGVK
           +PGE                                   ++  E   Y                         E+YF +RKI  + TS TEKSNGV+
Subjt:  FDRRPGE-----------------------------------VNFSECHGY-------------------------EDYFIQRKIAIRSTSFTEKSNGVK

Query:  KNLETNQVNRNITSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQ
        K LE+ Q++RNI+S SFKED HIQEIF+AN+KLFAELLQGA    TLQ P+N KSSASLAKSRSFPAPG+ RKGYKKLSSLQHKQNESFPK +KS   PQ
Subjt:  KNLETNQVNRNITSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQ

Query:  PSKLVESESPKNFHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIP--TADPSGHELPLDR
        PSKLVESESPKNFHED++PCDSDSTS  NI  QT+SSS GPNRGL++GGWNQLV+KRFNFIKQKIRHSFKERKKGNNQKT+KG P  T DPSGHELPL R
Subjt:  PSKLVESESPKNFHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIP--TADPSGHELPLDR

Query:  EESHESSGITAREDGSGIRGYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGS-----------NKN--------------------------
        EE+ ES G     DG GIRGYSET +SEN+NL +GVQTKT T    ASLERYS+LS+  S           N N                          
Subjt:  EESHESSGITAREDGSGIRGYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGS-----------NKN--------------------------

Query:  REAKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPP---------------------------------------------------
        REAK +HSQSLRL  E+ I  IE+ RKS+GRNLSLPDIDLFCTLFTDP                                                    
Subjt:  REAKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPP---------------------------------------------------

Query:  --------HGSLNGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSECRVE--VVGETVDDVTKLL-HINQVLELETCFEDDETLKFSNSEGAILNPG
                 GSLNGV+ +EGAAWAD+LEE+I  LD SD +H QVL GSEC +E   V ETVD V +L   ++QV   ETC  DDET K S+SEG+ILN  
Subjt:  --------HGSLNGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSECRVE--VVGETVDDVTKLL-HINQVLELETCFEDDETLKFSNSEGAILNPG

Query:  CSIANELEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECY
        CS ANELEP DDQP +AS EAL A E+IVNHEII D EKISNYL L+SE GR+NNADFNYMRYILQLSSFIE+GHTID PLSSSIF+GEEA FYKKLECY
Subjt:  CSIANELEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECY

Query:  WEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLE
        WEKV++DSDHQLL DLVYET HN FEKS   FLK+FS + Q+RPMPLGQYLLE+VREK+  YLCLGPELDQSLDDVVG+DL KGDDWMNLQ+E EY +LE
Subjt:  WEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLE

Query:  SEDLILDELLDEVL
         EDLILDELL+EV+
Subjt:  SEDLILDELLDEVL

KGN47177.2 hypothetical protein Csa_020828 [Cucumis sativus]1.6e-28356.84Show/hide
Query:  VHLMKTCPEVEVMEFGPGEETRAGTKRTRIHQFWVRWVSTEPPPSEPAITCKNLLTSQRKPHGPHLFLSSCYCYLRELGRLLIMFEMEKHAQRCQDSSLQ
        + +++  PEVE MEFGPGEET AGTK TRI QFWV WVST                                   RE GRL++MF+MEKH QR QDS+LQ
Subjt:  VHLMKTCPEVEVMEFGPGEETRAGTKRTRIHQFWVRWVSTEPPPSEPAITCKNLLTSQRKPHGPHLFLSSCYCYLRELGRLLIMFEMEKHAQRCQDSSLQ

Query:  FNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPGEVNFSE-----------------
        FN+NVPGCFWSIFHTIDYH WH+VKKMLP+RKHSRSK  PKSTLN+HH +E+P+QI+D NK   CT ESCS DR+    + +E                 
Subjt:  FNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPGEVNFSE-----------------

Query:  --------------------CHGY------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNITSRSFKEDTHIQEIFR
                              GY                        +DY  QRK A+R TS  EKS+GVKK LETN++NRNI++RSFKED+HIQEIF+
Subjt:  --------------------CHGY------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNITSRSFKEDTHIQEIFR

Query:  ANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKNFHEDMIPCDSDSTSLD
        AN+KLFAELLQGAH  +T Q  +N KSSASLAKSRSFPAPG+ RKGYKKLSSLQHKQ E+FPK QKSV + QPSKLV S SPKNFHEDM+PCDS ST+  
Subjt:  ANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKNFHEDMIPCDSDSTSLD

Query:  NIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIPTADPSGHELPLDREESHESSGITAREDGSGIRGYSET------
        NI+ Q T SSLG NRG ++GGWNQLV+KRFNFI+QKIRHSFKERKKGNNQKT+KGI   DPSGHEL L  EE+HES G    EDGSGIRGYS T      
Subjt:  NIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIPTADPSGHELPLDREESHESSGITAREDGSGIRGYSET------

Query:  -------------------------------------DSSENENLVNGVQTKTETALLNASLERYSQLSENGSNKNREAKRYHSQSLRLKSEDKISIIEQ
                                             D SENEN+ + VQT+T TA L+ASLE+YSQLS    +KNREAK YHS+SLRL SE+KI  +E 
Subjt:  -------------------------------------DSSENENLVNGVQTKTETALLNASLERYSQLSENGSNKNREAKRYHSQSLRLKSEDKISIIEQ

Query:  PRKSFGRNLSLPDIDLFCTLFTDPPH-----------------------------------------------------------GSLNGVEKNEGAAWA
        P+K FGRNLS PDIDLFCTLFTD PH                                                           GSLN +  +EG AWA
Subjt:  PRKSFGRNLSLPDIDLFCTLFTDPPH-----------------------------------------------------------GSLNGVEKNEGAAWA

Query:  DELEEQILQLDISDNKHPQVLSGSECRVEVVGETVDDVTKLLHINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQPCKASIEALPAFES
        D LEE+I  LDISD KH QVL G+E  V+ V +TV+  ++L    QVL+LE CF+DDET K S+SEGAI+NP C+   E EP DDQ  + S EALPAFE+
Subjt:  DELEEQILQLDISDNKHPQVLSGSECRVEVVGETVDDVTKLLHINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQPCKASIEALPAFES

Query:  IVNHEIIDDVEKISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEK
         V H+IIDD EKISNYL LHSE GR++NA+FNYMR+ILQLSSFIE+G TID PL+SSIF+GEEAHFYKKLECYWEKVD+DSDHQLLLDLVYETLH+ +EK
Subjt:  IVNHEIIDDVEKISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEK

Query:  SLIHFLKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDEVL
        S   FLKTFS + Q+RP+PLGQYLLE+VREK+  YL LGPELDQSLDDVV +DL KG++WMNLQSETE I+LE EDLILDELLDEV+
Subjt:  SLIHFLKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDEVL

XP_022943817.1 uncharacterized protein LOC111448443 isoform X4 [Cucurbita moschata]7.0e-28463.47Show/hide
Query:  MFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPG------
        MF+MEKH +R QDS++QFNR+VPGCFWSIFHTIDYHPWH+VKKMLPYR+HS+SKG PKST+NNHHVAE  +QINDENKPL CTAESC    +PG      
Subjt:  MFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPG------

Query:  ----EVNFSECHGY--------------------------------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNI
            E++  E   Y                                                  E+YF +RKI  + TSFTE+SNGV+K LE+ Q++RNI
Subjt:  ----EVNFSECHGY--------------------------------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNI

Query:  TSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKN
         S SFKED HIQEIF+AN+KLFAELLQGA    TLQ P+N KSSASLAKSRSFPAPG+ RKGYKKLSSLQHKQNESFPK +KS   PQPSKLVESESPKN
Subjt:  TSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKN

Query:  FHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIP--TADPSGHELPLDREESHESSGITAR
        FHED++PCDSDSTS  NI  QT+SSS GPNRGL++GGWNQLV+KRFNFIKQKIRHSFKERKKGNNQKT+KG P  T DPSGHELPL REE+ ES G    
Subjt:  FHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIP--TADPSGHELPLDREESHESSGITAR

Query:  EDGSGIRGYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGSNKNREAKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPP
         DG GIRGYSET +SEN+NL +GVQTKT TA L A LERYSQ  E      REAK +HSQSLRL  E+ I  IE+ RKS+GRNLSLPDIDLFCTLFTDP 
Subjt:  EDGSGIRGYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGSNKNREAKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPP

Query:  -----------------------------------------------------------HGSLNGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSE
                                                                    GSLN V+ +EGAAWAD+LEE+I  LD SD +H QVL GSE
Subjt:  -----------------------------------------------------------HGSLNGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSE

Query:  CRVE--VVGETVDDVTKLL-HINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSE
        C +E   V ETVD V  L   ++QV   ETC  DDET K S+SEG+ILN  CS ANELEP DDQP +AS EAL A E+IVNHEII D EKISNYL L+SE
Subjt:  CRVE--VVGETVDDVTKLL-HINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSE

Query:  SGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQ
         GR+NNADFNYMRYILQLSSFIE+GHTID PLSSSIF+GEEA FYKKLECYWEKVD+DSDHQLL DLVYET HN FEKS   FLKTFS + Q+RPMPLGQ
Subjt:  SGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQ

Query:  YLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDEVL
        YLLE+VREK+  YLCLGPELDQSLDDVVG+DL KGDDWMNLQ+E EY +LE EDLILDELL+EV+
Subjt:  YLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDEVL

XP_031743531.1 uncharacterized protein LOC105435809 isoform X1 [Cucumis sativus]7.3e-28157.13Show/hide
Query:  MEFGPGEETRAGTKRTRIHQFWVRWVSTEPPPSEPAITCKNLLTSQRKPHGPHLFLSSCYCYLRELGRLLIMFEMEKHAQRCQDSSLQFNRNVPGCFWSI
        MEFGPGEET AGTK TRI QFWV WVST                                   RE GRL++MF+MEKH QR QDS+LQFN+NVPGCFWSI
Subjt:  MEFGPGEETRAGTKRTRIHQFWVRWVSTEPPPSEPAITCKNLLTSQRKPHGPHLFLSSCYCYLRELGRLLIMFEMEKHAQRCQDSSLQFNRNVPGCFWSI

Query:  FHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPGEVNFSE-----------------------------
        FHTIDYH WH+VKKMLP+RKHSRSK  PKSTLN+HH +E+P+QI+D NK   CT ESCS DR+    + +E                             
Subjt:  FHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPGEVNFSE-----------------------------

Query:  --------CHGY------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNITSRSFKEDTHIQEIFRANKKLFAELLQG
                  GY                        +DY  QRK A+R TS  EKS+GVKK LETN++NRNI++RSFKED+HIQEIF+AN+KLFAELLQG
Subjt:  --------CHGY------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNITSRSFKEDTHIQEIFRANKKLFAELLQG

Query:  AHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKNFHEDMIPCDSDSTSLDNIRQQTTSSSLG
        AH  +T Q  +N KSSASLAKSRSFPAPG+ RKGYKKLSSLQHKQ E+FPK QKSV + QPSKLV S SPKNFHEDM+PCDS ST+  NI+ Q T SSLG
Subjt:  AHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKNFHEDMIPCDSDSTSLDNIRQQTTSSSLG

Query:  PNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIPTADPSGHELPLDREESHESSGITAREDGSGIRGYSET------------------
         NRG ++GGWNQLV+KRFNFI+QKIRHSFKERKKGNNQKT+KGI   DPSGHEL L  EE+HES G    EDGSGIRGYS T                  
Subjt:  PNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIPTADPSGHELPLDREESHESSGITAREDGSGIRGYSET------------------

Query:  -------------------------DSSENENLVNGVQTKTETALLNASLERYSQLSENGSNKNREAKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLP
                                 D SENEN+ + VQT+T TA L+ASLE+YSQLS    +KNREAK YHS+SLRL SE+KI  +E P+K FGRNLS P
Subjt:  -------------------------DSSENENLVNGVQTKTETALLNASLERYSQLSENGSNKNREAKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLP

Query:  DIDLFCTLFTDPPH-----------------------------------------------------------GSLNGVEKNEGAAWADELEEQILQLDI
        DIDLFCTLFTD PH                                                           GSLN +  +EG AWAD LEE+I  LDI
Subjt:  DIDLFCTLFTDPPH-----------------------------------------------------------GSLNGVEKNEGAAWADELEEQILQLDI

Query:  SDNKHPQVLSGSECRVEVVGETVDDVTKLLHINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQPCKASIEALPAFESIVNHEIIDDVEK
        SD KH QVL G+E  V+ V +TV+  ++L    QVL+LE CF+DDET K S+SEGAI+NP C+   E EP DDQ  + S EALPAFE+ V H+IIDD EK
Subjt:  SDNKHPQVLSGSECRVEVVGETVDDVTKLLHINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQPCKASIEALPAFESIVNHEIIDDVEK

Query:  ISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMK
        ISNYL LHSE GR++NA+FNYMR+ILQLSSFIE+G TID PL+SSIF+GEEAHFYKKLECYWEKVD+DSDHQLLLDLVYETLH+ +EKS   FLKTFS +
Subjt:  ISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMK

Query:  IQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDEVL
         Q+RP+PLGQYLLE+VREK+  YL LGPELDQSLDDVV +DL KG++WMNLQSETE I+LE EDLILDELLDEV+
Subjt:  IQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDEVL

XP_038902824.1 uncharacterized protein LOC120089436 [Benincasa hispida]2.6e-29462.44Show/hide
Query:  MFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPGEVNFSE
        MF+MEKH QR QDS+LQFN+NVPGCFWSIF+TIDYH WH+VKKMLPYRKHSRSKGG KSTLN+HHVAEVP+ INDEN+ L+CTAESC  DR   E + +E
Subjt:  MFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPGEVNFSE

Query:  --------------------------------------------------------------------CHGYEDYFIQRKIAIRSTSFTEKSNGVKKNLE
                                                                                EDY IQR+IAIR TS TEKSNGVKK LE
Subjt:  --------------------------------------------------------------------CHGYEDYFIQRKIAIRSTSFTEKSNGVKKNLE

Query:  TNQVNRNITSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKL
        TN++ RNI+SRS+KEDTHIQEIF+AN+KLFAELLQGAH  +TLQ  +N KS ASLAKS SFPAPG+ RKGYKKLSSLQHKQ ESFPK QKSV  PQPSKL
Subjt:  TNQVNRNITSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKL

Query:  VESESPKNFHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIPTADPSGHELPLDREESHES
        VES SPKNFHEDM PCDSDSTS  +IRQQTTSSSLG NRGL++GGWNQLV+KRFNFIKQKI+HSFKERKKGNNQKT+KGI T DPSG EL L REE+HES
Subjt:  VESESPKNFHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIPTADPSGHELPLDREESHES

Query:  SGITAREDGSGIR-------------------------------------------GYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGSNKN
         G T  EDGSG R                                           GYSETD +EN+NL + VQTKT TA L+ASLERYSQLSE G +KN
Subjt:  SGITAREDGSGIR-------------------------------------------GYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGSNKN

Query:  REAKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPP---------------------------------------------------
        REAK YHS+SLRL SE+K+  +E+P+K FGRNLS PDIDLFCTLFTDP                                                    
Subjt:  REAKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPP---------------------------------------------------

Query:  -----HGSLNGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSECRVEVVGETVDDVTKLLHINQVLELETCFEDDETLKFSNSEGAILNPGCSIANE
              GSLN +  +EG AWAD L+E+I  LDI D KH QV SG+E  VE   ET D V++L H+NQVLEL TCF+DDET KFS+S+GAILNPGCSIANE
Subjt:  -----HGSLNGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSECRVEVVGETVDDVTKLLHINQVLELETCFEDDETLKFSNSEGAILNPGCSIANE

Query:  LEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQ
        LEP DDQ  +A  EALPAFE+IVNHEI+DD EKISN L LHSE GR+NNADFNYMR+ILQLSSFIENGHTID PL+SSIF+GEEAHFYKKLECYW+KVD+
Subjt:  LEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQ

Query:  DSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLIL
        DSDHQLLLDLV ETLHN +EKS I FLKTFS + Q+RP+ LGQYLLEEVRE++  YLCLGPELDQSLDDVVG+DL KGDDWMNLQSETE+I+LE EDLIL
Subjt:  DSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLIL

Query:  DELLDEVL
        DELLDEVL
Subjt:  DELLDEVL

TrEMBL top hitse value%identityAlignment
A0A1S3CMA3 uncharacterized protein LOC1035020863.0e-27259.3Show/hide
Query:  MFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRP-----GE
        MF+MEKH QR QDS+LQFN+NVPGCFWSIFHTIDYH WH+VKKMLP+RKHSRSK  PKSTLN HH AE+P  +       +C+ ESC  DR+P      E
Subjt:  MFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRP-----GE

Query:  V--NFSE-------------------------------CHGY-----------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLET
        V  N SE                                 GY                              DY  QRKIAI  TS TEKS+GVKK LET
Subjt:  V--NFSE-------------------------------CHGY-----------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLET

Query:  NQVNRNITSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLV
        N++NRN+++RSFKED+H+QEIF+AN+KLFAELLQGAH  +TLQ P+N KSSASLAKSRSFPAPG+ RKGYKKLSSLQHKQ E+FPK QKSV + QPSKLV
Subjt:  NQVNRNITSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLV

Query:  ESESPKNFHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIPTADPSGHELPLDREESHESS
        ES SPKNFHEDM+PCDSDST+  NI+ Q TSSSLG NRG K+GGWNQLV+KRFNFIKQKIRHSFKERKKGNNQKT+KGI   DPSGHEL L  EE+HES 
Subjt:  ESESPKNFHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIPTADPSGHELPLDREESHESS

Query:  GITAREDGSGIRGYSE------------------------------------------TDSSENENLVNGVQTKTETALLNASLERYSQLSENGSNKNRE
        G    EDGSGIRGYSE                                          TD SENENL + VQT+T TA L+ASLE+YSQLS    +KNRE
Subjt:  GITAREDGSGIRGYSE------------------------------------------TDSSENENLVNGVQTKTETALLNASLERYSQLSENGSNKNRE

Query:  AKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPPH----------------------------------------------------
        AK YHSQS+RL SE+KI  +E P+K FGRNLS PDIDLFCTLFTD PH                                                    
Subjt:  AKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPPH----------------------------------------------------

Query:  -------GSLNGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSECRVEVVGETVDDVTKLLHINQVLELETCFEDDETLKFSNSEGAILNPGCSIAN
                SLN +  +EG  W D LEE+I  LDISD KH QVL G+E  VE V  TVD  ++L H  QVLEL+TCF+DDET K S+SEGAI+NP CS+AN
Subjt:  -------GSLNGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSECRVEVVGETVDDVTKLLHINQVLELETCFEDDETLKFSNSEGAILNPGCSIAN

Query:  ELEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVD
        E E  DDQ  + + EALPAFE+ V+H IIDD EKISN+L LHSE GR+NNA+FNYMR+ILQLSSFIE G TID PL+ SIF+GEEAHFYKKLECYWEKVD
Subjt:  ELEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVD

Query:  QDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLI
        +DSDHQLLLDLVYETLHN +EKS   FLKTFS + Q+RPMPLGQYLLE+VREK+  YL LGPELDQSLDDVV +DL KG++WMNLQSETE I+LE EDLI
Subjt:  QDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLI

Query:  LDELLDEVL
        LDELLDEV+
Subjt:  LDELLDEVL

A0A6J1FSR2 uncharacterized protein LOC111448443 isoform X32.1e-27860.6Show/hide
Query:  MFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPG------
        MF+MEKH +R QDS++QFNR+VPGCFWSIFHTIDYHPWH+VKKMLPYR+HS+SKG PKST+NNHHVAE  +QINDENKPL CTAESC    +PG      
Subjt:  MFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPG------

Query:  ----EVNFSECHGY--------------------------------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNI
            E++  E   Y                                                  E+YF +RKI  + TSFTE+SNGV+K LE+ Q++RNI
Subjt:  ----EVNFSECHGY--------------------------------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNI

Query:  TSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKN
         S SFKED HIQEIF+AN+KLFAELLQGA    TLQ P+N KSSASLAKSRSFPAPG+ RKGYKKLSSLQHKQNESFPK +KS   PQPSKLVESESPKN
Subjt:  TSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKN

Query:  FHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIP--TADPSGHELPLDREESHESSGITAR
        FHED++PCDSDSTS  NI  QT+SSS GPNRGL++GGWNQLV+KRFNFIKQKIRHSFKERKKGNNQKT+KG P  T DPSGHELPL REE+ ES G    
Subjt:  FHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIP--TADPSGHELPLDREESHESSGITAR

Query:  EDGSGIRGYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGS-------------------------------------NKNREAKRYHSQSLR
         DG GIRGYSET +SEN+NL +GVQTKT T    ASLERYS+LS+  S                                        REAK +HSQSLR
Subjt:  EDGSGIRGYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGS-------------------------------------NKNREAKRYHSQSLR

Query:  LKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPP-----------------------------------------------------------HGSL
        L  E+ I  IE+ RKS+GRNLSLPDIDLFCTLFTDP                                                             GSL
Subjt:  LKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPP-----------------------------------------------------------HGSL

Query:  NGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSECRVE--VVGETVDDVTKLLHINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQ
        N V+ +EGAAWAD+LEE+I  LD SD +H QVL GSEC +E   V ETVD V            ETC  DDET K S+SEG+ILN  CS ANELEP DDQ
Subjt:  NGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSECRVE--VVGETVDDVTKLLHINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQ

Query:  PCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLL
        P +AS EAL A E+IVNHEII D EKISNYL L+SE GR+NNADFNYMRYILQLSSFIE+GHTID PLSSSIF+GEEA FYKKLECYWEKVD+DSDHQLL
Subjt:  PCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLL

Query:  LDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDEV
         DLVYET HN FEKS   FLKTFS + Q+RPMPLGQYLLE+VREK+  YLCLGPELDQSLDDVVG+DL KGDDWMNLQ+E EY +LE EDLILDELL+EV
Subjt:  LDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDEV

Query:  L
        +
Subjt:  L

A0A6J1FU32 uncharacterized protein LOC111448443 isoform X43.4e-28463.47Show/hide
Query:  MFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPG------
        MF+MEKH +R QDS++QFNR+VPGCFWSIFHTIDYHPWH+VKKMLPYR+HS+SKG PKST+NNHHVAE  +QINDENKPL CTAESC    +PG      
Subjt:  MFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPG------

Query:  ----EVNFSECHGY--------------------------------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNI
            E++  E   Y                                                  E+YF +RKI  + TSFTE+SNGV+K LE+ Q++RNI
Subjt:  ----EVNFSECHGY--------------------------------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNI

Query:  TSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKN
         S SFKED HIQEIF+AN+KLFAELLQGA    TLQ P+N KSSASLAKSRSFPAPG+ RKGYKKLSSLQHKQNESFPK +KS   PQPSKLVESESPKN
Subjt:  TSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKN

Query:  FHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIP--TADPSGHELPLDREESHESSGITAR
        FHED++PCDSDSTS  NI  QT+SSS GPNRGL++GGWNQLV+KRFNFIKQKIRHSFKERKKGNNQKT+KG P  T DPSGHELPL REE+ ES G    
Subjt:  FHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIP--TADPSGHELPLDREESHESSGITAR

Query:  EDGSGIRGYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGSNKNREAKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPP
         DG GIRGYSET +SEN+NL +GVQTKT TA L A LERYSQ  E      REAK +HSQSLRL  E+ I  IE+ RKS+GRNLSLPDIDLFCTLFTDP 
Subjt:  EDGSGIRGYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGSNKNREAKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPP

Query:  -----------------------------------------------------------HGSLNGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSE
                                                                    GSLN V+ +EGAAWAD+LEE+I  LD SD +H QVL GSE
Subjt:  -----------------------------------------------------------HGSLNGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSE

Query:  CRVE--VVGETVDDVTKLL-HINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSE
        C +E   V ETVD V  L   ++QV   ETC  DDET K S+SEG+ILN  CS ANELEP DDQP +AS EAL A E+IVNHEII D EKISNYL L+SE
Subjt:  CRVE--VVGETVDDVTKLL-HINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSE

Query:  SGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQ
         GR+NNADFNYMRYILQLSSFIE+GHTID PLSSSIF+GEEA FYKKLECYWEKVD+DSDHQLL DLVYET HN FEKS   FLKTFS + Q+RPMPLGQ
Subjt:  SGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQ

Query:  YLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDEVL
        YLLE+VREK+  YLCLGPELDQSLDDVVG+DL KGDDWMNLQ+E EY +LE EDLILDELL+EV+
Subjt:  YLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDEVL

A0A6J1FYA4 uncharacterized protein LOC111448443 isoform X13.3e-27960.86Show/hide
Query:  MFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPG------
        MF+MEKH +R QDS++QFNR+VPGCFWSIFHTIDYHPWH+VKKMLPYR+HS+SKG PKST+NNHHVAE  +QINDENKPL CTAESC    +PG      
Subjt:  MFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPG------

Query:  ----EVNFSECHGY--------------------------------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNI
            E++  E   Y                                                  E+YF +RKI  + TSFTE+SNGV+K LE+ Q++RNI
Subjt:  ----EVNFSECHGY--------------------------------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNI

Query:  TSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKN
         S SFKED HIQEIF+AN+KLFAELLQGA    TLQ P+N KSSASLAKSRSFPAPG+ RKGYKKLSSLQHKQNESFPK +KS   PQPSKLVESESPKN
Subjt:  TSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKN

Query:  FHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIP--TADPSGHELPLDREESHESSGITAR
        FHED++PCDSDSTS  NI  QT+SSS GPNRGL++GGWNQLV+KRFNFIKQKIRHSFKERKKGNNQKT+KG P  T DPSGHELPL REE+ ES G    
Subjt:  FHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIP--TADPSGHELPLDREESHESSGITAR

Query:  EDGSGIRGYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGS-------------------------------------NKNREAKRYHSQSLR
         DG GIRGYSET +SEN+NL +GVQTKT T    ASLERYS+LS+  S                                        REAK +HSQSLR
Subjt:  EDGSGIRGYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGS-------------------------------------NKNREAKRYHSQSLR

Query:  LKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPP-----------------------------------------------------------HGSL
        L  E+ I  IE+ RKS+GRNLSLPDIDLFCTLFTDP                                                             GSL
Subjt:  LKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPP-----------------------------------------------------------HGSL

Query:  NGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSECRVE--VVGETVDDVTKLL-HINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDD
        N V+ +EGAAWAD+LEE+I  LD SD +H QVL GSEC +E   V ETVD V  L   ++QV   ETC  DDET K S+SEG+ILN  CS ANELEP DD
Subjt:  NGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSECRVE--VVGETVDDVTKLL-HINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDD

Query:  QPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQL
        QP +AS EAL A E+IVNHEII D EKISNYL L+SE GR+NNADFNYMRYILQLSSFIE+GHTID PLSSSIF+GEEA FYKKLECYWEKVD+DSDHQL
Subjt:  QPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQL

Query:  LLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDE
        L DLVYET HN FEKS   FLKTFS + Q+RPMPLGQYLLE+VREK+  YLCLGPELDQSLDDVVG+DL KGDDWMNLQ+E EY +LE EDLILDELL+E
Subjt:  LLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDE

Query:  VL
        V+
Subjt:  VL

A0A6J1J8H7 uncharacterized protein LOC111484426 isoform X26.9e-27762.5Show/hide
Query:  MFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPGE-----
        MF+MEKH +R QDS++QFNRNVPGCFWSIFHTIDYHPWH+VKKMLPYR+HS+SKG PKST+N+HHVAE  +QINDENKPL CTAESC    +PGE     
Subjt:  MFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGPKSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPGE-----

Query:  ------------------------------VNFSECHGY-------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNI
                                      ++  E   Y                         E+YF QRKI  + TSFTEKSNGV+K LE+ Q     
Subjt:  ------------------------------VNFSECHGY-------------------------EDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNI

Query:  TSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKN
         S SFKED HIQEIF+AN+KLFAELLQGA    TLQ P+N KSSASLAKSRSFPAPG+ RKGYKKLSSLQHKQNESFPK +KS   PQPSKL ESESPKN
Subjt:  TSRSFKEDTHIQEIFRANKKLFAELLQGAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKN

Query:  FHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIP--TADPSGHELPLDREESHESSGITAR
        FHED++PCDSDSTS  N+  QT+SSSLGPNRG+++GGWNQLV+KRFNFIKQKIRHSFKERKKGNNQKT+KG P  T DPSGHELPL REE+ ES G    
Subjt:  FHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGGWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIP--TADPSGHELPLDREESHESSGITAR

Query:  EDGSGIRGYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGSNKNREAKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPP
        +DG GIRGYSET +SEN+NL +GVQTKT TA L A LERYSQ  E      REAK +HSQSLRL  E+ I  IE+ RKS+GRNLSLPDIDLFCTLFTDP 
Subjt:  EDGSGIRGYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGSNKNREAKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPP

Query:  --------------HGS---------------------------------------------LNGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSE
                      H S                                             LN V+ +EGAAWAD LEE+I  LD SD +H QVL GSE
Subjt:  --------------HGS---------------------------------------------LNGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSE

Query:  CRVE--VVGETVDDVTKLLHINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSES
        C +E   V ETVD V            ETCF DDET K S+SEG+ILN  CS+ANELEP DDQP +A  EAL A E+ VNHEII D EKISNYL L+S+ 
Subjt:  CRVE--VVGETVDDVTKLLHINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSES

Query:  GRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQY
        GR+NNADFNYMRYILQLSSFIE+GHTID PLSSSIF+GEEA FYKKLECYWEKVD+ SDHQLL DLVYET HN FEKS   FLKTFS + Q+RPMPLGQY
Subjt:  GRLNNADFNYMRYILQLSSFIENGHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQY

Query:  LLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDEVL
        LLE+VREK+  YLCLGPELDQSLDDVVG+DL KGDDWMNLQ+E EY +LE EDLILDELL+EV+
Subjt:  LLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDEVL

SwissProt top hitse value%identityAlignment
F4HSD5 Protein TRM321.3e-1437.28Show/hide
Query:  NADFNYMRYILQLSSFIEN------GHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLG
        +A F Y++ +L++S F+EN       ++ + PL+ S+            E   ++ +  +D +LL DLV E +      S I+F KTF         P G
Subjt:  NADFNYMRYILQLSSFIEN------GHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLG

Query:  QYLLEEVREKIVRYLC-LGPE-LDQSLDDVVGQD-LSKGDDWMNLQSETEYISLESEDLILDELLDEVL
        +  L+EV  ++   L  LG E  D+SLDD+VG+D L+K D WMNLQ E+E+++LE EDLI D++LDE+L
Subjt:  QYLLEEVREKIVRYLC-LGPE-LDQSLDDVVGQD-LSKGDDWMNLQSETEYISLESEDLILDELLDEVL

Arabidopsis top hitse value%identityAlignment
AT1G07620.1 GTP-binding protein Obg/CgtA9.2e-1637.28Show/hide
Query:  NADFNYMRYILQLSSFIEN------GHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLG
        +A F Y++ +L++S F+EN       ++ + PL+ S+            E   ++ +  +D +LL DLV E +      S I+F KTF         P G
Subjt:  NADFNYMRYILQLSSFIEN------GHTIDWPLSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLG

Query:  QYLLEEVREKIVRYLC-LGPE-LDQSLDDVVGQD-LSKGDDWMNLQSETEYISLESEDLILDELLDEVL
        +  L+EV  ++   L  LG E  D+SLDD+VG+D L+K D WMNLQ E+E+++LE EDLI D++LDE+L
Subjt:  QYLLEEVREKIVRYLC-LGPE-LDQSLDDVVGQD-LSKGDDWMNLQSETEYISLESEDLILDELLDEVL

AT5G02390.1 Protein of unknown function (DUF3741)9.8e-2628.45Show/hide
Query:  GWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIPTADPSGHELPLD-REESHESSGITAREDGSGIRGYSETDSSENENLVNGVQTK--TETALLNA
        G NQ+VIKRF  ++QKI+H   E K   ++ T   +    P  +    D R++  E            ++G S   +++N N   G + K  T  + L  
Subjt:  GWNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIPTADPSGHELPLD-REESHESSGITAREDGSGIRGYSETDSSENENLVNGVQTK--TETALLNA

Query:  SLERYSQLSENGSNKNREAKR-YHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPPHGSLNGVEKNEGAAWADELEEQILQLDISDNKHP
        SL+RY QL E   +  REAK   +S+  +++ E+     ++  K  GR LSLP +        D    S+   E+ +G     E+ E   Q    +    
Subjt:  SLERYSQLSENGSNKNREAKR-YHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPPHGSLNGVEKNEGAAWADELEEQILQLDISDNKHP

Query:  QVLSGSECRVEVVGETVDDVTKLLHINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLL
        Q +  SE         VD  T+     Q  E+ T   + ET     S                  ++ P   S   +    S+ +     D E +S    
Subjt:  QVLSGSECRVEVVGETVDDVTKLLHINQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLL

Query:  LHSESGRLNNAD---FNYMRYILQLSSFIENGHTIDWPLSSSIFDG---EEAH------FYKKLECYW-EKVDQDSDHQLLLDLVYETLHNAFEKSLIHF
        L  E   ++  D   FNY+R IL++S F        W L     D    EE         ++  EC   E+   + +H LL DL+ E L   +E+S  ++
Subjt:  LHSESGRLNNAD---FNYMRYILQLSSFIENGHTIDWPLSSSIFDG---EEAH------FYKKLECYW-EKVDQDSDHQLLLDLVYETLHNAFEKSLIHF

Query:  LKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGP-ELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDEVLMS
         K  S   ++ PMP+G  +L+EV  +I  YL   P +  QS D V+ +DLS+ D WM+LQ E+E + +E EDLI +ELL+E+L+S
Subjt:  LKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGP-ELDQSLDDVVGQDLSKGDDWMNLQSETEYISLESEDLILDELLDEVLMS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAATATACACTTGTTTGGTTGCCAAGAAACCGAAGGAAAGCTTTGAAGCTGAAGGTTTCAGGCTCTGAATACCAAACAAACGACAACCAACACAAAATCAACTG
CAAAAGTAAACAAAATACAGACAAATTTCTGTTGGGTCAACCAAAAGGGGATTGGATTTGGAACCCATTTCAGTTTCGACGACTTGGCGAGGAAATTGGAAGTTCTCAGA
ATCGGTCAGCTAACCCATGTGTTCATCTTATGAAGACCTGCCCTGAGGTCGAGGTTATGGAGTTTGGACCAGGCGAAGAGACTCGTGCTGGGACCAAGCGGACCAGGATT
CACCAATTTTGGGTGCGGTGGGTCTCCACGGAGCCGCCTCCTTCTGAACCTGCCATAACCTGTAAAAATCTGTTAACCTCTCAAAGAAAGCCCCACGGCCCCCACCTTTT
CCTTTCTTCTTGCTATTGCTATTTAAGAGAGCTTGGAAGACTTCTCATCATGTTTGAAATGGAAAAGCACGCCCAGCGGTGCCAGGACTCCAGTCTGCAGTTTAACAGAA
ACGTTCCAGGCTGCTTCTGGAGCATATTCCATACTATTGACTACCATCCCTGGCATAGTGTTAAAAAGATGCTTCCTTACAGAAAGCATTCAAGAAGCAAGGGAGGTCCA
AAATCGACTCTGAACAACCACCACGTTGCCGAAGTGCCAAAACAAATTAATGATGAAAACAAGCCTCTACTGTGTACTGCAGAGAGTTGCTCTTTTGACCGAAGACCTGG
AGAAGTTAACTTCTCTGAATGCCATGGATACGAGGACTACTTCATCCAGAGAAAGATTGCTATCCGGTCGACATCTTTTACAGAAAAATCTAATGGAGTAAAGAAAAACT
TAGAAACTAACCAGGTCAACAGAAACATCACCAGTCGCTCGTTTAAGGAAGATACTCACATCCAAGAGATATTTAGGGCAAACAAAAAACTATTTGCTGAATTGTTGCAG
GGTGCACATGGTAACCACACTCTCCAACCCCCAAAAAATACGAAGTCCTCAGCAAGTCTAGCGAAATCAAGATCCTTTCCTGCTCCTGGTGTAACGCGAAAAGGATATAA
AAAGCTTAGCTCACTCCAACACAAGCAGAATGAGTCCTTTCCGAAGGGACAAAAATCTGTTTTTGTCCCCCAACCATCAAAGCTGGTTGAATCTGAATCTCCTAAGAATT
TTCATGAAGATATGATACCTTGTGACTCTGATAGTACTTCATTGGATAACATAAGACAACAAACAACCTCTTCTTCTTTGGGCCCAAACCGTGGACTAAAGTATGGGGGG
TGGAATCAATTGGTTATCAAGCGTTTCAATTTTATTAAGCAGAAAATAAGGCACTCTTTCAAGGAGAGGAAGAAGGGAAATAACCAGAAAACAGCTAAAGGAATACCAAC
TGCAGATCCTTCTGGACATGAACTACCCCTTGACAGAGAAGAGTCACATGAGAGTTCAGGAATTACTGCTCGTGAAGATGGCTCAGGCATTAGAGGTTATAGTGAGACTG
ATAGTTCTGAGAATGAAAATCTCGTTAATGGAGTTCAAACCAAGACAGAGACAGCTTTATTAAATGCTTCTCTGGAAAGATATTCTCAACTGTCTGAGAACGGTTCCAAT
AAAAACAGAGAGGCCAAGCGCTACCACTCACAAAGCTTAAGGCTGAAAAGTGAAGATAAAATTTCGATTATAGAGCAGCCTCGAAAATCTTTTGGAAGGAATCTTTCTCT
GCCTGATATTGACCTCTTTTGTACATTATTTACTGACCCTCCTCATGGTAGTTTGAACGGGGTGGAAAAGAATGAGGGAGCTGCTTGGGCAGATGAACTTGAGGAGCAAA
TACTTCAACTAGATATTTCAGACAATAAACACCCGCAAGTATTGTCAGGTAGCGAATGCAGAGTTGAAGTTGTTGGTGAGACTGTTGATGATGTCACCAAGCTTTTACAC
ATTAATCAAGTCTTAGAACTTGAAACTTGTTTCGAAGATGATGAAACTTTAAAGTTCTCCAACTCAGAAGGTGCAATACTAAATCCTGGGTGCAGTATTGCAAATGAACT
TGAACCTTTCGATGACCAGCCTTGCAAGGCCAGCATAGAAGCTTTACCAGCTTTTGAAAGCATTGTCAATCATGAGATAATTGATGATGTCGAAAAGATTTCTAACTATC
TCCTTCTACATTCTGAATCTGGTAGACTCAATAATGCCGACTTCAACTATATGAGGTATATTCTTCAGCTCTCCAGCTTTATCGAAAATGGTCACACGATAGACTGGCCA
CTTAGCTCTTCGATATTTGATGGAGAGGAGGCCCACTTTTACAAAAAACTTGAATGCTATTGGGAGAAGGTTGACCAAGATTCTGATCATCAGCTTCTGCTTGATTTAGT
TTATGAGACATTACATAATGCATTTGAAAAATCATTAATTCATTTCCTCAAAACCTTCTCCATGAAGATCCAACTCCGTCCAATGCCACTTGGGCAATATCTTCTCGAGG
AGGTTCGAGAAAAAATTGTCAGGTACTTGTGCTTGGGACCTGAACTAGACCAGTCTTTAGACGATGTCGTGGGACAAGATTTAAGTAAAGGTGATGATTGGATGAACCTC
CAATCTGAAACAGAGTACATATCACTTGAGTCGGAGGATTTGATTCTTGATGAGCTTTTAGATGAAGTATTAATGAGTTTTAGGATTTTTTGGATAGTTCTTGTTTATAC
TGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAATATACACTTGTTTGGTTGCCAAGAAACCGAAGGAAAGCTTTGAAGCTGAAGGTTTCAGGCTCTGAATACCAAACAAACGACAACCAACACAAAATCAACTG
CAAAAGTAAACAAAATACAGACAAATTTCTGTTGGGTCAACCAAAAGGGGATTGGATTTGGAACCCATTTCAGTTTCGACGACTTGGCGAGGAAATTGGAAGTTCTCAGA
ATCGGTCAGCTAACCCATGTGTTCATCTTATGAAGACCTGCCCTGAGGTCGAGGTTATGGAGTTTGGACCAGGCGAAGAGACTCGTGCTGGGACCAAGCGGACCAGGATT
CACCAATTTTGGGTGCGGTGGGTCTCCACGGAGCCGCCTCCTTCTGAACCTGCCATAACCTGTAAAAATCTGTTAACCTCTCAAAGAAAGCCCCACGGCCCCCACCTTTT
CCTTTCTTCTTGCTATTGCTATTTAAGAGAGCTTGGAAGACTTCTCATCATGTTTGAAATGGAAAAGCACGCCCAGCGGTGCCAGGACTCCAGTCTGCAGTTTAACAGAA
ACGTTCCAGGCTGCTTCTGGAGCATATTCCATACTATTGACTACCATCCCTGGCATAGTGTTAAAAAGATGCTTCCTTACAGAAAGCATTCAAGAAGCAAGGGAGGTCCA
AAATCGACTCTGAACAACCACCACGTTGCCGAAGTGCCAAAACAAATTAATGATGAAAACAAGCCTCTACTGTGTACTGCAGAGAGTTGCTCTTTTGACCGAAGACCTGG
AGAAGTTAACTTCTCTGAATGCCATGGATACGAGGACTACTTCATCCAGAGAAAGATTGCTATCCGGTCGACATCTTTTACAGAAAAATCTAATGGAGTAAAGAAAAACT
TAGAAACTAACCAGGTCAACAGAAACATCACCAGTCGCTCGTTTAAGGAAGATACTCACATCCAAGAGATATTTAGGGCAAACAAAAAACTATTTGCTGAATTGTTGCAG
GGTGCACATGGTAACCACACTCTCCAACCCCCAAAAAATACGAAGTCCTCAGCAAGTCTAGCGAAATCAAGATCCTTTCCTGCTCCTGGTGTAACGCGAAAAGGATATAA
AAAGCTTAGCTCACTCCAACACAAGCAGAATGAGTCCTTTCCGAAGGGACAAAAATCTGTTTTTGTCCCCCAACCATCAAAGCTGGTTGAATCTGAATCTCCTAAGAATT
TTCATGAAGATATGATACCTTGTGACTCTGATAGTACTTCATTGGATAACATAAGACAACAAACAACCTCTTCTTCTTTGGGCCCAAACCGTGGACTAAAGTATGGGGGG
TGGAATCAATTGGTTATCAAGCGTTTCAATTTTATTAAGCAGAAAATAAGGCACTCTTTCAAGGAGAGGAAGAAGGGAAATAACCAGAAAACAGCTAAAGGAATACCAAC
TGCAGATCCTTCTGGACATGAACTACCCCTTGACAGAGAAGAGTCACATGAGAGTTCAGGAATTACTGCTCGTGAAGATGGCTCAGGCATTAGAGGTTATAGTGAGACTG
ATAGTTCTGAGAATGAAAATCTCGTTAATGGAGTTCAAACCAAGACAGAGACAGCTTTATTAAATGCTTCTCTGGAAAGATATTCTCAACTGTCTGAGAACGGTTCCAAT
AAAAACAGAGAGGCCAAGCGCTACCACTCACAAAGCTTAAGGCTGAAAAGTGAAGATAAAATTTCGATTATAGAGCAGCCTCGAAAATCTTTTGGAAGGAATCTTTCTCT
GCCTGATATTGACCTCTTTTGTACATTATTTACTGACCCTCCTCATGGTAGTTTGAACGGGGTGGAAAAGAATGAGGGAGCTGCTTGGGCAGATGAACTTGAGGAGCAAA
TACTTCAACTAGATATTTCAGACAATAAACACCCGCAAGTATTGTCAGGTAGCGAATGCAGAGTTGAAGTTGTTGGTGAGACTGTTGATGATGTCACCAAGCTTTTACAC
ATTAATCAAGTCTTAGAACTTGAAACTTGTTTCGAAGATGATGAAACTTTAAAGTTCTCCAACTCAGAAGGTGCAATACTAAATCCTGGGTGCAGTATTGCAAATGAACT
TGAACCTTTCGATGACCAGCCTTGCAAGGCCAGCATAGAAGCTTTACCAGCTTTTGAAAGCATTGTCAATCATGAGATAATTGATGATGTCGAAAAGATTTCTAACTATC
TCCTTCTACATTCTGAATCTGGTAGACTCAATAATGCCGACTTCAACTATATGAGGTATATTCTTCAGCTCTCCAGCTTTATCGAAAATGGTCACACGATAGACTGGCCA
CTTAGCTCTTCGATATTTGATGGAGAGGAGGCCCACTTTTACAAAAAACTTGAATGCTATTGGGAGAAGGTTGACCAAGATTCTGATCATCAGCTTCTGCTTGATTTAGT
TTATGAGACATTACATAATGCATTTGAAAAATCATTAATTCATTTCCTCAAAACCTTCTCCATGAAGATCCAACTCCGTCCAATGCCACTTGGGCAATATCTTCTCGAGG
AGGTTCGAGAAAAAATTGTCAGGTACTTGTGCTTGGGACCTGAACTAGACCAGTCTTTAGACGATGTCGTGGGACAAGATTTAAGTAAAGGTGATGATTGGATGAACCTC
CAATCTGAAACAGAGTACATATCACTTGAGTCGGAGGATTTGATTCTTGATGAGCTTTTAGATGAAGTATTAATGAGTTTTAGGATTTTTTGGATAGTTCTTGTTTATAC
TGTATAG
Protein sequenceShow/hide protein sequence
MEEYTLVWLPRNRRKALKLKVSGSEYQTNDNQHKINCKSKQNTDKFLLGQPKGDWIWNPFQFRRLGEEIGSSQNRSANPCVHLMKTCPEVEVMEFGPGEETRAGTKRTRI
HQFWVRWVSTEPPPSEPAITCKNLLTSQRKPHGPHLFLSSCYCYLRELGRLLIMFEMEKHAQRCQDSSLQFNRNVPGCFWSIFHTIDYHPWHSVKKMLPYRKHSRSKGGP
KSTLNNHHVAEVPKQINDENKPLLCTAESCSFDRRPGEVNFSECHGYEDYFIQRKIAIRSTSFTEKSNGVKKNLETNQVNRNITSRSFKEDTHIQEIFRANKKLFAELLQ
GAHGNHTLQPPKNTKSSASLAKSRSFPAPGVTRKGYKKLSSLQHKQNESFPKGQKSVFVPQPSKLVESESPKNFHEDMIPCDSDSTSLDNIRQQTTSSSLGPNRGLKYGG
WNQLVIKRFNFIKQKIRHSFKERKKGNNQKTAKGIPTADPSGHELPLDREESHESSGITAREDGSGIRGYSETDSSENENLVNGVQTKTETALLNASLERYSQLSENGSN
KNREAKRYHSQSLRLKSEDKISIIEQPRKSFGRNLSLPDIDLFCTLFTDPPHGSLNGVEKNEGAAWADELEEQILQLDISDNKHPQVLSGSECRVEVVGETVDDVTKLLH
INQVLELETCFEDDETLKFSNSEGAILNPGCSIANELEPFDDQPCKASIEALPAFESIVNHEIIDDVEKISNYLLLHSESGRLNNADFNYMRYILQLSSFIENGHTIDWP
LSSSIFDGEEAHFYKKLECYWEKVDQDSDHQLLLDLVYETLHNAFEKSLIHFLKTFSMKIQLRPMPLGQYLLEEVREKIVRYLCLGPELDQSLDDVVGQDLSKGDDWMNL
QSETEYISLESEDLILDELLDEVLMSFRIFWIVLVYTV