; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030066 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030066
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF1666)
Genome locationchr8:44296694..44299042
RNA-Seq ExpressionLag0030066
SyntenyLag0030066
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR012870 - Protein of unknown function DUF1666


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033168.1 hypothetical protein SDJN02_07222, partial [Cucurbita argyrosperma subsp. argyrosperma]7.4e-13661.18Show/hide
Query:  LSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDE-TDTPFSALSFRFPTYEEFLR-TTENVDPPIFETSN
        + FS N N  ++ +LLLK +L++FCALF LL+ FFKYY     K+ LF        Q+ VD D   +TPFS+LSF+FPTYE+FLR TT+NVDPP++ETSN
Subjt:  LSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDE-TDTPFSALSFRFPTYEEFLR-TTENVDPPIFETSN

Query:  ENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIETMKAKTK-
        ENDF +++ S   +   D DL   E+PE F  NGL+ +N ++I+D         NE  +  EAISGSFD             EDFI+EL+ E  KAK K 
Subjt:  ENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIETMKAKTK-

Query:  ---PGLPSIPEESEYPTAMEEDVKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTGE
             LPSIPEESEYP AMEED K  K+EE  ++K + KELH+FHK YTEKMRKYDTLNHQ   AKE KM QSK  V+SV SKGLCGCRPDK+G+AKTGE
Subjt:  ---PGLPSIPEESEYPTAMEEDVKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTGE

Query:  NRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDN
        NRGIDGELE+VYVVQ+WV WEFIVWQYKKALE+ G EGYGS R++EVAEKFEHFKV+IQRFMENE  +EGSRVE YA +R+ RRKLLQVPLLKEDE KDN
Subjt:  NRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDN

Query:  KKKGGMDDDNDNVKIDRLIEILQGSIRILWQFIRADKFAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK
        KK G  +++ + VK+DR+IEILQ  IR+ WQFIRADK AHI ST K H IE +QE  SPA+S IL QILLDL K
Subjt:  KKKGGMDDDNDNVKIDRLIEILQGSIRILWQFIRADKFAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK

XP_022959969.1 uncharacterized protein LOC111460861 [Cucurbita moschata]9.7e-13660.55Show/hide
Query:  LSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDE-TDTPFSALSFRFPTYEEFLR-TTENVDPPIFETSN
        + FS N N  ++ +LLLK +L++FCALFPLL+ FFKYY     K+ LF        Q+ VD D   +TPFS+LSF+FPTYE+FLR TT+NVDPP++ETSN
Subjt:  LSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDE-TDTPFSALSFRFPTYEEFLR-TTENVDPPIFETSN

Query:  ENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIE----TMKA
        ENDF +++ S   +   D DL   E+PE F  NGL+ +N + I+D         NE  +  EAISGSFD             +DFI+EL+ E      KA
Subjt:  ENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIE----TMKA

Query:  KTKPGLPSIPEESEYPTAMEEDVKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTGE
        K++ GLPSIPEESEYP AMEED K  K+EE  ++K + KELH+FHK YTEKMRKYDTLNHQ   AKE KM QSK  V+SV SKGLCGCRPDK+G+AKTGE
Subjt:  KTKPGLPSIPEESEYPTAMEEDVKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTGE

Query:  NRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDN
        NRG++GELE+VYVVQ+WV WEFIVWQYKKALE+ G EGYGS R++EVAEKFEHFKV+IQRFMENE  +EGSRVE YA +R+ +RKLLQVPLLKEDE KDN
Subjt:  NRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDN

Query:  KKKGGMDDDNDNVKIDRLIEILQGSIRILWQFIRADKFAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK
        KK G  +++ + VK+DR+IEILQ  IR+ WQFIRADK AHI ST K H IE +QE  SPA+S IL QILLDL K
Subjt:  KKKGGMDDDNDNVKIDRLIEILQGSIRILWQFIRADKFAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK

XP_023544885.1 uncharacterized protein LOC111804322 [Cucurbita pepo subsp. pepo]7.9e-13861.47Show/hide
Query:  LSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDE-TDTPFSALSFRFPTYEEFLR-TTENVDPPIFETSN
        + FS N N  ++ +LLLK +L++FCALFPLL+ FFKYY     ++ LF        QE VDSD   +TPFS+LSF+FPTYE+FLR TT+NVDPP++ETSN
Subjt:  LSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDE-TDTPFSALSFRFPTYEEFLR-TTENVDPPIFETSN

Query:  ENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIETMKAKTK-
        ENDF +++ S   +   D DL   E+PE F  NGL+C+N I+I+D         NE  D  EAISG FD             +DFI+EL+ E  KAK K 
Subjt:  ENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIETMKAKTK-

Query:  ---PGLPSIPEESEYPTAMEEDVKPWKR-EESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTG
            GLPSIPEESEYP AMEED K  K+ EE  + K + KELH+FHK YTEKMRKYDTLNHQ   AKE KM QSK  V+SV SKGLCGCRPDK+G+AKTG
Subjt:  ---PGLPSIPEESEYPTAMEEDVKPWKR-EESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTG

Query:  ENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKD
        ENRG++GELE+VYVVQ+WV WEFIVWQYKK LE+ GREGYGS R++EVAEKFEHFKV+IQRFMENE  +EGSRVE YA +R+ RRKLLQVPLLKEDE KD
Subjt:  ENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKD

Query:  NKKKGGMDDDNDNVKIDRLIEILQGSIRILWQFIRADKFAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK
        NKK G  +++ + VK+DR+IEILQ  IR+ WQFIRADK AHI ST K H +E +QE  SPA+S IL QILLDLQK
Subjt:  NKKKGGMDDDNDNVKIDRLIEILQGSIRILWQFIRADKFAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK

XP_038889168.1 uncharacterized protein LOC120079052 isoform X1 [Benincasa hispida]4.5e-16566.53Show/hide
Query:  MKFLFETQLSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDETDTPFSALSFRFPTYEEFLRTTENVDPP
        MK L  TQ SF  NE+LYNI ++L KA+LV+FCALFPLL+G+F+Y+K    K+ LFE++  EALQES + +E +TPFSALSFRFPTYEEFL   ENVD  
Subjt:  MKFLFETQLSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDETDTPFSALSFRFPTYEEFLRTTENVDPP

Query:  IFETSNENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIETM
          +TSNE+DF+E +CSIPS+P  +F LH  E  E    NGL CTN ++IED      + F+E  DS EA       KIEDS EIESE+++FIK LQI+ M
Subjt:  IFETSNENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIETM

Query:  K----AKTKPGLPSIPEESEYPTAMEEDVKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRG
        K    AK K  LPSIPEESEYP   E D+KPWK+EESF+ +D  KELH FHKLY EKMRKYDTLNHQ TYAKEL+ MQSK+SV+SVLS+G CGC+PDK  
Subjt:  K----AKTKPGLPSIPEESEYPTAMEEDVKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRG

Query:  DAKTGENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLK-
          KTGE RGID ELELVYV+QLWV WEFIVWQYKK LE+ GREGYG C ++EVAEKFEHF+VMIQRFMENE FDEGSRVECY R RLARRKLLQVPL+K 
Subjt:  DAKTGENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLK-

Query:  ---EDEVKDNKKKGGM-DDDNDN-VKIDRLIEILQGSIRILWQFIRADKFAHISTLKCHIEAEQECRSPAHSKILKQILLDLQK
           EDEVK+NK KGG  +DD +N VKIDRLI+I Q SIRILWQFI ADK  HISTLKC +EA+QEC SP+HSKI  QILL LQK
Subjt:  ---EDEVKDNKKKGGM-DDDNDN-VKIDRLIEILQGSIRILWQFIRADKFAHISTLKCHIEAEQECRSPAHSKILKQILLDLQK

XP_038889169.1 uncharacterized protein LOC120079052 isoform X2 [Benincasa hispida]7.4e-13662.12Show/hide
Query:  MKFLFETQLSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDETDTPFSALSFRFPTYEEFLRTTENVDPP
        MK L  TQ SF  NE+LYNI ++L KA+LV+FCALFPLL+G+F+Y+K    K+ LFE++  EALQES + +E +TPFSALSFRFPTYEEFL   ENVD  
Subjt:  MKFLFETQLSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDETDTPFSALSFRFPTYEEFLRTTENVDPP

Query:  IFETSNENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIETM
          +TSNE+DF+E +CSIPS+P  +F LH  E  E    NGL CTN ++IED      + F+E  DS EA       KIEDS EIESE+++FIK LQI+ M
Subjt:  IFETSNENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIETM

Query:  K----AKTKPGLPSIPEESEYPTAMEEDVKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRG
        K    AK K  LPSIPEESEYP   E D+KPWK+EESF+ +D  KELH FHKLY EKMRKYDTLNHQ TYAKEL+ MQSK+SV+SVLS+G CGC+PDK  
Subjt:  K----AKTKPGLPSIPEESEYPTAMEEDVKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRG

Query:  DAKTGENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKE
          KTGE RGID ELELVYV+QLWV WEFIVWQYKK LE+ GREGYG C ++EVAEKFEHF+VMIQRFMENE FDEGSRVECY R RLARRKLLQVPL+K 
Subjt:  DAKTGENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKE

Query:  DEVKDNKKKGGMDDDNDNVKIDRLIEILQGSIR
           +++ ++  +++D+   K+   +E LQ   R
Subjt:  DEVKDNKKKGGMDDDNDNVKIDRLIEILQGSIR

TrEMBL top hitse value%identityAlignment
A0A0A0KS72 Uncharacterized protein2.3e-13051.67Show/hide
Query:  MKFLFETQLSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDETDTPFSALSFRFPTYEEFLRTTENVDPP
        MK L  TQ  F +NE+  N+F+++ K +L+  C LFP  + +FKY+K                   + D  + + PFSAL+FRFPTYEEFL+T ENVD  
Subjt:  MKFLFETQLSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDETDTPFSALSFRFPTYEEFLRTTENVDPP

Query:  IFETSNENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIED-----------------------------------------------PS
            SNE DF+E +CS+PSS   DF LH  E  E    N L CTN + IED                                               PS
Subjt:  IFETSNENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIED-----------------------------------------------PS

Query:  IEYGDDFNEIGDSAEAIS--------GSFDPKIEDS---DEIESEEEDFIKELQIETMKAKTKPGLPSIPEESEYP-TAMEEDVKPW-KREESFDKKDVT
            + F    + +E +S        G F   I+ +    E +SE++DFIK LQI+ MKAK K  LPSIPEE++Y  T  E D+KPW K++ESF+ +D+T
Subjt:  IEYGDDFNEIGDSAEAIS--------GSFDPKIEDS---DEIESEEEDFIKELQIETMKAKTKPGLPSIPEESEYP-TAMEEDVKPW-KREESFDKKDVT

Query:  KELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTGENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREG
        KELH+FHK YTEKMRKYD LN QKTYAKELKMMQSK+SV+SV +KG C C    +G+ KT E++GIDGE+E+VYVVQLWV WEFIVW+YKKALE+ GRE 
Subjt:  KELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTGENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREG

Query:  YGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGMDDDN--DNVKIDRLIEILQGSIRILWQFIRAD
        YGSCR++EVAEKFEHFKVMIQRFMENE  +EGSRVECY ++RL RRK LQVPLLKEDEVK+    GG  +DN  + V IDRLI+ILQ SIRILWQFIR D
Subjt:  YGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGMDDDN--DNVKIDRLIEILQGSIRILWQFIRAD

Query:  KFAHIST-LKCHIEAEQECRSPAHSKILKQILLDLQKV
        K  HIST L CH+E +QE  SP+HS +  Q+L+DLQKV
Subjt:  KFAHIST-LKCHIEAEQECRSPAHSKILKQILLDLQKV

A0A1S4E2E7 uncharacterized protein LOC1034984802.2e-13353.25Show/hide
Query:  MKFLFETQLSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDETDTPFSALSFRFPTYEEFLRTTENVDPP
        MK L   Q  F DNE  +N+F+ +LK +L+  C LFP  + +FKY+K    K+ LFE                D PFS L+FRFPTYEEFL+T ENVD  
Subjt:  MKFLFETQLSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDETDTPFSALSFRFPTYEEFLRTTENVDPP

Query:  IFETSNENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIED-----------------------------------------------PS
            SNE D +E +CSIPSSP  DF LH  E  E F  N L CTN + IED                                               PS
Subjt:  IFETSNENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIED-----------------------------------------------PS

Query:  IEYGDDFNEIGDSAEAISGSFDPKIEDSDEIE---------SEEEDFIKELQIETMK----AKTKPGLPSIPEESEYP-TAMEEDVKPW-KREESFDKKD
        I   D F    + +E +S +      D D IE         SE++D IK LQI+ MK    AK K  LPSIPEE++Y  T  E D+KPW K++ESF+ +D
Subjt:  IEYGDDFNEIGDSAEAISGSFDPKIEDSDEIE---------SEEEDFIKELQIETMK----AKTKPGLPSIPEESEYP-TAMEEDVKPW-KREESFDKKD

Query:  VTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTGENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGR
        +TKELHKFHK YTEKMRKYD LN QKTYAKELKMMQSK+SV+SVL+KG C C+ +K+     G  R IDGE+E+VYVVQLWV WEFIVW+YKKALE+ GR
Subjt:  VTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTGENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGR

Query:  EGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGMDDDN--DNVKIDRLIEILQGSIRILWQFIR
        EGYGSCR++ VAEKFEHFKVMI+RFMENE  +EGSRVECY R+RL RRKLLQVPLLKEDEVK+     G ++DN  + V IDRLI ILQ SIRILWQFIR
Subjt:  EGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGMDDDN--DNVKIDRLIEILQGSIRILWQFIR

Query:  ADKFAHIST-LKCHIEAEQECRSPAHSKILKQILLDLQK
         DK  HIST LKCH+EA+QE  SP+H  I  Q+LLDLQK
Subjt:  ADKFAHIST-LKCHIEAEQECRSPAHSKILKQILLDLQK

A0A6J1BWI3 uncharacterized protein LOC1110061622.1e-11255.8Show/hide
Query:  MKFLFETQLSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRA-------------GKESLFEQDFHEALQESVDSDETDTPFSALSFRFPTY
        M+FLF+ QL                  +LV+F ALF   + F +Y+K +              GKE + E+D  E LQ+S D +E +TPFSALSFRFPT+
Subjt:  MKFLFETQLSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRA-------------GKESLFEQDFHEALQESVDSDETDTPFSALSFRFPTY

Query:  EEFLRTTENVDPPIFETSNENDFVEANCSIPSSPVLDFDLHH-SENPEKFPQNGLKCTNGIKIEDPSIEYGDDF-NEIGDSAEAISGSFDPKIEDSDEIE
        EEF+ T ENVD  I +TSNEN       S+PS P  DF+LHH  E+PE FPQN L CT             DDF NE  D  E      D KIEDSD+  
Subjt:  EEFLRTTENVDPPIFETSNENDFVEANCSIPSSPVLDFDLHH-SENPEKFPQNGLKCTNGIKIEDPSIEYGDDF-NEIGDSAEAISGSFDPKIEDSDEIE

Query:  SEEEDFIKELQIETMKAKTK-PGLPSIPEESEYPTAMEEDVKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLS
        SEEE+ I+ LQ    K K   PGLP IPEESEY   MEED KPW+ +E+FD ++ TKELHK HK+Y E+M+KYDTLNHQK YAK+LKMMQSKD + SVLS
Subjt:  SEEEDFIKELQIETMKAKTK-PGLPSIPEESEYPTAMEEDVKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLS

Query:  KGLCGCRPDKRGDAKTGENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLA
        K LC CRP K+GD   G+ R IDG+LE+VYV QLW  WEF+V QYKKALE+C R   GSCR++EVA KF+HF+V+IQRFMENE F+EGSRVECYAR RL 
Subjt:  KGLCGCRPDKRGDAKTGENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLA

Query:  RRKLLQVPLLKEDEVKDNKKK----GGMDDDNDN-VKIDRLIEILQGS
        RRKLLQVP++KEDEVK+N KK     G DDDN++ +KIDR IEILQ S
Subjt:  RRKLLQVPLLKEDEVKDNKKK----GGMDDDNDN-VKIDRLIEILQGS

A0A6J1H9K8 uncharacterized protein LOC1114608614.7e-13660.55Show/hide
Query:  LSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDE-TDTPFSALSFRFPTYEEFLR-TTENVDPPIFETSN
        + FS N N  ++ +LLLK +L++FCALFPLL+ FFKYY     K+ LF        Q+ VD D   +TPFS+LSF+FPTYE+FLR TT+NVDPP++ETSN
Subjt:  LSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDE-TDTPFSALSFRFPTYEEFLR-TTENVDPPIFETSN

Query:  ENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIE----TMKA
        ENDF +++ S   +   D DL   E+PE F  NGL+ +N + I+D         NE  +  EAISGSFD             +DFI+EL+ E      KA
Subjt:  ENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIE----TMKA

Query:  KTKPGLPSIPEESEYPTAMEEDVKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTGE
        K++ GLPSIPEESEYP AMEED K  K+EE  ++K + KELH+FHK YTEKMRKYDTLNHQ   AKE KM QSK  V+SV SKGLCGCRPDK+G+AKTGE
Subjt:  KTKPGLPSIPEESEYPTAMEEDVKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTGE

Query:  NRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDN
        NRG++GELE+VYVVQ+WV WEFIVWQYKKALE+ G EGYGS R++EVAEKFEHFKV+IQRFMENE  +EGSRVE YA +R+ +RKLLQVPLLKEDE KDN
Subjt:  NRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDN

Query:  KKKGGMDDDNDNVKIDRLIEILQGSIRILWQFIRADKFAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK
        KK G  +++ + VK+DR+IEILQ  IR+ WQFIRADK AHI ST K H IE +QE  SPA+S IL QILLDL K
Subjt:  KKKGGMDDDNDNVKIDRLIEILQGSIRILWQFIRADKFAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK

A0A6J1JR14 uncharacterized protein LOC1114875636.8e-13560.63Show/hide
Query:  QLSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDETDTPFSALSFRFPTYEEFLR-TTENVDPPIFETSN
        ++ FS N N  ++  LLLK +L++FCALFPLL+ FFKYY     K+ LF     E            TPFS+LSF+FPTYE+FLR TT+NV+PP + TSN
Subjt:  QLSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDETDTPFSALSFRFPTYEEFLR-TTENVDPPIFETSN

Query:  ENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIETMKAKTK-
        ENDF +++ S   +   D DL   E+PE F  NGL+C+N I+I+D         NE  D  EAISGSFD             +DFI+EL+ E MKAK K 
Subjt:  ENDFVEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIETMKAKTK-

Query:  ---PGLPSIPEESEYPTAMEEDVKPWKREES-FDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTG
            GLPSIPEESEYP AMEED K  K+EE   ++K + KELH+FHK YTEKMRKYDTLNHQ   AKE KM QSK  V+SV SKGLCGCRP K+G+AKT 
Subjt:  ---PGLPSIPEESEYPTAMEEDVKPWKREES-FDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTG

Query:  ENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKD
        ENRGI+GELE+VYVVQ+WV WEFIVWQYKKALE+ GREGYGS R++EVAEKFEHFKV IQRFME E  +EGSRVE YAR+R+ RRKLLQVPLL+EDE KD
Subjt:  ENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKD

Query:  NKKKGGMDDDNDNVKIDRLIEILQGSIRILWQFIRADKFAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK
         KK G ++++ D VK+DR+IEILQ  IR+ WQFIRADK AHI ST K H IE +QE  SPA+S IL QILLDLQK
Subjt:  NKKKGGMDDDNDNVKIDRLIEILQGSIRILWQFIRADKFAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69610.1 Protein of unknown function (DUF1666)1.4e-2833.13Show/hide
Query:  DSAEAISGSFDPKIEDSDEIESEEEDFIKELQIETMKAKTKPGLPSIPEESEYPTAMEEDVKPWKREESFDK-KDVTKELHKFHKLYTEKMRKYDTLNHQ
        D  E +   FD    DSD+ E E  D I++L+ E   A+T  GL +I EESE P    +++KP K E   D+ KD   E+HK +K Y  KMRK D ++ Q
Subjt:  DSAEAISGSFDPKIEDSDEIESEEEDFIKELQIETMKAKTKPGLPSIPEESEYPTAMEEDVKPWKREESFDK-KDVTKELHKFHKLYTEKMRKYDTLNHQ

Query:  KTYAKELKMMQ--SKDSVDS-------------VLSKGLCGCRPDKRGDAKTGENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDE
          ++  L  ++  SK S ++                K    C P +R   +         + E VYV Q+ + WE + WQY K LE   +    + +Y+ 
Subjt:  KTYAKELKMMQ--SKDSVDS-------------VLSKGLCGCRPDKRGDAKTGENRGIDGELELVYVVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDE

Query:  VAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGMDDDNDNVKIDRLIEILQGSIRILWQFIRADKFAHISTLKC
        VA +F+ F+V++QRF+ENEPF   SRVE Y + R   +  LQ+PL+++D  + +KKK   + +   VK + L EI++ S+ + W+F+ ADK    S +K 
Subjt:  VAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGMDDDNDNVKIDRLIEILQGSIRILWQFIRADKFAHISTLKC

Query:  HIEAEQECRSPAHSKILKQILLDLQK
          + +   +     ++L  I   LQK
Subjt:  HIEAEQECRSPAHSKILKQILLDLQK

AT1G73850.1 Protein of unknown function (DUF1666)2.7e-1123.9Show/hide
Query:  SENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEED-----FIKELQIETMKAKTKPGLPSIPEESEYPTAMEED
        S N  +F  NG     GIK         +D +EI  S+      FD +   ++E+E EEE+     F +     +    +     S+  +  + T+    
Subjt:  SENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEED-----FIKELQIETMKAKTKPGLPSIPEESEYPTAMEED

Query:  VKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTGENRGIDGELELVYVVQLWVCWEF
           W+    F K D  +E+    ++  +K+ + ++L       + +    S+  V  + S G    +  K+     G       ELE  YV Q+ + WE 
Subjt:  VKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTGENRGIDGELELVYVVQLWVCWEF

Query:  IVWQYK--KALEMCGREGYGSCRYDE-VAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGMDDDNDNVKIDRLI
        + W YK  +      +  +        +A++F  F +++QR++ENEP++ G R E YAR R    KLL VP  ++ E ++ K+    +     +     +
Subjt:  IVWQYK--KALEMCGREGYGSCRYDE-VAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGMDDDNDNVKIDRLI

Query:  EILQGSIRILWQFIRADK
         I++  IR    F++ADK
Subjt:  EILQGSIRILWQFIRADK

AT3G20260.1 Protein of unknown function (DUF1666)1.3e-1627.08Show/hide
Query:  LKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDS---DEIESEEEDFIKELQIETMKAKTKPG----LPSIPEESEYPTAMEEDVKPWKREES
        LK     K   P++E   +   +    +A S + +  + DS   DEIE +++DFI       +K   +      +P   EE E  + ++ED    + + S
Subjt:  LKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDS---DEIESEEEDFIKELQIETMKAKTKPG----LPSIPEESEYPTAMEEDVKPWKREES

Query:  FDKKDVTKE-------LHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVD-SVLSKGLCG---CRPDKRGDAKTGENRGID--------GELELVY
         + +DV  E           ++ Y E+M  +D L+ Q+     + +  S  +      SK L     C   K+ D    +   +          +LE  Y
Subjt:  FDKKDVTKE-------LHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVD-SVLSKGLCG---CRPDKRGDAKTGENRGID--------GELELVY

Query:  VVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGMDDDNDN
        V QL + WE +  QY +   +   +      Y+  A+ F+ F V++QR++ENEPF++GSR E YAR R A  KLLQ P ++  + K+ +K  G       
Subjt:  VVQLWVCWEFIVWQYKKALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGMDDDNDN

Query:  VKIDRLIEILQGSIRILWQFIRADK
        V  D LI++++ SI     F++ DK
Subjt:  VKIDRLIEILQGSIRILWQFIRADK

AT5G39785.1 Protein of unknown function (DUF1666)6.4e-4538.79Show/hide
Query:  NEIGDSAEAISGSFDPKIEDSDEIES--EEEDFIKELQIETMKAKTKPGLPSIPEESE----YPTAMEEDVKPWKREE--SFDKKDVTKELHKFHKLYTE
        N   D++ + S S + + ED++  ES  E +D I++L++E  K K   GL +I EE E     P  M ED+KPW+ EE   F   D   E+HKFH+ Y E
Subjt:  NEIGDSAEAISGSFDPKIEDSDEIES--EEEDFIKELQIETMKAKTKPGLPSIPEESE----YPTAMEEDVKPWKREE--SFDKKDVTKELHKFHKLYTE

Query:  KMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRG------------DAKTGE-------NRGIDGELELVYVVQLWVCWEFIVWQYKKAL
        +MRK D L+ QK+YA  L   +S     S L     G  P +               AK  E        + I GELE VYV Q+ + WE + WQY+KA+
Subjt:  KMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRG------------DAKTGE-------NRGIDGELELVYVVQLWVCWEFIVWQYKKAL

Query:  EMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNK--KKGGMDDDNDNV-KIDRLIEILQGSIRI
        E+   + YGS RY+EVA +F+ F+V++QRF+ENEPF+E  RV+ Y + R   R LLQ+P+++ED  KD K  ++   +++ND V K D+L+EI++ +IR+
Subjt:  EMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNK--KKGGMDDDNDNV-KIDRLIEILQGSIRI

Query:  LWQFIRADKFA-----HISTLKCHIEAEQE
         W+F+R DK         S  K  IE + E
Subjt:  LWQFIRADKFA-----HISTLKCHIEAEQE

AT5G39785.2 Protein of unknown function (DUF1666)6.4e-4538.79Show/hide
Query:  NEIGDSAEAISGSFDPKIEDSDEIES--EEEDFIKELQIETMKAKTKPGLPSIPEESE----YPTAMEEDVKPWKREE--SFDKKDVTKELHKFHKLYTE
        N   D++ + S S + + ED++  ES  E +D I++L++E  K K   GL +I EE E     P  M ED+KPW+ EE   F   D   E+HKFH+ Y E
Subjt:  NEIGDSAEAISGSFDPKIEDSDEIES--EEEDFIKELQIETMKAKTKPGLPSIPEESE----YPTAMEEDVKPWKREE--SFDKKDVTKELHKFHKLYTE

Query:  KMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRG------------DAKTGE-------NRGIDGELELVYVVQLWVCWEFIVWQYKKAL
        +MRK D L+ QK+YA  L   +S     S L     G  P +               AK  E        + I GELE VYV Q+ + WE + WQY+KA+
Subjt:  KMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRG------------DAKTGE-------NRGIDGELELVYVVQLWVCWEFIVWQYKKAL

Query:  EMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNK--KKGGMDDDNDNV-KIDRLIEILQGSIRI
        E+   + YGS RY+EVA +F+ F+V++QRF+ENEPF+E  RV+ Y + R   R LLQ+P+++ED  KD K  ++   +++ND V K D+L+EI++ +IR+
Subjt:  EMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNK--KKGGMDDDNDNV-KIDRLIEILQGSIRI

Query:  LWQFIRADKFA-----HISTLKCHIEAEQE
         W+F+R DK         S  K  IE + E
Subjt:  LWQFIRADKFA-----HISTLKCHIEAEQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTTTTGTTCGAAACCCAGTTATCTTTCTCGGATAATGAAAATTTGTACAACATTTTTATCCTTCTCTTAAAAGCCATTTTGGTCGTGTTCTGTGCTCTGTTCCC
ACTCCTCATCGGATTCTTCAAATACTATAAACCTCGTGCCGGCAAAGAATCTCTGTTTGAACAAGATTTTCATGAAGCTTTGCAAGAAAGTGTTGATTCTGATGAAACTG
ATACACCCTTTTCAGCTCTAAGCTTCCGCTTTCCAACTTACGAGGAATTTTTGAGAACTACAGAGAATGTTGACCCACCCATATTTGAAACTTCAAATGAGAATGATTTT
GTTGAAGCGAATTGCTCGATTCCTTCAAGTCCTGTTCTGGATTTCGATCTTCATCACAGTGAAAACCCAGAGAAATTTCCCCAAAATGGTTTGAAATGCACTAACGGTAT
CAAAATCGAAGATCCAAGCATTGAATATGGTGATGATTTCAATGAAATTGGTGATTCTGCCGAAGCCATATCCGGGAGTTTTGATCCGAAAATTGAAGATTCAGATGAAA
TCGAATCGGAAGAAGAGGACTTCATAAAAGAACTGCAAATTGAGACTATGAAAGCCAAAACAAAACCCGGCCTGCCCTCGATTCCTGAAGAATCCGAGTACCCAACAGCC
ATGGAAGAAGACGTAAAGCCATGGAAGAGAGAAGAAAGCTTCGATAAAAAAGACGTAACGAAAGAACTCCACAAATTCCACAAGCTATACACAGAAAAGATGCGAAAATA
CGACACTTTAAATCACCAGAAAACATACGCAAAAGAGTTGAAGATGATGCAATCGAAGGACTCAGTGGACTCTGTTTTATCGAAAGGGTTGTGTGGGTGCAGGCCTGACA
AGAGAGGTGATGCTAAAACAGGGGAAAACAGAGGAATCGATGGGGAATTGGAGCTGGTTTATGTGGTGCAATTGTGGGTTTGTTGGGAGTTTATTGTGTGGCAGTACAAG
AAAGCGTTGGAGATGTGTGGCAGAGAAGGCTATGGAAGTTGCAGATACGATGAAGTTGCAGAGAAATTTGAGCATTTTAAAGTGATGATACAGAGATTTATGGAGAATGA
ACCGTTTGATGAAGGGTCGAGAGTTGAATGTTATGCTAGAACTCGACTTGCTAGGAGAAAGCTTCTGCAAGTTCCTCTGCTCAAAGAAGATGAAGTGAAAGACAACAAGA
AGAAAGGAGGAATGGACGATGACAACGACAACGTTAAAATTGATAGGCTAATTGAGATCTTGCAGGGATCCATAAGAATTTTATGGCAATTCATTCGAGCTGATAAATTT
GCCCACATATCAACTCTAAAGTGTCATATAGAAGCAGAACAAGAATGTAGAAGTCCAGCACATTCCAAGATTCTAAAGCAAATTCTTTTAGATCTCCAAAAAGTTGGTTC
TGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTTTTGTTCGAAACCCAGTTATCTTTCTCGGATAATGAAAATTTGTACAACATTTTTATCCTTCTCTTAAAAGCCATTTTGGTCGTGTTCTGTGCTCTGTTCCC
ACTCCTCATCGGATTCTTCAAATACTATAAACCTCGTGCCGGCAAAGAATCTCTGTTTGAACAAGATTTTCATGAAGCTTTGCAAGAAAGTGTTGATTCTGATGAAACTG
ATACACCCTTTTCAGCTCTAAGCTTCCGCTTTCCAACTTACGAGGAATTTTTGAGAACTACAGAGAATGTTGACCCACCCATATTTGAAACTTCAAATGAGAATGATTTT
GTTGAAGCGAATTGCTCGATTCCTTCAAGTCCTGTTCTGGATTTCGATCTTCATCACAGTGAAAACCCAGAGAAATTTCCCCAAAATGGTTTGAAATGCACTAACGGTAT
CAAAATCGAAGATCCAAGCATTGAATATGGTGATGATTTCAATGAAATTGGTGATTCTGCCGAAGCCATATCCGGGAGTTTTGATCCGAAAATTGAAGATTCAGATGAAA
TCGAATCGGAAGAAGAGGACTTCATAAAAGAACTGCAAATTGAGACTATGAAAGCCAAAACAAAACCCGGCCTGCCCTCGATTCCTGAAGAATCCGAGTACCCAACAGCC
ATGGAAGAAGACGTAAAGCCATGGAAGAGAGAAGAAAGCTTCGATAAAAAAGACGTAACGAAAGAACTCCACAAATTCCACAAGCTATACACAGAAAAGATGCGAAAATA
CGACACTTTAAATCACCAGAAAACATACGCAAAAGAGTTGAAGATGATGCAATCGAAGGACTCAGTGGACTCTGTTTTATCGAAAGGGTTGTGTGGGTGCAGGCCTGACA
AGAGAGGTGATGCTAAAACAGGGGAAAACAGAGGAATCGATGGGGAATTGGAGCTGGTTTATGTGGTGCAATTGTGGGTTTGTTGGGAGTTTATTGTGTGGCAGTACAAG
AAAGCGTTGGAGATGTGTGGCAGAGAAGGCTATGGAAGTTGCAGATACGATGAAGTTGCAGAGAAATTTGAGCATTTTAAAGTGATGATACAGAGATTTATGGAGAATGA
ACCGTTTGATGAAGGGTCGAGAGTTGAATGTTATGCTAGAACTCGACTTGCTAGGAGAAAGCTTCTGCAAGTTCCTCTGCTCAAAGAAGATGAAGTGAAAGACAACAAGA
AGAAAGGAGGAATGGACGATGACAACGACAACGTTAAAATTGATAGGCTAATTGAGATCTTGCAGGGATCCATAAGAATTTTATGGCAATTCATTCGAGCTGATAAATTT
GCCCACATATCAACTCTAAAGTGTCATATAGAAGCAGAACAAGAATGTAGAAGTCCAGCACATTCCAAGATTCTAAAGCAAATTCTTTTAGATCTCCAAAAAGTTGGTTC
TGTTTAA
Protein sequenceShow/hide protein sequence
MKFLFETQLSFSDNENLYNIFILLLKAILVVFCALFPLLIGFFKYYKPRAGKESLFEQDFHEALQESVDSDETDTPFSALSFRFPTYEEFLRTTENVDPPIFETSNENDF
VEANCSIPSSPVLDFDLHHSENPEKFPQNGLKCTNGIKIEDPSIEYGDDFNEIGDSAEAISGSFDPKIEDSDEIESEEEDFIKELQIETMKAKTKPGLPSIPEESEYPTA
MEEDVKPWKREESFDKKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRGDAKTGENRGIDGELELVYVVQLWVCWEFIVWQYK
KALEMCGREGYGSCRYDEVAEKFEHFKVMIQRFMENEPFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGMDDDNDNVKIDRLIEILQGSIRILWQFIRADKF
AHISTLKCHIEAEQECRSPAHSKILKQILLDLQKVGSV