; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039622 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039622
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF1666)
Genome locationscaffold10:44144052..44146135
RNA-Seq ExpressionSpg039622
SyntenySpg039622
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR012870 - Protein of unknown function DUF1666


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033168.1 hypothetical protein SDJN02_07222, partial [Cucurbita argyrosperma subsp. argyrosperma]9.8e-13661.64Show/hide
Query:  FSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDE-TDTPFSALSFHFPTYEEFLR-TTENVDPPIFETSNEN
        FS N N  ++ +LLLK +L++FCALF LL+ FFKYY     K+ LF        Q++VD D   +TPFS+LSF FPTYE+FLR TT+NVDPP++ETSNEN
Subjt:  FSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDE-TDTPFSALSFHFPTYEEFLR-TTENVDPPIFETSNEN

Query:  DFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAK---
        DF +++ S   +   D DL P E+PE F  NGL  +ND++     I+D ++FDE     EAISGSFD             EDF++ELK E  KAKAK   
Subjt:  DFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAK---

Query:  -SGLPSIPEESEYPIAMEEDVKPWKREESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENR
         S LPSIPEESEYPIAMEED K  K+EE  ++K + KEL +FHK YTEKMRKYDTLNHQ   AKE KM QSK  V+SV SKGLCGCRPDK+ +AKTGENR
Subjt:  -SGLPSIPEESEYPIAMEEDVKPWKREESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENR

Query:  GIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKK
        GIDGELE+VYVVQ+WVSWEF+ WQYKKALE+ G +GYGS R++EVAEKFEHFKV+IQRFMENE  +EGSRVE YA +R+ RRKLLQVPLLKEDE KDNKK
Subjt:  GIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKK

Query:  KGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADKLAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK
         G       +++E  VK+DR+IE+LQE IR+ WQFIRADKLAHI ST K H IE +QE  SPA+S IL QILLDL K
Subjt:  KGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADKLAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK

XP_022959969.1 uncharacterized protein LOC111460861 [Cucurbita moschata]5.7e-13661.22Show/hide
Query:  FSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDE-TDTPFSALSFHFPTYEEFLR-TTENVDPPIFETSNEN
        FS N N  ++L+LLLK +L++FCALFPLL+ FFKYY     K+ LF        Q++VD D   +TPFS+LSF FPTYE+FLR TT+NVDPP++ETSNEN
Subjt:  FSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDE-TDTPFSALSFHFPTYEEFLR-TTENVDPPIFETSNEN

Query:  DFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAK---
        DF +++ S   +   D DL P E+PE F  NGL  +ND+      I+D ++FDE     EAISGSFD             +DF++ELK E  K KAK   
Subjt:  DFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAK---

Query:  -SGLPSIPEESEYPIAMEEDVKPWKREESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENR
         SGLPSIPEESEYPIAMEED K  K+EE  ++K + KEL +FHK YTEKMRKYDTLNHQ   AKE KM QSK  V+SV SKGLCGCRPDK+ +AKTGENR
Subjt:  -SGLPSIPEESEYPIAMEEDVKPWKREESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENR

Query:  GIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKK
        G++GELE+VYVVQ+WVSWEF+ WQYKKALE+ G +GYGS R++EVAEKFEHFKV+IQRFMENE  +EGSRVE YA +R+ +RKLLQVPLLKEDE KDNKK
Subjt:  GIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKK

Query:  KGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADKLAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK
         G       +++E  VK+DR+IE+LQE IR+ WQFIRADKLAHI ST K H IE +QE  SPA+S IL QILLDL K
Subjt:  KGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADKLAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK

XP_023544885.1 uncharacterized protein LOC111804322 [Cucurbita pepo subsp. pepo]1.4e-13761.51Show/hide
Query:  FSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDE-TDTPFSALSFHFPTYEEFLR-TTENVDPPIFETSNEN
        FS N N  ++ +LLLK +L++FCALFPLL+ FFKYY     ++ LF        QE+VDSD   +TPFS+LSF FPTYE+FLR TT+NVDPP++ETSNEN
Subjt:  FSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDE-TDTPFSALSFHFPTYEEFLR-TTENVDPPIFETSNEN

Query:  DFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAK---
        DF +++ S   +   D DL P E+PE F  NGL C+NDI+I+D         +E  D  EAISG FD             +DF++ELK E  KAKAK   
Subjt:  DFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAK---

Query:  -SGLPSIPEESEYPIAMEEDVKPWKR-EESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGEN
         SGLPSIPEESEYPIAMEED K  K+ EE  + K + KEL +FHK YTEKMRKYDTLNHQ   AKE KM QSK  V+SV SKGLCGCRPDK+ +AKTGEN
Subjt:  -SGLPSIPEESEYPIAMEEDVKPWKR-EESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGEN

Query:  RGIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNK
        RG++GELE+VYVVQ+WVSWEF+ WQYKK LE+ GR+GYGS R++EVAEKFEHFKV+IQRFMENE  +EGSRVE YA +R+ RRKLLQVPLLKEDE KDNK
Subjt:  RGIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNK

Query:  KKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADKLAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK
        K G  ++      E  VK+DR+IE+LQE IR+ WQFIRADKLAHI ST K H +E +QE  SPA+S IL QILLDLQK
Subjt:  KKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADKLAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK

XP_038889168.1 uncharacterized protein LOC120079052 isoform X1 [Benincasa hispida]3.0e-16967.15Show/hide
Query:  MKFLFEIQFSFSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDETDTPFSALSFHFPTYEEFLRTTENVDPP
        MK L   QFSF  NE+LYNIL++L KA+LV+FCALFPLL+G+F+Y+K    K+ LFEE+  EALQE+ + +E +TPFSALSF FPTYEEFL   ENVD  
Subjt:  MKFLFEIQFSFSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDETDTPFSALSFHFPTYEEFLRTTENVDPP

Query:  IFETSNENDFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETM
          +TSNE+DF+E +CS+PS+P  +F LHP E  E    NGL+CTND++     IED + FDE  DS EA       KIEDS EIESE+++F+K L+I+ M
Subjt:  IFETSNENDFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETM

Query:  K----AKAKSGLPSIPEESEYPIAMEEDVKPWKREESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRA
        K    AKAKS LPSIPEESEYPI  E D+KPWK+EESF+ +D  KEL  FHKLY EKMRKYDTLNHQ TYAKEL+ MQSK+SV+SVLS+G CGC+PDK  
Subjt:  K----AKAKSGLPSIPEESEYPIAMEEDVKPWKREESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRA

Query:  DAKTGENRGIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLK-
          KTGE RGID ELELVYV+QLWVSWEF+ WQYKK LE+ GR+GYG C ++EVAEKFEHF+VMIQRFMENESFDEGSRVECY R RLARRKLLQVPL+K 
Subjt:  DAKTGENRGIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLK-

Query:  ---EDEVKDNKKKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADKLAHISTLKCHIEAEQECRSPAHSKILKQILLDLQK
           EDEVK+NK KGG ++D   DKENAVKIDRLI++ QESIRILWQFI ADKL HISTLKC +EA+QEC SP+HSKI  QILL LQK
Subjt:  ---EDEVKDNKKKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADKLAHISTLKCHIEAEQECRSPAHSKILKQILLDLQK

XP_038889169.1 uncharacterized protein LOC120079052 isoform X2 [Benincasa hispida]1.5e-13662.62Show/hide
Query:  MKFLFEIQFSFSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDETDTPFSALSFHFPTYEEFLRTTENVDPP
        MK L   QFSF  NE+LYNIL++L KA+LV+FCALFPLL+G+F+Y+K    K+ LFEE+  EALQE+ + +E +TPFSALSF FPTYEEFL   ENVD  
Subjt:  MKFLFEIQFSFSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDETDTPFSALSFHFPTYEEFLRTTENVDPP

Query:  IFETSNENDFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETM
          +TSNE+DF+E +CS+PS+P  +F LHP E  E    NGL+CTND++     IED + FDE  DS EA       KIEDS EIESE+++F+K L+I+ M
Subjt:  IFETSNENDFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETM

Query:  K----AKAKSGLPSIPEESEYPIAMEEDVKPWKREESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRA
        K    AKAKS LPSIPEESEYPI  E D+KPWK+EESF+ +D  KEL  FHKLY EKMRKYDTLNHQ TYAKEL+ MQSK+SV+SVLS+G CGC+PDK  
Subjt:  K----AKAKSGLPSIPEESEYPIAMEEDVKPWKREESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRA

Query:  DAKTGENRGIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKE
          KTGE RGID ELELVYV+QLWVSWEF+ WQYKK LE+ GR+GYG C ++EVAEKFEHF+VMIQRFMENESFDEGSRVECY R RLARRKLLQVPL+K 
Subjt:  DAKTGENRGIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKE

Query:  DEVKDNKKKGGIDDDDDKDKENAVKIDR
           +++ ++  I++D  + K +  ++ R
Subjt:  DEVKDNKKKGGIDDDDDKDKENAVKIDR

TrEMBL top hitse value%identityAlignment
A0A0A0KS72 Uncharacterized protein8.7e-13051.39Show/hide
Query:  MKFLFEIQFSFSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDETDTPFSALSFHFPTYEEFLRTTENVDPP
        MK L   QF F +NE+  N+ +++ K +L+  C LFP  + +FKY+K                     D  + + PFSAL+F FPTYEEFL+T ENVD  
Subjt:  MKFLFEIQFSFSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDETDTPFSALSFHFPTYEEFLRTTENVDPP

Query:  IFETSNENDFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSI----------EDGDDFDEI---------------------------
            SNE DF+E +CS+PSS   DF LH  E  E    N L+CTND+ IED  +          E+  + DE+                           
Subjt:  IFETSNENDFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSI----------EDGDDFDEI---------------------------

Query:  ----------GDSAEAIS--------GSFDPKIEDS---DEIESEEEDFLKELKIETMKAKAKSGLPSIPEESEYPIAM-EEDVKPW-KREESFDQKDVT
                   + +E +S        G F   I+ +    E +SE++DF+K L+I+ MKAKAKS LPSIPEE++Y I   E D+KPW K++ESF+ +D+T
Subjt:  ----------GDSAEAIS--------GSFDPKIEDS---DEIESEEEDFLKELKIETMKAKAKSGLPSIPEESEYPIAM-EEDVKPW-KREESFDQKDVT

Query:  KELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENRGIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDG
        KEL +FHK YTEKMRKYD LN QKTYAKELKMMQSK+SV+SV +KG C C+ +K    KT E++GIDGE+E+VYVVQLWVSWEF+ W+YKKALE+ GR+ 
Subjt:  KELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENRGIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDG

Query:  YGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFI
        YGSCR++EVAEKFEHFKVMIQRFMENE  +EGSRVECY ++RL RRK LQVPLLKEDEVK+    GG  +D   +KENAV IDRLI++LQESIRILWQFI
Subjt:  YGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFI

Query:  RADKLAHIST-LKCHIEAEQECRSPAHSKILKQILLDLQKV
        R DKL HIST L CH+E +QE  SP+HS +  Q+L+DLQKV
Subjt:  RADKLAHIST-LKCHIEAEQECRSPAHSKILKQILLDLQKV

A0A1S4E2E7 uncharacterized protein LOC1034984804.4e-13453.14Show/hide
Query:  MKFLFEIQFSFSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDETDTPFSALSFHFPTYEEFLRTTENVDPP
        MK L  IQF F DNE  +N+ + +LK +L+  C LFP  + +FKY+K    K+ LFE+               D PFS L+F FPTYEEFL+T ENVD  
Subjt:  MKFLFEIQFSFSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDETDTPFSALSFHFPTYEEFLRTTENVDPP

Query:  IFETSNENDFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIED-----------------------------------------------PS
            SNE D +E +CS+PSSP  DF LH  E  E F  N L+CTND+ IED                                               PS
Subjt:  IFETSNENDFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIED-----------------------------------------------PS

Query:  IEDGDDFDEIGDSAEAISGSFDPKIEDSDEIE---------SEEEDFLKELKIETMK----AKAKSGLPSIPEESEYPIAM-EEDVKPW-KREESFDQKD
        I   D F    + +E +S +      D D IE         SE++D +K L+I+ MK    AKAKS LPSIPEE++Y I   E D+KPW K++ESF+ +D
Subjt:  IEDGDDFDEIGDSAEAISGSFDPKIEDSDEIE---------SEEEDFLKELKIETMK----AKAKSGLPSIPEESEYPIAM-EEDVKPW-KREESFDQKD

Query:  VTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENRGIDGELELVYVVQLWVSWEFVGWQYKKALEMCGR
        +TKEL KFHK YTEKMRKYD LN QKTYAKELKMMQSK+SV+SVL+KG C C+ +K++    G  R IDGE+E+VYVVQLWVSWEF+ W+YKKALE+ GR
Subjt:  VTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENRGIDGELELVYVVQLWVSWEFVGWQYKKALEMCGR

Query:  DGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQ
        +GYGSCR++ VAEKFEHFKVMI+RFMENE  +EGSRVECY R+RL RRKLLQVPLLKEDEVK+     G ++D   +KENAV IDRLI +LQESIRILWQ
Subjt:  DGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQ

Query:  FIRADKLAHIST-LKCHIEAEQECRSPAHSKILKQILLDLQK
        FIR DKL HIST LKCH+EA+QE  SP+H  I  Q+LLDLQK
Subjt:  FIRADKLAHIST-LKCHIEAEQECRSPAHSKILKQILLDLQK

A0A6J1BWI3 uncharacterized protein LOC1110061621.5e-11056.67Show/hide
Query:  ILVVFCALFPLLIGFFKYYK-------------PHAGKESLFEEDFHEALQENVDSDETDTPFSALSFHFPTYEEFLRTTENVDPPIFETSNENDFVEAN
        +LV+F ALF   + F +Y+K                GKE + EED  E LQ++ D +E +TPFSALSF FPT+EEF+ T ENVD  I +TSNEN      
Subjt:  ILVVFCALFPLLIGFFKYYK-------------PHAGKESLFEEDFHEALQENVDSDETDTPFSALSFHFPTYEEFLRTTENVDPPIFETSNENDFVEAN

Query:  CSVPSSPVLDFDL-HPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAKS-GLPSIP
         SVPS P  DF+L HP E+PE FP+N LNCT+D   E+  I+            E      D KIEDSD+  SEEE+ ++ L+    K K    GLP IP
Subjt:  CSVPSSPVLDFDL-HPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAKS-GLPSIP

Query:  EESEYPIAMEEDVKPWKREESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENRGIDGELEL
        EESEY I MEED KPW+ +E+FD ++ TKEL K HK+Y E+M+KYDTLNHQK YAK+LKMMQSKD + SVLSK LC CRP K+ D   G+ R IDG+LE+
Subjt:  EESEYPIAMEEDVKPWKREESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENRGIDGELEL

Query:  VYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDD
        VYV QLW SWEF+  QYKKALE+C R   GSCR++EVA KF+HF+V+IQRFMENE+F+EGSRVECYAR RL RRKLLQVP++KEDEVK+N KK  I D  
Subjt:  VYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDD

Query:  DKDKENAVKIDRLIEMLQES
        D D E+A+KIDR IE+LQES
Subjt:  DKDKENAVKIDRLIEMLQES

A0A6J1H9K8 uncharacterized protein LOC1114608612.8e-13661.22Show/hide
Query:  FSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDE-TDTPFSALSFHFPTYEEFLR-TTENVDPPIFETSNEN
        FS N N  ++L+LLLK +L++FCALFPLL+ FFKYY     K+ LF        Q++VD D   +TPFS+LSF FPTYE+FLR TT+NVDPP++ETSNEN
Subjt:  FSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDE-TDTPFSALSFHFPTYEEFLR-TTENVDPPIFETSNEN

Query:  DFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAK---
        DF +++ S   +   D DL P E+PE F  NGL  +ND+      I+D ++FDE     EAISGSFD             +DF++ELK E  K KAK   
Subjt:  DFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAK---

Query:  -SGLPSIPEESEYPIAMEEDVKPWKREESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENR
         SGLPSIPEESEYPIAMEED K  K+EE  ++K + KEL +FHK YTEKMRKYDTLNHQ   AKE KM QSK  V+SV SKGLCGCRPDK+ +AKTGENR
Subjt:  -SGLPSIPEESEYPIAMEEDVKPWKREESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENR

Query:  GIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKK
        G++GELE+VYVVQ+WVSWEF+ WQYKKALE+ G +GYGS R++EVAEKFEHFKV+IQRFMENE  +EGSRVE YA +R+ +RKLLQVPLLKEDE KDNKK
Subjt:  GIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKK

Query:  KGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADKLAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK
         G       +++E  VK+DR+IE+LQE IR+ WQFIRADKLAHI ST K H IE +QE  SPA+S IL QILLDL K
Subjt:  KGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADKLAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK

A0A6J1JR14 uncharacterized protein LOC1114875632.0e-13460.42Show/hide
Query:  QFSFSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDETDTPFSALSFHFPTYEEFLR-TTENVDPPIFETSN
        +  FS N N  ++  LLLK +L++FCALFPLL+ FFKYY     K+ LF +   E            TPFS+LSF FPTYE+FLR TT+NV+PP + TSN
Subjt:  QFSFSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDETDTPFSALSFHFPTYEEFLR-TTENVDPPIFETSN

Query:  ENDFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAK-
        ENDF +++ S   +   D DL P E+PE F  NGL C+NDI+I+D         +E  D  EAISGSFD             +DF++ELK E MKAKAK 
Subjt:  ENDFVEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAK-

Query:  ---SGLPSIPEESEYPIAMEEDVKPWKREES-FDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTG
           SGLPSIPEESEYPIAMEED K  K+EE   ++K + KEL +FHK YTEKMRKYDTLNHQ   AKE KM QSK  V+SV SKGLCGCRP K+ +AKT 
Subjt:  ---SGLPSIPEESEYPIAMEEDVKPWKREES-FDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTG

Query:  ENRGIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKD
        ENRGI+GELE+VYVVQ+WVSWEF+ WQYKKALE+ GR+GYGS R++EVAEKFEHFKV IQRFME E  +EGSRVE YAR+R+ RRKLLQVPLL+EDE KD
Subjt:  ENRGIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKD

Query:  NKKKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADKLAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK
         KK G I+++D       VK+DR+IE+LQE IR+ WQFIRADKLAHI ST K H IE +QE  SPA+S IL QILLDLQK
Subjt:  NKKKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADKLAHI-STLKCH-IEAEQECRSPAHSKILKQILLDLQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69610.1 Protein of unknown function (DUF1666)8.5e-2932.93Show/hide
Query:  DSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAKSGLPSIPEESEYPIAMEEDVKPWKREESFDQ-KDVTKELLKFHKLYTEKMRKYDTLNHQ
        D  E +   FD    DSD+ E E  D +++LK E   A+   GL +I EESE P+   +++KP K E   DQ KD   E+ K +K Y  KMRK D ++ Q
Subjt:  DSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAKSGLPSIPEESEYPIAMEEDVKPWKREESFDQ-KDVTKELLKFHKLYTEKMRKYDTLNHQ

Query:  KTYAKELKMMQ--SKDSVDS-------------VLSKGLCGCRPDKRADAKTGENRGIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDE
          ++  L  ++  SK S ++                K    C P +R   +         + E VYV Q+ +SWE + WQY K LE   +    + +Y+ 
Subjt:  KTYAKELKMMQ--SKDSVDS-------------VLSKGLCGCRPDKRADAKTGENRGIDGELELVYVVQLWVSWEFVGWQYKKALEMCGRDGYGSCRYDE

Query:  VAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADKLAHI
        VA +F+ F+V++QRF+ENE F   SRVE Y + R   +  LQ+PL+++D  + +KKK   +       E AVK + L E+++ES+ + W+F+ ADK    
Subjt:  VAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADKLAHI

Query:  STLKCHIEAEQECRSPAHSKILKQILLDLQK
        S +K   + +   +     ++L  I   LQK
Subjt:  STLKCHIEAEQECRSPAHSKILKQILLDLQK

AT1G73850.1 Protein of unknown function (DUF1666)1.2e-0924.07Show/hide
Query:  DDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIE--TMKAKAKSGLP---SIPEESEYPIAMEEDVKPWKREESFDQKDVTKELLKFHKLYTE
        +D DEI  S+      FD +   ++E+E EEE+   ++  E  T  + +KS      S+  +  +  +       W+    F + D  +E+    ++  +
Subjt:  DDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIE--TMKAKAKSGLP---SIPEESEYPIAMEEDVKPWKREESFDQKDVTKELLKFHKLYTE

Query:  KMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENRGIDGELELVYVVQLWVSWEFVGWQYK---KALEMCGRDGYGSCRYDEV
        K+ + ++L       + +    S+  V  + S G    +  K+     G       ELE  YV Q+ ++WE + W YK   +      R          +
Subjt:  KMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENRGIDGELELVYVVQLWVSWEFVGWQYK---KALEMCGRDGYGSCRYDEV

Query:  AEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADK
        A++F  F +++QR++ENE ++ G R E YAR R    KLL VP  ++ E ++ K     +D++++   + +     + +++E IR    F++ADK
Subjt:  AEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFIRADK

AT3G20260.1 Protein of unknown function (DUF1666)3.1e-1527.24Show/hide
Query:  KIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDS---DEIESEEEDFL-KELKIETMKAKAKSGLPSIP---EESEYPIAMEEDVKPWKREESFDQKDVT
        K   P++E   +   +    +A S + +  + DS   DEIE +++DF+  E+K    + +  S +  IP   EE E    ++ED    + + S + +DV 
Subjt:  KIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDS---DEIESEEEDFL-KELKIETMKAKAKSGLPSIP---EESEYPIAMEEDVKPWKREESFDQKDVT

Query:  KELLKF-------HKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVD-SVLSKGLCG---CRPDKRADAKTGENRGID--------GELELVYVVQLWVS
         E L++       ++ Y E+M  +D L+ Q+     + +  S  +      SK L     C   K+ D    +   +          +LE  YV QL ++
Subjt:  KELLKF-------HKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVD-SVLSKGLCG---CRPDKRADAKTGENRGID--------GELELVYVVQLWVS

Query:  WEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKENAVK
        WE +  QY +   +          Y+  A+ F+ F V++QR++ENE F++GSR E YAR R A  KLLQ P ++  + K+ +K  G            V 
Subjt:  WEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKENAVK

Query:  IDRLIEMLQESIRILWQFIRADK
         D LI++++ SI     F++ DK
Subjt:  IDRLIEMLQESIRILWQFIRADK

AT5G39785.1 Protein of unknown function (DUF1666)1.4e-4437.93Show/hide
Query:  NDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIES--EEEDFLKELKIETMKAKAKSGLPSIPEESE----YPIAMEEDVKPWKREE--SFDQ
        +D    + S++ G   +   D++ + S S + + ED++  ES  E +D +++LK+E  K KA  GL +I EE E     P  M ED+KPW+ EE   F  
Subjt:  NDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIES--EEEDFLKELKIETMKAKAKSGLPSIPEESE----YPIAMEEDVKPWKREE--SFDQ

Query:  KDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRA------------DAKTGE-------NRGIDGELELVYVVQ
         D   E+ KFH+ Y E+MRK D L+ QK+YA  L   +S     S L     G  P + +             AK  E        + I GELE VYV Q
Subjt:  KDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRA------------DAKTGE-------NRGIDGELELVYVVQ

Query:  LWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKE
        + +SWE + WQY+KA+E+   D YGS RY+EVA +F+ F+V++QRF+ENE F+E  RV+ Y + R   R LLQ+P+++ED  KD  KK G   D +++ +
Subjt:  LWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKE

Query:  NAVKIDRLIEMLQESIRILWQFIRADKLA-----HISTLKCHIEAEQE
          +K D+L+E+++E+IR+ W+F+R DKL        S  K  IE + E
Subjt:  NAVKIDRLIEMLQESIRILWQFIRADKLA-----HISTLKCHIEAEQE

AT5G39785.2 Protein of unknown function (DUF1666)1.4e-4437.93Show/hide
Query:  NDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIES--EEEDFLKELKIETMKAKAKSGLPSIPEESE----YPIAMEEDVKPWKREE--SFDQ
        +D    + S++ G   +   D++ + S S + + ED++  ES  E +D +++LK+E  K KA  GL +I EE E     P  M ED+KPW+ EE   F  
Subjt:  NDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIES--EEEDFLKELKIETMKAKAKSGLPSIPEESE----YPIAMEEDVKPWKREE--SFDQ

Query:  KDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRA------------DAKTGE-------NRGIDGELELVYVVQ
         D   E+ KFH+ Y E+MRK D L+ QK+YA  L   +S     S L     G  P + +             AK  E        + I GELE VYV Q
Subjt:  KDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRA------------DAKTGE-------NRGIDGELELVYVVQ

Query:  LWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKE
        + +SWE + WQY+KA+E+   D YGS RY+EVA +F+ F+V++QRF+ENE F+E  RV+ Y + R   R LLQ+P+++ED  KD  KK G   D +++ +
Subjt:  LWVSWEFVGWQYKKALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKE

Query:  NAVKIDRLIEMLQESIRILWQFIRADKLA-----HISTLKCHIEAEQE
          +K D+L+E+++E+IR+ W+F+R DKL        S  K  IE + E
Subjt:  NAVKIDRLIEMLQESIRILWQFIRADKLA-----HISTLKCHIEAEQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTTTTGTTCGAAATCCAGTTTTCTTTCTCGGATAATGAAAATTTGTACAACATTTTGATCCTTCTCTTAAAAGCCATCTTGGTCGTGTTCTGTGCTCTGTTCCC
ACTCCTCATCGGATTCTTCAAATACTATAAACCTCATGCCGGCAAAGAATCTCTGTTTGAAGAAGATTTTCATGAAGCTTTGCAAGAAAATGTTGATTCTGATGAAACTG
ATACACCCTTTTCAGCTCTAAGCTTCCACTTTCCAACTTACGAGGAATTCTTGAGAACTACAGAGAATGTTGACCCACCCATATTTGAAACTTCAAATGAGAATGATTTT
GTTGAAGCGAATTGCTCGGTTCCTTCAAGTCCTGTTCTGGATTTCGATCTTCATCCCAGTGAAAACCCAGAGAAATTTCCCCGAAATGGTTTGAATTGCACTAATGATAT
AAAAATCGAAGATCCAAGCATTGAAGATGGTGATGATTTTGATGAAATTGGTGATTCTGCCGAAGCCATATCCGGGAGTTTTGATCCGAAAATTGAAGATTCAGATGAAA
TCGAATCGGAAGAAGAGGACTTCCTAAAAGAACTGAAAATTGAGACTATGAAAGCCAAAGCAAAATCCGGCCTGCCCTCGATTCCTGAAGAATCCGAGTACCCAATAGCC
ATGGAAGAAGACGTAAAGCCATGGAAGAGAGAAGAAAGCTTCGATCAAAAAGACGTAACAAAAGAACTCCTCAAATTCCACAAGCTATACACAGAAAAGATGAGAAAATA
CGACACTTTAAATCACCAGAAAACATACGCAAAAGAGTTAAAGATGATGCAATCGAAGGACTCAGTGGACTCTGTTTTATCGAAAGGGTTGTGTGGGTGCAGGCCTGACA
AGAGAGCTGATGCTAAAACAGGGGAAAACAGAGGAATCGATGGGGAATTGGAGCTGGTTTATGTGGTGCAATTGTGGGTTTCTTGGGAGTTTGTTGGGTGGCAGTACAAG
AAAGCGTTGGAGATGTGTGGCAGAGATGGCTATGGAAGTTGCAGATACGATGAAGTTGCAGAGAAATTTGAGCATTTTAAAGTGATGATACAGAGATTTATGGAGAATGA
ATCGTTTGATGAAGGGTCGAGAGTGGAATGTTATGCTAGAACTCGACTTGCTAGGAGAAAGCTTCTGCAAGTTCCTCTTCTCAAAGAAGATGAAGTGAAAGACAACAAGA
AGAAAGGAGGAATTGACGATGACGATGACAAAGACAAGGAAAATGCCGTTAAAATTGATAGGCTAATTGAGATGTTGCAGGAATCCATAAGAATTTTATGGCAATTCATT
CGAGCTGATAAATTAGCCCATATATCAACTCTAAAGTGTCATATAGAAGCAGAACAAGAATGTAGAAGTCCAGCACATTCCAAGATTCTAAAGCAAATTCTTTTAGATCT
CCAAAAAGTTGGTTCTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTTTTGTTCGAAATCCAGTTTTCTTTCTCGGATAATGAAAATTTGTACAACATTTTGATCCTTCTCTTAAAAGCCATCTTGGTCGTGTTCTGTGCTCTGTTCCC
ACTCCTCATCGGATTCTTCAAATACTATAAACCTCATGCCGGCAAAGAATCTCTGTTTGAAGAAGATTTTCATGAAGCTTTGCAAGAAAATGTTGATTCTGATGAAACTG
ATACACCCTTTTCAGCTCTAAGCTTCCACTTTCCAACTTACGAGGAATTCTTGAGAACTACAGAGAATGTTGACCCACCCATATTTGAAACTTCAAATGAGAATGATTTT
GTTGAAGCGAATTGCTCGGTTCCTTCAAGTCCTGTTCTGGATTTCGATCTTCATCCCAGTGAAAACCCAGAGAAATTTCCCCGAAATGGTTTGAATTGCACTAATGATAT
AAAAATCGAAGATCCAAGCATTGAAGATGGTGATGATTTTGATGAAATTGGTGATTCTGCCGAAGCCATATCCGGGAGTTTTGATCCGAAAATTGAAGATTCAGATGAAA
TCGAATCGGAAGAAGAGGACTTCCTAAAAGAACTGAAAATTGAGACTATGAAAGCCAAAGCAAAATCCGGCCTGCCCTCGATTCCTGAAGAATCCGAGTACCCAATAGCC
ATGGAAGAAGACGTAAAGCCATGGAAGAGAGAAGAAAGCTTCGATCAAAAAGACGTAACAAAAGAACTCCTCAAATTCCACAAGCTATACACAGAAAAGATGAGAAAATA
CGACACTTTAAATCACCAGAAAACATACGCAAAAGAGTTAAAGATGATGCAATCGAAGGACTCAGTGGACTCTGTTTTATCGAAAGGGTTGTGTGGGTGCAGGCCTGACA
AGAGAGCTGATGCTAAAACAGGGGAAAACAGAGGAATCGATGGGGAATTGGAGCTGGTTTATGTGGTGCAATTGTGGGTTTCTTGGGAGTTTGTTGGGTGGCAGTACAAG
AAAGCGTTGGAGATGTGTGGCAGAGATGGCTATGGAAGTTGCAGATACGATGAAGTTGCAGAGAAATTTGAGCATTTTAAAGTGATGATACAGAGATTTATGGAGAATGA
ATCGTTTGATGAAGGGTCGAGAGTGGAATGTTATGCTAGAACTCGACTTGCTAGGAGAAAGCTTCTGCAAGTTCCTCTTCTCAAAGAAGATGAAGTGAAAGACAACAAGA
AGAAAGGAGGAATTGACGATGACGATGACAAAGACAAGGAAAATGCCGTTAAAATTGATAGGCTAATTGAGATGTTGCAGGAATCCATAAGAATTTTATGGCAATTCATT
CGAGCTGATAAATTAGCCCATATATCAACTCTAAAGTGTCATATAGAAGCAGAACAAGAATGTAGAAGTCCAGCACATTCCAAGATTCTAAAGCAAATTCTTTTAGATCT
CCAAAAAGTTGGTTCTGTTTAA
Protein sequenceShow/hide protein sequence
MKFLFEIQFSFSDNENLYNILILLLKAILVVFCALFPLLIGFFKYYKPHAGKESLFEEDFHEALQENVDSDETDTPFSALSFHFPTYEEFLRTTENVDPPIFETSNENDF
VEANCSVPSSPVLDFDLHPSENPEKFPRNGLNCTNDIKIEDPSIEDGDDFDEIGDSAEAISGSFDPKIEDSDEIESEEEDFLKELKIETMKAKAKSGLPSIPEESEYPIA
MEEDVKPWKREESFDQKDVTKELLKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKDSVDSVLSKGLCGCRPDKRADAKTGENRGIDGELELVYVVQLWVSWEFVGWQYK
KALEMCGRDGYGSCRYDEVAEKFEHFKVMIQRFMENESFDEGSRVECYARTRLARRKLLQVPLLKEDEVKDNKKKGGIDDDDDKDKENAVKIDRLIEMLQESIRILWQFI
RADKLAHISTLKCHIEAEQECRSPAHSKILKQILLDLQKVGSV