; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018087 (gene) of Snake gourd v1 genome

Gene IDTan0018087
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1666)
Genome locationLG11:4424387..4432403
RNA-Seq ExpressionTan0018087
SyntenyTan0018087
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR012870 - Protein of unknown function DUF1666


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033168.1 hypothetical protein SDJN02_07222, partial [Cucurbita argyrosperma subsp. argyrosperma]4.9e-14364.68Show/hide
Query:  NGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLR-TTENVDSVISDTSNENDFV
        +G  NDL VLL K LL+LFCAL  LL+ FFKYY    NK+PLF        Q+ +D   IEETPFSSLSF+FPTY +FLR TT+NVD  + +TSNENDF 
Subjt:  NGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLR-TTENVDSVISDTSNENDFV

Query:  EENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEIMEAKAKSK--SWLPS
        +   S  ST D D DL PREDPEN LPNGL  +ND+R     I+DD+EFD+  EAISGSFD             +DFI+EL+NEI +AKAK+K  S LPS
Subjt:  EENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEIMEAKAKSK--SWLPS

Query:  IPEESEYPIAMEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGEF
        IPEESEYPIAMEED K  KKEE  N+K + KELH+FHK YTEKMRKYDTLNHQ   AKE KM QSKG V+S  SKGLCGCRPDK+ + KTGE R I+GE 
Subjt:  IPEESEYPIAMEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGEF

Query:  ELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDD
        E+VYVVQM VSWEFIVWQYKKALE+ G EGYGS RFNEVAEKFEHFKV+IQRFMENE  +EGSRVE YA +R+ RRKLLQVPLLKED  KDNKKTG    
Subjt:  ELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDD

Query:  HNNDKEDAIKIDRLIETLQESIRVFWQFIRVDKLAHIS----------EQERTSPARSKILKQILLDLQK
           ++E+ +K+DR+IE LQE IRVFWQFIR DKLAHIS          +QER+SPA S IL QILLDL K
Subjt:  HNNDKEDAIKIDRLIETLQESIRVFWQFIRVDKLAHIS----------EQERTSPARSKILKQILLDLQK

XP_022959969.1 uncharacterized protein LOC111460861 [Cucurbita moschata]4.2e-14264.26Show/hide
Query:  NGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLR-TTENVDSVISDTSNENDFV
        +G  ND+ VLL K LL+LFCAL PLL+ FFKYY    NK+PLF        Q+ +D   IEETPFSSLSF+FPTY +FLR TT+NVD  + +TSNENDF 
Subjt:  NGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLR-TTENVDSVISDTSNENDFV

Query:  EENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEI--MEAKAKSKSWLPS
        +   S  ST D D DL PREDPE  LPNGL  +ND+      I+DD+EFD+  EAISGSFDQ             DFI+EL+NEI  ++AKAKS+S LPS
Subjt:  EENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEI--MEAKAKSKSWLPS

Query:  IPEESEYPIAMEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGEF
        IPEESEYPIAMEED K  KKEE  N+K + KELH+FHK YTEKMRKYDTLNHQ   AKE KM QSKG V+S  SKGLCGCRPDK+ + KTGE R +EGE 
Subjt:  IPEESEYPIAMEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGEF

Query:  ELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDD
        E+VYVVQM VSWEFIVWQYKKALE+ G EGYGS RFNEVAEKFEHFKV+IQRFMENE  +EGSRVE YA +R+ +RKLLQVPLLKED  KDNKKTG    
Subjt:  ELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDD

Query:  HNNDKEDAIKIDRLIETLQESIRVFWQFIRVDKLAHIS----------EQERTSPARSKILKQILLDLQK
           ++E+ +K+DR+IE LQE IRVFWQFIR DKLAHIS          +QER+SPA S IL QILLDL K
Subjt:  HNNDKEDAIKIDRLIETLQESIRVFWQFIRVDKLAHIS----------EQERTSPARSKILKQILLDLQK

XP_023544885.1 uncharacterized protein LOC111804322 [Cucurbita pepo subsp. pepo]1.1e-14565.82Show/hide
Query:  NGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLR-TTENVDSVISDTSNENDFV
        +G  NDLSVLL K LL+LFCAL PLL+ FFKYY    N++PLF        QE +D   IEETPFSSLSF+FPTY +FLR TT+NVD  + +TSNENDF 
Subjt:  NGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLR-TTENVDSVISDTSNENDFV

Query:  EENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEIMEAKAKSK--SWLPS
        +   S  ST D D DL PREDPE  LPNGL C+NDIR     I+DD+EFDD  EAISG FDQ             DFI+EL+NEI +AKAK+K  S LPS
Subjt:  EENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEIMEAKAKSK--SWLPS

Query:  IPEESEYPIAMEEDLKPWKK-EESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGE
        IPEESEYPIAMEED K  KK EE  N K + KELH+FHK YTEKMRKYDTLNHQ   AKE KM QSKG V+S  SKGLCGCRPDK+ + KTGE R +EGE
Subjt:  IPEESEYPIAMEEDLKPWKK-EESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGE

Query:  FELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMD
         E+VYVVQM VSWEFIVWQYKK LE+ GREGYGS RFNEVAEKFEHFKV+IQRFMENE  +EGSRVE YA +R+ RRKLLQVPLLKED  KDNKKTG   
Subjt:  FELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMD

Query:  DHNNDKEDAIKIDRLIETLQESIRVFWQFIRVDKLAHIS----------EQERTSPARSKILKQILLDLQK
            ++E+ +K+DR+IE LQE IRVFWQFIR DKLAHIS          +QERTSPA S IL QILLDLQK
Subjt:  DHNNDKEDAIKIDRLIETLQESIRVFWQFIRVDKLAHIS----------EQERTSPARSKILKQILLDLQK

XP_038889168.1 uncharacterized protein LOC120079052 isoform X1 [Benincasa hispida]7.1e-16668.8Show/hide
Query:  MKFLFKTQFSFSDNGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLRTTENVDS
        MK L  TQFSF  N  L ++ V+L KALLVLFCAL PLL+G+F+Y+KTS +K+PLFEE+C+EALQES +  EI ETPFS+LSFRFPTY EFL   ENVDS
Subjt:  MKFLFKTQFSFSDNGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLRTTENVDS

Query:  VISDTSNENDFVEENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEIMEA
        V SDTSNE+DF+EE+CSIPS PDP+F L P+E  EN  PNGL CTND+R ED      DEFDDS EA       KIEDS E ESE+D+FIK LQ +IM+A
Subjt:  VISDTSNENDFVEENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEIMEA

Query:  K--AKSKSWLPSIPEESEYPIAMEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDG
        K  AK+KS LPSIPEESEYPI  E DLKPWKKEESFN +D  KELH FHKLY EKMRKYDTLNHQ TYAKEL+ MQSK SV+S LS+G CGC+PDK    
Subjt:  K--AKSKSWLPSIPEESEYPIAMEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDG

Query:  KTGETREIEGEFELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLK---
        KTGETR I+ E ELVYV+Q+ VSWEFIVWQYKK LE+ GREGYG C FNEVAEKFEHF+VMIQRFMENESFDEGSRVECY RNRLARRKLLQVPL+K   
Subjt:  KTGETREIEGEFELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLK---

Query:  -EDRVKDNKKTGGMDDHNNDKEDAIKIDRLIETLQESIRVFWQFIRVDKLAHIS--------EQERTSPARSKILKQILLDLQK
         ED VK+NK  GG ++  +DKE+A+KIDRLI+  QESIR+ WQFI  DKL HIS        +QE TSP+ SKI  QILL LQK
Subjt:  -EDRVKDNKKTGGMDDHNNDKEDAIKIDRLIETLQESIRVFWQFIRVDKLAHIS--------EQERTSPARSKILKQILLDLQK

XP_038889169.1 uncharacterized protein LOC120079052 isoform X2 [Benincasa hispida]3.4e-14471.43Show/hide
Query:  MKFLFKTQFSFSDNGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLRTTENVDS
        MK L  TQFSF  N  L ++ V+L KALLVLFCAL PLL+G+F+Y+KTS +K+PLFEE+C+EALQES +  EI ETPFS+LSFRFPTY EFL   ENVDS
Subjt:  MKFLFKTQFSFSDNGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLRTTENVDS

Query:  VISDTSNENDFVEENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEIMEA
        V SDTSNE+DF+EE+CSIPS PDP+F L P+E  EN  PNGL CTND+R ED      DEFDDS EA       KIEDS E ESE+D+FIK LQ +IM+A
Subjt:  VISDTSNENDFVEENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEIMEA

Query:  K--AKSKSWLPSIPEESEYPIAMEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDG
        K  AK+KS LPSIPEESEYPI  E DLKPWKKEESFN +D  KELH FHKLY EKMRKYDTLNHQ TYAKEL+ MQSK SV+S LS+G CGC+PDK    
Subjt:  K--AKSKSWLPSIPEESEYPIAMEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDG

Query:  KTGETREIEGEFELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKED
        KTGETR I+ E ELVYV+Q+ VSWEFIVWQYKK LE+ GREGYG C FNEVAEKFEHF+VMIQRFMENESFDEGSRVECY RNRLARRKLLQVPL+K +
Subjt:  KTGETREIEGEFELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKED

TrEMBL top hitse value%identityAlignment
A0A0A0KS72 Uncharacterized protein9.8e-12150.46Show/hide
Query:  MKFLFKTQFSFSDNGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLRTTENVDS
        MK L  TQF F +N +  ++ V++ K LL+  C L P  + +FKY+K S  K+P                    E PFS+L+FRFPTY EFL+T ENVD 
Subjt:  MKFLFKTQFSFSDNGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLRTTENVDS

Query:  VISDTSNENDFVEENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSI----------KDDDEFDDSAEAI----------------------
             SNE DF+E++CS+PS+  PDF L  RE  EN  PN L CTND+  ED  +          ++D E D+  + +                      
Subjt:  VISDTSNENDFVEENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSI----------KDDDEFDDSAEAI----------------------

Query:  --------------------------SGSFDQKIEDS---DEFESEEDDFIKELQNEIMEAKAKSKSWLPSIPEESEYPIAM-EEDLKPW-KKEESFNQK
                                   G F   I+ +    E +SE+DDFIK LQ +IM  KAK+KS LPSIPEE++Y I   E DLKPW KK+ESFN +
Subjt:  --------------------------SGSFDQKIEDS---DEFESEEDDFIKELQNEIMEAKAKSKSWLPSIPEESEYPIAM-EEDLKPW-KKEESFNQK

Query:  DVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGEFELVYVVQMLVSWEFIVWQYKKALEMCG
        D+TKELH+FHK YTEKMRKYD LN QKTYAKELKMMQSK SV+S  +KG C C+ +K    KT E + I+GE E+VYVVQ+ VSWEFIVW+YKKALE+ G
Subjt:  DVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGEFELVYVVQMLVSWEFIVWQYKKALEMCG

Query:  REGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDDHNNDKEDAIKIDRLIETLQESIRVFWQ
        RE YGSCRFNEVAEKFEHFKVMIQRFMENE  +EGSRVECY ++RL RRK LQVPLLKED VK+    GG  + N  KE+A+ IDRLI+ LQESIR+ WQ
Subjt:  REGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDDHNNDKEDAIKIDRLIETLQESIRVFWQ

Query:  FIRVDKLAHIS---------EQERTSPARSKILKQILLDLQKV
        FIR DKL HIS         +QE  SP+ S +  Q+L+DLQKV
Subjt:  FIRVDKLAHIS---------EQERTSPARSKILKQILLDLQKV

A0A1S4E2E7 uncharacterized protein LOC1034984802.8e-12352.03Show/hide
Query:  MKFLFKTQFSFSDNGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLRTTENVDS
        MK L   QF F DN    ++ V + K LL+  C L P  + +FKY+K S  K+PLFE+D                 PFS L+FRFPTY EFL+T ENVD 
Subjt:  MKFLFKTQFSFSDNGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLRTTENVDS

Query:  VISDTSNENDFVEENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTED----------LSIKDDDEFDDSAEAI----------------------
             SNE D +E++CSIPS+P  DF L  RE  EN  PN L CTND+  ED             ++D E D   +A+                      
Subjt:  VISDTSNENDFVEENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTED----------LSIKDDDEFDDSAEAI----------------------

Query:  ------------SGSFDQKIEDSD---------------EFESEEDDFIKELQNEIMEAK--AKSKSWLPSIPEESEYPIAM-EEDLKPW-KKEESFNQK
                     GS +    DSD               E +SE+DD IK LQ +IM+AK  AK+KS LPSIPEE++Y I   E DLKPW KK+ESFN +
Subjt:  ------------SGSFDQKIEDSD---------------EFESEEDDFIKELQNEIMEAK--AKSKSWLPSIPEESEYPIAM-EEDLKPW-KKEESFNQK

Query:  DVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGEFELVYVVQMLVSWEFIVWQYKKALEMCG
        D+TKELHKFHK YTEKMRKYD LN QKTYAKELKMMQSK SV+S L+KG C C+ +K+     G  R I+GE E+VYVVQ+ VSWEFIVW+YKKALE+ G
Subjt:  DVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGEFELVYVVQMLVSWEFIVWQYKKALEMCG

Query:  REGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDDHNNDKEDAIKIDRLIETLQESIRVFWQ
        REGYGSCRFN VAEKFEHFKVMI+RFMENE  +EGSRVECY R+RL RRKLLQVPLLKED VK+     G ++ N  KE+A+ IDRLI  LQESIR+ WQ
Subjt:  REGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDDHNNDKEDAIKIDRLIETLQESIRVFWQ

Query:  FIRVDKLAHIS---------EQERTSPARSKILKQILLDLQK
        FIR DKL HIS         +QE TSP+   I  Q+LLDLQK
Subjt:  FIRVDKLAHIS---------EQERTSPARSKILKQILLDLQK

A0A6J1BWI3 uncharacterized protein LOC1110061622.0e-11056.15Show/hide
Query:  MKFLFKTQFSFSDNGELNDLSVLLSKALLVLFCALIPLLIGFFKYYK-------------TSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPT
        M+FLFK Q                   +LVLF AL    + F +Y+K             T   KEP+ EEDC+E LQ+S DF E +ETPFS+LSFRFPT
Subjt:  MKFLFKTQFSFSDNGELNDLSVLLSKALLVLFCALIPLLIGFFKYYK-------------TSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPT

Query:  YVEFLRTTENVDSVISDTSNENDFVEENCSIPSTPDPDFDL-DPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEE
        + EF+ T ENVDS ISDTSNEN       S+PS PDPDF+L  PREDPEN   N L+CT+D   E+L IK++   ++         D+KIEDSD+F SEE
Subjt:  YVEFLRTTENVDSVISDTSNENDFVEENCSIPSTPDPDFDL-DPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEE

Query:  DDFIKELQNEIMEAKAKSKS-WLPSIPEESEYPIAMEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSK
        ++ I+ LQ      K K     LP IPEESEY I MEED KPW+ +E+F+ ++ TKELHK HK+Y E+M+KYDTLNHQK YAK+LKMMQSK  + S LSK
Subjt:  DDFIKELQNEIMEAKAKSKS-WLPSIPEESEYPIAMEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSK

Query:  GLCGCRPDKRDDGKTGETREIEGEFELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLAR
         LC CRP K D G  G+ REI+G+ E+VYV Q+  SWEF+V QYKKALE+C R   GSCRFNEVA KF+HF+V+IQRFMENE+F+EGSRVECYARNRL R
Subjt:  GLCGCRPDKRDDGKTGETREIEGEFELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLAR

Query:  RKLLQVPLLKEDRVKDN-KKTGGMDDHNNDKEDAIKIDRLIETLQES
        RKLLQVP++KED VK+N KK    D  ++D E AIKIDR IE LQES
Subjt:  RKLLQVPLLKEDRVKDN-KKTGGMDDHNNDKEDAIKIDRLIETLQES

A0A6J1H9K8 uncharacterized protein LOC1114608612.0e-14264.26Show/hide
Query:  NGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLR-TTENVDSVISDTSNENDFV
        +G  ND+ VLL K LL+LFCAL PLL+ FFKYY    NK+PLF        Q+ +D   IEETPFSSLSF+FPTY +FLR TT+NVD  + +TSNENDF 
Subjt:  NGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLR-TTENVDSVISDTSNENDFV

Query:  EENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEI--MEAKAKSKSWLPS
        +   S  ST D D DL PREDPE  LPNGL  +ND+      I+DD+EFD+  EAISGSFDQ             DFI+EL+NEI  ++AKAKS+S LPS
Subjt:  EENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEI--MEAKAKSKSWLPS

Query:  IPEESEYPIAMEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGEF
        IPEESEYPIAMEED K  KKEE  N+K + KELH+FHK YTEKMRKYDTLNHQ   AKE KM QSKG V+S  SKGLCGCRPDK+ + KTGE R +EGE 
Subjt:  IPEESEYPIAMEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGEF

Query:  ELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDD
        E+VYVVQM VSWEFIVWQYKKALE+ G EGYGS RFNEVAEKFEHFKV+IQRFMENE  +EGSRVE YA +R+ +RKLLQVPLLKED  KDNKKTG    
Subjt:  ELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDD

Query:  HNNDKEDAIKIDRLIETLQESIRVFWQFIRVDKLAHIS----------EQERTSPARSKILKQILLDLQK
           ++E+ +K+DR+IE LQE IRVFWQFIR DKLAHIS          +QER+SPA S IL QILLDL K
Subjt:  HNNDKEDAIKIDRLIETLQESIRVFWQFIRVDKLAHIS----------EQERTSPARSKILKQILLDLQK

A0A6J1JR14 uncharacterized protein LOC1114875633.5e-14265.18Show/hide
Query:  NGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLR-TTENVDSVISDTSNENDFV
        +G  NDLS LL K LL+LFCAL PLL+ FFKYY    NK+PLF +              IEETPFSSLSF+FPTY +FLR TT+NV+     TSNENDF 
Subjt:  NGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLR-TTENVDSVISDTSNENDFV

Query:  EENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEIMEAKAKSK--SWLPS
        +   S  ST D D DL PREDPE  LPNGL C+NDIR     I+DD+EFDD  EAISGSFDQ             DFI+EL+NEIM+AKAK+K  S LPS
Subjt:  EENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEIMEAKAKSK--SWLPS

Query:  IPEESEYPIAMEEDLKPWKKEES-FNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGE
        IPEESEYPIAMEED K  KKEE   N+K + KELH+FHK YTEKMRKYDTLNHQ   AKE KM QSKG V+S  SKGLCGCRP K+ + KT E R IEGE
Subjt:  IPEESEYPIAMEEDLKPWKKEES-FNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGE

Query:  FELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMD
         E+VYVVQM VSWEFIVWQYKKALE+ GREGYGS RFNEVAEKFEHFKV IQRFME E  +EGSRVE YAR+R+ RRKLLQVPLL+ED  KD KKTG ++
Subjt:  FELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMD

Query:  DHNNDKEDAIKIDRLIETLQESIRVFWQFIRVDKLAHIS----------EQERTSPARSKILKQILLDLQK
        +     ED +K+DR+IE LQE IRVFWQFIR DKLAHIS          +QERTSPA S IL QILLDLQK
Subjt:  DHNNDKEDAIKIDRLIETLQESIRVFWQFIRVDKLAHIS----------EQERTSPARSKILKQILLDLQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69610.1 Protein of unknown function (DUF1666)1.2e-3033.05Show/hide
Query:  KDDDEFDDSAEAISGSFDQKIE---------DSDEFESEEDDFIKELQNEIMEAKAKSKSWLPSIPEESEYPIAMEEDLKPWKKEESFNQ-KDVTKELHK
        ++++EF    E +   FD++ E         DSD+ E E  D I++L+ E+  A+      L +I EESE P+   ++LKP K E   +Q KD   E+HK
Subjt:  KDDDEFDDSAEAISGSFDQKIE---------DSDEFESEEDDFIKELQNEIMEAKAKSKSWLPSIPEESEYPIAMEEDLKPWKKEESFNQ-KDVTKELHK

Query:  FHKLYTEKMRKYDTLNHQKTYAKELKMMQS------------KGSVDSGL---SKGLCGCRPDKRDDGKTGETREIEGEFELVYVVQMLVSWEFIVWQYK
         +K Y  KMRK D ++ Q  ++  L  ++             K S+   +    K    C P +R        +E   +FE VYV Q+ +SWE + WQY 
Subjt:  FHKLYTEKMRKYDTLNHQKTYAKELKMMQS------------KGSVDSGL---SKGLCGCRPDKRDDGKTGETREIEGEFELVYVVQMLVSWEFIVWQYK

Query:  KALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDDHNNDKEDAIKIDRLIETLQE
        K LE   +    + ++N VA +F+ F+V++QRF+ENE F   SRVE Y +NR   +  LQ+PL+++DR    K          + E A+K + L E ++E
Subjt:  KALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDDHNNDKEDAIKIDRLIETLQE

Query:  SIRVFWQFIRVDK-----LAHISEQERTSPARS---KILKQILLDLQK
        S+ VFW+F+  DK     +  +S Q + SP  S   ++L  I   LQK
Subjt:  SIRVFWQFIRVDK-----LAHISEQERTSPARS---KILKQILLDLQK

AT1G73850.1 Protein of unknown function (DUF1666)7.7e-0922.74Show/hide
Query:  TTENVDSV-ISDTSNENDFVEENCSIPSTPDPDFDLDPR-------------EDPENSLPNGLSCTNDIRTEDLSIKDDDEFD--------------DSA
        T E+  S  +S   +E+   EE+  +   PD  +D D               ED     PN  S     R +   + +D  ++              D  
Subjt:  TTENVDSV-ISDTSNENDFVEENCSIPSTPDPDFDLDPR-------------EDPENSLPNGLSCTNDIRTEDLSIKDDDEFD--------------DSA

Query:  EAISGSFDQKIEDSDEFESEEDDFIKELQNEIMEAKAKSKS---WLPSIPEESEYPIAMEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNH
        +  SG FD +   ++E E EE++   ++  E     + SKS   W  S+  +  +  +       W+    F + D  +E+    ++  +K+ + ++L  
Subjt:  EAISGSFDQKIEDSDEFESEEDDFIKELQNEIMEAKAKSKS---WLPSIPEESEYPIAMEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNH

Query:  QKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGEFELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEV------AEKFEHF
             + +    S+  V    S G    +  K+  G  G       E E  YV Q+ ++WE + W YK       +       FN+V      A++F  F
Subjt:  QKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGEFELVYVVQMLVSWEFIVWQYKKALEMCGREGYGSCRFNEV------AEKFEHF

Query:  KVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDDHNNDKEDAIKIDRLIETLQESIRVFWQFIRVDKLAHISEQERTSPA
         +++QR++ENE ++ G R E YAR R    KLL VP  ++   ++ K+    D++       I     +  ++E IR F  F++ DK     +  +    
Subjt:  KVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDDHNNDKEDAIKIDRLIETLQESIRVFWQFIRVDKLAHISEQERTSPA

Query:  RSK--ILKQILLDLQKVVGVESAKYAKDHKR
        RSK   +   L+ L K V  +     K+ +R
Subjt:  RSK--ILKQILLDLQKVVGVESAKYAKDHKR

AT3G20260.1 Protein of unknown function (DUF1666)3.5e-1426.46Show/hide
Query:  KDDDEFDDSAEAISGSFDQKIEDS---DEFESEEDDFI-KELQNEIMEAKAKS-KSWLPSIPEESEYPIAMEEDLKPWKKEESFNQKDVTKE-------L
        K+D       +A S + +  + DS   DE E ++DDFI  E++  + E +  S    +P   EE E    ++ED    + + S   +DV  E        
Subjt:  KDDDEFDDSAEAISGSFDQKIEDS---DEFESEEDDFI-KELQNEIMEAKAKS-KSWLPSIPEESEYPIAMEEDLKPWKKEESFNQKDVTKE-------L

Query:  HKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVD-SGLSKGLCG---CRPDKRDDGKTGETREIE--------GEFELVYVVQMLVSWEFIVWQYKK
           ++ Y E+M  +D L+ Q+     + +  S  +      SK L     C   K+ D    +   ++         + E  YV Q+ ++WE +  QY +
Subjt:  HKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVD-SGLSKGLCG---CRPDKRDDGKTGETREIE--------GEFELVYVVQMLVSWEFIVWQYKK

Query:  ALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLK-EDRVKDNKKTGGMDDHNNDKEDAIKIDRLIETLQE
           +   +      +N  A+ F+ F V++QR++ENE F++GSR E YAR R A  KLLQ P ++  D+ +  K TG M          +  D LI+ ++ 
Subjt:  ALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLK-EDRVKDNKKTGGMDDHNNDKEDAIKIDRLIETLQE

Query:  SIRVFWQFIRVDKL-----AHISEQERTSPARSK---ILKQILLDLQKVVGVESAKYAK
        SI  F  F+++DK       H+      +   S    +L Q  +D ++V   E +K  K
Subjt:  SIRVFWQFIRVDKL-----AHISEQERTSPARSK---ILKQILLDLQKVVGVESAKYAK

AT5G39785.1 Protein of unknown function (DUF1666)1.5e-4434.79Show/hide
Query:  EDCNEALQE-------SIDFGE--IEETPFSSLSFRFPTYVEFLRTTENVDSVISDTSNENDFVEENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDI
        EDC +  QE       S+  GE  ++   +S  SF+    + FL   + ++S       ++DFV+ + +  S  D D  L   +  E SL  G       
Subjt:  EDCNEALQE-------SIDFGE--IEETPFSSLSFRFPTYVEFLRTTENVDSVISDTSNENDFVEENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDI

Query:  RTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFES--EEDDFIKELQNEIMEAKAKS--KSWLPSIPEESEYPIAMEEDLKPWK--KEESFNQKDVTKE
                  +   D++ + S S +++ ED++ FES  E  D I++L+ E+ + KA     + L    E+ + P  M EDLKPW+  +E+ F   D   E
Subjt:  RTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFES--EEDDFIKELQNEIMEAKAKS--KSWLPSIPEESEYPIAMEEDLKPWK--KEESFNQKDVTKE

Query:  LHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKG---------------SVDSGLSKGLCGCRPDKRD-DGKTGETREIEGEFELVYVVQMLVSWEFIV
        +HKFH+ Y E+MRK D L+ QK+YA  L ++QSK                S  S  S  +   +  K + +      +EI+GE E VYV QM +SWE + 
Subjt:  LHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKG---------------SVDSGLSKGLCGCRPDKRD-DGKTGETREIEGEFELVYVVQMLVSWEFIV

Query:  WQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDDHNNDKEDAIKIDRLIE
        WQY+KA+E+   + YGS R+NEVA +F+ F+V++QRF+ENE F+E  RV+ Y + R   R LLQ+P+++ED  KD KK G   D+  + +  IK D+L+E
Subjt:  WQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDDHNNDKEDAIKIDRLIE

Query:  TLQESIRVFWQFIRVDKLAHISEQERTSPARSKI
         ++E+IR+FW+F+R DKL   S  ++ S  +S+I
Subjt:  TLQESIRVFWQFIRVDKLAHISEQERTSPARSKI

AT5G39785.2 Protein of unknown function (DUF1666)1.5e-4434.79Show/hide
Query:  EDCNEALQE-------SIDFGE--IEETPFSSLSFRFPTYVEFLRTTENVDSVISDTSNENDFVEENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDI
        EDC +  QE       S+  GE  ++   +S  SF+    + FL   + ++S       ++DFV+ + +  S  D D  L   +  E SL  G       
Subjt:  EDCNEALQE-------SIDFGE--IEETPFSSLSFRFPTYVEFLRTTENVDSVISDTSNENDFVEENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDI

Query:  RTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFES--EEDDFIKELQNEIMEAKAKS--KSWLPSIPEESEYPIAMEEDLKPWK--KEESFNQKDVTKE
                  +   D++ + S S +++ ED++ FES  E  D I++L+ E+ + KA     + L    E+ + P  M EDLKPW+  +E+ F   D   E
Subjt:  RTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFES--EEDDFIKELQNEIMEAKAKS--KSWLPSIPEESEYPIAMEEDLKPWK--KEESFNQKDVTKE

Query:  LHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKG---------------SVDSGLSKGLCGCRPDKRD-DGKTGETREIEGEFELVYVVQMLVSWEFIV
        +HKFH+ Y E+MRK D L+ QK+YA  L ++QSK                S  S  S  +   +  K + +      +EI+GE E VYV QM +SWE + 
Subjt:  LHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKG---------------SVDSGLSKGLCGCRPDKRD-DGKTGETREIEGEFELVYVVQMLVSWEFIV

Query:  WQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDDHNNDKEDAIKIDRLIE
        WQY+KA+E+   + YGS R+NEVA +F+ F+V++QRF+ENE F+E  RV+ Y + R   R LLQ+P+++ED  KD KK G   D+  + +  IK D+L+E
Subjt:  WQYKKALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDDHNNDKEDAIKIDRLIE

Query:  TLQESIRVFWQFIRVDKLAHISEQERTSPARSKI
         ++E+IR+FW+F+R DKL   S  ++ S  +S+I
Subjt:  TLQESIRVFWQFIRVDKLAHISEQERTSPARSKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTTTTGTTCAAAACCCAGTTTTCTTTTTCCGATAATGGAGAATTGAACGACCTTTCGGTCCTTCTTTCAAAAGCCCTCTTGGTCTTGTTCTGTGCTTTGATCCC
ACTCCTCATCGGATTCTTCAAATACTATAAAACCTCTGCCAACAAAGAACCTCTGTTTGAAGAAGATTGTAACGAAGCTTTACAAGAAAGTATTGATTTTGGTGAAATTG
AAGAAACGCCCTTCTCATCTCTAAGCTTTCGATTTCCAACTTACGTGGAGTTTTTGAGAACTACAGAGAATGTTGATTCAGTCATATCTGATACTTCAAATGAGAATGAT
TTTGTTGAAGAAAACTGCTCGATTCCTTCAACTCCTGATCCGGATTTCGATCTTGATCCCAGAGAAGACCCAGAAAATTCTCTCCCAAATGGTTTGAGTTGCACCAATGA
TATCAGAACCGAAGATTTAAGCATTAAAGACGACGATGAATTTGATGATTCAGCCGAAGCCATATCCGGGAGTTTTGATCAGAAAATTGAAGATTCAGATGAATTTGAAT
CTGAAGAAGACGACTTTATAAAAGAACTGCAAAATGAGATTATGGAAGCCAAAGCCAAATCCAAATCCTGGCTGCCCTCAATTCCTGAAGAATCCGAGTACCCAATAGCC
ATGGAAGAAGATTTAAAGCCATGGAAGAAGGAAGAAAGCTTCAATCAGAAAGACGTAACAAAAGAACTCCACAAATTCCACAAGCTTTACACAGAAAAGATGCGAAAATA
CGACACTTTGAATCACCAGAAAACATACGCAAAAGAGTTGAAGATGATGCAATCGAAGGGCTCAGTGGACTCTGGTTTATCAAAAGGGCTGTGTGGGTGCAGGCCTGATA
AGAGAGATGATGGTAAAACAGGGGAAACCAGAGAAATCGAAGGGGAATTTGAGTTGGTTTATGTGGTGCAAATGTTGGTTTCTTGGGAGTTTATTGTGTGGCAGTACAAG
AAAGCTTTGGAGATGTGTGGGAGAGAAGGTTATGGAAGTTGTAGATTCAATGAAGTTGCAGAGAAATTTGAGCATTTTAAAGTGATGATACAGAGATTTATGGAGAATGA
ATCGTTTGATGAAGGGTCGAGAGTTGAGTGTTATGCTAGAAATCGACTGGCAAGGAGAAAGCTTCTGCAAGTTCCTCTTCTCAAAGAAGATAGAGTCAAAGATAATAAAA
AGACAGGAGGAATGGATGATCACAACAACGACAAGGAAGATGCAATTAAAATTGATAGACTTATTGAAACTTTGCAAGAATCCATAAGAGTCTTCTGGCAATTTATTCGA
GTTGATAAACTTGCCCATATATCAGAACAAGAACGTACAAGTCCTGCACGTTCTAAGATTCTAAAACAAATTCTTTTAGATCTTCAAAAAGTTGTTGGTGTCGAGAGTGC
TAAATATGCAAAGGATCACAAGAGACCAATTAAGTTGGTGTCACCATAA
mRNA sequenceShow/hide mRNA sequence
CACGAATGAAGTTTTTGTTCAAAACCCAGTTTTCTTTTTCCGATAATGGAGAATTGAACGACCTTTCGGTCCTTCTTTCAAAAGCCCTCTTGGTCTTGTTCTGTGCTTTG
ATCCCACTCCTCATCGGATTCTTCAAATACTATAAAACCTCTGCCAACAAAGAACCTCTGTTTGAAGAAGATTGTAACGAAGCTTTACAAGAAAGTATTGATTTTGGTGA
AATTGAAGAAACGCCCTTCTCATCTCTAAGCTTTCGATTTCCAACTTACGTGGAGTTTTTGAGAACTACAGAGAATGTTGATTCAGTCATATCTGATACTTCAAATGAGA
ATGATTTTGTTGAAGAAAACTGCTCGATTCCTTCAACTCCTGATCCGGATTTCGATCTTGATCCCAGAGAAGACCCAGAAAATTCTCTCCCAAATGGTTTGAGTTGCACC
AATGATATCAGAACCGAAGATTTAAGCATTAAAGACGACGATGAATTTGATGATTCAGCCGAAGCCATATCCGGGAGTTTTGATCAGAAAATTGAAGATTCAGATGAATT
TGAATCTGAAGAAGACGACTTTATAAAAGAACTGCAAAATGAGATTATGGAAGCCAAAGCCAAATCCAAATCCTGGCTGCCCTCAATTCCTGAAGAATCCGAGTACCCAA
TAGCCATGGAAGAAGATTTAAAGCCATGGAAGAAGGAAGAAAGCTTCAATCAGAAAGACGTAACAAAAGAACTCCACAAATTCCACAAGCTTTACACAGAAAAGATGCGA
AAATACGACACTTTGAATCACCAGAAAACATACGCAAAAGAGTTGAAGATGATGCAATCGAAGGGCTCAGTGGACTCTGGTTTATCAAAAGGGCTGTGTGGGTGCAGGCC
TGATAAGAGAGATGATGGTAAAACAGGGGAAACCAGAGAAATCGAAGGGGAATTTGAGTTGGTTTATGTGGTGCAAATGTTGGTTTCTTGGGAGTTTATTGTGTGGCAGT
ACAAGAAAGCTTTGGAGATGTGTGGGAGAGAAGGTTATGGAAGTTGTAGATTCAATGAAGTTGCAGAGAAATTTGAGCATTTTAAAGTGATGATACAGAGATTTATGGAG
AATGAATCGTTTGATGAAGGGTCGAGAGTTGAGTGTTATGCTAGAAATCGACTGGCAAGGAGAAAGCTTCTGCAAGTTCCTCTTCTCAAAGAAGATAGAGTCAAAGATAA
TAAAAAGACAGGAGGAATGGATGATCACAACAACGACAAGGAAGATGCAATTAAAATTGATAGACTTATTGAAACTTTGCAAGAATCCATAAGAGTCTTCTGGCAATTTA
TTCGAGTTGATAAACTTGCCCATATATCAGAACAAGAACGTACAAGTCCTGCACGTTCTAAGATTCTAAAACAAATTCTTTTAGATCTTCAAAAAGTTGTTGGTGTCGAG
AGTGCTAAATATGCAAAGGATCACAAGAGACCAATTAAGTTGGTGTCACCATAAATTGAGTTGTATTAGTTTTCCAAATGGGAAGATTAAAACAGAGTCTTCATCATTTT
TCTTTCCTTCTTGATGAGTTTGCTTTGTATTTTTATCATAAATCTAGATTAGTAAATTTGAATTTATGTAATTAGGTTTGAATTTTGAAAAAAAAGAGGGAACAAATTAA
AAGAAACATAGGAAAGTTAGTACAAGTACAAAAGGAGG
Protein sequenceShow/hide protein sequence
MKFLFKTQFSFSDNGELNDLSVLLSKALLVLFCALIPLLIGFFKYYKTSANKEPLFEEDCNEALQESIDFGEIEETPFSSLSFRFPTYVEFLRTTENVDSVISDTSNEND
FVEENCSIPSTPDPDFDLDPREDPENSLPNGLSCTNDIRTEDLSIKDDDEFDDSAEAISGSFDQKIEDSDEFESEEDDFIKELQNEIMEAKAKSKSWLPSIPEESEYPIA
MEEDLKPWKKEESFNQKDVTKELHKFHKLYTEKMRKYDTLNHQKTYAKELKMMQSKGSVDSGLSKGLCGCRPDKRDDGKTGETREIEGEFELVYVVQMLVSWEFIVWQYK
KALEMCGREGYGSCRFNEVAEKFEHFKVMIQRFMENESFDEGSRVECYARNRLARRKLLQVPLLKEDRVKDNKKTGGMDDHNNDKEDAIKIDRLIETLQESIRVFWQFIR
VDKLAHISEQERTSPARSKILKQILLDLQKVVGVESAKYAKDHKRPIKLVSP