; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g34190 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g34190
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF674)
Genome locationchr9:26094500..26105155
RNA-Seq ExpressionMoc09g34190
SyntenyMoc09g34190
Gene Ontology termsNA
InterPro domainsIPR007750 - Protein of unknown function DUF674


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF9674789.1 hypothetical protein SADUNF_Sadunf10G0163400 [Salix dunnii]5.5e-9638.92Show/hide
Query:  SKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDELQSLLQIQ
        S +SLKL+ID K  +++ AEA K F+DFL  +L+LPLG V++L++   P  T  I N+Y + + L+ +Y    +NKD +LN   P+ T      + L   
Subjt:  SKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDELQSLLQIQ

Query:  SFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRC---------GQSMTTNATYVYGAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQL
        +   P      H   +     S S T   VCS C          Q +    T    + E    + GGYVK G+VT+ V DDL+V P+ S +S + +L++L
Subjt:  SFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRC---------GQSMTTNATYVYGAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQL

Query:  HVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHKI----DFPTT----------PVLAQNQT---------------AVRLKLLIDTKGNRVL
        +++D G +EEK +   I+EG++LL+ASL +   LT VFL K+    D P            PV+ +N+T                + LKLLID+K N+V+
Subjt:  HVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHKI----DFPTT----------PVLAQNQT---------------AVRLKLLIDTKGNRVL

Query:  FGEADKNLIDFLFNLLSLPLGTVIRLLTKQTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSS--CGSTMLLPSVEASTAATTFYGCSYAGYGNCR
        F EA K+ +DFL NLL+LPLGTVI+L+TK  M GC+ NLY S+E L+D+YLQ  Q K+++L P +++     + LLP+ +       +Y   + G     
Subjt:  FGEADKNLIDFLFNLLSLPLGTVIRLLTKQTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSS--CGSTMLLPSVEASTAATTFYGCSYAGYGNCR

Query:  VYVSDGPNATCPQCKQK-------MAQVNTYVQPPSGSIQ-GVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGV
          VS+  N+ C QC+ +         +V   V   S S      DQGGYVK VVTY V DDL+V P+S +S + LLNK NIK+ G L+EK++    ++G+
Subjt:  VYVSDGPNATCPQCKQK-------MAQVNTYVQPPSGSIQ-GVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGV

Query:  KLLKASLQSKTVLTDVFLKKD------------NVRLKLLIDPKGQRILFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYESVETMNEAY
        +LLKASL SK  LT VFL KD             + LKLLID K  +++F EA K+ VDFL +LL+L LG V +LL  +  + GC+ NLY S+E ++E+Y
Subjt:  KLLKASLQSKTVLTDVFLKKD------------NVRLKLLIDPKGQRILFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYESVETMNEAY

Query:  LLPNHNRDTLLKPKVVS-FFNSSLLLP
        L PN N+D++LKP + +   N + LLP
Subjt:  LLPNHNRDTLLKPKVVS-FFNSSLLLP

KVI09725.1 Protein of unknown function DUF674, partial [Cynara cardunculus var. scolymus]1.2e-9539.36Show/hide
Query:  LYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDELQSLLQIQSFHP-PTTYYTCHVST
        L+AEA+K+F+DFLF ILSLP+G V++LL     +   S+ N+Y + + LN  Y    RNKD +LNP +  A   D +  LL      P    +Y C    
Subjt:  LYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDELQSLLQIQSFHP-PTTYYTCHVST

Query:  YNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQLHVEDVGQIEEKLIYL---------
         N   +  +    AVC  C   M T   +V GA      E GG+VK G+VT+MVMDDL V P+ S++S+I++    +V++VG + EK++ L         
Subjt:  YNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQLHVEDVGQIEEKLIYL---------

Query:  --DINEGVKLLR----------------------ASLCTSTV---LTDVFLHKIDFPTTPVLAQ---------NQTAVRLKLLIDTKGNRVLFGEADKNL
          D+ EGV  ++                        + TST     T   L      +  VL Q           + V +KLLID K  +VLF EA+K  
Subjt:  --DINEGVKLLR----------------------ASLCTSTV---LTDVFLHKIDFPTTPVLAQ---------NQTAVRLKLLIDTKGNRVLFGEADKNL

Query:  IDFLFNLLSLPLGTVIRLLTKQTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGS---TMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGP
        +DFLF++ SLP+GTVIRLL K +MVG LGNLYDS+E LNDTY+Q  + K+++LNPK+++ G     +L    + +  A TFY C      N   YV++ P
Subjt:  IDFLFNLLSLPLGTVIRLLTKQTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGS---TMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGP

Query:  NATCPQCKQKMAQVNTYVQPPSGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITM-----------DFNQGVKLLK
         A CPQC  +M  V ++V   +G+ +   + GG+VK VVTYMVMDDL V PMSTISSIT+LNKFN+KEVG LEEK++++           D  +GV  ++
Subjt:  NATCPQCKQKMAQVNTYVQPPSGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITM-----------DFNQGVKLLK

Query:  ------ASLQSKTVLTDVFLKKDNVRLKLLIDPKGQRILFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYESVETMNEAYLLPNHNRDTL
               S  + T  +        V LKLLID K ++++FAEA+K  VDFLFH+LSL +G V RLLKN   +VG L NLY+S+E +++ Y+ PN ++D +
Subjt:  ------ASLQSKTVLTDVFLKKDNVRLKLLIDPKGQRILFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYESVETMNEAYLLPNHNRDTL

Query:  LKPKVVSFFNSSLLLPYIVDDSPPS
        L PK+    N    LP ++ +   S
Subjt:  LKPKVVSFFNSSLLLPYIVDDSPPS

KVI09806.1 Protein of unknown function DUF674 [Cynara cardunculus var. scolymus]4.8e-10040.7Show/hide
Query:  AQTQSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDE
        A + S T S++SLKL++D+K++++++AEADK+F+DFLF ILSLP+G V+KLL+    +   SI N+Y + + L+  Y    ++K+ +LNP + +      
Subjt:  AQTQSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDE

Query:  LQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGA-KEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQL
        L  L    +      Y     S Y+   Y  +    AVC  C   +    +YV G+  E    E GG+VK G+VT+M+MDDL VKP+ S++S+I++L+ L
Subjt:  LQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGA-KEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQL

Query:  HVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHKIDFPTTPVLAQNQTAVRLKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPLGTVIRLLTK
        +++++G +EEKL+            AS  TS                       + + LKLL+D K  +VLF EA+K  +DFLF++LSLP+GTVI+LLTK
Subjt:  HVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHKIDFPTTPVLAQNQTAVRLKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPLGTVIRLLTK

Query:  QTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGS---TMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGPNATCPQCKQKMAQVNTYVQPP
         +MVG LGNLY S+E L+DTY+Q  Q KN++LNPK++  G+    +LLP+ +A   A   Y CS   Y     YV+D P   CP C + + +   YV   
Subjt:  QTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGS---TMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGPNATCPQCKQKMAQVNTYVQPP

Query:  SGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLTDVFLKKDNVRLKLLIDPKGQRI
        SG+ +   + GG+VK VVTYM+MDDL VKPMSTISSIT+LN FN+KE+G LEEK++     +     K   ++KT+L +       + LKLLI  K Q++
Subjt:  SGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLTDVFLKKDNVRLKLLIDPKGQRI

Query:  LFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYESVETMNEAYLLPNHNRDTLLKPKVVSF
        LFAEA K  VDFLFH LSL +  V RLLK +  +VG + N+Y+S++ +++ Y+ PN +++ +L PK+ ++
Subjt:  LFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYESVETMNEAYLLPNHNRDTLLKPKVVSF

XP_022139200.1 uncharacterized protein LOC111010169 [Momordica charantia]3.8e-137100Show/hide
Query:  MTTQPQTEAKAQTQSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNP
        MTTQPQTEAKAQTQSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNP
Subjt:  MTTQPQTEAKAQTQSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNP

Query:  NLPSATPSDELQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSM
        NLPSATPSDELQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSM
Subjt:  NLPSATPSDELQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSM

Query:  STISVLHQLHVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHKIDFPTTP
        STISVLHQLHVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHKIDFPTTP
Subjt:  STISVLHQLHVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHKIDFPTTP

XP_027102561.1 uncharacterized protein LOC113723788 [Coffea arabica]2.7e-11945.93Show/hide
Query:  SKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDELQSLLQIQ
        S VSLKL+ID K  ++L+AEA+K  +DFLF +LSLP+G V++LL  G       + N+Y + ++LN  Y    ++KD LL P   ++ P      LL + 
Subjt:  SKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDELQSLLQIQ

Query:  SFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVY--GAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQLHVEDVGQ
               +Y C    YN C Y  S    A C +C   MT + TYV     KEA   + GG+VK G+VT+MVMDDL VKP+ S++S+I++L++ +V++VG 
Subjt:  SFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVY--GAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQLHVEDVGQ

Query:  IEEKLIYLDINEGVKLLRASLCTSTVLTDVFLH---KIDFPTTPVLAQNQTA--VRLKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPLGTVIRLLTKQT
        +EEK + L +NE + LL+AS  + TVLT+VFL    K+        +++  A  V LKLLIDTK  +VLF EA+K+ +DFLF++LSLP+GTVIRLL KQ 
Subjt:  IEEKLIYLDINEGVKLLRASLCTSTVLTDVFLH---KIDFPTTPVLAQNQTA--VRLKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPLGTVIRLLTKQT

Query:  MVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGPNATCPQCKQKMAQVNTYVQPPSGSIQ
        MVGC+ NLY+S+E+LN+TY+Q KQ K+TLL P+  +  S  LL   +  T A  FY C + GY NC  YVSD   A CPQCK  M     YV PP+    
Subjt:  MVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGPNATCPQCKQKMAQVNTYVQPPSGSIQ

Query:  GVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLTDVFLKK---------------------
          GD+GG+VK VVTYMVMDDL VKPMSTISSI LLN+FN+KE+ ALEEK + +  N+ +KLLK SL+SK VLT+VF+K                      
Subjt:  GVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLTDVFLKK---------------------

Query:  -------------DNVRLKLLIDPKGQRILFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYESVETMNEAYLLPNHNRDTLLKPKVVSFF
                       + LKLLID    ++LFAEA KN VDFL H+LSL L  V RLL++Q   +G L NLY S+E ++EAY+ PN N++ LL PK  +  
Subjt:  -------------DNVRLKLLIDPKGQRILFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYESVETMNEAYLLPNHNRDTLLKPKVVSFF

Query:  NSSLLLPYIVDDSP
         ++LL    ++DSP
Subjt:  NSSLLLPYIVDDSP

TrEMBL top hitse value%identityAlignment
A0A103YIG3 Uncharacterized protein (Fragment)5.9e-9639.36Show/hide
Query:  LYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDELQSLLQIQSFHP-PTTYYTCHVST
        L+AEA+K+F+DFLF ILSLP+G V++LL     +   S+ N+Y + + LN  Y    RNKD +LNP +  A   D +  LL      P    +Y C    
Subjt:  LYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDELQSLLQIQSFHP-PTTYYTCHVST

Query:  YNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQLHVEDVGQIEEKLIYL---------
         N   +  +    AVC  C   M T   +V GA      E GG+VK G+VT+MVMDDL V P+ S++S+I++    +V++VG + EK++ L         
Subjt:  YNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQLHVEDVGQIEEKLIYL---------

Query:  --DINEGVKLLR----------------------ASLCTSTV---LTDVFLHKIDFPTTPVLAQ---------NQTAVRLKLLIDTKGNRVLFGEADKNL
          D+ EGV  ++                        + TST     T   L      +  VL Q           + V +KLLID K  +VLF EA+K  
Subjt:  --DINEGVKLLR----------------------ASLCTSTV---LTDVFLHKIDFPTTPVLAQ---------NQTAVRLKLLIDTKGNRVLFGEADKNL

Query:  IDFLFNLLSLPLGTVIRLLTKQTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGS---TMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGP
        +DFLF++ SLP+GTVIRLL K +MVG LGNLYDS+E LNDTY+Q  + K+++LNPK+++ G     +L    + +  A TFY C      N   YV++ P
Subjt:  IDFLFNLLSLPLGTVIRLLTKQTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGS---TMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGP

Query:  NATCPQCKQKMAQVNTYVQPPSGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITM-----------DFNQGVKLLK
         A CPQC  +M  V ++V   +G+ +   + GG+VK VVTYMVMDDL V PMSTISSIT+LNKFN+KEVG LEEK++++           D  +GV  ++
Subjt:  NATCPQCKQKMAQVNTYVQPPSGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITM-----------DFNQGVKLLK

Query:  ------ASLQSKTVLTDVFLKKDNVRLKLLIDPKGQRILFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYESVETMNEAYLLPNHNRDTL
               S  + T  +        V LKLLID K ++++FAEA+K  VDFLFH+LSL +G V RLLKN   +VG L NLY+S+E +++ Y+ PN ++D +
Subjt:  ------ASLQSKTVLTDVFLKKDNVRLKLLIDPKGQRILFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYESVETMNEAYLLPNHNRDTL

Query:  LKPKVVSFFNSSLLLPYIVDDSPPS
        L PK+    N    LP ++ +   S
Subjt:  LKPKVVSFFNSSLLLPYIVDDSPPS

A0A103YIL4 Uncharacterized protein2.3e-10040.7Show/hide
Query:  AQTQSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDE
        A + S T S++SLKL++D+K++++++AEADK+F+DFLF ILSLP+G V+KLL+    +   SI N+Y + + L+  Y    ++K+ +LNP + +      
Subjt:  AQTQSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDE

Query:  LQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGA-KEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQL
        L  L    +      Y     S Y+   Y  +    AVC  C   +    +YV G+  E    E GG+VK G+VT+M+MDDL VKP+ S++S+I++L+ L
Subjt:  LQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGA-KEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQL

Query:  HVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHKIDFPTTPVLAQNQTAVRLKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPLGTVIRLLTK
        +++++G +EEKL+            AS  TS                       + + LKLL+D K  +VLF EA+K  +DFLF++LSLP+GTVI+LLTK
Subjt:  HVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHKIDFPTTPVLAQNQTAVRLKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPLGTVIRLLTK

Query:  QTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGS---TMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGPNATCPQCKQKMAQVNTYVQPP
         +MVG LGNLY S+E L+DTY+Q  Q KN++LNPK++  G+    +LLP+ +A   A   Y CS   Y     YV+D P   CP C + + +   YV   
Subjt:  QTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGS---TMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGPNATCPQCKQKMAQVNTYVQPP

Query:  SGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLTDVFLKKDNVRLKLLIDPKGQRI
        SG+ +   + GG+VK VVTYM+MDDL VKPMSTISSIT+LN FN+KE+G LEEK++     +     K   ++KT+L +       + LKLLI  K Q++
Subjt:  SGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLTDVFLKKDNVRLKLLIDPKGQRI

Query:  LFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYESVETMNEAYLLPNHNRDTLLKPKVVSF
        LFAEA K  VDFLFH LSL +  V RLLK +  +VG + N+Y+S++ +++ Y+ PN +++ +L PK+ ++
Subjt:  LFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYESVETMNEAYLLPNHNRDTLLKPKVVSF

A0A6J1CBN0 uncharacterized protein LOC1110101691.8e-137100Show/hide
Query:  MTTQPQTEAKAQTQSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNP
        MTTQPQTEAKAQTQSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNP
Subjt:  MTTQPQTEAKAQTQSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNP

Query:  NLPSATPSDELQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSM
        NLPSATPSDELQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSM
Subjt:  NLPSATPSDELQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSM

Query:  STISVLHQLHVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHKIDFPTTP
        STISVLHQLHVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHKIDFPTTP
Subjt:  STISVLHQLHVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHKIDFPTTP

A0A6J1CC90 uncharacterized protein LOC1110101689.5e-94100Show/hide
Query:  MVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGPNATCPQCKQKMAQVNTYVQPPSGSIQ
        MVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGPNATCPQCKQKMAQVNTYVQPPSGSIQ
Subjt:  MVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGPNATCPQCKQKMAQVNTYVQPPSGSIQ

Query:  GVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLTDVFLKK
        GVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLTDVFLKK
Subjt:  GVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLTDVFLKK

A0A6P6VIM5 uncharacterized protein LOC1137237881.3e-11945.93Show/hide
Query:  SKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDELQSLLQIQ
        S VSLKL+ID K  ++L+AEA+K  +DFLF +LSLP+G V++LL  G       + N+Y + ++LN  Y    ++KD LL P   ++ P      LL + 
Subjt:  SKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDELQSLLQIQ

Query:  SFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVY--GAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQLHVEDVGQ
               +Y C    YN C Y  S    A C +C   MT + TYV     KEA   + GG+VK G+VT+MVMDDL VKP+ S++S+I++L++ +V++VG 
Subjt:  SFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVY--GAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQLHVEDVGQ

Query:  IEEKLIYLDINEGVKLLRASLCTSTVLTDVFLH---KIDFPTTPVLAQNQTA--VRLKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPLGTVIRLLTKQT
        +EEK + L +NE + LL+AS  + TVLT+VFL    K+        +++  A  V LKLLIDTK  +VLF EA+K+ +DFLF++LSLP+GTVIRLL KQ 
Subjt:  IEEKLIYLDINEGVKLLRASLCTSTVLTDVFLH---KIDFPTTPVLAQNQTA--VRLKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPLGTVIRLLTKQT

Query:  MVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGPNATCPQCKQKMAQVNTYVQPPSGSIQ
        MVGC+ NLY+S+E+LN+TY+Q KQ K+TLL P+  +  S  LL   +  T A  FY C + GY NC  YVSD   A CPQCK  M     YV PP+    
Subjt:  MVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGPNATCPQCKQKMAQVNTYVQPPSGSIQ

Query:  GVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLTDVFLKK---------------------
          GD+GG+VK VVTYMVMDDL VKPMSTISSI LLN+FN+KE+ ALEEK + +  N+ +KLLK SL+SK VLT+VF+K                      
Subjt:  GVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLTDVFLKK---------------------

Query:  -------------DNVRLKLLIDPKGQRILFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYESVETMNEAYLLPNHNRDTLLKPKVVSFF
                       + LKLLID    ++LFAEA KN VDFL H+LSL L  V RLL++Q   +G L NLY S+E ++EAY+ PN N++ LL PK  +  
Subjt:  -------------DNVRLKLLIDPKGQRILFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYESVETMNEAYLLPNHNRDTLLKPKVVSFF

Query:  NSSLLLPYIVDDSP
         ++LL    ++DSP
Subjt:  NSSLLLPYIVDDSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09110.1 Protein of unknown function (DUF674)6.0e-2429.87Show/hide
Query:  LKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPLGTVIRLLTK-----QTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTA
        L+LLID + NRV+  EA K+ +D L +LL+LP+GT++RLL K      ++VGCL NLY SV  ++    +S+  K+ LL+P+ +       L      T 
Subjt:  LKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPLGTVIRLLTK-----QTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTA

Query:  ATTFYGC-SYAGYGNCRVYVSDGPNATCPQCKQKMAQVNTYVQPPSGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKI
        AT F+ C ++     CR   S+     C +C   M     + + P    Q     G +     ++++ DDL V   S    + +LN F       L+E +
Subjt:  ATTFYGC-SYAGYGNCRVYVSDGPNATCPQCKQKMAQVNTYVQPPSGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKI

Query:  ITMDFNQGVKLLKASLQSKTVLTDVFLKK-----------------------DN-VRLKLLIDPKGQRILFAEADKNVVDFLFHLLSLQLGIVCRLLKNQ
        I + F + + LL     S+  LTD FL+K                       DN + LK+ +    + +L+AE  +  VDFLF  L++ +      L   
Subjt:  ITMDFNQGVKLLKASLQSKTVLTDVFLKK-----------------------DN-VRLKLLIDPKGQRILFAEADKNVVDFLFHLLSLQLGIVCRLLKNQ

Query:  IGLVGCLRNLYESVETMN
        IG +GC+ NL  SV+ ++
Subjt:  IGLVGCLRNLYESVETMN

AT3G09140.2 Protein of unknown function (DUF674)1.2e-2424.7Show/hide
Query:  QSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLL-----STGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPS
        +S    K+S++L+ID+ + +++ AE+ K F+D LF+ L+LP+G +V+LL     S  V +  ++  N+Y++   ++L  F +   K +LL P   +    
Subjt:  QSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLL-----STGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPS

Query:  DELQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPLEM-------------GGYVKGGMVTFMVMDDLTVKP
          ++  L+I        Y+ C       CR+ +S +    C  CG+ + +        +E K LE              G +      +F++ DDL V+ 
Subjt:  DELQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPLEM-------------GGYVKGGMVTFMVMDDLTVKP

Query:  ISSSMSTISVLHQLHVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHKID--------FPTTPVL------AQNQTAVRLKLLIDTKGNRVLF
         SS  + ++ L  L   D  ++ E L+++ ++E + LL     +   LTD FL K           P +P L      A     +  K  +     ++L 
Subjt:  ISSSMSTISVLHQLHVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHKID--------FPTTPVL------AQNQTAVRLKLLIDTKGNRVLF

Query:  GEADKNLIDFLFNLLSLPLGTVIRLLTKQTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVYV
         E   + ID LF  L+LPL +V  +      +GC+GNL+ S + ++     S   K  L  P   SC   +L    +  T     Y CS+          
Subjt:  GEADKNLIDFLFNLLSLPLGTVIRLLTKQTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVYV

Query:  SDGPNATCPQCKQKMAQVNTYVQPPSGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKT
                P     + + N  +   + +        G++K    ++V DDL +KP + +S+I+LL      +   +EE +IT+   + + LL+ASL + +
Subjt:  SDGPNATCPQCKQKMAQVNTYVQPPSGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKT

Query:  VLTDVF
         LT  F
Subjt:  VLTDVF

AT5G01150.1 Protein of unknown function (DUF674)1.4e-2524.08Show/hide
Query:  QSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLST---GVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDE
        +S  E K SL+L++D+++ +++ AEA + F+D LF++L+LP+G +V+LL       P+      N+YR+   +  + F +   K +L+ P          
Subjt:  QSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLST---GVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDE

Query:  LQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPL--EMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQ
        L+  L I     PT    C   +     YS    +     RCG+ M          ++      +  G    G  +F++ DDL V  + S+   ++ L  
Subjt:  LQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPL--EMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQ

Query:  LHVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLH--------KIDFPTTPVL------AQNQTAVRLKLLIDTKGNRVLFGEADKNLIDFLFN
        L   DVG++ E+L+ + + E + LL     ++  L D+FL+        K     +P L      A+ +  V LK  +  +  ++LF E  +  ++ LF+
Subjt:  LHVEDVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLH--------KIDFPTTPVL------AQNQTAVRLKLLIDTKGNRVLFGEADKNLIDFLFN

Query:  LLSLPLGTVIRLLTKQTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGPNATCPQCKQ
         +++PL +   +      +GC+GN+  +   LN+   +++   +T   P  S     M LP +  +     +Y      Y      ++   N T    K 
Subjt:  LLSLPLGTVIRLLTKQTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGPNATCPQCKQ

Query:  KMAQVNTYVQPPSGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLT
        ++ +V+       GS +      G++K      V+DDLT+  M++ S++ LL K        LE ++I++   + + LL+ASL + + L+
Subjt:  KMAQVNTYVQPPSGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLT

AT5G43240.1 Protein of unknown function (DUF674)2.4e-2525.46Show/hide
Query:  VSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSI---VNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDELQSLLQI
        + LKL+ID+++ ++++ EA K F+D LF+  +LP+G +V+LL      +  +I    N+Y +  ++ + +F +   K +LL    P +   ++ ++L   
Subjt:  VSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSI---VNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDELQSLLQI

Query:  QSFHPPTTYYTC-HVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYG----AKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQLHVE
              T Y+ C        C  S+S    + CS CG  M    T + G    A     +E G +V+    +FM+ DDL V+ I+S   T++VL  L   
Subjt:  QSFHPPTTYYTC-HVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYG----AKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQLHVE

Query:  DVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHK--------IDFPTTPVLAQNQTA----VRLKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPL
        D  +++EK+  +++ E   LL     +   LTD FL K        I    +P L +++        + L +  K   +LF E   + +D LF  L++PL
Subjt:  DVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHK--------IDFPTTPVLAQNQTA----VRLKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPL

Query:  GTVIRLLTKQTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVY--VSDGPNATCPQCKQKMAQ
         +   +     ++GC+GNL  S + L+      ++ K  L  P    C    LL +V      T +   S     + R Y    D          + M  
Subjt:  GTVIRLLTKQTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVY--VSDGPNATCPQCKQKMAQ

Query:  VNTYVQPPSGSIQGVGDQ-----GGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLT
        ++  + P S       D+     GG++K    +M+ DDL + P+++ S+I L+ +  I+ +  +E + I +   + ++LL+ASL + + L+
Subjt:  VNTYVQPPSGSIQGVGDQ-----GGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLT

AT5G43240.3 Protein of unknown function (DUF674)2.4e-2525.46Show/hide
Query:  VSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSI---VNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDELQSLLQI
        + LKL+ID+++ ++++ EA K F+D LF+  +LP+G +V+LL      +  +I    N+Y +  ++ + +F +   K +LL    P +   ++ ++L   
Subjt:  VSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSI---VNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDELQSLLQI

Query:  QSFHPPTTYYTC-HVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYG----AKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQLHVE
              T Y+ C        C  S+S    + CS CG  M    T + G    A     +E G +V+    +FM+ DDL V+ I+S   T++VL  L   
Subjt:  QSFHPPTTYYTC-HVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYG----AKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQLHVE

Query:  DVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHK--------IDFPTTPVLAQNQTA----VRLKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPL
        D  +++EK+  +++ E   LL     +   LTD FL K        I    +P L +++        + L +  K   +LF E   + +D LF  L++PL
Subjt:  DVGQIEEKLIYLDINEGVKLLRASLCTSTVLTDVFLHK--------IDFPTTPVLAQNQTA----VRLKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPL

Query:  GTVIRLLTKQTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVY--VSDGPNATCPQCKQKMAQ
         +   +     ++GC+GNL  S + L+      ++ K  L  P    C    LL +V      T +   S     + R Y    D          + M  
Subjt:  GTVIRLLTKQTMVGCLGNLYDSVETLNDTYLQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVY--VSDGPNATCPQCKQKMAQ

Query:  VNTYVQPPSGSIQGVGDQ-----GGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLT
        ++  + P S       D+     GG++K    +M+ DDL + P+++ S+I L+ +  I+ +  +E + I +   + ++LL+ASL + + L+
Subjt:  VNTYVQPPSGSIQGVGDQ-----GGYVKDVVTYMVMDDLTVKPMSTISSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACCCAACCTCAAACTGAAGCCAAAGCTCAAACCCAAAGCGAAACCGAAAGCAAGGTGAGTTTGAAGCTTGTAATAGACCAAAAAGAGAAGAGAATTCTATACGC
TGAAGCCGATAAGAAATTCATAGACTTCCTCTTCACCATACTCTCCCTCCCTCTGGGAGCTGTCGTTAAGTTGCTTTCCACCGGCGTGCCACTGGAAACTTGGTCCATTG
TAAATGTCTACCGCACCCACCAAACCTTAAACCTCAACTATTTTGCATCAACCCGCAACAAAGACATTCTCCTCAACCCCAACCTTCCATCCGCCACTCCGTCTGATGAA
CTCCAAAGCCTCTTGCAAATCCAATCTTTTCATCCACCAACCACCTACTATACATGTCATGTTAGTACTTATAACGTCTGTCGTTATAGTTTTAGTGGTACATATGGTGC
AGTATGTAGCAGATGCGGCCAATCCATGACCACGAATGCAACTTATGTTTACGGAGCTAAAGAAGCAAAACCCCTTGAGATGGGAGGTTATGTGAAAGGGGGGATGGTTA
CGTTCATGGTGATGGATGATCTTACTGTTAAACCAATCTCCTCCTCCATGTCCACCATTTCGGTGCTTCATCAGTTACATGTGGAGGACGTTGGCCAGATTGAAGAGAAG
CTTATATATTTGGACATCAATGAGGGCGTGAAGTTGCTAAGGGCTTCTCTGTGCACATCAACTGTACTCACCGATGTGTTTCTTCACAAGATTGACTTCCCAACAACACC
TGTATTGGCGCAAAATCAAACCGCTGTGAGATTGAAGCTTCTGATAGACACGAAAGGGAACAGAGTTCTATTCGGCGAGGCCGACAAGAACCTGATAGATTTCCTTTTCA
ATCTACTTTCCCTCCCACTGGGGACTGTCATCAGGCTACTTACAAAGCAGACCATGGTGGGTTGCTTGGGGAATCTGTACGACAGTGTTGAAACCTTGAACGACACTTAT
TTGCAGTCAAAGCAGATAAAAAACACACTCTTGAATCCCAAGGTCTCGTCCTGTGGCTCAACCATGCTTTTGCCTAGCGTTGAAGCCTCGACCGCCGCAACTACATTTTA
TGGATGCAGTTACGCTGGTTATGGCAACTGTCGTGTTTACGTTTCTGATGGCCCCAATGCGACTTGTCCACAGTGCAAGCAGAAAATGGCTCAAGTGAATACTTATGTGC
AGCCACCAAGTGGAAGCATTCAGGGAGTGGGAGATCAGGGAGGGTATGTGAAGGATGTGGTGACTTATATGGTGATGGATGACTTGACTGTCAAGCCAATGTCCACCATC
TCTAGTATTACTCTTTTGAACAAGTTTAACATCAAGGAAGTGGGGGCTTTGGAGGAGAAAATCATCACTATGGATTTTAATCAGGGTGTGAAATTGTTGAAGGCATCTCT
GCAGTCAAAGACTGTTCTCACTGATGTTTTCTTGAAGAAGGATAACGTGAGATTGAAACTTCTGATAGACCCAAAAGGACAGAGAATTCTTTTCGCTGAAGCAGACAAGA
ACGTGGTGGACTTCCTTTTCCATCTACTTTCCCTTCAACTTGGAATTGTGTGTAGGCTGCTCAAAAACCAAATTGGGTTGGTGGGTTGTCTGAGAAATCTGTACGAAAGC
GTGGAAACCATGAACGAGGCGTATTTGCTGCCAAACCACAACAGAGACACCCTTTTGAAACCCAAAGTCGTCTCCTTTTTCAATTCCTCGTTGCTTTTGCCTTATATTGT
TGATGATTCCCCTCCTTCGGCCAGTATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCACCCAACCTCAAACTGAAGCCAAAGCTCAAACCCAAAGCGAAACCGAAAGCAAGGTGAGTTTGAAGCTTGTAATAGACCAAAAAGAGAAGAGAATTCTATACGC
TGAAGCCGATAAGAAATTCATAGACTTCCTCTTCACCATACTCTCCCTCCCTCTGGGAGCTGTCGTTAAGTTGCTTTCCACCGGCGTGCCACTGGAAACTTGGTCCATTG
TAAATGTCTACCGCACCCACCAAACCTTAAACCTCAACTATTTTGCATCAACCCGCAACAAAGACATTCTCCTCAACCCCAACCTTCCATCCGCCACTCCGTCTGATGAA
CTCCAAAGCCTCTTGCAAATCCAATCTTTTCATCCACCAACCACCTACTATACATGTCATGTTAGTACTTATAACGTCTGTCGTTATAGTTTTAGTGGTACATATGGTGC
AGTATGTAGCAGATGCGGCCAATCCATGACCACGAATGCAACTTATGTTTACGGAGCTAAAGAAGCAAAACCCCTTGAGATGGGAGGTTATGTGAAAGGGGGGATGGTTA
CGTTCATGGTGATGGATGATCTTACTGTTAAACCAATCTCCTCCTCCATGTCCACCATTTCGGTGCTTCATCAGTTACATGTGGAGGACGTTGGCCAGATTGAAGAGAAG
CTTATATATTTGGACATCAATGAGGGCGTGAAGTTGCTAAGGGCTTCTCTGTGCACATCAACTGTACTCACCGATGTGTTTCTTCACAAGATTGACTTCCCAACAACACC
TGTATTGGCGCAAAATCAAACCGCTGTGAGATTGAAGCTTCTGATAGACACGAAAGGGAACAGAGTTCTATTCGGCGAGGCCGACAAGAACCTGATAGATTTCCTTTTCA
ATCTACTTTCCCTCCCACTGGGGACTGTCATCAGGCTACTTACAAAGCAGACCATGGTGGGTTGCTTGGGGAATCTGTACGACAGTGTTGAAACCTTGAACGACACTTAT
TTGCAGTCAAAGCAGATAAAAAACACACTCTTGAATCCCAAGGTCTCGTCCTGTGGCTCAACCATGCTTTTGCCTAGCGTTGAAGCCTCGACCGCCGCAACTACATTTTA
TGGATGCAGTTACGCTGGTTATGGCAACTGTCGTGTTTACGTTTCTGATGGCCCCAATGCGACTTGTCCACAGTGCAAGCAGAAAATGGCTCAAGTGAATACTTATGTGC
AGCCACCAAGTGGAAGCATTCAGGGAGTGGGAGATCAGGGAGGGTATGTGAAGGATGTGGTGACTTATATGGTGATGGATGACTTGACTGTCAAGCCAATGTCCACCATC
TCTAGTATTACTCTTTTGAACAAGTTTAACATCAAGGAAGTGGGGGCTTTGGAGGAGAAAATCATCACTATGGATTTTAATCAGGGTGTGAAATTGTTGAAGGCATCTCT
GCAGTCAAAGACTGTTCTCACTGATGTTTTCTTGAAGAAGGATAACGTGAGATTGAAACTTCTGATAGACCCAAAAGGACAGAGAATTCTTTTCGCTGAAGCAGACAAGA
ACGTGGTGGACTTCCTTTTCCATCTACTTTCCCTTCAACTTGGAATTGTGTGTAGGCTGCTCAAAAACCAAATTGGGTTGGTGGGTTGTCTGAGAAATCTGTACGAAAGC
GTGGAAACCATGAACGAGGCGTATTTGCTGCCAAACCACAACAGAGACACCCTTTTGAAACCCAAAGTCGTCTCCTTTTTCAATTCCTCGTTGCTTTTGCCTTATATTGT
TGATGATTCCCCTCCTTCGGCCAGTATTTAG
Protein sequenceShow/hide protein sequence
MTTQPQTEAKAQTQSETESKVSLKLVIDQKEKRILYAEADKKFIDFLFTILSLPLGAVVKLLSTGVPLETWSIVNVYRTHQTLNLNYFASTRNKDILLNPNLPSATPSDE
LQSLLQIQSFHPPTTYYTCHVSTYNVCRYSFSGTYGAVCSRCGQSMTTNATYVYGAKEAKPLEMGGYVKGGMVTFMVMDDLTVKPISSSMSTISVLHQLHVEDVGQIEEK
LIYLDINEGVKLLRASLCTSTVLTDVFLHKIDFPTTPVLAQNQTAVRLKLLIDTKGNRVLFGEADKNLIDFLFNLLSLPLGTVIRLLTKQTMVGCLGNLYDSVETLNDTY
LQSKQIKNTLLNPKVSSCGSTMLLPSVEASTAATTFYGCSYAGYGNCRVYVSDGPNATCPQCKQKMAQVNTYVQPPSGSIQGVGDQGGYVKDVVTYMVMDDLTVKPMSTI
SSITLLNKFNIKEVGALEEKIITMDFNQGVKLLKASLQSKTVLTDVFLKKDNVRLKLLIDPKGQRILFAEADKNVVDFLFHLLSLQLGIVCRLLKNQIGLVGCLRNLYES
VETMNEAYLLPNHNRDTLLKPKVVSFFNSSLLLPYIVDDSPPSASI