; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G01760 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G01760
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPlant protein of unknown function (DUF247)
Genome locationClcChr06:1633780..1646864
RNA-Seq ExpressionClc06G01760
SyntenyClc06G01760
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039868.1 UPF0481 protein [Cucumis melo var. makuwa]7.1e-21952.24Show/hide
Query:  EVKERLSKLQESIGMEE-LSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVERVSSILEHLLGS
        EV E LSKL+ESIGMEE LS Q   KPSIYKIP+F+REV+PKAYEP+LVSLGPYH+GK  LLPMEREK K FL FK++NNL  ESIV+ V SILE LLGS
Subjt:  EVKERLSKLQESIGMEE-LSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVERVSSILEHLLGS

Query:  Y-DKLQDIWKLYPAKFLEVMIVDACFLLDFITE-CPQSLS-SMIWDIKRDMLLLENQLPMNLLKQLHAIRTN----------------------------
        Y D L DIWK   A FL++MIVD CF+LD I+E   +SL   MIWDIKRDM+LLENQLP+ LLKQLHA+ T                             
Subjt:  Y-DKLQDIWKLYPAKFLEVMIVDACFLLDFITE-CPQSLS-SMIWDIKRDMLLLENQLPMNLLKQLHAIRTN----------------------------

Query:  --------------------EQKDVIEKMKTILELNPADMD-ANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDILTEPTLINI
                            +Q +  E M  ILE    + D   CQSI  ATQLR+AGIKFEKSST++LMDVSFD ++R+LRLPSLAIDDILTEPTLINI
Subjt:  --------------------EQKDVIEKMKTILELNPADMD-ANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDILTEPTLINI

Query:  MAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGIL-SNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWANLKL--PQTQ
        MAFEKLH+GV  Q  SFV+LMKNLI +DKDV LLASKGIL SN IHD +GVVQLFN LAKGQ K LE+ M +LFK+LN   +N+W+IW  NLKL  PQT+
Subjt:  MAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGIL-SNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWANLKL--PQTQ

Query:  NPWPVI------YG---------EQNESR----------NLGSSLRRNTKMTQPQ-----LPISISRN-------DSAQLVIEVQENLNKL------GKS
        NPW +I      +G           N  R          NLGS + +   M + +     LPI    +       D   +VI V++   +L        +
Subjt:  NPWPVI------YG---------EQNESR----------NLGSSLRRNTKMTQPQ-----LPISISRN-------DSAQLVIEVQENLNKL------GKS

Query:  VLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRL---KTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRP-
           + E T   SIY+IP FM+++ PKA+EP LVSFGPYHHG    A ME EK K F+ L     N    ESI   V++ L++L  AYD L   +W K   
Subjt:  VLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRL---KTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRP-

Query:  --AKFLEVLIVDGCFMLSFLK--KCPPSLSSMSWDIKRDMLLLENQLPMLLLDDLYSILPNMDGR--LAWFICKSMCFASGEAVSMGGNLHILDMYRMSL
          AKF+E++I+D CF+L F    K   SL ++  DIKRD+LLLENQLP  LL  LY ILP  D    L   ICK + F   + +++ G  HIL+MY+M L
Subjt:  --AKFLEVLIVDGCFMLSFLK--KCPPSLSSMSWDIKRDMLLLENQLPMLLLDDLYSILPNMDGR--LAWFICKSMCFASGEAVSMGGNLHILDMYRMSL

Query:  LG---TTILENKDGSMERK-----TKKSGPE----YQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEA
        L     T+L   D S ++       KK   E     Q+I  AT L DAGI F++S T+SL D+ F  K GVL++P L VDDD+E  LLNVMAFEKLH   
Subjt:  LG---TTILENKDGSMERK-----TKKSGPE----YQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEA

Query:  GSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAIL
         S VTSFVVLMNNLID+D+DV LLS   I+ NALG+D+ AA+LF  LGKG A+DLES+I +VH  VN HC    N WCA+L+HNYF NPWAIISL+ AI 
Subjt:  GSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAIL

Query:  GFAILIVQAVYQIVDYY
        GF ILIVQAVYQI+DY+
Subjt:  GFAILIVQAVYQIVDYY

KAG6783030.1 hypothetical protein POTOM_012460 [Populus tomentosa]6.6e-12434.29Show/hide
Query:  VALGIEV-KERLSKLQESIGMEELSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVERVSSILE
        V L I+V KE  +KL  S  M+ +   K    SIYK+P  +  ++  AY PQ VS GPYH+   +L  ME  K +    F  ++    E  ++ ++ +++
Subjt:  VALGIEV-KERLSKLQESIGMEELSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVERVSSILE

Query:  HLLGSYDKLQDIWKLYPAKFLEVMIVDACFLLDFITECPQSLSSMIWD---------------IKRDMLLLENQLPMNLLKQLHAIRTNEQKD--VIEKM
         L  SYD+L   WK   +KFL++MI+D CF+++ +    Q  +    D               IKRDML++ENQLPM +L +L A+ +N  KD   + K+
Subjt:  HLLGSYDKLQDIWKLYPAKFLEVMIVDACFLLDFITECPQSLSSMIWD---------------IKRDMLLLENQLPMNLLKQLHAIRTNEQKD--VIEKM

Query:  --------------------------KTILELNP------------ADMDANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDIL
                                  K++++ +P               D N   +  A ++  AGI+F+KS T SL D+SF     +L LP + +DD+ 
Subjt:  --------------------------KTILELNP------------ADMDANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDIL

Query:  TEPTLINIMAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGILSNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWANLK
        TE  L+N+MAFE+ H+G G++  S+V  M NLI  ++DV LL S+GI+ NAI     V +LFN L+K      +S +  + + +N  C   W  W ANL 
Subjt:  TEPTLINIMAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGILSNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWANLK

Query:  LPQTQNPW---------------------------PVIYGEQNE------SRNLGSSL--------------RRNTKMTQPQLPISI------SRNDSAQ
            +NPW                           P++     E        N+GS+                RN    +     S+      S N ++ 
Subjt:  LPQTQNPW---------------------------PVIYGEQNE------SRNLGSSL--------------RRNTKMTQPQLPISI------SRNDSAQ

Query:  LVIE-VQENLNKLGKSVLATDENTFN----------RSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFR-RLKTNPEMLESIVQTV
        L +E ++E    L K V    +N  N          RSIYRIP  +   +  A+ PQ VSFGPYHH       ME  KQ+A    L+     L+S+V+++
Subjt:  LVIE-VQENLNKLGKSVLATDENTFN----------RSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFR-RLKTNPEMLESIVQTV

Query:  TDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWD---------------IKRDMLLLENQLPMLLLDDLYSILPN--MDG
         D +Q L  +YD L E  W K   KFL+++I+DGCFML  ++   P+    S D               I+RDMLLLENQLPML+L  L ++  +   D 
Subjt:  TDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWD---------------IKRDMLLLENQLPMLLLDDLYSILPN--MDG

Query:  RLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQLV
             + ++       A S+G  LH+LD+YR SLL    ++ K    E K    G +  +IR AT++ +AGI F+KS +KS+ ++SF   +GVL++P + 
Subjt:  RLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQLV

Query:  VDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWC
        VDD +EA+ LN+MAFE+ H  AG+EVTS+V  M+++ID + DVALL+S+ I+ NA+G D++ A+LF SL K  A++  S++  V   VN +C++PWN W 
Subjt:  VDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWC

Query:  ASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYRGN
        A+L H YF NPWAI+S++AA++ FA+ I Q +Y I+ YY  N
Subjt:  ASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYRGN

KAG7025238.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-16845.91Show/hide
Query:  KQSAIGSKLVDRNDIIIKDDMVALGIEVKERLSKLQESIGMEELSMQKNTKPSIYKIPEFMREVHP----KAYEPQLVSLGPYHHGKHRLLPMEREKQKG
        KQ  +  KL+  +++   DDMV   I+V E L++L    G++  +     K SIYK+P FMR+ H     KA++PQ+VS GPYHHGK  L PMER+K K 
Subjt:  KQSAIGSKLVDRNDIIIKDDMVALGIEVKERLSKLQESIGMEELSMQKNTKPSIYKIPEFMREVHP----KAYEPQLVSLGPYHHGKHRLLPMEREKQKG

Query:  FLRFKSKNNLDFESIVERVSSILEHLLGSYDKLQDIWKLYPAKFLEVMIVDACFLLDFITECPQSLSSMI-W-DIKRDMLLLENQLPMNLLK------QL
        F   K++  L  E+IV  VS+IL+ LL SYD+L++ W   P KFL++MIVD CF+L    +CP SL  +  W  I +  LL E    +++ K       +
Subjt:  FLRFKSKNNLDFESIVERVSSILEHLLGSYDKLQDIWKLYPAKFLEVMIVDACFLLDFITECPQSLSSMI-W-DIKRDMLLLENQLPMNLLK------QL

Query:  HAIRTNEQKDVIEKMKTILELNPADMDANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDILTEPTLINIMAFEKLHI-GVGSQA
           +  E+     K K   EL+ A+     Q IPPAT+L   GIKF+ S T SL DV FD    +L LP L +DD  TE T +N+MAFEKLH+ G   +A
Subjt:  HAIRTNEQKDVIEKMKTILELNPADMDANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDILTEPTLINIMAFEKLHI-GVGSQA

Query:  ----ISFVMLMKNLIRLDKDVWLLASKGILSNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWANLKLPQTQNPWPVIYGEQNES
             SFV+LM NLI  ++DV LL+SKG L+NA+ +     +LF+RL KG        M  + K+LN  C   W    A LK    QNPW     +  +S
Subjt:  ----ISFVMLMKNLIRLDKDVWLLASKGILSNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWANLKLPQTQNPWPVIYGEQNES

Query:  RNLGSSLRRNTKMTQPQLPISISRNDSAQLVIEVQENLNKLGKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAF
          +                +++    S++L+                        SIY+IP FM + HPKA+EPQ+VS GPY+HGK   + MELEK K F
Subjt:  RNLGSSLRRNTKMTQPQLPISISRNDSAQLVIEVQENLNKLGKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAF

Query:  RRLKTNPEM-LESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWDIKRDMLLLENQLPMLLLDDLYSILPNMD
           KT   + +ESIV+ V+  L  L+ +YDKL E +W + PAKFL+++IVDGCFML FL  CP SL ++S DIK+DMLLLENQLPMLLL  LYSI    D
Subjt:  RRLKTNPEM-LESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWDIKRDMLLLENQLPMLLLDDLYSILPNMD

Query:  GRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQL
          L   +   +     E   M   LHIL+MY+ SLL   I +  D S+E     S PE QVI  AT+LR+AGI F++S T SLTDV FD+K GVL +P+L
Subjt:  GRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQL

Query:  VVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEW
        +VDD++E++LLNVMAFEKLH EAG +VTSFV+LM+NLID + DVA+L+ + +LANA+G+D+ AA LF  LG GAAM L++H+  VH +VN HC +PWNE 
Subjt:  VVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEW

Query:  CASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYRGN
        CA+LKH YF +PW IISL AAI GF ILI+QA+YQ +DYY  N
Subjt:  CASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYRGN

XP_016902479.1 PREDICTED: uncharacterized protein LOC103499077 [Cucumis melo]1.9e-21651.17Show/hide
Query:  KDDMVALGIEVKERLSKLQESIGMEE-LSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVERVS
        +++M  +  EV E LSKL+ESIGMEE LS Q   KPSIYKIP+F+REV+PKAYEP+LVSLGPYH+GK  LLPMEREK K FL FK++NNL  ESIV+ V 
Subjt:  KDDMVALGIEVKERLSKLQESIGMEE-LSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVERVS

Query:  SILEHLLGSY-DKLQDIWKLYPAKFLEVMIVDACFLLDFITE-CPQSLS-SMIWDIKRDMLLLENQLPMNLLKQLHAIRTN-------------------
        SILE LLGSY D L DIWK   A FL++MIVD CF+LD I+E   +SL   MIWDIKRDM+LLENQLP+ LLKQLHA+ T                    
Subjt:  SILEHLLGSY-DKLQDIWKLYPAKFLEVMIVDACFLLDFITE-CPQSLS-SMIWDIKRDMLLLENQLPMNLLKQLHAIRTN-------------------

Query:  -----------------------------EQKDVIEKMKTILELNPADMD-ANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDI
                                     +Q +  E M  ILE    + D   CQSI  ATQLR+AGIKFEKSST++LMDVSFD ++R+LRLPSLAIDDI
Subjt:  -----------------------------EQKDVIEKMKTILELNPADMD-ANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDI

Query:  LTEPTLINIMAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGIL-SNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWAN
        LTEPTLINIMAFEKLH+GV  Q  SFV+LMKNLI +DK+V LLASKGIL SN IHD +GVVQLFN LAKGQ K LE+ M +LFK+LN   +N+W+IW  N
Subjt:  LTEPTLINIMAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGIL-SNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWAN

Query:  LKL--PQTQNPW--------------PVIYGEQNESR--------------------------NLGSSLRRNTKMTQPQ-----LPISISRN-------D
        LKL  PQT+NPW              P++    +  R                          NLGS + +   M + +     LPIS   +       D
Subjt:  LKL--PQTQNPW--------------PVIYGEQNESR--------------------------NLGSSLRRNTKMTQPQ-----LPISISRN-------D

Query:  SAQLVIEVQENLNKL------GKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRL---KTNPEMLESIVQTV
           +VI V++ L +L        +   + E T   SIY+IP FM+++  KA+EP LVSFGPYHHG    A ME EKQK F+ L     N    ESI   V
Subjt:  SAQLVIEVQENLNKL------GKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRL---KTNPEMLESIVQTV

Query:  TDNLQNLLGAYDKLIEGEWAK---RPAKFLEVLIVDGCFMLSFLK--KCPPSLSSMSWDIKRDMLLLENQLPMLLLDDLYSILPNMDGR--LAWFICKSM
        ++ L++L  AYD L E +W K     AKF+E++I+D CF+L F    K   SL ++  DIKRD+LLLENQLP  LL  LY ILP  D    L   ICK +
Subjt:  TDNLQNLLGAYDKLIEGEWAK---RPAKFLEVLIVDGCFMLSFLK--KCPPSLSSMSWDIKRDMLLLENQLPMLLLDDLYSILPNMDGR--LAWFICKSM

Query:  CFASGEAVSMGGNLHILDMYRMSLLG--TTILENKDGSMERK-----TKKSGPE----YQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQL
         F   + +++ G  HIL+MYRM LL     +L  +D S ++       KK   E     Q+I  AT L DAGI F++S T+SL D+ F  K GVL++P L
Subjt:  CFASGEAVSMGGNLHILDMYRMSLLG--TTILENKDGSMERK-----TKKSGPE----YQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQL

Query:  VVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEW
         VDDD+E  LLNVMAFEKLH    S VTSFVVLMNNLID+D+DV LLS   I+ NALG+D+ AA+LF  LGKG A+DLES+I +VH  VN HC    N W
Subjt:  VVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEW

Query:  CASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYY
        CA+LKHNYF NPWAIISL+ AI GF ILIVQAVYQI+DY+
Subjt:  CASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYY

XP_038875622.1 UPF0481 protein At3g47200-like [Benincasa hispida]2.1e-13862.84Show/hide
Query:  QPQLPISISRNDSA-----QLVIEVQENLNKLGKSVLATDE-----NTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRLK
        QP LPI   RND+       +VI+V+ NLNKL  S  A DE      +   SIY+IP FMR++  KAFEPQLVS GPYHHGK    SME EKQKAFRR  
Subjt:  QPQLPISISRNDSA-----QLVIEVQENLNKLGKSVLATDE-----NTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRLK

Query:  TNPEM-LESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWDIKRDMLLLENQLPMLLLDDLYSILPNM---DG
         +  + LESIV+++++NL+NL GAYDKL E +W +  AKFLEV+IVDGCFML F K+CP SLS+M WDIKRDMLLLENQLPM LL +LY  +PN    D 
Subjt:  TNPEM-LESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWDIKRDMLLLENQLPMLLLDDLYSILPNM---DG

Query:  RLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKT--KKSGPEYQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQ
         L   IC+ MC    E   MG  LHILDMYR SLL   + + KD S + K   ++S PE Q+I  ATQL DAGI F+ S TK+LTDVSF+ K+GVL++P 
Subjt:  RLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKT--KKSGPEYQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQ

Query:  LVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNE
        +VVDDD+E +LLNVMAFEKL+ EAGS+VTSFV+LMNNLIDVD DVALL+S  ILANALG+D+SAA+LF  LGKGAAMDL+SHIT+VH++VN HC   WN+
Subjt:  LVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNE

Query:  WCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYRGN
        WCASLKH+YF NPWAIISL AA+ GFAILIVQA+YQIVDY+RGN
Subjt:  WCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYRGN

TrEMBL top hitse value%identityAlignment
A0A0A0KAA3 Uncharacterized protein2.6e-14242.59Show/hide
Query:  DIIIKDDMVALGIEVKERLSKLQESIGMEELSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVE
        +I+ +++M  +  EV   LSKL+ESIGMEE    ++ KPSIYKIP+F+REV+PKAYEP+LVSLGPYH+GK  LLPMEREK K FL FK++NNL  ESIV+
Subjt:  DIIIKDDMVALGIEVKERLSKLQESIGMEELSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVE

Query:  RVSSILEHLLGSY-DKLQDIWKLYPAKFLEVMIVDACFLLDFITE-CPQSLS-SMIWDIKRDMLLLENQLPMNLLKQLHAIRTN----------------
         V SILE LLGSY D L D WK   A FL++MIVD CF+LD I+E   +SL   MIWDIKRDM+LLENQLP+ LLKQLHA+ T                 
Subjt:  RVSSILEHLLGSY-DKLQDIWKLYPAKFLEVMIVDACFLLDFITE-CPQSLS-SMIWDIKRDMLLLENQLPMNLLKQLHAIRTN----------------

Query:  -------------------------------EQKDVIEKMKTILE--LNPADMDANCQS-IPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLA
                                       +Q +  E   TILE  L   +    CQS I  ATQLR+AGIKFEKSST++LMDVSFD ++R+LRLPSLA
Subjt:  -------------------------------EQKDVIEKMKTILE--LNPADMDANCQS-IPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLA

Query:  IDDILTEPTLINIMAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGIL-SNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTI
        IDDILTEPTLINIMAFEKLH+GV  Q  SFV+LMKNLI ++KDV LLASKGIL SN IHD +GVVQLFN LAKGQ K LES +                I
Subjt:  IDDILTEPTLINIMAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGIL-SNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTI

Query:  WWANLKL--PQTQNPWPVIYGEQNESRNLGSSLRRNTKMTQPQLPISISRNDSAQLVIEVQENLNKLGKSVLATDENTFNRSIYRIPTFMREVHPKAFEP
        W  NLKL  PQT+N W +I           S        T P     +      Q++      +N  G    + D+NT                      
Subjt:  WWANLKL--PQTQNPWPVIYGEQNESRNLGSSLRRNTKMTQPQLPISISRNDSAQLVIEVQENLNKLGKSVLATDENTFNRSIYRIPTFMREVHPKAFEP

Query:  QLVSFGPYHHGKPQYASMELEKQKAFRRLKTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWDIKR
                                                 T+T  + NLL                 F+E                           K 
Subjt:  QLVSFGPYHHGKPQYASMELEKQKAFRRLKTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWDIKR

Query:  DMLLLENQLPMLLLDDLYSILPNMDGRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILE---NKDGSMERKTKKSGPEYQVIRHATQLRDAG
        ++ ++E +                                          HIL MYR SLL    L      +     K  + G + Q+I  AT LR+ G
Subjt:  DMLLLENQLPMLLLDDLYSILPNMDGRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILE---NKDGSMERKTKKSGPEYQVIRHATQLRDAG

Query:  IDFQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSK--NILANALGDDQSAAELFGSL
        I FQKS  KSL +VSF+  KGVL +P L+VDD+++ +LLNVMAFEKLH+E GS+VTSFVVLM+NLI +D+DV LLS+   NI+ANA  +D+ A  LF +L
Subjt:  IDFQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSK--NILANALGDDQSAAELFGSL

Query:  GKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLV-AAILGFAILIVQAVYQIVDYY
        GKG  M+  +++T+++  VN +C + WN+WCA LK +YF NPW+I+SL  A + GFAILIVQAVYQIVD++
Subjt:  GKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLV-AAILGFAILIVQAVYQIVDYY

A0A1S4E2L7 uncharacterized protein LOC1034990779.4e-21751.17Show/hide
Query:  KDDMVALGIEVKERLSKLQESIGMEE-LSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVERVS
        +++M  +  EV E LSKL+ESIGMEE LS Q   KPSIYKIP+F+REV+PKAYEP+LVSLGPYH+GK  LLPMEREK K FL FK++NNL  ESIV+ V 
Subjt:  KDDMVALGIEVKERLSKLQESIGMEE-LSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVERVS

Query:  SILEHLLGSY-DKLQDIWKLYPAKFLEVMIVDACFLLDFITE-CPQSLS-SMIWDIKRDMLLLENQLPMNLLKQLHAIRTN-------------------
        SILE LLGSY D L DIWK   A FL++MIVD CF+LD I+E   +SL   MIWDIKRDM+LLENQLP+ LLKQLHA+ T                    
Subjt:  SILEHLLGSY-DKLQDIWKLYPAKFLEVMIVDACFLLDFITE-CPQSLS-SMIWDIKRDMLLLENQLPMNLLKQLHAIRTN-------------------

Query:  -----------------------------EQKDVIEKMKTILELNPADMD-ANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDI
                                     +Q +  E M  ILE    + D   CQSI  ATQLR+AGIKFEKSST++LMDVSFD ++R+LRLPSLAIDDI
Subjt:  -----------------------------EQKDVIEKMKTILELNPADMD-ANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDI

Query:  LTEPTLINIMAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGIL-SNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWAN
        LTEPTLINIMAFEKLH+GV  Q  SFV+LMKNLI +DK+V LLASKGIL SN IHD +GVVQLFN LAKGQ K LE+ M +LFK+LN   +N+W+IW  N
Subjt:  LTEPTLINIMAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGIL-SNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWAN

Query:  LKL--PQTQNPW--------------PVIYGEQNESR--------------------------NLGSSLRRNTKMTQPQ-----LPISISRN-------D
        LKL  PQT+NPW              P++    +  R                          NLGS + +   M + +     LPIS   +       D
Subjt:  LKL--PQTQNPW--------------PVIYGEQNESR--------------------------NLGSSLRRNTKMTQPQ-----LPISISRN-------D

Query:  SAQLVIEVQENLNKL------GKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRL---KTNPEMLESIVQTV
           +VI V++ L +L        +   + E T   SIY+IP FM+++  KA+EP LVSFGPYHHG    A ME EKQK F+ L     N    ESI   V
Subjt:  SAQLVIEVQENLNKL------GKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRL---KTNPEMLESIVQTV

Query:  TDNLQNLLGAYDKLIEGEWAK---RPAKFLEVLIVDGCFMLSFLK--KCPPSLSSMSWDIKRDMLLLENQLPMLLLDDLYSILPNMDGR--LAWFICKSM
        ++ L++L  AYD L E +W K     AKF+E++I+D CF+L F    K   SL ++  DIKRD+LLLENQLP  LL  LY ILP  D    L   ICK +
Subjt:  TDNLQNLLGAYDKLIEGEWAK---RPAKFLEVLIVDGCFMLSFLK--KCPPSLSSMSWDIKRDMLLLENQLPMLLLDDLYSILPNMDGR--LAWFICKSM

Query:  CFASGEAVSMGGNLHILDMYRMSLLG--TTILENKDGSMERK-----TKKSGPE----YQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQL
         F   + +++ G  HIL+MYRM LL     +L  +D S ++       KK   E     Q+I  AT L DAGI F++S T+SL D+ F  K GVL++P L
Subjt:  CFASGEAVSMGGNLHILDMYRMSLLG--TTILENKDGSMERK-----TKKSGPE----YQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQL

Query:  VVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEW
         VDDD+E  LLNVMAFEKLH    S VTSFVVLMNNLID+D+DV LLS   I+ NALG+D+ AA+LF  LGKG A+DLES+I +VH  VN HC    N W
Subjt:  VVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEW

Query:  CASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYY
        CA+LKHNYF NPWAIISL+ AI GF ILIVQAVYQI+DY+
Subjt:  CASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYY

A0A5A7TA99 UPF0481 protein3.5e-21952.24Show/hide
Query:  EVKERLSKLQESIGMEE-LSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVERVSSILEHLLGS
        EV E LSKL+ESIGMEE LS Q   KPSIYKIP+F+REV+PKAYEP+LVSLGPYH+GK  LLPMEREK K FL FK++NNL  ESIV+ V SILE LLGS
Subjt:  EVKERLSKLQESIGMEE-LSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVERVSSILEHLLGS

Query:  Y-DKLQDIWKLYPAKFLEVMIVDACFLLDFITE-CPQSLS-SMIWDIKRDMLLLENQLPMNLLKQLHAIRTN----------------------------
        Y D L DIWK   A FL++MIVD CF+LD I+E   +SL   MIWDIKRDM+LLENQLP+ LLKQLHA+ T                             
Subjt:  Y-DKLQDIWKLYPAKFLEVMIVDACFLLDFITE-CPQSLS-SMIWDIKRDMLLLENQLPMNLLKQLHAIRTN----------------------------

Query:  --------------------EQKDVIEKMKTILELNPADMD-ANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDILTEPTLINI
                            +Q +  E M  ILE    + D   CQSI  ATQLR+AGIKFEKSST++LMDVSFD ++R+LRLPSLAIDDILTEPTLINI
Subjt:  --------------------EQKDVIEKMKTILELNPADMD-ANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDILTEPTLINI

Query:  MAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGIL-SNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWANLKL--PQTQ
        MAFEKLH+GV  Q  SFV+LMKNLI +DKDV LLASKGIL SN IHD +GVVQLFN LAKGQ K LE+ M +LFK+LN   +N+W+IW  NLKL  PQT+
Subjt:  MAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGIL-SNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWANLKL--PQTQ

Query:  NPWPVI------YG---------EQNESR----------NLGSSLRRNTKMTQPQ-----LPISISRN-------DSAQLVIEVQENLNKL------GKS
        NPW +I      +G           N  R          NLGS + +   M + +     LPI    +       D   +VI V++   +L        +
Subjt:  NPWPVI------YG---------EQNESR----------NLGSSLRRNTKMTQPQ-----LPISISRN-------DSAQLVIEVQENLNKL------GKS

Query:  VLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRL---KTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRP-
           + E T   SIY+IP FM+++ PKA+EP LVSFGPYHHG    A ME EK K F+ L     N    ESI   V++ L++L  AYD L   +W K   
Subjt:  VLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRL---KTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRP-

Query:  --AKFLEVLIVDGCFMLSFLK--KCPPSLSSMSWDIKRDMLLLENQLPMLLLDDLYSILPNMDGR--LAWFICKSMCFASGEAVSMGGNLHILDMYRMSL
          AKF+E++I+D CF+L F    K   SL ++  DIKRD+LLLENQLP  LL  LY ILP  D    L   ICK + F   + +++ G  HIL+MY+M L
Subjt:  --AKFLEVLIVDGCFMLSFLK--KCPPSLSSMSWDIKRDMLLLENQLPMLLLDDLYSILPNMDGR--LAWFICKSMCFASGEAVSMGGNLHILDMYRMSL

Query:  LG---TTILENKDGSMERK-----TKKSGPE----YQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEA
        L     T+L   D S ++       KK   E     Q+I  AT L DAGI F++S T+SL D+ F  K GVL++P L VDDD+E  LLNVMAFEKLH   
Subjt:  LG---TTILENKDGSMERK-----TKKSGPE----YQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEA

Query:  GSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAIL
         S VTSFVVLMNNLID+D+DV LLS   I+ NALG+D+ AA+LF  LGKG A+DLES+I +VH  VN HC    N WCA+L+HNYF NPWAIISL+ AI 
Subjt:  GSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAIL

Query:  GFAILIVQAVYQIVDYY
        GF ILIVQAVYQI+DY+
Subjt:  GFAILIVQAVYQIVDYY

A0A6A4L8Y3 Uncharacterized protein (Fragment)1.4e-11132.79Show/hide
Query:  IEVKERLSKLQESIGMEELSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVERVSSILEHLLGS
        + VKE L  +  S  +         K SIY+IPE + + + KAY+P  VS GPYHH  H L PME  K +  L F  ++    ES +  +  + + L  S
Subjt:  IEVKERLSKLQESIGMEELSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVERVSSILEHLLGS

Query:  YDKLQDIWKLYPAKFLEVMIVDACFLLD------------------FITECPQSLSSMIWDIKRDMLLLENQLPMNLLKQLHAIRTNEQKDVIEKMKTIL
        Y+ L   W     +FL++MIVD CF+++                  F  E        + D+ R +LL E+    +  K       N QKD  +K KT  
Subjt:  YDKLQDIWKLYPAKFLEVMIVDACFLLD------------------FITECPQSLSSMIWDIKRDMLLLENQLPMNLLKQLHAIRTNEQKDVIEKMKTIL

Query:  ELNPADMDANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDILTEPTLINIMAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLL
          +    +   + I  A +L  AGI F +S T SL D+SF  E   LRLP + +DD  +E   +N++AFE+ H+GVG+   S+V  M ++I   KDV LL
Subjt:  ELNPADMDANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDILTEPTLINIMAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLL

Query:  ASKGILSNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWANLKLPQTQNPW------PVIYGEQNESRNLGSSLRRNTKMTQPQL
           GI+ NA+     V +LFN ++K       S +  + + ++  C  +   W A L     +NPW        +  E+ +    G S      M    +
Subjt:  ASKGILSNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWANLKLPQTQNPW------PVIYGEQNESRNLGSSLRRNTKMTQPQL

Query:  PISISRNDSAQL------------VIEVQENLNKLGKSVLATDENTFNRSIYRIP-TFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRR-LK
           ++  D   +            V+ +++ L ++        ++   RSIYR+P + + +++ KA++PQ VSFGPYHHG+     ME  K +A    LK
Subjt:  PISISRNDSAQL------------VIEVQENLNKLGKSVLATDENTFNRSIYRIP-TFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRR-LK

Query:  TNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWD-----------------------IKRDMLLLEN
         +   LES    + +  Q+L  +Y+ L   +W +   +FL+++I+DGCFML  L     +    +                         IKRDML+LEN
Subjt:  TNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWD-----------------------IKRDMLLLEN

Query:  QLPMLLLDDLYSILPNMDGR-----LAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQK
        QLPMLLL  L       D       +  F C +  FA     SMG  LH+LD++R  LL       K     R   ++G   ++IR A ++ +AGI F+K
Subjt:  QLPMLLLDDLYSILPNMDGR-----LAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQK

Query:  SGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMD
        S ++SL D+SF+   G L++P +VVDD +E+  LN++AFE+ H   G+EVTS++  M+++ID  +DV+LL  K I+ NALG D++ AE F ++ K A +D
Subjt:  SGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMD

Query:  LESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYY
          S +  VH  V  +C++  N+W A+ KH YF NPWAIIS++AA + FA+ I+Q +Y ++ YY
Subjt:  LESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYY

A0A6N2MH00 Uncharacterized protein6.7e-11432.88Show/hide
Query:  IKDDMVALGIEVKERLSKLQESIGMEELSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVERVS
        ++DD   L  E+ E L    ++     L      K SIYKIP  +   +  AY PQ VS GPYHHG++ L PME  K +    + ++       +VE ++
Subjt:  IKDDMVALGIEVKERLSKLQESIGMEELSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNLDFESIVERVS

Query:  SILEHLLGSYDKLQDIWKLYPAKFLEVMIVDACFLLDFITECPQSLSSMIWD---------------IKRDMLLLENQLPMNLLKQLHAIRTN--EQKDV
          ++ L  SYD L + W+    KFL++MI+D CF+L+ I      L     +               I+RDMLLLENQLPM +L +L A+ ++  + ++ 
Subjt:  SILEHLLGSYDKLQDIWKLYPAKFLEVMIVDACFLLDFITECPQSLSSMIWD---------------IKRDMLLLENQLPMNLLKQLHAIRTN--EQKDV

Query:  IEKM-------------------------KTILELNPADM------------DANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAID
        + K+                         K++L+ +PA              D N   +P AT+L  AGI+F+K +T SL D+SF     +L LP + +D
Subjt:  IEKM-------------------------KTILELNPADM------------DANCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAID

Query:  DILTEPTLINIMAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGILSNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWA
        D   E T +N++AFE+LH G GS+   +V  M ++I  ++D+ LL S+GI+ NAI     V  LFN L++       S +  +   +N  C      W A
Subjt:  DILTEPTLINIMAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGILSNAIHDRDGVVQLFNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWA

Query:  NLKLPQTQNPWPVI----------------------YGEQNESRNLGSSL-------------------------------------RRNTKMTQPQLPI
        NL     +NP  +                       Y  ++    L S +                                     R+ +    P+   
Subjt:  NLKLPQTQNPWPVI----------------------YGEQNESRNLGSSL-------------------------------------RRNTKMTQPQLPI

Query:  SISRNDSAQLVIEVQENLNKLGKSVLATDENTF--NRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFR-RLKTNPEMLESIVQTV
          +  D  QL+++V E L  L  + L   +N     RSIY+IP      +  A+ PQ VSFGPYHH +     ME  KQ+A    LK   + L+++VQ++
Subjt:  SISRNDSAQLVIEVQENLNKLGKSVLATDENTF--NRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFR-RLKTNPEMLESIVQTV

Query:  TDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCP--PSLSSMSWD---------------IKRDMLLLENQLPMLLLDDLYSILPNMDG
           +Q L  +YD +++G W     KFL+++I+DGCFML  ++      SL     D               I+RDMLLLENQLPM++L  L ++    DG
Subjt:  TDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCP--PSLSSMSWD---------------IKRDMLLLENQLPMLLLDDLYSILPNMDG

Query:  ----RLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKSG---PEYQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGV
                 +  + C+ +     M   LH+LD+YR SLL     E+    M R     G    E  ++R AT+L +AGI F+K+ TKSL D+SF  + GV
Subjt:  ----RLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKSG---PEYQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGV

Query:  LKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCR
        L++P +VVDD +EA+ LN++AFE+LH ++G EVTS+V  M  LI  + DVALL S+ I+ NA+G D +  ELF SL K  A++  S++  V  QVN + +
Subjt:  LKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCR

Query:  RPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYY
        +PWNEW A+L H YF NPWA ++L+A  + FA+   + VY I+ YY
Subjt:  RPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYY

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026451.7e-1327.01Show/hide
Query:  EYQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANAL
        E   I   + L  AG+ F+ +   +++ V+FDS  G   +P + +D ++E  L N++A+E  +       T +  L+N +ID +EDV LL  + +L + L
Subjt:  EYQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANAL

Query:  GDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQ
          DQ AAE++  + K   +     + +    VN +    W      L   Y +  W I++ +AA+L   ++ +Q
Subjt:  GDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQ

Q9SD53 UPF0481 protein At3g472001.4e-3126.49Show/hide
Query:  IYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFR----RLKTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGC
        I+R+P     ++PKA++P++VS GPYH+G+     ++  K +  +      K        +V+ V D    +  +Y      E  K     + ++++DGC
Subjt:  IYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFR----RLKTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGC

Query:  FMLSF-------LKKCPPSLSSMSW---DIKRDMLLLENQLPMLLLDDLY---SILPNMD-GRLAWFICKSMCFASGEAVSMGGNL---HILDMYRMSLL
        F+L         ++     + S+ W    I+ D+LLLENQ+P  +L  LY    I  + D  R+A+   K+     G       N    H+LD+ R + L
Subjt:  FMLSF-------LKKCPPSLSSMSW---DIKRDMLLLENQLPMLLLDDLY---SILPNMD-GRLAWFICKSMCFASGEAVSMGGNL---HILDMYRMSLL

Query:  GTT--------------ILENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEE
          T              + E K G++     K+ P   +I  A +LR  GI F+   +K  + ++   KK  L+IPQL  D    +  LN +AFE+ + +
Subjt:  GTT--------------ILENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEE

Query:  AGSEVTSFVVLMNNLIDVDEDVALL-SSKNILANALGDDQSAAELFGSLGKGAAMDLE-SHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVA
        + +E+T+++V M  L++ +EDV  L + K I+ N  G +   +E F ++ K    +++ S++  V   VN + ++ +N   A  +H +F +PW  +S  A
Subjt:  AGSEVTSFVVLMNNLIDVDEDVALL-SSKNILANALGDDQSAAELFGSLGKGAAMDLE-SHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVA

Query:  AILGFAILIVQAVYQIVDY
         +    + ++Q+   I+ Y
Subjt:  AILGFAILIVQAVYQIVDY

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)4.2e-5230.47Show/hide
Query:  IYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRR-LKTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFML
        IYR+P +++E   K++ PQ VS GPYHHGK +  SM+  K +A  R LK   + ++  +  + +  +     Y    EG  +    +F+E+L++DGCF+L
Subjt:  IYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRR-LKTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFML

Query:  SFLKKCPPSLSSMSW--------------DIKRDMLLLENQLPMLLLDDLYSI---LPNMDGRLAWFICK---------SMCFASGEA------------
           +      + + +               I+RDM++LENQLP+ +L+ L  +     N  G +A    +              SG++            
Subjt:  SFLKKCPPSLSSMSW--------------DIKRDMLLLENQLPMLLLDDLYSI---LPNMDGRLAWFICK---------SMCFASGEA------------

Query:  --VSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKS----GPEYQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLN
           +  G LH LD++R SLL ++       + +R ++ +        Q+I   T+L++AGI F++  T    D+ F  K G L+IP+L++ D +++  LN
Subjt:  --VSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKS----GPEYQVIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLN

Query:  VMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLE-SHITEVHHQVNLHCRRPWNEWCASLKHNYFHN
        ++AFE+ H ++ +++TS+++ M+NLID  EDV+ L    I+ + LG D   A+LF  L +    D E S+++ +  +VN +    WN W A+LKH YF+N
Subjt:  VMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLE-SHITEVHHQVNLHCRRPWNEWCASLKHNYFHN

Query:  PWAIISLVAAILGFAILIVQAVYQIVDYYR
        PWAI+S  AA++   +   Q+ Y +  YY+
Subjt:  PWAIISLVAAILGFAILIVQAVYQIVDYYR

AT3G50140.1 Plant protein of unknown function (DUF247)1.6e-4627.53Show/hide
Query:  VIEVQENLNKLGKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRR-LKTNPEMLESIVQTVTDNLQNLLGAYD
        VI +++ + ++ +    T  +     IYR+P  +++    ++ PQ VS GPYHHG      M+  K +A    +K   + +E  +  + +  +     Y 
Subjt:  VIEVQENLNKLGKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRR-LKTNPEMLESIVQTVTDNLQNLLGAYD

Query:  KLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWD--------------IKRDMLLLENQLPMLLLDDLYSI---LPNMDGRLAWFICK----
           EG       KF ++L++DGCF+L   +      S + +D              I+RDML+LENQLP+ +L+ L  +        G +A    +    
Subjt:  KLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWD--------------IKRDMLLLENQLPMLLLDDLYSI---LPNMDGRLAWFICK----

Query:  -----------------SMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKK----SGPEYQVIRHATQLRDAGIDFQKSGTKSLTDVSF
                         +  F +  A      LH LD++R SLL  ++  +   S  R ++K       + Q++   T+LR+AGI F++  +    D+ F
Subjt:  -----------------SMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKK----SGPEYQVIRHATQLRDAGIDFQKSGTKSLTDVSF

Query:  DSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLE-SHITEVHH
          K G L+IP+L++ D +++   N++A+E+ H ++ +++TS+++ M+NLID  ED+  L   +I+ + LG+D   A++F  L +  A DLE ++++E+ +
Subjt:  DSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLE-SHITEVHH

Query:  QVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYR
        +V+ +  R WN   A+LKH YF NPWA  S  AA++   + + Q+ +    Y++
Subjt:  QVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYR

AT3G50150.1 Plant protein of unknown function (DUF247)2.3e-5028.22Show/hide
Query:  RNLGSSLRRNTKMTQPQLPISIS----RNDSAQLVIEVQENLNKLGKSVLATDENTFNR-SIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELE
        R+LG +L  + +       I +     R    + VI +++   K+ K++     N++++  IYR+P +++E   K++ PQ VS GPYHHGK     ME  
Subjt:  RNLGSSLRRNTKMTQPQLPISIS----RNDSAQLVIEVQENLNKLGKSVLATDENTFNR-SIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELE

Query:  KQKAFRRLKTNPE-MLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSW--------------DIKRDMLLLE
        K +A   +    +  +E  +  + +  +     Y   I+    K   +F E+L++DGCF+L   K        + +               I+RDM++LE
Subjt:  KQKAFRRLKTNPE-MLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSW--------------DIKRDMLLLE

Query:  NQLPMLLLDDLYSI---LPNMDG---RLAWFICKSMCFAS---------------GEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKSGPEYQ
        NQLP+ +LD L  +    PN  G    +A    K++   S                + +   G LH LD++  SL+ ++   N+ G+          + Q
Subjt:  NQLPMLLLDDLYSI---LPNMDG---RLAWFICKSMCFAS---------------GEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKSGPEYQ

Query:  VIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDD
        +I   T+LR AG++F +  T  L D+ F  K G LKIP+L++ D +++   N++AFE+ H ++ + +TS+++ M+NLI+  +DV+ L    I+ + LG D
Subjt:  VIRHATQLRDAGIDFQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDD

Query:  QSAAELFGSLGKGAAMD-LESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYR
           A+LF  L K    D  + +++++  +VN +  R WN   A+L+  YF+NPWA  S  AA++   +   Q+ + +  YY+
Subjt:  QSAAELFGSLGKGAAMD-LESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYR

AT3G50160.1 Plant protein of unknown function (DUF247)1.1e-5229.04Show/hide
Query:  LRRNTKMTQPQLPISISRNDSAQL----VIEVQENLNKLGKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRR
        +    + TQ +  +SI   +  +L    VI + + +  LG +   + +N     IYR+P +++E   K++ PQ+VS GPYHHG      ME  K +A   
Subjt:  LRRNTKMTQPQLPISISRNDSAQL----VIEVQENLNKLGKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRR

Query:  LKTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSW--------------DIKRDMLLLENQLPMLLL
        +       +  ++   D ++ L        +G       +F+E+L++DG F++   K        + +               I+RDM++LENQLP  +L
Subjt:  LKTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSW--------------DIKRDMLLLENQLPMLLL

Query:  DDLY-----SILPNMDGRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQKSGTKSLT
          L       +L  ++ +L     + +   + E ++  G LH LD+ R  LL ++   ++D SM  K  +     Q+I   T+LR+AG++F +  T    
Subjt:  DDLY-----SILPNMDGRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQKSGTKSLT

Query:  DVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMD-LESHIT
        D+ F  K G LKIP+L++ D +++  LN++AFE+ H ++  ++TS+++ M+NLI+  EDV+ L    I+ N LG D   ++LF  LGK    D  + +++
Subjt:  DVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMD-LESHIT

Query:  EVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYR
         +  +VN++ RR WN   A+L+H YF+NPWA  S +AA+        Q+ + +  Y++
Subjt:  EVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYR

AT3G50170.1 Plant protein of unknown function (DUF247)7.7e-5430.2Show/hide
Query:  VIEVQENLNKLGKSVLATDENTF--NRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRLKTNPEMLESIVQTVTDNLQNLLGAY
        VI +++ L +  +     D+ T      IYR+P +++E   K++ PQ VS GPYHHGK +   ME  K +A  ++    + L+  ++  T+ ++ L    
Subjt:  VIEVQENLNKLGKSVLATDENTF--NRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRLKTNPEMLESIVQTVTDNLQNLLGAY

Query:  DKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSW--------------DIKRDMLLLENQLPMLLLDDLYSI---LPNMDGRLAWFICK--S
            EG  +    +F E+L++DGCF+L   +      + + +               I+RDM++LENQLP+ +LD L  +     N  G +A    K   
Subjt:  DKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSW--------------DIKRDMLLLENQLPMLLLDDLYSI---LPNMDGRLAWFICK--S

Query:  MCFASGEAVSM-------------------GGNLHILDMYRMSLLGTTILENKDGSMERKTKKS----GPEYQVIRHATQLRDAGIDFQKSGTKSLTDVS
            +GEA++                     G LH LD++R SLL ++   N    ++R T+ +      + Q++   T+LR+AG+ F+K  T    D+ 
Subjt:  MCFASGEAVSM-------------------GGNLHILDMYRMSLLGTTILENKDGSMERKTKKS----GPEYQVIRHATQLRDAGIDFQKSGTKSLTDVS

Query:  FDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMD-LESHITEVH
        F  K G L+IP+L++ D +++   N++AFE+ H E+ + +TS+++ M+NLI+  EDV+ L    I+ + LG D   A+LF  L +    D  +SH++ + 
Subjt:  FDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMD-LESHITEVH

Query:  HQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYRGN
          VN +  R WN   A+L H YF+NPWA  S  AA++   + + Q+ Y +  YY+ N
Subjt:  HQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYRGN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATATGAAGCAGTCAGCAATAGGCAGCAAATTAGTTGATAGAAATGATATTATTATTAAAGATGATATGGTCGCGTTGGGAATTGAGGTGAAGGAACGTCTAAG
TAAACTGCAGGAAAGTATTGGAATGGAGGAGTTATCAATGCAAAAAAACACAAAACCTTCAATCTACAAAATACCTGAATTCATGAGAGAAGTTCATCCCAAAGCTTACG
AGCCACAGTTGGTGTCTCTTGGGCCATACCATCATGGGAAACACCGTTTGCTTCCCATGGAGCGGGAGAAACAAAAAGGGTTTCTCCGTTTTAAATCTAAAAACAATCTG
GACTTTGAATCCATTGTGGAAAGGGTGTCCAGCATTTTGGAACATCTTTTGGGATCCTACGATAAGCTCCAGGATATATGGAAGCTATATCCTGCCAAGTTCTTGGAGGT
GATGATTGTGGATGCCTGTTTCCTGCTGGATTTCATTACTGAATGTCCACAATCCCTAAGCTCTATGATTTGGGATATAAAGCGAGACATGCTGTTGCTTGAGAATCAAC
TACCCATGAACCTTCTTAAACAACTGCATGCCATAAGAACCAATGAACAAAAGGATGTGATTGAGAAGATGAAGACAATATTGGAGTTGAACCCAGCGGACATGGACGCA
AATTGTCAAAGCATCCCACCAGCAACACAACTTCGTAGAGCGGGGATAAAATTCGAGAAAAGCAGTACAAGTAGCCTTATGGACGTATCCTTCGACTGCGAACGGCGCAT
GCTGAGGCTGCCGTCTCTGGCGATCGACGACATCCTCACTGAACCCACCTTAATAAACATCATGGCATTTGAGAAACTGCATATTGGGGTTGGAAGCCAAGCCATTTCTT
TCGTCATGTTGATGAAAAATCTAATAAGATTGGACAAAGATGTGTGGTTGTTGGCCTCCAAAGGAATACTTTCCAATGCAATTCATGACCGTGATGGTGTGGTACAGTTG
TTCAATCGACTGGCCAAAGGACAGAACAAAAGTTTGGAGAGCCGCATGCATGACTTATTCAAGATATTGAATTGGTGTTGTAACAATCAGTGGACAATATGGTGGGCAAA
TCTCAAACTACCCCAAACTCAAAATCCATGGCCTGTCATCTACGGTGAGCAAAATGAAAGTAGAAATTTGGGAAGCAGTTTAAGAAGAAATACGAAGATGACGCAGCCAC
AGCTGCCCATCAGCATTAGTAGAAACGATAGTGCTCAGTTGGTGATTGAGGTGCAGGAAAATCTCAATAAACTGGGGAAATCAGTACTTGCTACGGACGAGAATACCTTC
AATCGTTCGATCTACAGAATACCAACATTCATGAGAGAAGTTCATCCGAAAGCGTTCGAGCCACAGTTGGTGTCGTTTGGGCCATACCACCATGGGAAACCACAATATGC
TTCAATGGAACTGGAAAAACAGAAAGCGTTTCGCCGTTTGAAAACCAACCCAGAGATGTTAGAGTCGATTGTGCAGACTGTGACTGACAATTTGCAAAACCTGTTGGGAG
CATACGATAAGCTAATTGAGGGTGAATGGGCGAAACGTCCTGCCAAGTTCTTGGAGGTGTTGATTGTGGATGGCTGTTTCATGCTAAGTTTCCTCAAGAAATGTCCCCCC
TCCCTAAGTTCTATGAGTTGGGATATAAAGCGGGACATGCTTCTGCTTGAGAATCAGCTACCCATGCTGCTTCTTGACGATTTGTATTCCATTTTACCAAACATGGATGG
AAGACTGGCATGGTTTATATGCAAGTCAATGTGCTTCGCATCAGGGGAAGCAGTATCAATGGGAGGAAACTTGCACATTTTGGATATGTATAGAATGTCACTATTGGGTA
CTACGATATTAGAGAATAAGGATGGTAGTATGGAGAGAAAAACGAAGAAATCCGGGCCAGAATATCAAGTGATTCGACATGCAACACAGCTTCGTGACGCGGGGATAGAT
TTCCAGAAAAGCGGCACAAAGAGCCTTACTGACGTGAGTTTTGACTCCAAAAAAGGGGTGCTGAAAATTCCACAATTAGTGGTGGACGATGATAGTGAAGCCAGTTTATT
AAATGTGATGGCATTTGAGAAACTACATGAGGAGGCTGGAAGCGAAGTCACATCTTTCGTAGTATTGATGAACAATTTGATAGACGTGGATGAAGATGTGGCTTTGCTGT
CTTCTAAAAATATATTAGCTAACGCGCTTGGGGATGATCAAAGTGCGGCGGAGTTGTTTGGTTCGCTTGGGAAAGGGGCGGCTATGGATTTGGAAAGCCACATTACAGAG
GTGCATCACCAGGTGAATCTGCATTGCCGCCGGCCATGGAATGAATGGTGTGCAAGTCTCAAACATAATTACTTTCATAACCCATGGGCCATTATCTCACTCGTTGCCGC
TATTCTCGGTTTCGCCATCCTTATTGTTCAAGCCGTCTACCAAATTGTTGATTATTACCGAGGGAATTAA
mRNA sequenceShow/hide mRNA sequence
CCAGGGTTGGGTGGAGAATGAAAAATTCAAGATTGAGTATTTGCATGTATATATATTTTGGAAGCTAAATTAAGGCATTATGGAGAATATGAAGCAGTCAGCAATAGGCA
GCAAATTAGTTGATAGAAATGATATTATTATTAAAGATGATATGGTCGCGTTGGGAATTGAGGTGAAGGAACGTCTAAGTAAACTGCAGGAAAGTATTGGAATGGAGGAG
TTATCAATGCAAAAAAACACAAAACCTTCAATCTACAAAATACCTGAATTCATGAGAGAAGTTCATCCCAAAGCTTACGAGCCACAGTTGGTGTCTCTTGGGCCATACCA
TCATGGGAAACACCGTTTGCTTCCCATGGAGCGGGAGAAACAAAAAGGGTTTCTCCGTTTTAAATCTAAAAACAATCTGGACTTTGAATCCATTGTGGAAAGGGTGTCCA
GCATTTTGGAACATCTTTTGGGATCCTACGATAAGCTCCAGGATATATGGAAGCTATATCCTGCCAAGTTCTTGGAGGTGATGATTGTGGATGCCTGTTTCCTGCTGGAT
TTCATTACTGAATGTCCACAATCCCTAAGCTCTATGATTTGGGATATAAAGCGAGACATGCTGTTGCTTGAGAATCAACTACCCATGAACCTTCTTAAACAACTGCATGC
CATAAGAACCAATGAACAAAAGGATGTGATTGAGAAGATGAAGACAATATTGGAGTTGAACCCAGCGGACATGGACGCAAATTGTCAAAGCATCCCACCAGCAACACAAC
TTCGTAGAGCGGGGATAAAATTCGAGAAAAGCAGTACAAGTAGCCTTATGGACGTATCCTTCGACTGCGAACGGCGCATGCTGAGGCTGCCGTCTCTGGCGATCGACGAC
ATCCTCACTGAACCCACCTTAATAAACATCATGGCATTTGAGAAACTGCATATTGGGGTTGGAAGCCAAGCCATTTCTTTCGTCATGTTGATGAAAAATCTAATAAGATT
GGACAAAGATGTGTGGTTGTTGGCCTCCAAAGGAATACTTTCCAATGCAATTCATGACCGTGATGGTGTGGTACAGTTGTTCAATCGACTGGCCAAAGGACAGAACAAAA
GTTTGGAGAGCCGCATGCATGACTTATTCAAGATATTGAATTGGTGTTGTAACAATCAGTGGACAATATGGTGGGCAAATCTCAAACTACCCCAAACTCAAAATCCATGG
CCTGTCATCTACGGTGAGCAAAATGAAAGTAGAAATTTGGGAAGCAGTTTAAGAAGAAATACGAAGATGACGCAGCCACAGCTGCCCATCAGCATTAGTAGAAACGATAG
TGCTCAGTTGGTGATTGAGGTGCAGGAAAATCTCAATAAACTGGGGAAATCAGTACTTGCTACGGACGAGAATACCTTCAATCGTTCGATCTACAGAATACCAACATTCA
TGAGAGAAGTTCATCCGAAAGCGTTCGAGCCACAGTTGGTGTCGTTTGGGCCATACCACCATGGGAAACCACAATATGCTTCAATGGAACTGGAAAAACAGAAAGCGTTT
CGCCGTTTGAAAACCAACCCAGAGATGTTAGAGTCGATTGTGCAGACTGTGACTGACAATTTGCAAAACCTGTTGGGAGCATACGATAAGCTAATTGAGGGTGAATGGGC
GAAACGTCCTGCCAAGTTCTTGGAGGTGTTGATTGTGGATGGCTGTTTCATGCTAAGTTTCCTCAAGAAATGTCCCCCCTCCCTAAGTTCTATGAGTTGGGATATAAAGC
GGGACATGCTTCTGCTTGAGAATCAGCTACCCATGCTGCTTCTTGACGATTTGTATTCCATTTTACCAAACATGGATGGAAGACTGGCATGGTTTATATGCAAGTCAATG
TGCTTCGCATCAGGGGAAGCAGTATCAATGGGAGGAAACTTGCACATTTTGGATATGTATAGAATGTCACTATTGGGTACTACGATATTAGAGAATAAGGATGGTAGTAT
GGAGAGAAAAACGAAGAAATCCGGGCCAGAATATCAAGTGATTCGACATGCAACACAGCTTCGTGACGCGGGGATAGATTTCCAGAAAAGCGGCACAAAGAGCCTTACTG
ACGTGAGTTTTGACTCCAAAAAAGGGGTGCTGAAAATTCCACAATTAGTGGTGGACGATGATAGTGAAGCCAGTTTATTAAATGTGATGGCATTTGAGAAACTACATGAG
GAGGCTGGAAGCGAAGTCACATCTTTCGTAGTATTGATGAACAATTTGATAGACGTGGATGAAGATGTGGCTTTGCTGTCTTCTAAAAATATATTAGCTAACGCGCTTGG
GGATGATCAAAGTGCGGCGGAGTTGTTTGGTTCGCTTGGGAAAGGGGCGGCTATGGATTTGGAAAGCCACATTACAGAGGTGCATCACCAGGTGAATCTGCATTGCCGCC
GGCCATGGAATGAATGGTGTGCAAGTCTCAAACATAATTACTTTCATAACCCATGGGCCATTATCTCACTCGTTGCCGCTATTCTCGGTTTCGCCATCCTTATTGTTCAA
GCCGTCTACCAAATTGTTGATTATTACCGAGGGAATTAAATGGATCTAAAATAACAATTCCCTCCTCGTCTATTTACCCATCTTTGGGTGTTGCTCACCTCAATCCCTAA
TAATTTTCCTCTCTTTTTTGTTTTGTTTGTGATAGGAGGGGATGCATGCATCAAACGGCGGCTATCAGTATTCATGTATTAAGTACTACTCTAAAG
Protein sequenceShow/hide protein sequence
MENMKQSAIGSKLVDRNDIIIKDDMVALGIEVKERLSKLQESIGMEELSMQKNTKPSIYKIPEFMREVHPKAYEPQLVSLGPYHHGKHRLLPMEREKQKGFLRFKSKNNL
DFESIVERVSSILEHLLGSYDKLQDIWKLYPAKFLEVMIVDACFLLDFITECPQSLSSMIWDIKRDMLLLENQLPMNLLKQLHAIRTNEQKDVIEKMKTILELNPADMDA
NCQSIPPATQLRRAGIKFEKSSTSSLMDVSFDCERRMLRLPSLAIDDILTEPTLINIMAFEKLHIGVGSQAISFVMLMKNLIRLDKDVWLLASKGILSNAIHDRDGVVQL
FNRLAKGQNKSLESRMHDLFKILNWCCNNQWTIWWANLKLPQTQNPWPVIYGEQNESRNLGSSLRRNTKMTQPQLPISISRNDSAQLVIEVQENLNKLGKSVLATDENTF
NRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRLKTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPP
SLSSMSWDIKRDMLLLENQLPMLLLDDLYSILPNMDGRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKSGPEYQVIRHATQLRDAGID
FQKSGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITE
VHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYRGN