; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014736 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014736
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationchr12:4240542..4242223
RNA-Seq ExpressionLag0014736
SyntenyLag0014736
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592831.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. sororia]5.8e-6439.76Show/hide
Query:  SCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTE-EGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCF
        S  +IP  +      A++PQ+VS GPY+HGK HL+  E E LKL  FH F  RC  D   +   +  +L+ L E YD+L+ D   D   FL+LM VDGCF
Subjt:  SCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTE-EGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCF

Query:  MLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPR
        ML  L++  D    L N    IK+DMLLLENQLPMLLL +L+               P+   K LV  +L +P     ++  +D  HILEMY+E LLHP 
Subjt:  MLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPR

Query:  RSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAV
          +D    + E D  D E          +I P  +L+ AGI F+ +NT SL +V FD++ GVL +P L +++ T++ L NV+A E L+   G +VTSF +
Subjt:  RSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAV

Query:  LMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFL
        LM++LI+ ++DV +L  +++L +    D++ A  FN LG GAA+          V+K ++ +C++PW+E   +L +  F+SPWTIISL AA  G ++L L
Subjt:  LMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFL

Query:  QTFYQVYGYH
        Q  YQ   Y+
Subjt:  QTFYQVYGYH

KAG7025238.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.6e-6740.24Show/hide
Query:  SCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTE-EGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCF
        S  +IP  +    P A++PQ+VS GPY+HGK HL+  E E LKL  FH F  RC  D   +   +  +L+ L E YD+L+ D   D A FL+LM VDGCF
Subjt:  SCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTE-EGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCF

Query:  MLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPR
        ML  L++  D    L N    IK+DMLLLENQLPMLLL +L+                R+  K LV  +L +P     ++  +D  HILEMY+E LLHP 
Subjt:  MLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPR

Query:  RSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAV
          +D    + E D  D E          +I P  +L+ AGI F+ +NT SL +V FD++ GVL +P L +++ T++ L NV+A E L+   G +VTSF +
Subjt:  RSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAV

Query:  LMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFL
        LM++LI+ ++DV +L  +++L +    D++ A  FN LG GAA+          V+K ++ +C++PW+E   +L +  F+SPWTIISL AA  G ++L L
Subjt:  LMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFL

Query:  QTFYQVYGYH
        Q  YQ   Y+
Subjt:  QTFYQVYGYH

XP_022148888.1 UPF0481 protein At3g47200-like [Momordica charantia]2.1e-6941.35Show/hide
Query:  SCQRIPHHIRNVQPNA-FDPQLVSFGPYHHGKLHLAQTEEGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDD--ALFLKLMAVDG
        S  +IPH +R VQP A F+PQLVSFGPYHHG+ HL + E   K + F  F  R       +   +  MLE ++  YD+L+ +   +D  A FL+LM +DG
Subjt:  SCQRIPHHIRNVQPNA-FDPQLVSFGPYHHGKLHLAQTEEGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDD--ALFLKLMAVDG

Query:  CFMLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELF----YNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYRE
        CFMLEVLL+  D   WL N  E I RDMLLLENQLPM LL EL     ++ L              + K LVC+F+ +P   E D+   +Y HILEMY +
Subjt:  CFMLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELF----YNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYRE

Query:  KLLHPRRSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNE
         LL P+     R       G   E      +   +I P  RL  AGI F  + + S+ +V FD +RGVLK+P + +++ T++   NV+A E L+   G++
Subjt:  KLLHPRRSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNE

Query:  VTSFAVLMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLG
        VT F +LMN+LI+VD+DV LL   KI+L+    D+D AE F  L +GAAL+  +      V + +  +C K  H+W  SL +  F+ PW I+SLIAA+LG
Subjt:  VTSFAVLMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLG

Query:  AVLLFLQTFYQVYGYH
         V+L LQ  YQ+  Y+
Subjt:  AVLLFLQTFYQVYGYH

XP_023004238.1 UPF0481 protein At3g47200-like isoform X1 [Cucurbita maxima]1.1e-6540.73Show/hide
Query:  SCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTE-EGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCF
        S  +IP  +    P A++PQ+VS GPY+HGK HL+  E E LKL  FH F  RC  D   +   +  +L+ L E YD+L+ +   D   FL+LM VDGCF
Subjt:  SCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTE-EGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCF

Query:  MLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPR
        ML  L+S  +    L N    IK+DMLLLENQLPMLLL +L Y++       P   P     K LVC++L++P     ++  +D  HILEMY+E LL+P 
Subjt:  MLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPR

Query:  RSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAV
          +D R  + E D  D E          +I P  +L  AGI F+ + T SLR+V FD++RGVL +P L +++ T++ + NV+A E L+   G +VTSF +
Subjt:  RSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAV

Query:  LMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFL
        LM++LI+ ++DV +L  +KIL +    D++ A  F+ LG GAA+   +      V+K ++ +C++PW+E   +L +  F+SPWTIISL AA  G ++L L
Subjt:  LMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFL

Query:  QTFYQVYGYH
        Q  YQ   Y+
Subjt:  QTFYQVYGYH

XP_023004239.1 UPF0481 protein At3g47200-like isoform X2 [Cucurbita maxima]6.2e-6640.73Show/hide
Query:  SCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTE-EGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCF
        S  +IP  +    P A++PQ+VS GPY+HGK HL+  E E LKL  FH F  RC  D   +   +  +L+ L E YD+L+ +   D   FL+LM VDGCF
Subjt:  SCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTE-EGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCF

Query:  MLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPR
        ML  L+S  +    L N    IK+DMLLLENQLPMLLL +L Y++       P  P      K LVC++L++P     ++  +D  HILEMY+E LL+P 
Subjt:  MLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPR

Query:  RSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAV
          +D R  + E D  D E          +I P  +L  AGI F+ + T SLR+V FD++RGVL +P L +++ T++ + NV+A E L+   G +VTSF +
Subjt:  RSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAV

Query:  LMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFL
        LM++LI+ ++DV +L  +KIL +    D++ A  F+ LG GAA+   +      V+K ++ +C++PW+E   +L +  F+SPWTIISL AA  G ++L L
Subjt:  LMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFL

Query:  QTFYQVYGYH
        Q  YQ   Y+
Subjt:  QTFYQVYGYH

TrEMBL top hitse value%identityAlignment
A0A6J1D5C0 UPF0481 protein At3g47200-like9.9e-7041.35Show/hide
Query:  SCQRIPHHIRNVQPNA-FDPQLVSFGPYHHGKLHLAQTEEGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDD--ALFLKLMAVDG
        S  +IPH +R VQP A F+PQLVSFGPYHHG+ HL + E   K + F  F  R       +   +  MLE ++  YD+L+ +   +D  A FL+LM +DG
Subjt:  SCQRIPHHIRNVQPNA-FDPQLVSFGPYHHGKLHLAQTEEGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDD--ALFLKLMAVDG

Query:  CFMLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELF----YNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYRE
        CFMLEVLL+  D   WL N  E I RDMLLLENQLPM LL EL     ++ L              + K LVC+F+ +P   E D+   +Y HILEMY +
Subjt:  CFMLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELF----YNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYRE

Query:  KLLHPRRSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNE
         LL P+     R       G   E      +   +I P  RL  AGI F  + + S+ +V FD +RGVLK+P + +++ T++   NV+A E L+   G++
Subjt:  KLLHPRRSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNE

Query:  VTSFAVLMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLG
        VT F +LMN+LI+VD+DV LL   KI+L+    D+D AE F  L +GAAL+  +      V + +  +C K  H+W  SL +  F+ PW I+SLIAA+LG
Subjt:  VTSFAVLMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLG

Query:  AVLLFLQTFYQVYGYH
         V+L LQ  YQ+  Y+
Subjt:  AVLLFLQTFYQVYGYH

A0A6J1EC69 UPF0481 protein At3g47200-like3.5e-5937.97Show/hide
Query:  SSCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTEEGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKW-GDDALFLKLMAVDGC
        +S  RIP HI+ V PNAF PQL+SFGPYHHG+LHL  TE+ +K   F  F KRC      M  E+  MLE L+  YD+LD DKW  + A FL++M +DGC
Subjt:  SSCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTEEGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKW-GDDALFLKLMAVDGC

Query:  FMLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKL---
        F+++VLL +    LW    +E + RD+LLLENQLPM LL +L   L+              + + LV E   +P+  +      DY HIL+MYR +L   
Subjt:  FMLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKL---

Query:  ------LHPRRSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPN
               H     + R  +FE          + +     I   RR   AGI  +     +LR+V FD  +GVL +P +++    ++ L N +A E L   
Subjt:  ------LHPRRSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPN

Query:  IGNEVTSFAVLMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIA
        IGN V SF +LM +L+                     ++D  + FN L +G  L       +  VYKS++N+C +PW  WWT+L + NF+SPWTIIS   
Subjt:  IGNEVTSFAVLMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIA

Query:  ASLGAVLLFLQTFYQVYGYHHPSP
        A +G  LL +QT Y VYGY+ P P
Subjt:  ASLGAVLLFLQTFYQVYGYHHPSP

A0A6J1HB25 UPF0481 protein At3g47200-like1.1e-6339.51Show/hide
Query:  SCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTE-EGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCF
        S  +IP  +    P A++PQ+VS GPY+HGK HL+  E E LKL  FH F  RC  D   +   +  +L+ L E YD+L+ D   D   FL+LM VDGCF
Subjt:  SCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTE-EGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCF

Query:  MLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPR
        ML  L++  D    L N    IK+DMLLLENQLPMLLL +L+               P+   K LV  +L +P     ++  +D  HILEMY+E LLHP 
Subjt:  MLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPR

Query:  RSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAV
          +D    + E    D E          +I P  +L+ AGI F+ + T SL +V FD++ GVL +P L +++ T++ L NV+A E L+   G +VTSF +
Subjt:  RSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAV

Query:  LMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFL
        LM++LI+ ++DV +L  +++L +    D++ A  FN LG GAA+       +  V+K ++ +C++PW+E   +L +  F+SPWTIISL AA  G ++L L
Subjt:  LMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFL

Query:  QTFYQVYGYH
        Q  YQ   Y+
Subjt:  QTFYQVYGYH

A0A6J1KVQ6 UPF0481 protein At3g47200-like isoform X23.0e-6640.73Show/hide
Query:  SCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTE-EGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCF
        S  +IP  +    P A++PQ+VS GPY+HGK HL+  E E LKL  FH F  RC  D   +   +  +L+ L E YD+L+ +   D   FL+LM VDGCF
Subjt:  SCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTE-EGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCF

Query:  MLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPR
        ML  L+S  +    L N    IK+DMLLLENQLPMLLL +L Y++       P  P      K LVC++L++P     ++  +D  HILEMY+E LL+P 
Subjt:  MLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPR

Query:  RSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAV
          +D R  + E D  D E          +I P  +L  AGI F+ + T SLR+V FD++RGVL +P L +++ T++ + NV+A E L+   G +VTSF +
Subjt:  RSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAV

Query:  LMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFL
        LM++LI+ ++DV +L  +KIL +    D++ A  F+ LG GAA+   +      V+K ++ +C++PW+E   +L +  F+SPWTIISL AA  G ++L L
Subjt:  LMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFL

Query:  QTFYQVYGYH
        Q  YQ   Y+
Subjt:  QTFYQVYGYH

A0A6J1KYV8 UPF0481 protein At3g47200-like isoform X15.1e-6640.73Show/hide
Query:  SCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTE-EGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCF
        S  +IP  +    P A++PQ+VS GPY+HGK HL+  E E LKL  FH F  RC  D   +   +  +L+ L E YD+L+ +   D   FL+LM VDGCF
Subjt:  SCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTE-EGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCF

Query:  MLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPR
        ML  L+S  +    L N    IK+DMLLLENQLPMLLL +L Y++       P   P     K LVC++L++P     ++  +D  HILEMY+E LL+P 
Subjt:  MLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPR

Query:  RSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAV
          +D R  + E D  D E          +I P  +L  AGI F+ + T SLR+V FD++RGVL +P L +++ T++ + NV+A E L+   G +VTSF +
Subjt:  RSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAV

Query:  LMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFL
        LM++LI+ ++DV +L  +KIL +    D++ A  F+ LG GAA+   +      V+K ++ +C++PW+E   +L +  F+SPWTIISL AA  G ++L L
Subjt:  LMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFL

Query:  QTFYQVYGYH
        Q  YQ   Y+
Subjt:  QTFYQVYGYH

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026457.5e-1429.07Show/hide
Query:  LKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFF
        L  AG+ F+P    ++  V+FDS  G   +P + L+  T+  L N++A E  N +     T +  L+N +I+ ++DV LL E+ +L+S    D++ AE +
Subjt:  LKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFF

Query:  NVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFLQTFYQVY
        N + +   L +    F D   + ++ Y +  W      LV       W I++ +AA L  +L+ LQ F  V+
Subjt:  NVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFLQTFYQVY

Q9SD53 UPF0481 protein At3g472006.5e-3425.56Show/hide
Query:  RNFVKSQKEIQKIRFVSVGASSC--QRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHL--AQTEEGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFY
        RN++ S  +   +   S G  SC   R+P     + P A+ P++VS GPYH+G+ HL   Q  +   L+ F    K+ + + + +   +  + + +++ Y
Subjt:  RNFVKSQKEIQKIRFVSVGASSC--QRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHL--AQTEEGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFY

Query:  DQLDHDKWGDDALFLKLMAVDGCFMLEVLL-------SKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEF
           +  K G D +F  +M +DGCF+L V L         ED    +   + +I+ D+LLLENQ+P  +L  L+                      +   F
Subjt:  DQLDHDKWGDDALFLKLMAVDGCFMLEVLL-------SKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEF

Query:  LALPSPTELDQY--HRDY--FHILEMYREKLLHPRRSLDNRVINFEYDGDDHEDWHNSMAT-----RPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRR
           P   E   +  HR+Y   H+L++ RE  L P  S  ++  +       HE    ++ +      P+IL  +RL+  GI F    +     ++   ++
Subjt:  LALPSPTELDQY--HRDY--FHILEMYREKLLHPRRSLDNRVINFEYDGDDHEDWHNSMAT-----RPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRR

Query:  GVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLLTEKKILLSHGF-EDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSI
          L+IP L+ +    +   N +A E    +  NE+T++ V M  L+N ++DVT L   K+++ + F  + +V+EFF  + +       +  + + V+K +
Subjt:  GVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLLTEKKILLSHGF-EDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSI

Query:  HNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFLQTFYQVYGY
        + Y  K ++  W    +T+F+SPWT +S  A     +L  LQ+   +  Y
Subjt:  HNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFLQTFYQVYGY

Arabidopsis top hitse value%identityAlignment
AT3G47200.1 Plant protein of unknown function (DUF247)4.6e-3525.56Show/hide
Query:  RNFVKSQKEIQKIRFVSVGASSC--QRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHL--AQTEEGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFY
        RN++ S  +   +   S G  SC   R+P     + P A+ P++VS GPYH+G+ HL   Q  +   L+ F    K+ + + + +   +  + + +++ Y
Subjt:  RNFVKSQKEIQKIRFVSVGASSC--QRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHL--AQTEEGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFY

Query:  DQLDHDKWGDDALFLKLMAVDGCFMLEVLL-------SKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEF
           +  K G D +F  +M +DGCF+L V L         ED    +   + +I+ D+LLLENQ+P  +L  L+                      +   F
Subjt:  DQLDHDKWGDDALFLKLMAVDGCFMLEVLL-------SKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEF

Query:  LALPSPTELDQY--HRDY--FHILEMYREKLLHPRRSLDNRVINFEYDGDDHEDWHNSMAT-----RPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRR
           P   E   +  HR+Y   H+L++ RE  L P  S  ++  +       HE    ++ +      P+IL  +RL+  GI F    +     ++   ++
Subjt:  LALPSPTELDQY--HRDY--FHILEMYREKLLHPRRSLDNRVINFEYDGDDHEDWHNSMAT-----RPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRR

Query:  GVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLLTEKKILLSHGF-EDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSI
          L+IP L+ +    +   N +A E    +  NE+T++ V M  L+N ++DVT L   K+++ + F  + +V+EFF  + +       +  + + V+K +
Subjt:  GVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLLTEKKILLSHGF-EDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSI

Query:  HNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFLQTFYQVYGY
        + Y  K ++  W    +T+F+SPWT +S  A     +L  LQ+   +  Y
Subjt:  HNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFLQTFYQVYGY

AT3G47200.2 Plant protein of unknown function (DUF247)4.6e-3525.56Show/hide
Query:  RNFVKSQKEIQKIRFVSVGASSC--QRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHL--AQTEEGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFY
        RN++ S  +   +   S G  SC   R+P     + P A+ P++VS GPYH+G+ HL   Q  +   L+ F    K+ + + + +   +  + + +++ Y
Subjt:  RNFVKSQKEIQKIRFVSVGASSC--QRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHL--AQTEEGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLKEFY

Query:  DQLDHDKWGDDALFLKLMAVDGCFMLEVLL-------SKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEF
           +  K G D +F  +M +DGCF+L V L         ED    +   + +I+ D+LLLENQ+P  +L  L+                      +   F
Subjt:  DQLDHDKWGDDALFLKLMAVDGCFMLEVLL-------SKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVCEF

Query:  LALPSPTELDQY--HRDY--FHILEMYREKLLHPRRSLDNRVINFEYDGDDHEDWHNSMAT-----RPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRR
           P   E   +  HR+Y   H+L++ RE  L P  S  ++  +       HE    ++ +      P+IL  +RL+  GI F    +     ++   ++
Subjt:  LALPSPTELDQY--HRDY--FHILEMYREKLLHPRRSLDNRVINFEYDGDDHEDWHNSMAT-----RPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRR

Query:  GVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLLTEKKILLSHGF-EDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSI
          L+IP L+ +    +   N +A E    +  NE+T++ V M  L+N ++DVT L   K+++ + F  + +V+EFF  + +       +  + + V+K +
Subjt:  GVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLLTEKKILLSHGF-EDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSI

Query:  HNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFLQTFYQVYGY
        + Y  K ++  W    +T+F+SPWT +S  A     +L  LQ+   +  Y
Subjt:  HNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFLQTFYQVYGY

AT3G47250.1 Plant protein of unknown function (DUF247)7.9e-3526.02Show/hide
Query:  KQTLEECNRNFVKSQKEIQKIRFVSVGASSC--QRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTEEGLKLEGFHRFIKRCNEDYSGMAMELCY---
        +  +    RN   S K +  I   S G +SC   RIP  +  V P A+ P++VS GPYH+G+ HL   ++  K      F+ R  +   GM   + Y   
Subjt:  KQTLEECNRNFVKSQKEIQKIRFVSVGASSC--QRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTEEGLKLEGFHRFIKRCNEDYSGMAMELCY---

Query:  --MLEPLKEFY-DQLDHDKWGDDALFLKLMAVDGCFMLEVLL---------SKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPP
          +   ++  Y ++L  +K    +  + +M +DGCF+L +LL           +D    +   + +I+ D+LLLENQ+P  +L  +F             
Subjt:  --MLEPLKEFY-DQLDHDKWGDDALFLKLMAVDGCFMLEVLL---------SKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPP

Query:  PPPRDHFKLLVCEF-LALPSPTELDQYHRD--YFHILEMYREKLLHPRRSLDNRVI----NFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFE-PNN
          P D  ++    F L++  P      HRD    H+L++ R+  +   RS+          F++      +  +S +T P+IL  +RL+  GI F   ++
Subjt:  PPPRDHFKLLVCEF-LALPSPTELDQYHRD--YFHILEMYREKLLHPRRSLDNRVI----NFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFE-PNN

Query:  TWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLL-TEKKILLSHGFEDRDVAEFFNVLGRGAALNR
          S+ ++    ++  L+IP L+L+    +   N +A E       N++TS+ V M  L+N  +D T L  +K+I+ ++   + +V++FF  + +    + 
Subjt:  TWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLL-TEKKILLSHGFEDRDVAEFFNVLGRGAALNR

Query:  SNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFLQTFYQVYGYHH
        + R +   V++ ++ Y SK ++  W    +T+F+SPWT +S  A     +L   Q  Y +  Y+H
Subjt:  SNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFLQTFYQVYGYHH

AT3G47250.2 Plant protein of unknown function (DUF247)7.9e-3526.02Show/hide
Query:  KQTLEECNRNFVKSQKEIQKIRFVSVGASSC--QRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTEEGLKLEGFHRFIKRCNEDYSGMAMELCY---
        +  +    RN   S K +  I   S G +SC   RIP  +  V P A+ P++VS GPYH+G+ HL   ++  K      F+ R  +   GM   + Y   
Subjt:  KQTLEECNRNFVKSQKEIQKIRFVSVGASSC--QRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTEEGLKLEGFHRFIKRCNEDYSGMAMELCY---

Query:  --MLEPLKEFY-DQLDHDKWGDDALFLKLMAVDGCFMLEVLL---------SKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPP
          +   ++  Y ++L  +K    +  + +M +DGCF+L +LL           +D    +   + +I+ D+LLLENQ+P  +L  +F             
Subjt:  --MLEPLKEFY-DQLDHDKWGDDALFLKLMAVDGCFMLEVLL---------SKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPP

Query:  PPPRDHFKLLVCEF-LALPSPTELDQYHRD--YFHILEMYREKLLHPRRSLDNRVI----NFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFE-PNN
          P D  ++    F L++  P      HRD    H+L++ R+  +   RS+          F++      +  +S +T P+IL  +RL+  GI F   ++
Subjt:  PPPRDHFKLLVCEF-LALPSPTELDQYHRD--YFHILEMYREKLLHPRRSLDNRVI----NFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFE-PNN

Query:  TWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLL-TEKKILLSHGFEDRDVAEFFNVLGRGAALNR
          S+ ++    ++  L+IP L+L+    +   N +A E       N++TS+ V M  L+N  +D T L  +K+I+ ++   + +V++FF  + +    + 
Subjt:  TWSLRNVSFDSRRGVLKIPNLKLENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLL-TEKKILLSHGFEDRDVAEFFNVLGRGAALNR

Query:  SNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFLQTFYQVYGYHH
        + R +   V++ ++ Y SK ++  W    +T+F+SPWT +S  A     +L   Q  Y +  Y+H
Subjt:  SNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFLQTFYQVYGYHH

AT5G22540.1 Plant protein of unknown function (DUF247)3.4e-3827.96Show/hide
Query:  SVGASSC--QRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTEEGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLK-EFYDQLDHDKWGDDALFLKL
        S G+  C   RIP  +  +   A++P++VS GPYHHGK HL  T++  +   F +F     E+   +  EL   +  L+         D   D    +++
Subjt:  SVGASSC--QRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTEEGLKLEGFHRFIKRCNEDYSGMAMELCYMLEPLK-EFYDQLDHDKWGDDALFLKL

Query:  MAVDGCFML--------EVLLSKEDDRL----WLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVC--------EF----
        M +DGCF+L        +V  +  DD +    W+   + +I+ D+LLLENQ+P +LL  LF                 +  KL+ C        EF    
Subjt:  MAVDGCFML--------EVLLSKEDDRL----WLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRDHFKLLVC--------EF----

Query:  LALPSPTELDQYHRDYFHILEMYREKLL---HPRRSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEP-NNTWSLRNVSFDSRRGVLKI
        L  P       Y  +  H+L++ R+  +     RR  D+   + +   +DHE           +L  ++L   GI F+P  NT S+ ++S+ +  GVL I
Subjt:  LALPSPTELDQYHRDYFHILEMYREKLL---HPRRSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEP-NNTWSLRNVSFDSRRGVLKI

Query:  PNLKLENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSK
        P + +++ T +   N +A E L  +  N +TS+   M  LIN + D + L+E++IL ++   + +V+ F+  +G+  AL+   + +   V++ ++ Y S+
Subjt:  PNLKLENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSK

Query:  PWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFLQTFYQVYGYHHP
         +H      ++T+F SPWT  S  AA L  +   LQ F+  Y Y  P
Subjt:  PWHEWWTSLVNTNFKSPWTIISLIAASLGAVLLFLQTFYQVYGYHHP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCACAATGGTGGTCGGTGGCAGAGAAATCGATGACAGAGAAATAATTGGATCAATTATTGATTCAAAGGAGAAAAAAATACTAGAAGGTCTGAAACAGACGTTAGA
AGAATGCAATAGGAACTTCGTCAAGTCACAGAAGGAAATACAAAAGATTAGATTTGTATCAGTGGGAGCTTCTTCATGCCAGAGAATACCACACCACATCAGAAACGTTC
AGCCGAATGCTTTCGATCCTCAATTGGTTTCGTTTGGGCCATACCACCATGGCAAACTGCATTTGGCTCAAACCGAAGAAGGCCTTAAACTCGAAGGCTTTCATAGATTT
ATCAAGCGTTGCAATGAGGACTATAGCGGAATGGCGATGGAGTTGTGCTACATGTTGGAACCTCTCAAGGAATTCTACGATCAGCTTGATCATGATAAATGGGGAGATGA
TGCATTATTCTTGAAGCTCATGGCCGTGGATGGTTGTTTCATGCTGGAAGTGCTGTTGAGCAAAGAAGATGATCGCCTATGGCTCGGAAATGAGGTTGAGACTATAAAGC
GGGATATGCTGCTGCTTGAGAATCAGTTGCCCATGTTGCTTCTTCATGAGCTATTTTATAATTTGTTGCCGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGCCTCGGGAT
CATTTCAAATTGCTTGTTTGCGAATTCTTGGCTTTGCCCTCCCCAACAGAACTCGACCAGTATCATCGAGACTACTTTCACATTTTGGAAATGTATAGGGAGAAGCTACT
GCATCCTCGTCGGTCGTTGGATAATCGAGTTATTAATTTTGAATATGATGGGGATGATCATGAGGATTGGCATAACTCGATGGCCACGAGACCCATGATTTTGCCCACAA
GACGGCTTAAAACGGCGGGGATCACATTTGAACCAAACAATACTTGGAGCCTTAGGAATGTGTCTTTCGACTCGAGACGAGGTGTGTTGAAGATCCCAAATTTGAAGCTG
GAGAATGTCACCAAAGCAGCCTTGTTTAATGTGATAGCACTTGAGACCCTAAACCCCAATATTGGCAATGAAGTGACCTCTTTCGCTGTCCTAATGAATGATCTGATCAA
TGTGGACAAAGATGTGACGCTACTGACCGAGAAAAAGATATTATTGAGCCATGGTTTTGAAGATAGAGATGTGGCGGAGTTTTTCAATGTGCTGGGAAGAGGGGCGGCTT
TGAACCGATCGAACCGCCACTTCTTTGATCCAGTCTACAAGTCCATTCACAACTACTGCAGCAAGCCATGGCATGAATGGTGGACAAGTCTTGTAAACACCAATTTCAAA
AGCCCATGGACCATCATCTCCCTCATTGCCGCTTCTTTGGGTGCTGTGCTTCTCTTCCTTCAAACTTTCTACCAAGTATATGGGTATCACCACCCATCACCACGTCCATA
A
mRNA sequenceShow/hide mRNA sequence
ATGAGCACAATGGTGGTCGGTGGCAGAGAAATCGATGACAGAGAAATAATTGGATCAATTATTGATTCAAAGGAGAAAAAAATACTAGAAGGTCTGAAACAGACGTTAGA
AGAATGCAATAGGAACTTCGTCAAGTCACAGAAGGAAATACAAAAGATTAGATTTGTATCAGTGGGAGCTTCTTCATGCCAGAGAATACCACACCACATCAGAAACGTTC
AGCCGAATGCTTTCGATCCTCAATTGGTTTCGTTTGGGCCATACCACCATGGCAAACTGCATTTGGCTCAAACCGAAGAAGGCCTTAAACTCGAAGGCTTTCATAGATTT
ATCAAGCGTTGCAATGAGGACTATAGCGGAATGGCGATGGAGTTGTGCTACATGTTGGAACCTCTCAAGGAATTCTACGATCAGCTTGATCATGATAAATGGGGAGATGA
TGCATTATTCTTGAAGCTCATGGCCGTGGATGGTTGTTTCATGCTGGAAGTGCTGTTGAGCAAAGAAGATGATCGCCTATGGCTCGGAAATGAGGTTGAGACTATAAAGC
GGGATATGCTGCTGCTTGAGAATCAGTTGCCCATGTTGCTTCTTCATGAGCTATTTTATAATTTGTTGCCGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGCCTCGGGAT
CATTTCAAATTGCTTGTTTGCGAATTCTTGGCTTTGCCCTCCCCAACAGAACTCGACCAGTATCATCGAGACTACTTTCACATTTTGGAAATGTATAGGGAGAAGCTACT
GCATCCTCGTCGGTCGTTGGATAATCGAGTTATTAATTTTGAATATGATGGGGATGATCATGAGGATTGGCATAACTCGATGGCCACGAGACCCATGATTTTGCCCACAA
GACGGCTTAAAACGGCGGGGATCACATTTGAACCAAACAATACTTGGAGCCTTAGGAATGTGTCTTTCGACTCGAGACGAGGTGTGTTGAAGATCCCAAATTTGAAGCTG
GAGAATGTCACCAAAGCAGCCTTGTTTAATGTGATAGCACTTGAGACCCTAAACCCCAATATTGGCAATGAAGTGACCTCTTTCGCTGTCCTAATGAATGATCTGATCAA
TGTGGACAAAGATGTGACGCTACTGACCGAGAAAAAGATATTATTGAGCCATGGTTTTGAAGATAGAGATGTGGCGGAGTTTTTCAATGTGCTGGGAAGAGGGGCGGCTT
TGAACCGATCGAACCGCCACTTCTTTGATCCAGTCTACAAGTCCATTCACAACTACTGCAGCAAGCCATGGCATGAATGGTGGACAAGTCTTGTAAACACCAATTTCAAA
AGCCCATGGACCATCATCTCCCTCATTGCCGCTTCTTTGGGTGCTGTGCTTCTCTTCCTTCAAACTTTCTACCAAGTATATGGGTATCACCACCCATCACCACGTCCATA
A
Protein sequenceShow/hide protein sequence
MSTMVVGGREIDDREIIGSIIDSKEKKILEGLKQTLEECNRNFVKSQKEIQKIRFVSVGASSCQRIPHHIRNVQPNAFDPQLVSFGPYHHGKLHLAQTEEGLKLEGFHRF
IKRCNEDYSGMAMELCYMLEPLKEFYDQLDHDKWGDDALFLKLMAVDGCFMLEVLLSKEDDRLWLGNEVETIKRDMLLLENQLPMLLLHELFYNLLPPPPPPPPPPPPRD
HFKLLVCEFLALPSPTELDQYHRDYFHILEMYREKLLHPRRSLDNRVINFEYDGDDHEDWHNSMATRPMILPTRRLKTAGITFEPNNTWSLRNVSFDSRRGVLKIPNLKL
ENVTKAALFNVIALETLNPNIGNEVTSFAVLMNDLINVDKDVTLLTEKKILLSHGFEDRDVAEFFNVLGRGAALNRSNRHFFDPVYKSIHNYCSKPWHEWWTSLVNTNFK
SPWTIISLIAASLGAVLLFLQTFYQVYGYHHPSPRP