; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005590 (gene) of Snake gourd v1 genome

Gene IDTan0005590
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUPF0481 protein At3g47200-like
Genome locationLG02:95374374..95380659
RNA-Seq ExpressionTan0005590
SyntenyTan0005590
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131631.1 UPF0481 protein At3g47200-like [Momordica charantia]7.7e-11154.5Show/hide
Query:  ITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHYAETINMDSDEFVKMMLVDGCF
        I+ E SIYR+S  L +IN  AYTPQAISIGPFHHG+Q LMAME+LKL FL  YL ++ M  ++AFE+A+ WE RAR  YAE INM+S +FVK++LVD  F
Subjt:  ITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHYAETINMDSDEFVKMMLVDGCF

Query:  TVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLDISFVQLAHWFLLRWFMGQLFDLPPIAFTTKLNHLVDLFSFNYA
         V  MI+          + G +   AI +D+YRDLI+LENQLPFF+L  LF + +  I FV  A  F  RW+MG    +P      K NHLVD  SF YA
Subjt:  TVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLDISFVQLAHWFLLRWFMGQLFDLPPIAFTTKLNHLVDLFSFNYA

Query:  IP---SDNDHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKHIMDISF--HDGVLEIPHFEISDVFETTMRNLMAFEHYHAGSNKRYAIQYAIFMDELIN
        +P    +NDH  + N +  N P+ATELWEAGV+FQKA E K IMDI F   +GVL IPH EI D FET MRNL+A+EHYH G ++R  IQY +F+D+LI+
Subjt:  IP---SDNDHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKHIMDISF--HDGVLEIPHFEISDVFETTMRNLMAFEHYHAGSNKRYAIQYAIFMDELIN

Query:  TDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTPWAYISFFAAVILILFTVLQFIFTIV
        T++DVSLLV+A IITN IGG+ EE+SK+FN+LCKD S+  DFYYY +IS  LR++    W++W+  LKRDYFN+PW  ISF AA + IL TV+Q +++++
Subjt:  TDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTPWAYISFFAAVILILFTVLQFIFTIV

XP_022132066.1 UPF0481 protein At3g47200-like [Momordica charantia]2.9e-11855.29Show/hide
Query:  DEVEQHRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETR
        DEVE  +  +TI    ++ M++ L PI+ E SIYRVS RL +IN +AYTPQAISIGPFHHG++  MAMEQLKLRFL  YL R+ MG+E AFE+AQ WETR
Subjt:  DEVEQHRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETR

Query:  ARNHYAETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLDISFVQLAHWFLLRWFMG
        AR  YAE I+M SD FVKMMLVDG F VEF+ + + + +  +  L  +   AI VD+YRDLI+LENQLPFF+L+ L ++ S    FV     F  RW+ G
Subjt:  ARNHYAETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLDISFVQLAHWFLLRWFMG

Query:  QLFDLPPIAFTTKLNHLVDLFSFNYAIPS---DNDHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKH-IMDISFHDGVLEIPHFEISDVFETTMRNLMA
            +     T K NHLVD  SF YA+P+    ND +     +S   P+ATELWEAGV+FQKA E K  IMDI F DGVL IPH EI D FET +RNL+A
Subjt:  QLFDLPPIAFTTKLNHLVDLFSFNYAIPS---DNDHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKH-IMDISFHDGVLEIPHFEISDVFETTMRNLMA

Query:  FEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTP
        +EHYH G ++R  IQY  F+DELI+T++DVSLLV+A IITNNIGG++E++SKLFNDLCKD +++ DFYYY  IS DL ++    W++    L+RDYFNTP
Subjt:  FEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTP

Query:  WAYISFFAAVILILFTVLQFIFTIV
        WA+ISF AA  L+L T +Q I++ +
Subjt:  WAYISFFAAVILILFTVLQFIFTIV

XP_022158989.1 UPF0481 protein At3g47200-like isoform X1 [Momordica charantia]9.8e-10652.71Show/hide
Query:  HRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHY
        H  ++  + + I++M++ LPP+  E +I+RV  RL   N  AY PQ ISIGPFHHG+Q LM MEQ KLRFL  YL R N G+E    + + WET ARN Y
Subjt:  HRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHY

Query:  AETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLD--ISFVQLAHWFLLR--WFMGQ
        AE INMDSDEFVKMMLVDGCF VE M++     S  E +       A+  DLY DLIMLENQLPFFVLQ LF+Q SL+  +SF+QL H F  R      +
Subjt:  AETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLD--ISFVQLAHWFLLR--WFMGQ

Query:  LFDLP--PIAFTTKLNHLVDLFSFNY----AIPSDNDHVTSFNHQSFNH-PSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNL
          +LP   +  T K+NHLVD  SF Y    A  S   H  + + +     P+ TELWEAG+ F+KA   KHIMDISF D VL+IP  EI DVFET +RNL
Subjt:  LFDLP--PIAFTTKLNHLVDLFSFNY----AIPSDNDHVTSFNHQSFNH-PSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNL

Query:  MAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFN
        MAFE YH  ++ +YAIQY +F++ LI+ ++DVSLLV+A IITN IGG+++E+S LFNDLCKD  +  D   +  I++ L EH   RWNK    L+RDYFN
Subjt:  MAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFN

Query:  TPWAYISFFAAVILILFTVLQFIFT
        TPWA+ISF AA  LIL T LQ +F+
Subjt:  TPWAYISFFAAVILILFTVLQFIFT

XP_022158990.1 UPF0481 protein At3g47200-like isoform X2 [Momordica charantia]9.8e-10652.71Show/hide
Query:  HRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHY
        H  ++  + + I++M++ LPP+  E +I+RV  RL   N  AY PQ ISIGPFHHG+Q LM MEQ KLRFL  YL R N G+E    + + WET ARN Y
Subjt:  HRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHY

Query:  AETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLD--ISFVQLAHWFLLR--WFMGQ
        AE INMDSDEFVKMMLVDGCF VE M++     S  E +       A+  DLY DLIMLENQLPFFVLQ LF+Q SL+  +SF+QL H F  R      +
Subjt:  AETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLD--ISFVQLAHWFLLR--WFMGQ

Query:  LFDLP--PIAFTTKLNHLVDLFSFNY----AIPSDNDHVTSFNHQSFNH-PSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNL
          +LP   +  T K+NHLVD  SF Y    A  S   H  + + +     P+ TELWEAG+ F+KA   KHIMDISF D VL+IP  EI DVFET +RNL
Subjt:  LFDLP--PIAFTTKLNHLVDLFSFNY----AIPSDNDHVTSFNHQSFNH-PSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNL

Query:  MAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFN
        MAFE YH  ++ +YAIQY +F++ LI+ ++DVSLLV+A IITN IGG+++E+S LFNDLCKD  +  D   +  I++ L EH   RWNK    L+RDYFN
Subjt:  MAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFN

Query:  TPWAYISFFAAVILILFTVLQFIFT
        TPWA+ISF AA  LIL T LQ +F+
Subjt:  TPWAYISFFAAVILILFTVLQFIFT

XP_022158992.1 UPF0481 protein At3g47200-like isoform X3 [Momordica charantia]9.8e-10652.71Show/hide
Query:  HRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHY
        H  ++  + + I++M++ LPP+  E +I+RV  RL   N  AY PQ ISIGPFHHG+Q LM MEQ KLRFL  YL R N G+E    + + WET ARN Y
Subjt:  HRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHY

Query:  AETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLD--ISFVQLAHWFLLR--WFMGQ
        AE INMDSDEFVKMMLVDGCF VE M++     S  E +       A+  DLY DLIMLENQLPFFVLQ LF+Q SL+  +SF+QL H F  R      +
Subjt:  AETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLD--ISFVQLAHWFLLR--WFMGQ

Query:  LFDLP--PIAFTTKLNHLVDLFSFNY----AIPSDNDHVTSFNHQSFNH-PSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNL
          +LP   +  T K+NHLVD  SF Y    A  S   H  + + +     P+ TELWEAG+ F+KA   KHIMDISF D VL+IP  EI DVFET +RNL
Subjt:  LFDLP--PIAFTTKLNHLVDLFSFNY----AIPSDNDHVTSFNHQSFNH-PSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNL

Query:  MAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFN
        MAFE YH  ++ +YAIQY +F++ LI+ ++DVSLLV+A IITN IGG+++E+S LFNDLCKD  +  D   +  I++ L EH   RWNK    L+RDYFN
Subjt:  MAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFN

Query:  TPWAYISFFAAVILILFTVLQFIFT
        TPWA+ISF AA  LIL T LQ +F+
Subjt:  TPWAYISFFAAVILILFTVLQFIFT

TrEMBL top hitse value%identityAlignment
A0A6J1BQ17 UPF0481 protein At3g47200-like3.7e-11154.5Show/hide
Query:  ITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHYAETINMDSDEFVKMMLVDGCF
        I+ E SIYR+S  L +IN  AYTPQAISIGPFHHG+Q LMAME+LKL FL  YL ++ M  ++AFE+A+ WE RAR  YAE INM+S +FVK++LVD  F
Subjt:  ITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHYAETINMDSDEFVKMMLVDGCF

Query:  TVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLDISFVQLAHWFLLRWFMGQLFDLPPIAFTTKLNHLVDLFSFNYA
         V  MI+          + G +   AI +D+YRDLI+LENQLPFF+L  LF + +  I FV  A  F  RW+MG    +P      K NHLVD  SF YA
Subjt:  TVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLDISFVQLAHWFLLRWFMGQLFDLPPIAFTTKLNHLVDLFSFNYA

Query:  IP---SDNDHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKHIMDISF--HDGVLEIPHFEISDVFETTMRNLMAFEHYHAGSNKRYAIQYAIFMDELIN
        +P    +NDH  + N +  N P+ATELWEAGV+FQKA E K IMDI F   +GVL IPH EI D FET MRNL+A+EHYH G ++R  IQY +F+D+LI+
Subjt:  IP---SDNDHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKHIMDISF--HDGVLEIPHFEISDVFETTMRNLMAFEHYHAGSNKRYAIQYAIFMDELIN

Query:  TDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTPWAYISFFAAVILILFTVLQFIFTIV
        T++DVSLLV+A IITN IGG+ EE+SK+FN+LCKD S+  DFYYY +IS  LR++    W++W+  LKRDYFN+PW  ISF AA + IL TV+Q +++++
Subjt:  TDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTPWAYISFFAAVILILFTVLQFIFTIV

A0A6J1BR71 UPF0481 protein At3g47200-like1.4e-11855.29Show/hide
Query:  DEVEQHRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETR
        DEVE  +  +TI    ++ M++ L PI+ E SIYRVS RL +IN +AYTPQAISIGPFHHG++  MAMEQLKLRFL  YL R+ MG+E AFE+AQ WETR
Subjt:  DEVEQHRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETR

Query:  ARNHYAETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLDISFVQLAHWFLLRWFMG
        AR  YAE I+M SD FVKMMLVDG F VEF+ + + + +  +  L  +   AI VD+YRDLI+LENQLPFF+L+ L ++ S    FV     F  RW+ G
Subjt:  ARNHYAETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLDISFVQLAHWFLLRWFMG

Query:  QLFDLPPIAFTTKLNHLVDLFSFNYAIPS---DNDHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKH-IMDISFHDGVLEIPHFEISDVFETTMRNLMA
            +     T K NHLVD  SF YA+P+    ND +     +S   P+ATELWEAGV+FQKA E K  IMDI F DGVL IPH EI D FET +RNL+A
Subjt:  QLFDLPPIAFTTKLNHLVDLFSFNYAIPS---DNDHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKH-IMDISFHDGVLEIPHFEISDVFETTMRNLMA

Query:  FEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTP
        +EHYH G ++R  IQY  F+DELI+T++DVSLLV+A IITNNIGG++E++SKLFNDLCKD +++ DFYYY  IS DL ++    W++    L+RDYFNTP
Subjt:  FEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTP

Query:  WAYISFFAAVILILFTVLQFIFTIV
        WA+ISF AA  L+L T +Q I++ +
Subjt:  WAYISFFAAVILILFTVLQFIFTIV

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X24.7e-10652.71Show/hide
Query:  HRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHY
        H  ++  + + I++M++ LPP+  E +I+RV  RL   N  AY PQ ISIGPFHHG+Q LM MEQ KLRFL  YL R N G+E    + + WET ARN Y
Subjt:  HRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHY

Query:  AETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLD--ISFVQLAHWFLLR--WFMGQ
        AE INMDSDEFVKMMLVDGCF VE M++     S  E +       A+  DLY DLIMLENQLPFFVLQ LF+Q SL+  +SF+QL H F  R      +
Subjt:  AETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLD--ISFVQLAHWFLLR--WFMGQ

Query:  LFDLP--PIAFTTKLNHLVDLFSFNY----AIPSDNDHVTSFNHQSFNH-PSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNL
          +LP   +  T K+NHLVD  SF Y    A  S   H  + + +     P+ TELWEAG+ F+KA   KHIMDISF D VL+IP  EI DVFET +RNL
Subjt:  LFDLP--PIAFTTKLNHLVDLFSFNY----AIPSDNDHVTSFNHQSFNH-PSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNL

Query:  MAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFN
        MAFE YH  ++ +YAIQY +F++ LI+ ++DVSLLV+A IITN IGG+++E+S LFNDLCKD  +  D   +  I++ L EH   RWNK    L+RDYFN
Subjt:  MAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFN

Query:  TPWAYISFFAAVILILFTVLQFIFT
        TPWA+ISF AA  LIL T LQ +F+
Subjt:  TPWAYISFFAAVILILFTVLQFIFT

A0A6J1DYL4 UPF0481 protein At3g47200-like isoform X34.7e-10652.71Show/hide
Query:  HRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHY
        H  ++  + + I++M++ LPP+  E +I+RV  RL   N  AY PQ ISIGPFHHG+Q LM MEQ KLRFL  YL R N G+E    + + WET ARN Y
Subjt:  HRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHY

Query:  AETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLD--ISFVQLAHWFLLR--WFMGQ
        AE INMDSDEFVKMMLVDGCF VE M++     S  E +       A+  DLY DLIMLENQLPFFVLQ LF+Q SL+  +SF+QL H F  R      +
Subjt:  AETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLD--ISFVQLAHWFLLR--WFMGQ

Query:  LFDLP--PIAFTTKLNHLVDLFSFNY----AIPSDNDHVTSFNHQSFNH-PSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNL
          +LP   +  T K+NHLVD  SF Y    A  S   H  + + +     P+ TELWEAG+ F+KA   KHIMDISF D VL+IP  EI DVFET +RNL
Subjt:  LFDLP--PIAFTTKLNHLVDLFSFNY----AIPSDNDHVTSFNHQSFNH-PSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNL

Query:  MAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFN
        MAFE YH  ++ +YAIQY +F++ LI+ ++DVSLLV+A IITN IGG+++E+S LFNDLCKD  +  D   +  I++ L EH   RWNK    L+RDYFN
Subjt:  MAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFN

Query:  TPWAYISFFAAVILILFTVLQFIFT
        TPWA+ISF AA  LIL T LQ +F+
Subjt:  TPWAYISFFAAVILILFTVLQFIFT

A0A6J1E120 UPF0481 protein At3g47200-like isoform X14.7e-10652.71Show/hide
Query:  HRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHY
        H  ++  + + I++M++ LPP+  E +I+RV  RL   N  AY PQ ISIGPFHHG+Q LM MEQ KLRFL  YL R N G+E    + + WET ARN Y
Subjt:  HRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHY

Query:  AETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLD--ISFVQLAHWFLLR--WFMGQ
        AE INMDSDEFVKMMLVDGCF VE M++     S  E +       A+  DLY DLIMLENQLPFFVLQ LF+Q SL+  +SF+QL H F  R      +
Subjt:  AETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLD--ISFVQLAHWFLLR--WFMGQ

Query:  LFDLP--PIAFTTKLNHLVDLFSFNY----AIPSDNDHVTSFNHQSFNH-PSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNL
          +LP   +  T K+NHLVD  SF Y    A  S   H  + + +     P+ TELWEAG+ F+KA   KHIMDISF D VL+IP  EI DVFET +RNL
Subjt:  LFDLP--PIAFTTKLNHLVDLFSFNY----AIPSDNDHVTSFNHQSFNH-PSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNL

Query:  MAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFN
        MAFE YH  ++ +YAIQY +F++ LI+ ++DVSLLV+A IITN IGG+++E+S LFNDLCKD  +  D   +  I++ L EH   RWNK    L+RDYFN
Subjt:  MAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFN

Query:  TPWAYISFFAAVILILFTVLQFIFT
        TPWA+ISF AA  LIL T LQ +F+
Subjt:  TPWAYISFFAAVILILFTVLQFIFT

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026455.9e-1328.11Show/hide
Query:  EQHRRDLTIIVAPIEEMI------KNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRIN-MGVESAFEMAQK
        EQHR D T  V  +++ +       +L  +T   SI+ V   L   +P +YTP  +SIGP+H  K  L  ME+ KL   +    + N        E  Q 
Subjt:  EQHRRDLTIIVAPIEEMI------KNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRIN-MGVESAFEMAQK

Query:  WETRARNHYAETINMDSDEFVKMMLVDGCFTVEFM-IISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLDISFVQLAHWFLL
         E + R  Y + I  + +  + +M VD  F +EF+ I SF       N++G +       ++ RD++M+ENQ+P FVL+     L   +   + A   LL
Subjt:  WETRARNHYAETINMDSDEFVKMMLVDGCFTVEFM-IISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLDISFVQLAHWFLL

Query:  RWFMGQLFDLPPIAFTTKLNHLVDLFSFNYAIPSDNDHVTSFNHQSFNH
            G   DL P+                  I  D+D +     Q  NH
Subjt:  RWFMGQLFDLPPIAFTTKLNHLVDLFSFNYAIPSDNDHVTSFNHQSFNH

Q9SD53 UPF0481 protein At3g472001.4e-3027.08Show/hide
Query:  IYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYL-------CRINMGVESAFEMAQKWETRARNHYAETINMDSDEFVKMMLVDGC
        I+RV     ++NP AY P+ +SIGP+H+G++HL  ++Q K R LQ +L          N+ V++  ++    E + R  Y+E +    D  + MM++DGC
Subjt:  IYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYL-------CRINMGVESAFEMAQKWETRARNHYAETINMDSDEFVKMMLVDGC

Query:  FTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLF--NQLSLDISFVQLAHWFLLRWF--MGQLFDLPPIAFTTKLNHLVDLF
        F +   +I        E+ +       +   +  DL++LENQ+PFFVLQ L+  +++ +     ++A  F        G  ++        K  HL+DL 
Subjt:  FTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLF--NQLSLDISFVQLAHWFLLRWF--MGQLFDLPPIAFTTKLNHLVDLF

Query:  SFNYAIPSDNDHVTSFNH--------QSFNHP-----------SATELWEAGVKF--QKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNLMAFEHY
           +   +      S  H        +S N P           SA  L   G+KF  +++KE   I+++      L+IP         +   N +AFE +
Subjt:  SFNYAIPSDNDHVTSFNH--------QSFNHP-----------SATELWEAGVKF--QKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNLMAFEHY

Query:  HAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTPWAYI
        +  S+      Y +FM  L+N ++DV+ L   K+I  N  GS+ E+S+ F  + KD     D  Y   + K + E++   +N      +  +F +PW ++
Subjt:  HAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTPWAYI

Query:  SFFAAVILILFTVLQFIFTIV
        S  A + +IL T+LQ    I+
Subjt:  SFFAAVILILFTVLQFIFTIV

Arabidopsis top hitse value%identityAlignment
AT3G50130.1 Plant protein of unknown function (DUF247)1.1e-4933.18Show/hide
Query:  IYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHYAETINMDSDEFVKMMLVDGCFTVEFMI
        IYRV   L+  N  +Y PQ +S+GPFHHG +HL+ M++ K R +   + R    +E   +  ++ E RAR  Y   I++ S++F +M+++DGCF +E   
Subjt:  IYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHYAETINMDSDEFVKMMLVDGCFTVEFMI

Query:  ISFPFTSPFENQLGRSFCDAIAV------DLYRDLIMLENQLPFFVLQHLFNQLSLDISFVQLAHWFLLRWFMGQLFD---------------------L
          F       ++LG    D +         + RD++MLENQLP FVL  L     L+I   +     L+     + FD                      
Subjt:  ISFPFTSPFENQLGRSFCDAIAV------DLYRDLIMLENQLPFFVLQHLFNQLSLDISFVQLAHWFLLRWFMGQLFD---------------------L

Query:  PPIAFTTKLN-HLVDLFSFNYAIPSDN-------------DHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTM
         PIA   K   H +D+F  N   P  N               V     Q   H   TEL EAG+KF+  K  +   DI F +G LEIP   I D  ++  
Subjt:  PPIAFTTKLN-HLVDLFSFNYAIPSDN-------------DHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTM

Query:  RNLMAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRD
         NL+AFE  H  S+      Y IFMD LI++ +DV  L    II + +G  + E++ LFN LC++ +      Y  ++S  +  + S +WN  K +LK  
Subjt:  RNLMAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRD

Query:  YFNTPWAYISFFAAVILILFTVLQFIFTIVP
        YFN PWAY SFFAA++L++ T+ Q  FT  P
Subjt:  YFNTPWAYISFFAAVILILFTVLQFIFTIVP

AT3G50140.1 Plant protein of unknown function (DUF247)1.3e-4733.33Show/hide
Query:  IYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHYAETINMDSDEFVKMMLVDGCFTVEFMI
        IYRV   L+  +  +Y PQA+S+GP+HHG +HL  M+  K R +   + R   G+E   +  ++ E RAR  Y   I + S++F +M+++DGCF ++   
Subjt:  IYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHYAETINMDSDEFVKMMLVDGCFTVEFMI

Query:  ISFPFTSPFE--NQLGRSFCDAIAV------DLYRDLIMLENQLPFFVLQHLFNQLSLDISF-----VQLAHWF---LLRWFM------------GQLFD
            F   +E  ++LG    D +         + RD++MLENQLP FVL  L  +L L   +      QLA  F   L+  +M             + F+
Subjt:  ISFPFTSPFE--NQLGRSFCDAIAV------DLYRDLIMLENQLPFFVLQHLFNQLSLDISF-----VQLAHWF---LLRWFM------------GQLFD

Query:  LPPIAFTTKLN-HLVDLFSFNYAIP-------------SDNDHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETT
          PIA   K   H +D+F  +   P             S    V     Q   H   TEL EAG+KF++ K  +   DI F +G LEIP   I D  ++ 
Subjt:  LPPIAFTTKLN-HLVDLFSFNYAIP-------------SDNDHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETT

Query:  MRNLMAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKR
          NL+A+E  H  S       Y IFMD LI++ +D+  L    II + + G+  E++ +FN LC++ +   +  Y  ++S  +  + + +WN  K  LK 
Subjt:  MRNLMAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKR

Query:  DYFNTPWAYISFFAAVILILFTVLQFIFTIVP
         YF+ PWAY SFFAAVIL+L T+ Q  FT  P
Subjt:  DYFNTPWAYISFFAAVILILFTVLQFIFTIVP

AT3G50160.1 Plant protein of unknown function (DUF247)5.3e-4933.33Show/hide
Query:  QEDVVTQVDDEVEQHRRDLTIIVA--PIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVES
        Q + V  ++D+ EQ  R++ +I     ++ +  N         IYRV   L+  +  +Y PQ +SIGP+HHG +HLM ME+ K R +   + R    +E 
Subjt:  QEDVVTQVDDEVEQHRRDLTIIVA--PIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVES

Query:  AFEMAQKWETRARNHYAETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCD------AIAVDLYRDLIMLENQLPFFVLQHLFNQLSLD
          +  ++ E +AR  Y   INM+ +EF++M+++DG F +E     F  TS    ++G +  D       +   + RD++MLENQLP+ VL+ L      D
Subjt:  AFEMAQKWETRARNHYAETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCD------AIAVDLYRDLIMLENQLPFFVLQHLFNQLSLD

Query:  ISFVQLAHWFLLRWFMGQLFDLPPIAFTTKLNHLVDLFS---FNYAIPSDND-HVTSFNHQSFNHPSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIP
        +  +   +  L + F   L     +       H +D+        +  SD D  + +   Q   H   TEL  AGV+F + KE  H  DI F +G L+IP
Subjt:  ISFVQLAHWFLLRWFMGQLFDLPPIAFTTKLNHLVDLFS---FNYAIPSDND-HVTSFNHQSFNHPSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIP

Query:  HFEISDVFETTMRNLMAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSG
           I D  ++   NL+AFE  H  S+K+    Y IFMD LIN+ +DVS L    II N + GS  E+S LFN L K+     +  Y   ++ ++  +   
Subjt:  HFEISDVFETTMRNLMAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSG

Query:  RWNKWKTLLKRDYFNTPWAYISFFAAVILILFTVLQFIFTI
        +WN  K  L+  YFN PWAY SF AAV L++FT  Q  F +
Subjt:  RWNKWKTLLKRDYFNTPWAYISFFAAVILILFTVLQFIFTI

AT4G31980.1 unknown protein8.9e-6537.11Show/hide
Query:  IVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHYAETINMD
        +V  I+  +  L  ++ +  IY+V N+LR +NP AYTP+ +S GP H GK+ L AME  K R+L +++ R N  +E    +A+ WE  AR+ YAE + + 
Subjt:  IVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHYAETINMD

Query:  SDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAV-DLYRDLIMLENQLPFFVLQHLFNQL-----SLDISFVQLAHWFLLRWFMGQLFDLP
        SDEFV+M++VDG F VE ++ S       EN   R F +++ + D+ RD+I++ENQLPFFV++ +F  L         S +QLA     R F   L  + 
Subjt:  SDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAV-DLYRDLIMLENQLPFFVLQHLFNQL-----SLDISFVQLAHWFLLRWFMGQLFDLP

Query:  PIAFTTKLNHLVDLFSFNYAIPSDNDHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNLMAFEHYHAGSNK
           F T+  H VDL    Y +P     +     +  N P ATEL  AGV+F+ A+    ++DISF DGVL+IP   + D+ E+  +N++ FE     SNK
Subjt:  PIAFTTKLNHLVDLFSFNYAIPSDNDHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNLMAFEHYHAGSNK

Query:  RYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTPWAYISFFAAV
         + + Y + +   I +  D  LL+ + II N +G S  ++S LFN + K+      F Y+  +S++L+ + +  WN+WK +L+RDYF+ PWA  S FAA+
Subjt:  RYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTPWAYISFFAAV

Query:  ILILFTVLQFIFTIV
        +L+L T +Q + +I+
Subjt:  ILILFTVLQFIFTIV

AT5G11290.1 Plant protein of unknown function (DUF247)1.3e-4734.88Show/hide
Query:  MEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHYAETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSF-CDAIAVDLYRDLIMLEN
        ME  KLR+LQ+++ R  + +E    +A+ WE RAR  Y E + + SDE+VKM++VD  F VE ++ S      +   L R +    + VD+  D+++LEN
Subjt:  MEQLKLRFLQTYLCRINMGVESAFEMAQKWETRARNHYAETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSF-CDAIAVDLYRDLIMLEN

Query:  QLPFFVLQHLFNQLSLDI-----SFVQLAHWFLLRWFMGQLFDLPPIAFT---TKLNHLVDLFSFNYAIPSDNDHVTSFNHQSFNHP----SATELWEAG
        QLP+FV++ +F  L +D         ++ H    +++M     +P  + +   +K+ H VDL   +  +P     V SF   S        SA E+  AG
Subjt:  QLPFFVLQHLFNQLSLDI-----SFVQLAHWFLLRWFMGQLFDLPPIAFT---TKLNHLVDLFSFNYAIPSDNDHVTSFNHQSFNHP----SATELWEAG

Query:  VKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNLMAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLC
        VK Q A      +DISF +GVL IP  +I+D+ E+  RN++ FE  H      Y I Y  F+   I +  D  L ++  II N  G + E++S+LFN + 
Subjt:  VKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNLMAFEHYHAGSNKRYAIQYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLC

Query:  KDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTPWAYISFFAAVILILFTVLQFIFTIV
        K+TS  +  +YY  +  +L+ H +  WNKWK  L+RDYF+ PW+  S  AA +L+L T +Q I +I+
Subjt:  KDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTPWAYISFFAAVILILFTVLQFIFTIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACTCCATTGATGTTGAAATAATGCAGCAAGAAGATGTTGTCACACAAGTTGATGATGAGGTTGAACAACACCGACGTGATTTAACAATTATTGTGGCACCCAT
TGAAGAAATGATAAAGAATCTTCCTCCCATCACTCCAGAACGCAGTATTTATCGAGTTTCAAATCGATTACGCAGCATCAATCCTGTAGCCTATACCCCTCAAGCCATTT
CCATTGGCCCTTTTCACCATGGTAAACAGCATTTGATGGCCATGGAACAACTTAAACTCCGATTTCTCCAAACTTATCTCTGCCGTATAAATATGGGAGTTGAGTCCGCC
TTTGAAATGGCTCAAAAGTGGGAGACTAGAGCCCGTAATCACTACGCAGAAACCATAAACATGGATAGTGATGAGTTTGTGAAAATGATGCTTGTGGATGGTTGCTTCAC
AGTGGAGTTTATGATCATAAGTTTTCCCTTCACCTCTCCATTTGAAAACCAGTTAGGACGTTCATTCTGTGACGCTATAGCTGTTGATTTATATCGCGACCTTATCATGC
TTGAAAATCAACTTCCTTTCTTTGTTCTTCAACACCTATTCAACCAACTTTCACTAGACATCTCCTTTGTACAACTTGCACACTGGTTTTTACTTAGATGGTTTATGGGC
CAACTATTTGATCTTCCTCCTATTGCATTCACCACAAAATTAAACCACTTGGTCGATTTATTCAGCTTTAACTATGCCATCCCCTCGGACAATGATCATGTGACGAGTTT
CAATCATCAATCTTTCAATCATCCAAGTGCAACCGAGCTTTGGGAGGCTGGTGTCAAGTTCCAAAAAGCGAAAGAAGGCAAACACATTATGGACATAAGCTTCCACGACG
GAGTTCTAGAAATCCCACATTTCGAAATTAGCGACGTCTTCGAAACCACTATGCGAAACCTTATGGCATTCGAGCATTACCATGCAGGGAGTAATAAGAGGTATGCAATC
CAATATGCTATATTTATGGATGAGTTGATAAACACAGACAAAGATGTGAGTTTACTTGTCGAAGCAAAAATCATAACCAACAATATTGGTGGCAGTCATGAAGAAATTTC
AAAACTTTTTAACGACCTTTGTAAGGATACCTCCTTGACATTTGATTTTTACTACTACGGAAAAATAAGCAAAGATTTACGGGAGCATAGCAGTGGACGATGGAACAAGT
GGAAGACTTTACTGAAACGTGACTATTTCAATACACCATGGGCTTATATCTCTTTCTTTGCTGCTGTCATCCTCATTCTCTTCACTGTCTTACAATTCATATTCACTATT
GTACCGGTCGGTTCCAAGTAA
mRNA sequenceShow/hide mRNA sequence
AATAAGCACTCTTACTCCATCTTCTTCCCATTGCTCATTCTCTTTCTAATTCCCTCTTTTCAGCTAGCAGCAACCGACATCTCCTTCCTTCCATTTGCAGCTCAAATGGA
AAACTCCATTGATGTTGAAATAATGCAGCAAGAAGATGTTGTCACACAAGTTGATGATGAGGTTGAACAACACCGACGTGATTTAACAATTATTGTGGCACCCATTGAAG
AAATGATAAAGAATCTTCCTCCCATCACTCCAGAACGCAGTATTTATCGAGTTTCAAATCGATTACGCAGCATCAATCCTGTAGCCTATACCCCTCAAGCCATTTCCATT
GGCCCTTTTCACCATGGTAAACAGCATTTGATGGCCATGGAACAACTTAAACTCCGATTTCTCCAAACTTATCTCTGCCGTATAAATATGGGAGTTGAGTCCGCCTTTGA
AATGGCTCAAAAGTGGGAGACTAGAGCCCGTAATCACTACGCAGAAACCATAAACATGGATAGTGATGAGTTTGTGAAAATGATGCTTGTGGATGGTTGCTTCACAGTGG
AGTTTATGATCATAAGTTTTCCCTTCACCTCTCCATTTGAAAACCAGTTAGGACGTTCATTCTGTGACGCTATAGCTGTTGATTTATATCGCGACCTTATCATGCTTGAA
AATCAACTTCCTTTCTTTGTTCTTCAACACCTATTCAACCAACTTTCACTAGACATCTCCTTTGTACAACTTGCACACTGGTTTTTACTTAGATGGTTTATGGGCCAACT
ATTTGATCTTCCTCCTATTGCATTCACCACAAAATTAAACCACTTGGTCGATTTATTCAGCTTTAACTATGCCATCCCCTCGGACAATGATCATGTGACGAGTTTCAATC
ATCAATCTTTCAATCATCCAAGTGCAACCGAGCTTTGGGAGGCTGGTGTCAAGTTCCAAAAAGCGAAAGAAGGCAAACACATTATGGACATAAGCTTCCACGACGGAGTT
CTAGAAATCCCACATTTCGAAATTAGCGACGTCTTCGAAACCACTATGCGAAACCTTATGGCATTCGAGCATTACCATGCAGGGAGTAATAAGAGGTATGCAATCCAATA
TGCTATATTTATGGATGAGTTGATAAACACAGACAAAGATGTGAGTTTACTTGTCGAAGCAAAAATCATAACCAACAATATTGGTGGCAGTCATGAAGAAATTTCAAAAC
TTTTTAACGACCTTTGTAAGGATACCTCCTTGACATTTGATTTTTACTACTACGGAAAAATAAGCAAAGATTTACGGGAGCATAGCAGTGGACGATGGAACAAGTGGAAG
ACTTTACTGAAACGTGACTATTTCAATACACCATGGGCTTATATCTCTTTCTTTGCTGCTGTCATCCTCATTCTCTTCACTGTCTTACAATTCATATTCACTATTGTACC
GGTCGGTTCCAAGTAACAATTAGTTCATTTGCTAGGCTTCTCCTTTTAAATAATGTTAAACCATGTATATATTTAATTACC
Protein sequenceShow/hide protein sequence
MENSIDVEIMQQEDVVTQVDDEVEQHRRDLTIIVAPIEEMIKNLPPITPERSIYRVSNRLRSINPVAYTPQAISIGPFHHGKQHLMAMEQLKLRFLQTYLCRINMGVESA
FEMAQKWETRARNHYAETINMDSDEFVKMMLVDGCFTVEFMIISFPFTSPFENQLGRSFCDAIAVDLYRDLIMLENQLPFFVLQHLFNQLSLDISFVQLAHWFLLRWFMG
QLFDLPPIAFTTKLNHLVDLFSFNYAIPSDNDHVTSFNHQSFNHPSATELWEAGVKFQKAKEGKHIMDISFHDGVLEIPHFEISDVFETTMRNLMAFEHYHAGSNKRYAI
QYAIFMDELINTDKDVSLLVEAKIITNNIGGSHEEISKLFNDLCKDTSLTFDFYYYGKISKDLREHSSGRWNKWKTLLKRDYFNTPWAYISFFAAVILILFTVLQFIFTI
VPVGSK