; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002303 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002303
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUPF0481 protein At3g47200-like
Genome locationscaffold1:30972141..30973379
RNA-Seq ExpressionSpg002303
SyntenySpg002303
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132066.1 UPF0481 protein At3g47200-like [Momordica charantia]4.2e-11957.21Show/hide
Query:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK
        ML++L PI+EECSIYRV KRL NIN +AYTPQ ISIGPFH  QK+ MA E+LKLR LD+YLRR + + +EDA + A+ WE++AR+ YAE I M SD+FVK
Subjt:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK

Query:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFF-----SIGLIKDNYELP
        M+LVDG F+VE + ++Y+    TQ  L+ + F+A+ +D+YRDL +LENQLPFF+LE L        +K ++   F+  T  F      +  LI D     
Subjt:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFF-----SIGLIKDNYELP

Query:  HDVSTQKINHLVDFLRFYY----VPSKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRL-MDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPV
          + T+K NHLVDFL FYY    V  K+ K    KRE   PPT TEL EAGV  +KAT+ KRL MDI F+DGVL IPH E+HD FETYVRNL+A+EHY +
Subjt:  HDVSTQKINHLVDFLRFYY----VPSKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRL-MDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPV

Query:  GNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYY-DTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSF
        G+DER +I YV FLD LISTE+D SLLVKA IITN+IGG++E++SKLFN+LCKD  I  DFYYY D S  LH++C T  HR MASLRRDYFNTPWA +SF
Subjt:  GNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYY-DTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSF

Query:  TAATFLILLTFLQTLSSFLSLSK
         AATFL+LLT +Q + S +S  K
Subjt:  TAATFLILLTFLQTLSSFLSLSK

XP_022132118.1 UPF0481 protein At3g47200-like [Momordica charantia]1.3e-12559.32Show/hide
Query:  MLQQLPPI-AEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFV
        +L++ PPI A E  IYRVPKRL ++   AYTP+VI+IGPFH  + DL+AT++ KL C  +YL R I+ +V+  V  A+ WE KARR YAEPI M SDDFV
Subjt:  MLQQLPPI-AEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFV

Query:  KMLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIGLIKDNYELPHDV-
        +M+L+D CFIVE+MII    CF+T+ G D  F K +  DLY++LTMLENQLPFFVL+ LF++ P    KN   +SFIQLT KF S GLI+  Y LP  V 
Subjt:  KMLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIGLIKDNYELPHDV-

Query:  STQKINHLVDFLRFYYVPSKSVKESGEKRE-----FLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPV--G
        ST+++NHLVD L FYYVPS   +E  + +E     FLLPPTIT+LCEAGV VKKA   + L+DISF+ GVL+IP FE+HD FE YVRNL+AFE Y V   
Subjt:  STQKINHLVDFLRFYYVPSKSVKESGEKRE-----FLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPV--G

Query:  NDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYDTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSFTA
        +D+RYVIHY+EFLDGLIST +D +LLVK  II NHIGGS++E+S+LFNNLCK+T IP  FY+Y TSK LH+HC     R  A+LRRDYF++PWA +S  A
Subjt:  NDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYDTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSFTA

Query:  ATFLILLTFLQTL
        ATFLILL  LQT+
Subjt:  ATFLILLTFLQTL

XP_022158989.1 UPF0481 protein At3g47200-like isoform X1 [Momordica charantia]3.9e-11757.41Show/hide
Query:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK
        MLQ+LPP+AEEC+I+RVP+RL   N  AY PQ+ISIGPFH  ++DLM  E+ KLR LD YLRR     +E  V   +SWE+ AR  YAEPI M+SD+FVK
Subjt:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK

Query:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIG-LIKD-NYELPHDV
        M+LVDGCFIVE+M++  R   +T+   DP  F A+  DLY DL MLENQLPFFVL+ LF+    +A      +SF+QLTH F++ G LIK    ELPH V
Subjt:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIG-LIKD-NYELPHDV

Query:  --STQKINHLVDFLRFYYVP--------SKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHY
          ST K+NHLVDFL FYY P        S S+  S +K  F  PPT+TEL EAG++ KKA + K +MDISF+D VL+IP  E+ D FETYVRNL+AFE Y
Subjt:  --STQKINHLVDFLRFYYVP--------SKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHY

Query:  PVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYD-TSKALHEHCLTRRHRWMASLRRDYFNTPWASV
           ND +Y I Y  FL+GLIS E+D SLLVKA IITN IGG+++E+S LFN+LCKD  +  D   ++  ++ALHEHC  R ++ MASLRRDYFNTPWA +
Subjt:  PVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYD-TSKALHEHCLTRRHRWMASLRRDYFNTPWASV

Query:  SFTAATFLILLTFLQTLSSFLSLSK
        SF AA FLILLTFLQTL S +SLSK
Subjt:  SFTAATFLILLTFLQTLSSFLSLSK

XP_022158990.1 UPF0481 protein At3g47200-like isoform X2 [Momordica charantia]3.9e-11757.41Show/hide
Query:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK
        MLQ+LPP+AEEC+I+RVP+RL   N  AY PQ+ISIGPFH  ++DLM  E+ KLR LD YLRR     +E  V   +SWE+ AR  YAEPI M+SD+FVK
Subjt:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK

Query:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIG-LIKD-NYELPHDV
        M+LVDGCFIVE+M++  R   +T+   DP  F A+  DLY DL MLENQLPFFVL+ LF+    +A      +SF+QLTH F++ G LIK    ELPH V
Subjt:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIG-LIKD-NYELPHDV

Query:  --STQKINHLVDFLRFYYVP--------SKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHY
          ST K+NHLVDFL FYY P        S S+  S +K  F  PPT+TEL EAG++ KKA + K +MDISF+D VL+IP  E+ D FETYVRNL+AFE Y
Subjt:  --STQKINHLVDFLRFYYVP--------SKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHY

Query:  PVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYD-TSKALHEHCLTRRHRWMASLRRDYFNTPWASV
           ND +Y I Y  FL+GLIS E+D SLLVKA IITN IGG+++E+S LFN+LCKD  +  D   ++  ++ALHEHC  R ++ MASLRRDYFNTPWA +
Subjt:  PVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYD-TSKALHEHCLTRRHRWMASLRRDYFNTPWASV

Query:  SFTAATFLILLTFLQTLSSFLSLSK
        SF AA FLILLTFLQTL S +SLSK
Subjt:  SFTAATFLILLTFLQTLSSFLSLSK

XP_022158992.1 UPF0481 protein At3g47200-like isoform X3 [Momordica charantia]3.9e-11757.41Show/hide
Query:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK
        MLQ+LPP+AEEC+I+RVP+RL   N  AY PQ+ISIGPFH  ++DLM  E+ KLR LD YLRR     +E  V   +SWE+ AR  YAEPI M+SD+FVK
Subjt:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK

Query:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIG-LIKD-NYELPHDV
        M+LVDGCFIVE+M++  R   +T+   DP  F A+  DLY DL MLENQLPFFVL+ LF+    +A      +SF+QLTH F++ G LIK    ELPH V
Subjt:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIG-LIKD-NYELPHDV

Query:  --STQKINHLVDFLRFYYVP--------SKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHY
          ST K+NHLVDFL FYY P        S S+  S +K  F  PPT+TEL EAG++ KKA + K +MDISF+D VL+IP  E+ D FETYVRNL+AFE Y
Subjt:  --STQKINHLVDFLRFYYVP--------SKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHY

Query:  PVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYD-TSKALHEHCLTRRHRWMASLRRDYFNTPWASV
           ND +Y I Y  FL+GLIS E+D SLLVKA IITN IGG+++E+S LFN+LCKD  +  D   ++  ++ALHEHC  R ++ MASLRRDYFNTPWA +
Subjt:  PVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYD-TSKALHEHCLTRRHRWMASLRRDYFNTPWASV

Query:  SFTAATFLILLTFLQTLSSFLSLSK
        SF AA FLILLTFLQTL S +SLSK
Subjt:  SFTAATFLILLTFLQTLSSFLSLSK

TrEMBL top hitse value%identityAlignment
A0A6J1BR71 UPF0481 protein At3g47200-like2.0e-11957.21Show/hide
Query:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK
        ML++L PI+EECSIYRV KRL NIN +AYTPQ ISIGPFH  QK+ MA E+LKLR LD+YLRR + + +EDA + A+ WE++AR+ YAE I M SD+FVK
Subjt:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK

Query:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFF-----SIGLIKDNYELP
        M+LVDG F+VE + ++Y+    TQ  L+ + F+A+ +D+YRDL +LENQLPFF+LE L        +K ++   F+  T  F      +  LI D     
Subjt:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFF-----SIGLIKDNYELP

Query:  HDVSTQKINHLVDFLRFYY----VPSKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRL-MDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPV
          + T+K NHLVDFL FYY    V  K+ K    KRE   PPT TEL EAGV  +KAT+ KRL MDI F+DGVL IPH E+HD FETYVRNL+A+EHY +
Subjt:  HDVSTQKINHLVDFLRFYY----VPSKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRL-MDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPV

Query:  GNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYY-DTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSF
        G+DER +I YV FLD LISTE+D SLLVKA IITN+IGG++E++SKLFN+LCKD  I  DFYYY D S  LH++C T  HR MASLRRDYFNTPWA +SF
Subjt:  GNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYY-DTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSF

Query:  TAATFLILLTFLQTLSSFLSLSK
         AATFL+LLT +Q + S +S  K
Subjt:  TAATFLILLTFLQTLSSFLSLSK

A0A6J1BVD4 UPF0481 protein At3g47200-like6.5e-12659.32Show/hide
Query:  MLQQLPPI-AEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFV
        +L++ PPI A E  IYRVPKRL ++   AYTP+VI+IGPFH  + DL+AT++ KL C  +YL R I+ +V+  V  A+ WE KARR YAEPI M SDDFV
Subjt:  MLQQLPPI-AEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFV

Query:  KMLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIGLIKDNYELPHDV-
        +M+L+D CFIVE+MII    CF+T+ G D  F K +  DLY++LTMLENQLPFFVL+ LF++ P    KN   +SFIQLT KF S GLI+  Y LP  V 
Subjt:  KMLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIGLIKDNYELPHDV-

Query:  STQKINHLVDFLRFYYVPSKSVKESGEKRE-----FLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPV--G
        ST+++NHLVD L FYYVPS   +E  + +E     FLLPPTIT+LCEAGV VKKA   + L+DISF+ GVL+IP FE+HD FE YVRNL+AFE Y V   
Subjt:  STQKINHLVDFLRFYYVPSKSVKESGEKRE-----FLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPV--G

Query:  NDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYDTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSFTA
        +D+RYVIHY+EFLDGLIST +D +LLVK  II NHIGGS++E+S+LFNNLCK+T IP  FY+Y TSK LH+HC     R  A+LRRDYF++PWA +S  A
Subjt:  NDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYDTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSFTA

Query:  ATFLILLTFLQTL
        ATFLILL  LQT+
Subjt:  ATFLILLTFLQTL

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X21.9e-11757.41Show/hide
Query:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK
        MLQ+LPP+AEEC+I+RVP+RL   N  AY PQ+ISIGPFH  ++DLM  E+ KLR LD YLRR     +E  V   +SWE+ AR  YAEPI M+SD+FVK
Subjt:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK

Query:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIG-LIKD-NYELPHDV
        M+LVDGCFIVE+M++  R   +T+   DP  F A+  DLY DL MLENQLPFFVL+ LF+    +A      +SF+QLTH F++ G LIK    ELPH V
Subjt:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIG-LIKD-NYELPHDV

Query:  --STQKINHLVDFLRFYYVP--------SKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHY
          ST K+NHLVDFL FYY P        S S+  S +K  F  PPT+TEL EAG++ KKA + K +MDISF+D VL+IP  E+ D FETYVRNL+AFE Y
Subjt:  --STQKINHLVDFLRFYYVP--------SKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHY

Query:  PVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYD-TSKALHEHCLTRRHRWMASLRRDYFNTPWASV
           ND +Y I Y  FL+GLIS E+D SLLVKA IITN IGG+++E+S LFN+LCKD  +  D   ++  ++ALHEHC  R ++ MASLRRDYFNTPWA +
Subjt:  PVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYD-TSKALHEHCLTRRHRWMASLRRDYFNTPWASV

Query:  SFTAATFLILLTFLQTLSSFLSLSK
        SF AA FLILLTFLQTL S +SLSK
Subjt:  SFTAATFLILLTFLQTLSSFLSLSK

A0A6J1DYL4 UPF0481 protein At3g47200-like isoform X31.9e-11757.41Show/hide
Query:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK
        MLQ+LPP+AEEC+I+RVP+RL   N  AY PQ+ISIGPFH  ++DLM  E+ KLR LD YLRR     +E  V   +SWE+ AR  YAEPI M+SD+FVK
Subjt:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK

Query:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIG-LIKD-NYELPHDV
        M+LVDGCFIVE+M++  R   +T+   DP  F A+  DLY DL MLENQLPFFVL+ LF+    +A      +SF+QLTH F++ G LIK    ELPH V
Subjt:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIG-LIKD-NYELPHDV

Query:  --STQKINHLVDFLRFYYVP--------SKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHY
          ST K+NHLVDFL FYY P        S S+  S +K  F  PPT+TEL EAG++ KKA + K +MDISF+D VL+IP  E+ D FETYVRNL+AFE Y
Subjt:  --STQKINHLVDFLRFYYVP--------SKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHY

Query:  PVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYD-TSKALHEHCLTRRHRWMASLRRDYFNTPWASV
           ND +Y I Y  FL+GLIS E+D SLLVKA IITN IGG+++E+S LFN+LCKD  +  D   ++  ++ALHEHC  R ++ MASLRRDYFNTPWA +
Subjt:  PVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYD-TSKALHEHCLTRRHRWMASLRRDYFNTPWASV

Query:  SFTAATFLILLTFLQTLSSFLSLSK
        SF AA FLILLTFLQTL S +SLSK
Subjt:  SFTAATFLILLTFLQTLSSFLSLSK

A0A6J1E120 UPF0481 protein At3g47200-like isoform X11.9e-11757.41Show/hide
Query:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK
        MLQ+LPP+AEEC+I+RVP+RL   N  AY PQ+ISIGPFH  ++DLM  E+ KLR LD YLRR     +E  V   +SWE+ AR  YAEPI M+SD+FVK
Subjt:  MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVK

Query:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIG-LIKD-NYELPHDV
        M+LVDGCFIVE+M++  R   +T+   DP  F A+  DLY DL MLENQLPFFVL+ LF+    +A      +SF+QLTH F++ G LIK    ELPH V
Subjt:  MLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIG-LIKD-NYELPHDV

Query:  --STQKINHLVDFLRFYYVP--------SKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHY
          ST K+NHLVDFL FYY P        S S+  S +K  F  PPT+TEL EAG++ KKA + K +MDISF+D VL+IP  E+ D FETYVRNL+AFE Y
Subjt:  --STQKINHLVDFLRFYYVP--------SKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHY

Query:  PVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYD-TSKALHEHCLTRRHRWMASLRRDYFNTPWASV
           ND +Y I Y  FL+GLIS E+D SLLVKA IITN IGG+++E+S LFN+LCKD  +  D   ++  ++ALHEHC  R ++ MASLRRDYFNTPWA +
Subjt:  PVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYD-TSKALHEHCLTRRHRWMASLRRDYFNTPWASV

Query:  SFTAATFLILLTFLQTLSSFLSLSK
        SF AA FLILLTFLQTL S +SLSK
Subjt:  SFTAATFLILLTFLQTLSSFLSLSK

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026454.5e-1527.65Show/hide
Query:  SIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVKMLLVDGCFIVEV
        SI+ VPK L   +  +YTP  +SIGP+H  + +L   E  KL        ++      D V+  +S E K R  Y + I  N +  + ++ VD  F++E 
Subjt:  SIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVKMLLVDGCFIVEV

Query:  MIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNN-RVSFIQLTHKFFSIGLIKDNYELPHDVSTQKINHLVDFLR
        + IY      +   ++    +    ++ RD+ M+ENQ+P FVL +      +  E  ++  +S +    K  S  +IK + +       Q+ NH++DFL 
Subjt:  MIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNN-RVSFIQLTHKFFSIGLIKDNYELPHDVSTQKINHLVDFLR

Query:  FYYVPSKSVKESGEKRE
           VP    +E  E  E
Subjt:  FYYVPSKSVKESGEKRE

Q9SD53 UPF0481 protein At3g472001.0e-3529.34Show/hide
Query:  EECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVED--AVKNAKSWESKARRFYAEPITMNSDDFVKMLLVDGC
        E C I+RVP+    +N  AY P+V+SIGP+H  +K L   ++ K R L  +L    K +VE+   VK     E K R+ Y+E +     D + M+++DGC
Subjt:  EECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVED--AVKNAKSWESKARRFYAEPITMNSDDFVKMLLVDGC

Query:  FIVEVMIIYYRGCFQTQNGLDPSF-FKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIGLIKD-NYELPHDVSTQKINH
        FI+ V +I       ++   DP F    L   +  DL +LENQ+PFFVL+ L+         + NR++F      FF   + K+ +Y   H     K  H
Subjt:  FIVEVMIIYYRGCFQTQNGLDPSF-FKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIGLIKD-NYELPHDVSTQKINH

Query:  LVDFLRFYYVP--SKSVKESGEKREFLL----PPTITELCEAGVIVKKATKGKRLMDISF------QDGV---------LEIPHFEVHDHFETYVRNLIA
        L+D +R  ++P  S+S K S    +  L       +  +    V +  + K  RL  I F      +D +         L+IP         ++  N +A
Subjt:  LVDFLRFYYVP--SKSVKESGEKREFLL----PPTITELCEAGVIVKKATKGKRLMDISF------QDGV---------LEIPHFEVHDHFETYVRNLIA

Query:  FEHYPVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSD-FYYYDTSKALHEHCLTRRHRWMASLRRDYFNTP
        FE +   +    +  Y+ F+  L++ E+D + L   K+I  +  GS+ E+S+ F  + KD     D  Y  +  K ++E+     +   A  R  +F +P
Subjt:  FEHYPVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSD-FYYYDTSKALHEHCLTRRHRWMASLRRDYFNTP

Query:  WASVSFTAATFLILLTFLQTLSSFLS
        W  +S  A  F+ILLT LQ+  + LS
Subjt:  WASVSFTAATFLILLTFLQTLSSFLS

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)4.0e-5132.68Show/hide
Query:  CSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVKMLLVDGCFIVE
        CSI+RVP+ + + N   Y P+V+SIGP+HR Q  L   EE K R L+  L R   L +ED +K+ K+ E  AR  Y+E I M+S++F +M+++DGCF++E
Subjt:  CSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVKMLLVDGCFIVE

Query:  VMIIYYRGCFQTQNGL------DPSFFKALRID-LYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIGLIKDNYELPHDVSTQKIN
        +        F+  N L      DP    A  +   YRD   LENQ+PFFVLE LFN+   D E N    S   L   FF+  + +   +L       +  
Subjt:  VMIIYYRGCFQTQNGL------DPSFFKALRID-LYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIGLIKDNYELPHDVSTQKIN

Query:  HLVDFLRFYYVPSK------SVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPVGNDERYVI
        HL+D LR  ++P        +     EK    +  +I++L  AG+ +++    +  + + F+ G +E+P   V D   +++ N +A+E   V     +  
Subjt:  HLVDFLRFYYVPSK------SVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPVGNDERYVI

Query:  HYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIP-SDFYYYDTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSFTAATFLIL
         Y   LD L +T KD   L    II N+  G+D E++K  N+L +D     +  Y  D  + ++E+  +  H   A+ +  YFN+PW+ VS  AA  L++
Subjt:  HYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIP-SDFYYYDTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSFTAATFLIL

Query:  LTFLQTL
        L+ +QT+
Subjt:  LTFLQTL

AT3G50120.1 Plant protein of unknown function (DUF247)3.5e-4733.41Show/hide
Query:  IYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRR---HIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVKMLLVDGCFIV
        IYRVP  L+  ++ +Y PQ +S+GP+H  +K L + +  K R ++  L+R    IK+ + DA++     E KAR  Y  P++++S++F++ML++DGCF++
Subjt:  IYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRR---HIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVKMLLVDGCFIV

Query:  EVMIIYYRGCFQ--TQNGL---DPSF-FKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFF-----------SIGLIKDNYE
        E+    +RG  +  T+ G    DP F  +     + RD+ MLENQLP FVL +L  +        N      QL  +FF             G  K    
Subjt:  EVMIIYYRGCFQ--TQNGL---DPSF-FKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFF-----------SIGLIKDNYE

Query:  LPHDVSTQKIN-----HLVDFLRFYYVPSKSVKES-------------GEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHF
        L  D S          H +D  R   + S    E               +KR   L   +TEL EAG+  ++  K  R  D+ F++G LEIP   +HD  
Subjt:  LPHDVSTQKIN-----HLVDFLRFYYVPSKSVKES-------------GEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHF

Query:  ETYVRNLIAFE--HYPVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPS-DFYYYDTSKALHEHCLTRRHRWM
        ++   NLIAFE  H    ND   +  Y+ F+D LI + +D S L    II  H  GSD E++ LFN LC++    + D Y    S  ++ +   + + W 
Subjt:  ETYVRNLIAFE--HYPVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPS-DFYYYDTSKALHEHCLTRRHRWM

Query:  ASLRRDYFNTPWASVSFTAATFLILLTFLQT
        A+L+  YFN PWA VSF AA  L++LTF Q+
Subjt:  ASLRRDYFNTPWASVSFTAATFLILLTFLQT

AT3G50150.1 Plant protein of unknown function (DUF247)5.9e-4733.18Show/hide
Query:  EECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITM-NSDDFVKMLLVDGCF
        ++  IYRVP  L+  +  +Y PQ +SIGP+H  +  L   E  K R ++  + R  K  +E  +   K  E +AR  Y  PI M NS++F +ML++DGCF
Subjt:  EECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITM-NSDDFVKMLLVDGCF

Query:  IVEVMIIYYRGCFQTQNGL-----DPSFFK-ALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFF------SIGLIKDNYELPH
        ++E+    ++G  Q    +     DP F K  L   + RD+ MLENQLP FVL++L  +  Q    N   +   ++  +FF      S  L K    L  
Subjt:  IVEVMIIYYRGCFQTQNGL-----DPSFFK-ALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFF------SIGLIKDNYELPH

Query:  DVSTQKIN-----HLVDFLRFYYVPSKSVKESG---------EKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNL
           + ++      H +D      + S      G         EK++ L+   +TEL  AGV   +   G +L DI F++G L+IP   +HD  ++   NL
Subjt:  DVSTQKIN-----HLVDFLRFYYVPSKSVKESG---------EKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNL

Query:  IAFEHYPVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTI-PSDFYYYDTSKALHEHCLTRRHRWMASLRRDYFN
        IAFE     +    +  Y+ F+D LI++ +D S L    II  H  GSD E++ LFN LCK+    P D Y    S+ ++ +   + +   A+LR+ YFN
Subjt:  IAFEHYPVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTI-PSDFYYYDTSKALHEHCLTRRHRWMASLRRDYFN

Query:  TPWASVSFTAATFLILLTFLQT
         PWA  SF+AA  L+ LTF Q+
Subjt:  TPWASVSFTAATFLILLTFLQT

AT4G31980.1 unknown protein4.7e-6838.33Show/hide
Query:  LPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVKMLLV
        L  ++ +C IY+VP +LR +N  AYTP+++S GP HR +++L A E+ K R L S++ R     +ED V+ A++WE  AR  YAE + ++SD+FV+ML+V
Subjt:  LPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVKMLLV

Query:  DGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRI-DLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIGLIKDNYELPHDVSTQKI
        DG F+VE+++  +    + +N  D  F  ++ I D+ RD+ ++ENQLPFFV++++F +L    ++     S IQL  + FS  L +    +  +    + 
Subjt:  DGCFIVEVMIIYYRGCFQTQNGLDPSFFKALRI-DLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIGLIKDNYELPHDVSTQKI

Query:  NHLVDFLRFYYVPSKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPVGNDERYVIHYVEF
         H VD LR  Y+P   +K      +    P  TEL  AGV  K A     L+DISF DGVL+IP   V D  E+  +N+I FE     N  +  + Y+  
Subjt:  NHLVDFLRFYYVPSKSVKESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPVGNDERYVIHYVEF

Query:  LDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYDTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSFTAATFLILLTFLQT
        L   I +  DA LL+ + II N++G S  ++S LFN++ K+      FY+   S+ L  +C T  +RW A LRRDYF+ PWA  S  AA  L+LLTF+Q+
Subjt:  LDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTTIPSDFYYYDTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSFTAATFLILLTFLQT

Query:  LSSFLSL
        + S L+L
Subjt:  LSSFLSL

AT5G11290.1 Plant protein of unknown function (DUF247)1.1e-5336.26Show/hide
Query:  EELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVKMLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFF--KALRIDLYRDLTMLE
        E+ KLR L S++ R   L +ED V+ A++WE +AR  Y E + ++SD++VKML+VD  F+VE+++   R  F    G+    +  + + +D+  D+ +LE
Subjt:  EELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVKMLLVDGCFIVEVMIIYYRGCFQTQNGLDPSFF--KALRIDLYRDLTMLE

Query:  NQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTH-KFFSIGLIKDNYELPHDVSTQKINHLVDFLRFYYVPSKSVKESGEKREFLLPPTITELCEAGVIVK
        NQLP+FV+E +F +L  D  +    ++ I   H K F + +          +S  KI H VD LR  ++P       G  R      +  E+  AGV ++
Subjt:  NQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTH-KFFSIGLIKDNYELPHDVSTQKINHLVDFLRFYYVPSKSVKESGEKREFLLPPTITELCEAGVIVK

Query:  KATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTT
         A      +DISF +GVL IP  +++D  E+  RN+I FE       + Y IHY+ FL   I +  DA L +   II N  G + E++S+LFN++ K+T+
Subjt:  KATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGSDEEISKLFNNLCKDTT

Query:  IPSDFYYYDTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSFTAATFLILLTFLQTLSSFLSL
          S FYY      L  HC    ++W A+LRRDYF+ PW++ S  AA  L+LLTF+Q + S L+L
Subjt:  IPSDFYYYDTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSFTAATFLILLTFLQTLSSFLSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCAGCAACTACCTCCTATCGCTGAAGAATGTAGCATCTATCGAGTTCCTAAACGGTTACGCAACATTAATCATATCGCCTATACTCCTCAAGTCATTTCCATTGG
CCCTTTCCACCGTCGGCAAAAGGATTTGATGGCCACTGAAGAGCTTAAACTTCGATGTTTGGATAGTTACCTACGTCGTCATATAAAATTGGAAGTTGAGGACGCCGTGA
AGAATGCCAAGAGTTGGGAGAGTAAAGCCCGTCGTTTCTATGCAGAACCTATAACCATGAATAGCGACGACTTTGTCAAAATGTTGCTTGTAGATGGATGTTTCATAGTG
GAGGTCATGATAATATATTACCGTGGATGTTTCCAAACTCAAAACGGGTTAGATCCTTCATTCTTCAAAGCTCTAAGAATCGACTTATACCGTGACTTGACAATGCTCGA
GAATCAACTCCCCTTCTTTGTTCTTGAACAGCTATTCAACATGCTTCCTCAGGATGCAGAGAAAAATAATAATCGTGTCTCGTTTATACAACTTACCCACAAATTTTTCA
GTATTGGATTGATAAAAGATAATTATGAGCTTCCTCATGATGTCTCCACACAAAAAATAAACCACTTGGTCGATTTCTTAAGGTTCTACTACGTCCCATCGAAATCTGTG
AAGGAAAGTGGGGAAAAGAGGGAGTTTCTGCTTCCTCCCACTATAACTGAGCTTTGTGAGGCTGGTGTCATCGTTAAAAAGGCAACAAAAGGTAAACGCTTGATGGACAT
AAGCTTCCAAGATGGGGTCCTAGAAATCCCACATTTTGAAGTTCATGATCATTTTGAAACCTATGTGCGAAACTTGATTGCATTTGAGCATTACCCCGTGGGGAATGATG
AGAGGTATGTAATCCATTATGTTGAGTTCTTAGATGGCTTGATAAGCACTGAGAAAGACGCAAGTTTACTTGTGAAGGCGAAAATCATAACCAACCATATTGGTGGCAGT
GATGAAGAAATTTCAAAACTTTTCAACAATCTTTGTAAAGATACCACCATTCCAAGTGATTTTTACTACTATGATACGAGCAAAGCTTTACATGAGCACTGCTTGACACG
ACGACACCGGTGGATGGCTTCATTAAGACGAGATTATTTCAATACACCATGGGCTTCTGTCTCTTTCACTGCCGCTACCTTCCTCATTTTACTCACTTTCCTTCAAACCT
TATCATCTTTTCTATCACTTTCAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCAGCAACTACCTCCTATCGCTGAAGAATGTAGCATCTATCGAGTTCCTAAACGGTTACGCAACATTAATCATATCGCCTATACTCCTCAAGTCATTTCCATTGG
CCCTTTCCACCGTCGGCAAAAGGATTTGATGGCCACTGAAGAGCTTAAACTTCGATGTTTGGATAGTTACCTACGTCGTCATATAAAATTGGAAGTTGAGGACGCCGTGA
AGAATGCCAAGAGTTGGGAGAGTAAAGCCCGTCGTTTCTATGCAGAACCTATAACCATGAATAGCGACGACTTTGTCAAAATGTTGCTTGTAGATGGATGTTTCATAGTG
GAGGTCATGATAATATATTACCGTGGATGTTTCCAAACTCAAAACGGGTTAGATCCTTCATTCTTCAAAGCTCTAAGAATCGACTTATACCGTGACTTGACAATGCTCGA
GAATCAACTCCCCTTCTTTGTTCTTGAACAGCTATTCAACATGCTTCCTCAGGATGCAGAGAAAAATAATAATCGTGTCTCGTTTATACAACTTACCCACAAATTTTTCA
GTATTGGATTGATAAAAGATAATTATGAGCTTCCTCATGATGTCTCCACACAAAAAATAAACCACTTGGTCGATTTCTTAAGGTTCTACTACGTCCCATCGAAATCTGTG
AAGGAAAGTGGGGAAAAGAGGGAGTTTCTGCTTCCTCCCACTATAACTGAGCTTTGTGAGGCTGGTGTCATCGTTAAAAAGGCAACAAAAGGTAAACGCTTGATGGACAT
AAGCTTCCAAGATGGGGTCCTAGAAATCCCACATTTTGAAGTTCATGATCATTTTGAAACCTATGTGCGAAACTTGATTGCATTTGAGCATTACCCCGTGGGGAATGATG
AGAGGTATGTAATCCATTATGTTGAGTTCTTAGATGGCTTGATAAGCACTGAGAAAGACGCAAGTTTACTTGTGAAGGCGAAAATCATAACCAACCATATTGGTGGCAGT
GATGAAGAAATTTCAAAACTTTTCAACAATCTTTGTAAAGATACCACCATTCCAAGTGATTTTTACTACTATGATACGAGCAAAGCTTTACATGAGCACTGCTTGACACG
ACGACACCGGTGGATGGCTTCATTAAGACGAGATTATTTCAATACACCATGGGCTTCTGTCTCTTTCACTGCCGCTACCTTCCTCATTTTACTCACTTTCCTTCAAACCT
TATCATCTTTTCTATCACTTTCAAAGTAA
Protein sequenceShow/hide protein sequence
MLQQLPPIAEECSIYRVPKRLRNINHIAYTPQVISIGPFHRRQKDLMATEELKLRCLDSYLRRHIKLEVEDAVKNAKSWESKARRFYAEPITMNSDDFVKMLLVDGCFIV
EVMIIYYRGCFQTQNGLDPSFFKALRIDLYRDLTMLENQLPFFVLEQLFNMLPQDAEKNNNRVSFIQLTHKFFSIGLIKDNYELPHDVSTQKINHLVDFLRFYYVPSKSV
KESGEKREFLLPPTITELCEAGVIVKKATKGKRLMDISFQDGVLEIPHFEVHDHFETYVRNLIAFEHYPVGNDERYVIHYVEFLDGLISTEKDASLLVKAKIITNHIGGS
DEEISKLFNNLCKDTTIPSDFYYYDTSKALHEHCLTRRHRWMASLRRDYFNTPWASVSFTAATFLILLTFLQTLSSFLSLSK