; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012452 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012452
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationscaffold63:787288..788586
RNA-Seq ExpressionMS012452
SyntenyMS012452
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]1.8e-20585.02Show/hide
Query:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHL
        MDS  LAAL+SSLISQLLL LFLLFPSSNPHSL SN   DS FYAN F   THFLFSQ+ A+SL FLSVSRKRKRT+ P+ LEL       G   GRVH 
Subjt:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHL

Query:  LW-TRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L+ TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPL+LS EIRLGVGL RLATGCDFSTIS+QFGVSESVARFC+KQLCRVLCTNFRFWVEFPC
Subjt:  LW-TRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ
        PNELE TSSAFE LAGLPNCCGVV+CTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIE+GRLL+SPPVYLHG+AVN+
Subjt:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ

Query:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH
        Y FG GEYPLLPWL+VPF+GAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLSQP+ EEFKTAVAYIGACSILHNALLMREDFSAMADEWE L+SLDH
Subjt:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH

Query:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS
         SQY+  GLN DST+EKASVIQRALA RARELHS
Subjt:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS

KAG6600319.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]8.6e-20082.49Show/hide
Query:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGG-GGGRGRVH
        MDSR+LAAL+SSLISQLLL L LLFPSSNPHSLLSN  SDS+FYAN FPL  HFLFSQ+IA+SLSFLSVSRKRKRTH  E LEL PS  GG  GGRGRVH
Subjt:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGG-GGGRGRVH

Query:  LLWTRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LL TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPL+LSAEIRLGVGLSRLATGCDFSTIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLWTRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ
        P+ELE TSS+FE +AGLPNCCGV++CT                            SIVAGFRGDKDDSTVLMS+TLFKDIEEGRLL SPPVYLHGMAVNQ
Subjt:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ

Query:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH
        Y FGHGEYPLLPWLMVPF+GAVSGSTEESFN+AHRLMCIPALKAI+SLRNWGVLSQPM EEFKTAVAYIGACSILHNALLMREDF+AMADEWE LASLDH
Subjt:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH

Query:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS
         SQY+G GLNEDS DEKA +IQ+ALALRARELH+
Subjt:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]1.4e-20585.02Show/hide
Query:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHL
        MDS  LAAL+SSLISQLLL LFLLFPSSNPHSL SN   DS FYAN F    HFLFSQ+ A+SL FLSVSRKRKRT+  + LEL       G   GRVH 
Subjt:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHL

Query:  LW-TRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L+ TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPL+LS EIRLGVGL RLATGCDFSTIS+QFGVSESVARFC+KQLCRVLCTNFRFWVEFPC
Subjt:  LW-TRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ
        PNELE TSSAFE LAGLPNCCGVV+CTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIE+GRLL+SPPVYLHG+AVN+
Subjt:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ

Query:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH
        Y FGHGEYPLLPWL+VPF+GAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLSQP+ EEFKTAVAYIGACSILHNALLMREDFSAMADEWE L+SLDH
Subjt:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH

Query:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS
         SQY+  GLN DST+EKASVIQRALALRARELHS
Subjt:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS

XP_022149126.1 protein ALP1-like [Momordica charantia]1.5e-22893.53Show/hide
Query:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHL
        MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHL
Subjt:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHL

Query:  LWTRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCP
        LWTRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCP
Subjt:  LWTRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCP

Query:  NELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQY
        NELESTSSAFEALAGLPNCCGVVACT                            SIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQY
Subjt:  NELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQY

Query:  FFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDHG
        FFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDHG
Subjt:  FFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDHG

Query:  SQYIGEGLNEDSTDEKASVIQRALALRARELHS
        SQYIGEGLNEDSTDEKASVIQRALALRARELHS
Subjt:  SQYIGEGLNEDSTDEKASVIQRALALRARELHS

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]1.0e-20082.95Show/hide
Query:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGG-GGGRGRVH
        MDSR+LAAL+SSLISQLLL L LLFPSSNPHSLLSN  SDS+FYAN FPL  HFLFSQ+IA+SLSFLSVSRKRKRTH  E LEL PS  GG  GGRGRVH
Subjt:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGG-GGGRGRVH

Query:  LLWTRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LL TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPL+LSAEIRLGVGLSRLATGCDFSTIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLWTRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ
        P+ELE TSSAFE +AGLPNCCGV++CT                            SIVAGFRGDKDDSTVLMS+TLFKDIEEGRLL SPPVYLHGMAVNQ
Subjt:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ

Query:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH
        Y FGHGEYPLLPWLMVPF+GAVSGSTEESFN+AHRLMCIPALKAI+SLRNWGVLSQPM EEFKTAVAYIGACSILHNALLMREDF+AMADEWE LASLDH
Subjt:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH

Query:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS
         SQY+G GLNEDS DEKAS+IQ+ALALRARELH+
Subjt:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein6.7e-20685.02Show/hide
Query:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHL
        MDS  LAAL+SSLISQLLL LFLLFPSSNPHSL SN   DS FYAN F    HFLFSQ+ A+SL FLSVSRKRKRT+  + LEL       G   GRVH 
Subjt:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHL

Query:  LW-TRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L+ TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPL+LS EIRLGVGL RLATGCDFSTIS+QFGVSESVARFC+KQLCRVLCTNFRFWVEFPC
Subjt:  LW-TRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ
        PNELE TSSAFE LAGLPNCCGVV+CTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIE+GRLL+SPPVYLHG+AVN+
Subjt:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ

Query:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH
        Y FGHGEYPLLPWL+VPF+GAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLSQP+ EEFKTAVAYIGACSILHNALLMREDFSAMADEWE L+SLDH
Subjt:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH

Query:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS
         SQY+  GLN DST+EKASVIQRALALRARELHS
Subjt:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS

A0A5D3CRB2 Putative nuclease HARBI18.7e-20685.02Show/hide
Query:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHL
        MDS  LAAL+SSLISQLLL LFLLFPSSNPHSL SN   DS FYAN F   THFLFSQ+ A+SL FLSVSRKRKRT+ P+ LEL       G   GRVH 
Subjt:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHL

Query:  LW-TRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L+ TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPL+LS EIRLGVGL RLATGCDFSTIS+QFGVSESVARFC+KQLCRVLCTNFRFWVEFPC
Subjt:  LW-TRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ
        PNELE TSSAFE LAGLPNCCGVV+CTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIE+GRLL+SPPVYLHG+AVN+
Subjt:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ

Query:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH
        Y FG GEYPLLPWL+VPF+GAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLSQP+ EEFKTAVAYIGACSILHNALLMREDFSAMADEWE L+SLDH
Subjt:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH

Query:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS
         SQY+  GLN DST+EKASVIQRALA RARELHS
Subjt:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS

A0A6J1D7F1 protein ALP1-like7.3e-22993.53Show/hide
Query:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHL
        MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHL
Subjt:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHL

Query:  LWTRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCP
        LWTRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCP
Subjt:  LWTRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCP

Query:  NELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQY
        NELESTSSAFEALAGLPNCCGVVACT                            SIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQY
Subjt:  NELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQY

Query:  FFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDHG
        FFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDHG
Subjt:  FFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDHG

Query:  SQYIGEGLNEDSTDEKASVIQRALALRARELHS
        SQYIGEGLNEDSTDEKASVIQRALALRARELHS
Subjt:  SQYIGEGLNEDSTDEKASVIQRALALRARELHS

A0A6J1FNZ2 protein ALP1-like3.5e-19982.03Show/hide
Query:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGG-GGGRGRVH
        MDSR+LAAL+SSLISQLLL L LLFPSSNPHSLLSN  SDS+FYAN FPL  HFLFSQ+IA+SLSFLSVSRKRKRTH  E LEL PS  GG  GGRGRVH
Subjt:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGG-GGGRGRVH

Query:  LLWTRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LL TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPL+LSAEIRLGVGLSRLATGCDFSTIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLWTRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ
        P+ELE TSS+FE +AGLPNCCGV++CT                            SIVAGFRGDKDDSTVLMS+TLFKDIEE RLL SPPVYLHGMAVNQ
Subjt:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ

Query:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH
        Y FGHGEYPLLPWLMVPF+GAVSGSTEESFN+AHRLMCIPALKAI+SLRNWGVLSQPM EEFKTAVAYIGACSILHNALLMREDF+AMADEWE LASLDH
Subjt:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH

Query:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS
         SQY+G GLNEDS DEKA+++Q+ALALRARELH+
Subjt:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS

A0A6J1J0M5 protein ALP1-like3.9e-19882.03Show/hide
Query:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGG-GGGRGRVH
        MDSR+LAAL+SSLISQLLL L LLFPSSNPHSLLSN  SDS+FYAN FPL  HFLFSQ+IA+SLSFLSVSRKRKRTH  E LEL PS  GG  GGRGRVH
Subjt:  MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGG-GGGRGRVH

Query:  LLWTRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LL TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPL+LSAEIRLGVGLSRLATGCDFSTIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLWTRRPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ
        P+ELE TSSAFE +AGLPNCCGV++CT                            SIVAGFRGDKDDSTVLMS+TLFKDIEE RLL SPPVYLHG+AVNQ
Subjt:  PNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ

Query:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH
        Y FGHG+YPLLPWLMVPF+GAVSGSTEESFN+AHRLM IPALKAI+SLRNWGVLSQPM EEFKTAVAYIGACSILHNALLMREDF+AMADEWE LASLDH
Subjt:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDH

Query:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS
         SQY+G GLNEDS DEKAS+IQ+ALALRARELH+
Subjt:  GSQYIGEGLNEDSTDEKASVIQRALALRARELHS

SwissProt top hitse value%identityAlignment
Q9M2U3 Protein ALP1-like3.3e-2929.74Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+PL+L+   R+ V L RL +G   S I E FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGR
            P++L+   S FE ++GLPNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +E+G+
Subjt:  EFPCPNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGR

Query:  LLDSPPVYL-HGMAVNQYFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRN-W----GVLSQPMREEFKTAVAYIGACSILHNA
         L+   + L     + +Y  G   +PLLPWL+ P+ G  +   +  FNK H      A  A+  L++ W    GV+  P R         I  C +LHN 
Subjt:  LLDSPPVYL-HGMAVNQYFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRN-W----GVLSQPMREEFKTAVAYIGACSILHNA

Query:  LLMRED
        ++  ED
Subjt:  LLMRED

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)2.0e-1326.41Show/hide
Query:  RLATGCDFSTISEQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRI
        RLA G  +  +  +FG  S S A      +C+++       ++ P P+   +          LPNC GVV   RF++       + SI  Q +VDS+ R 
Subjt:  RLATGCDFSTISEQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRI

Query:  LSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYL-HGMAVNQYFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALK----AIVSLR
        + I AG+        +   + LF   EE  +L   P  L +G+ V +Y  G    PLLPWL+ P+      S EESF +    +    L     A   +R
Subjt:  LSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYL-HGMAVNQYFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALK----AIVSLR

Query:  -NWGVLS---QPMREEFKTAVAYIGACSILHNALLMREDFSAMADEW-EGLASLDHGSQYIGEGLNEDSTDEKASVIQRALALR
          W +L    +P   EF   V   G   +LHN L+   D     +E   G  + D+G     +   E++   +    + +  +R
Subjt:  -NWGVLS---QPMREEFKTAVAYIGACSILHNALLMREDFSAMADEW-EGLASLDHGSQYIGEGLNEDSTDEKASVIQRALALR

AT1G72270.2 LOCATED IN: mitochondrion2.0e-1326.41Show/hide
Query:  RLATGCDFSTISEQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRI
        RLA G  +  +  +FG  S S A      +C+++       ++ P P+   +          LPNC GVV   RF++       + SI  Q +VDS+ R 
Subjt:  RLATGCDFSTISEQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYEDSIATQLVVDSSSRI

Query:  LSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYL-HGMAVNQYFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALK----AIVSLR
        + I AG+        +   + LF   EE  +L   P  L +G+ V +Y  G    PLLPWL+ P+      S EESF +    +    L     A   +R
Subjt:  LSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYL-HGMAVNQYFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALK----AIVSLR

Query:  -NWGVLS---QPMREEFKTAVAYIGACSILHNALLMREDFSAMADEW-EGLASLDHGSQYIGEGLNEDSTDEKASVIQRALALR
          W +L    +P   EF   V   G   +LHN L+   D     +E   G  + D+G     +   E++   +    + +  +R
Subjt:  -NWGVLS---QPMREEFKTAVAYIGACSILHNALLMREDFSAMADEW-EGLASLDHGSQYIGEGLNEDSTDEKASVIQRALALR

AT3G19120.1 PIF / Ping-Pong family of plant transposases8.5e-2025.59Show/hide
Query:  LFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSR----KRKRTHLPEQLELEPSGGGGGGGRGRVHL----LWT----RRPDSFRN
        LF S++  S  S   S     ++A PLL   L     AS LSFL+V+R        +  P      P   G         L    +W+     R   +R+
Subjt:  LFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSR----KRKRTHLPEQLELEPSGGGGGGGRGRVHL----LWT----RRPDSFRN

Query:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTN-FRFWVEFPC-PNELESTSSA
         + ++   F  +   L+P +       S L+L A+  + + LSRLA GC   T++ ++ +   +       + R+L T  +  +++ P     L  T+  
Subjt:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTN-FRFWVEFPC-PNELESTSSA

Query:  FEALAGLPNCCGVVACTRFKIIRNS----------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ
        FE L  LPN CG +  T  K+ R +           +  D++  Q+V D       +     G +DDS+    S L+K +  G ++    + + G  V  
Subjt:  FEALAGLPNCCGVVACTRFKIIRNS----------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQ

Query:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSL--RNWGVLSQPMREEFKTAVAYIGACSILHN
        Y  G   YPLL +LM PFS   SG+  E+      +     +   + L    W +L Q +      A   I AC +LHN
Subjt:  YFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSL--RNWGVLSQPMREEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases2.4e-3029.74Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+PL+L+   R+ V L RL +G   S I E FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGR
            P++L+   S FE ++GLPNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +E+G+
Subjt:  EFPCPNELESTSSAFEALAGLPNCCGVVACTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGR

Query:  LLDSPPVYL-HGMAVNQYFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRN-W----GVLSQPMREEFKTAVAYIGACSILHNA
         L+   + L     + +Y  G   +PLLPWL+ P+ G  +   +  FNK H      A  A+  L++ W    GV+  P R         I  C +LHN 
Subjt:  LLDSPPVYL-HGMAVNQYFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLRN-W----GVLSQPMREEFKTAVAYIGACSILHNA

Query:  LLMRED
        ++  ED
Subjt:  LLMRED

AT4G29780.1 unknown protein6.5e-2026.95Show/hide
Query:  SGGGGGGGRGRVHL------LWTR--RP----DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQF
        SG G G    R+ +       W R  RP    D FR  FRM+ STF  +   L+  +       RD + +P       R+GV + RLATG     +SE+F
Subjt:  SGGGGGGGRGRVHL------LWTR--RP----DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQF

Query:  GVSESVARFCAKQLCR----VLCTNFRFWVEFPCPNELESTSSAFEALAGLPNCCGVVACTRFKIIR---------NSHFYED------SIATQLVVDSS
        G+  S       ++CR    VL   +  W   P  +E+ ST + FE++  +PN  G +  T   II          N    E       SI  Q VV++ 
Subjt:  GVSESVARFCAKQLCR----VLCTNFRFWVEFPCPNELESTSSAFEALAGLPNCCGVVACTRFKIIR---------NSHFYED------SIATQLVVDSS

Query:  SRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQYFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLR-N
             +  G  G   D  +L  S+L           S      GM  + +  G+  +PL  +L+VP++      T+ +FN++   +   A  A   L+  
Subjt:  SRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQYFFGHGEYPLLPWLMVPFSGAVSGSTEESFNKAHRLMCIPALKAIVSLR-N

Query:  WGVLSQPMREEFKTAVAYIGACSILHNALLMRED
        W  L +    + +     +GAC +LHN   MR++
Subjt:  WGVLSQPMREEFKTAVAYIGACSILHNALLMRED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGGGAATTGGCAGCTTTAATCTCTTCTTTGATCTCCCAACTCCTCCTCCACCTCTTCCTCCTCTTCCCTTCCTCCAACCCACATTCCCTTTTGTCCAATTT
CGATTCCGATTCCGATTTCTACGCAAATGCTTTCCCTCTCCTCACCCACTTCCTCTTTTCGCAGGAAATTGCTTCCTCCCTTTCATTTCTCTCGGTTTCGCGTAAGAGGA
AGAGAACCCATTTGCCGGAGCAGCTCGAATTGGAGCCATCTGGTGGCGGCGGCGGCGGCGGCCGTGGACGAGTCCATTTGTTGTGGACTCGAAGGCCCGACTCGTTCAGA
AATCACTTCAGAATGACCTCCTCAACTTTCGAATGGCTCTCCGGTTTGCTCGAGCCCCTTCTGGAGTGCCGCGACCCGGTTGGTTCGCCCCTCAATCTCTCCGCCGAGAT
TCGACTCGGCGTCGGCCTCTCTCGGCTGGCCACCGGCTGCGATTTCTCGACAATCTCCGAACAATTCGGCGTCTCGGAATCGGTAGCTCGGTTCTGTGCTAAGCAGTTAT
GTCGGGTTCTCTGTACCAATTTTCGCTTCTGGGTCGAATTCCCCTGCCCCAATGAGCTCGAATCTACATCCTCAGCGTTCGAAGCCCTCGCGGGGCTCCCGAATTGCTGC
GGCGTGGTTGCTTGTACAAGGTTCAAGATCATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGT
TGCAGGATTTCGTGGCGATAAGGACGACTCGACGGTGCTTATGTCCTCGACGCTGTTCAAAGACATTGAAGAAGGAAGGCTTCTGGATTCTCCTCCGGTTTACCTTCATG
GGATGGCTGTGAATCAGTACTTTTTTGGACATGGCGAATACCCTCTGCTACCATGGTTAATGGTGCCTTTTTCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAAC
AAAGCCCACCGATTGATGTGCATTCCAGCTCTAAAAGCGATCGTTAGCCTGAGAAATTGGGGAGTTCTGAGCCAACCAATGCGTGAGGAGTTCAAAACTGCAGTTGCTTA
TATTGGGGCTTGTTCAATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAATGGGAGGGCTTAGCTTCACTTGATCATGGCTCTCAGTATA
TTGGGGAGGGATTGAATGAGGATTCAACTGATGAGAAGGCTTCTGTTATTCAGAGGGCACTGGCTCTGAGAGCTAGAGAGCTTCATAGT
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCCGGGAATTGGCAGCTTTAATCTCTTCTTTGATCTCCCAACTCCTCCTCCACCTCTTCCTCCTCTTCCCTTCCTCCAACCCACATTCCCTTTTGTCCAATTT
CGATTCCGATTCCGATTTCTACGCAAATGCTTTCCCTCTCCTCACCCACTTCCTCTTTTCGCAGGAAATTGCTTCCTCCCTTTCATTTCTCTCGGTTTCGCGTAAGAGGA
AGAGAACCCATTTGCCGGAGCAGCTCGAATTGGAGCCATCTGGTGGCGGCGGCGGCGGCGGCCGTGGACGAGTCCATTTGTTGTGGACTCGAAGGCCCGACTCGTTCAGA
AATCACTTCAGAATGACCTCCTCAACTTTCGAATGGCTCTCCGGTTTGCTCGAGCCCCTTCTGGAGTGCCGCGACCCGGTTGGTTCGCCCCTCAATCTCTCCGCCGAGAT
TCGACTCGGCGTCGGCCTCTCTCGGCTGGCCACCGGCTGCGATTTCTCGACAATCTCCGAACAATTCGGCGTCTCGGAATCGGTAGCTCGGTTCTGTGCTAAGCAGTTAT
GTCGGGTTCTCTGTACCAATTTTCGCTTCTGGGTCGAATTCCCCTGCCCCAATGAGCTCGAATCTACATCCTCAGCGTTCGAAGCCCTCGCGGGGCTCCCGAATTGCTGC
GGCGTGGTTGCTTGTACAAGGTTCAAGATCATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGT
TGCAGGATTTCGTGGCGATAAGGACGACTCGACGGTGCTTATGTCCTCGACGCTGTTCAAAGACATTGAAGAAGGAAGGCTTCTGGATTCTCCTCCGGTTTACCTTCATG
GGATGGCTGTGAATCAGTACTTTTTTGGACATGGCGAATACCCTCTGCTACCATGGTTAATGGTGCCTTTTTCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAAC
AAAGCCCACCGATTGATGTGCATTCCAGCTCTAAAAGCGATCGTTAGCCTGAGAAATTGGGGAGTTCTGAGCCAACCAATGCGTGAGGAGTTCAAAACTGCAGTTGCTTA
TATTGGGGCTTGTTCAATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAATGGGAGGGCTTAGCTTCACTTGATCATGGCTCTCAGTATA
TTGGGGAGGGATTGAATGAGGATTCAACTGATGAGAAGGCTTCTGTTATTCAGAGGGCACTGGCTCTGAGAGCTAGAGAGCTTCATAGT
Protein sequenceShow/hide protein sequence
MDSRELAALISSLISQLLLHLFLLFPSSNPHSLLSNFDSDSDFYANAFPLLTHFLFSQEIASSLSFLSVSRKRKRTHLPEQLELEPSGGGGGGGRGRVHLLWTRRPDSFR
NHFRMTSSTFEWLSGLLEPLLECRDPVGSPLNLSAEIRLGVGLSRLATGCDFSTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELESTSSAFEALAGLPNCC
GVVACTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEEGRLLDSPPVYLHGMAVNQYFFGHGEYPLLPWLMVPFSGAVSGSTEESFN
KAHRLMCIPALKAIVSLRNWGVLSQPMREEFKTAVAYIGACSILHNALLMREDFSAMADEWEGLASLDHGSQYIGEGLNEDSTDEKASVIQRALALRARELHS