; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G14660 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G14660
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionAspergillus nuclease S(1)
Genome locationChr1:10374403..10378231
RNA-Seq ExpressionCSPI01G14660
SyntenyCSPI01G14660
Gene Ontology termsGO:0006308 - DNA catabolic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0000014 - single-stranded DNA endodeoxyribonuclease activity (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004521 - endoribonuclease activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR003154 - S1/P1 nuclease
IPR008947 - Phospholipase C/P1 nuclease domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139483.1 endonuclease 4 isoform X1 [Cucumis sativus]9.3e-17799.67Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
        MGQSELCWTANAFLFLL LPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD

Query:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
        CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
Subjt:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN

Query:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI
        LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI
Subjt:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI

XP_008461475.1 PREDICTED: endonuclease 4-like isoform X1 [Cucumis melo]1.9e-15887.63Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
        MGQSELCWTAN F+FLLLLPGILGWGREGHY +CKIAE YLTEDALSMV+ELLP+ AEGDLAAVCSWADELR + DYHWS  LH+VDTPDFFCNY CSRD
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD

Query:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
        CHD++RHKGRCVTAAIYNYTMQLESAYKEITSE++YNLTEALMFLSHFIGDVHQPLHVGFVGD+GGNLIKV WY+R TNLHHVWDTMII+SALKRFYHSN
Subjt:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN

Query:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI
        L+LMIQAIQ+NISDEWHNEVSAWRNCT N T CPNPYASES+S+ACKYAYKNATPGS LEDSYFL+RLPV+EKRLAQ GIRLASTLNRIFASE KVAE+
Subjt:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI

XP_008461483.1 PREDICTED: endonuclease 4-like isoform X2 [Cucumis melo]1.9e-15887.63Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
        MGQSELCWTAN F+FLLLLPGILGWGREGHY +CKIAE YLTEDALSMV+ELLP+ AEGDLAAVCSWADELR + DYHWS  LH+VDTPDFFCNY CSRD
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD

Query:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
        CHD++RHKGRCVTAAIYNYTMQLESAYKEITSE++YNLTEALMFLSHFIGDVHQPLHVGFVGD+GGNLIKV WY+R TNLHHVWDTMII+SALKRFYHSN
Subjt:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN

Query:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI
        L+LMIQAIQ+NISDEWHNEVSAWRNCT N T CPNPYASES+S+ACKYAYKNATPGS LEDSYFL+RLPV+EKRLAQ GIRLASTLNRIFASE KVAE+
Subjt:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI

XP_038883248.1 endonuclease 4-like isoform X1 [Benincasa hispida]2.8e-14982.62Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSR-
        MGQS+LCWTANAF+FLLLLPGILGWG+EGHY+ICKIAE Y TED LS VKELLP+ AEGDLAAVCSW DE++    Y WS ALHYVDTPDFFCNY CS  
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSR-

Query:  -----DCHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALK
             DCHD + HKGRCVTAAIYNYTMQLESAYKE+TSEI+YNLTEALMFLSHFIGDVHQPLHVGFVGD+GGN I V WYRR+TNLHHVWD MII+SALK
Subjt:  -----DCHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALK

Query:  RFYHSNLLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEG
         FYHSNL LMIQAIQNNISDEW+N+VSAWRNCT+N T CPNPYASES+S+ACKYAYKNATPGS LEDSYFLSRLPV+EKRLAQ GIRLASTLNRIFASE 
Subjt:  RFYHSNLLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEG

Query:  KVAEI
        KVAE+
Subjt:  KVAEI

XP_038883252.1 endonuclease 4-like isoform X2 [Benincasa hispida]6.1e-15284.62Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
        MGQS+LCWTANAF+FLLLLPGILGWG+EGHY+ICKIAE Y TED LS VKELLP+ AEGDLAAVCSW DE++    Y WS ALHYVDTPDFFCNY CSRD
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD

Query:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
        CHD + HKGRCVTAAIYNYTMQLESAYKE+TSEI+YNLTEALMFLSHFIGDVHQPLHVGFVGD+GGN I V WYRR+TNLHHVWD MII+SALK FYHSN
Subjt:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN

Query:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI
        L LMIQAIQNNISDEW+N+VSAWRNCT+N T CPNPYASES+S+ACKYAYKNATPGS LEDSYFLSRLPV+EKRLAQ GIRLASTLNRIFASE KVAE+
Subjt:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI

TrEMBL top hitse value%identityAlignment
A0A0A0LY83 Aspergillus nuclease S(1)4.5e-17799.67Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
        MGQSELCWTANAFLFLL LPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD

Query:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
        CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
Subjt:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN

Query:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI
        LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI
Subjt:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI

A0A1S3CEU3 Aspergillus nuclease S(1)9.4e-15987.63Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
        MGQSELCWTAN F+FLLLLPGILGWGREGHY +CKIAE YLTEDALSMV+ELLP+ AEGDLAAVCSWADELR + DYHWS  LH+VDTPDFFCNY CSRD
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD

Query:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
        CHD++RHKGRCVTAAIYNYTMQLESAYKEITSE++YNLTEALMFLSHFIGDVHQPLHVGFVGD+GGNLIKV WY+R TNLHHVWDTMII+SALKRFYHSN
Subjt:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN

Query:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI
        L+LMIQAIQ+NISDEWHNEVSAWRNCT N T CPNPYASES+S+ACKYAYKNATPGS LEDSYFL+RLPV+EKRLAQ GIRLASTLNRIFASE KVAE+
Subjt:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI

A0A1S3CF84 Aspergillus nuclease S(1)9.4e-15987.63Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
        MGQSELCWTAN F+FLLLLPGILGWGREGHY +CKIAE YLTEDALSMV+ELLP+ AEGDLAAVCSWADELR + DYHWS  LH+VDTPDFFCNY CSRD
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD

Query:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
        CHD++RHKGRCVTAAIYNYTMQLESAYKEITSE++YNLTEALMFLSHFIGDVHQPLHVGFVGD+GGNLIKV WY+R TNLHHVWDTMII+SALKRFYHSN
Subjt:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN

Query:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI
        L+LMIQAIQ+NISDEWHNEVSAWRNCT N T CPNPYASES+S+ACKYAYKNATPGS LEDSYFL+RLPV+EKRLAQ GIRLASTLNRIFASE KVAE+
Subjt:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI

A0A6J1GEX6 Aspergillus nuclease S(1)1.7e-14481.61Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
        MGQ ELC  A AF FLLL+PGILGWG+EGHY+ICKIAE Y TED LSMVKELLP+ AEGDLAAVCSW DE+R    Y WS ALHYVDTPDFFCNY  SRD
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD

Query:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
        CHD + HKGRCVTAAIYNYT QLESAYKE+TSEI+YNLTEALMFLSHFIGDVHQPLHVGFVGD+GGN I V WYRR+TNLHHVWD MII+SALK FYHSN
Subjt:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN

Query:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI
        L LMIQAIQ NIS EW+N+VSAWRNC +N T CPNPYASES+ +ACKYAY+NATPGS LEDSYFLSRLP++EKRLAQ GIRLASTLNRIFASE KVAEI
Subjt:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI

A0A6J1IVA4 Aspergillus nuclease S(1)2.4e-14682.61Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
        MGQSELC  A AF FLLL+PGILGWG+EGHY+ICKIAE Y TED LSMVKELLP+ AEGDLAAVCSW DE+R    Y WS ALHYVDTPDFFCNY  SRD
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD

Query:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
        CHD + HKGRCVTAAIYNYT QLESAYKE+TSEI+YNLTEALMFLSHFIGDVHQPLHVGFVGD+GGN I V WYRR+TNLHHVWD MII+SALK FYHSN
Subjt:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN

Query:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI
        L LMIQAIQ NIS EW+N+VSAWRNC +N T CPNPYASES+S+ACKYAYKNATPGS LEDSYFLSRLP++EKRLAQ GIRLASTLNRIFASE KVAEI
Subjt:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI

SwissProt top hitse value%identityAlignment
F4JJL0 Endonuclease 42.3e-11461.95Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
        M  S   W A   +   L+ G L WG+EGHY +CKIAE Y  E+ ++ VK+LLP  A+GDLA+VCSW DE++ H  + W+  LHYVDTPD+ CNY+  RD
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD

Query:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
        CHD ++++ RCVT AI+NYTMQL SA +   + + YNLTEALMFLSHFIGD+HQPLHVGF+GD GGN I V WYRR+TNLHHVWD MII+SALK +Y+ +
Subjt:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN

Query:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVA
        L LMI+A+Q N++++W N+V  W +C +NQT CPNPYASES+++ACKYAY+NATPG+ L D YFLSRLP++EKRLAQGGIRLA+TLNRIF+S+ K A
Subjt:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVA

F4JJL3 Endonuclease 51.1e-10056.01Show/hide
Query:  WTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAE-GDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRDCHDNYR
        W  +  +   L+ G L WG++GHY +CK+AE +  +D ++ VK+LLP   + G LA  CSW DE++    + W+  LHYV+TP++ CNY+  RDCHD ++
Subjt:  WTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAE-GDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRDCHDNYR

Query:  HKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSNLLLMIQ
        HK  CVT AI+NYT QL SA +   + + YNLTEAL+FLSH++GDVHQPLH GF+GD+GGN I V+WY  ++NLHHVWD MIIDSAL+ +Y+S+L  MIQ
Subjt:  HKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSNLLLMIQ

Query:  AIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVA
        A+Q  + + W N+V +W++C  +Q  CPN YASES+ +ACKYAY+NATPG+ L D YFLSRLPV+EKRLAQGGIRLA+TLNRIF+++ K+A
Subjt:  AIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVA

Q8LDW6 Endonuclease 33.9e-10156.9Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
        MG S   W  +  +   L+ G L WG  GHY +CKIA+ Y  ED +  VK+LLP  A G+LAAVCSW DE++  P + W+ ALH+ DTPD+ CNY+ SRD
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD

Query:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
        C  ++     CVT AI+NYT QL S  +   S + YNLTEALMFLSH++GD+HQPLH GF+GD+GGN IKV WY + TNLH VWD MII+SAL+ +Y+S+
Subjt:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN

Query:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVA
        L  MI  +Q  + + W N+V +W +C +NQT CPNPYASES+ +ACKYAY+NAT G+ L D YF+SRLPV+EKRLAQGGIRLA TLNRIF+++ K+A
Subjt:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVA

Q9C9G4 Endonuclease 29.3e-8754.91Show/hide
Query:  LLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRDCHDNYRHKGRCVTAA
        L   P I GWG+EGH +ICKIA+  L E A   VKELLP  AEGDL+++C WAD ++    YHWS  LHY++TPD  C+Y+ +RDC D    KGRCV  A
Subjt:  LLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRDCHDNYRHKGRCVTAA

Query:  IYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSNLLLMIQAIQNNISDE
        IYNYT QL S     +S+ +YNLTEAL+F+SHF+GD+HQPLHV +  D GGN I+V WY R+ NLHH+WD+ II++A    Y+S L  M+ A++ NI+ E
Subjt:  IYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSNLLLMIQAIQNNISDE

Query:  WHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIF
        W ++V  W  CT  +T CP+ YASE +  AC +AYK  T G  LED YF SRLP++ +RLAQGG+RLA+TLNRIF
Subjt:  WHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIF

Q9SXA6 Endonuclease 13.8e-8047.04Show/hide
Query:  LFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRDCHDNYRHKGRCVT
        L L  +  +  W +EGH + C+IA+  L      +V+ LLP Y +GDL+A+C W D++R    Y W+  LHY+DTPD  C+Y+ SRDCHD +  K  CV 
Subjt:  LFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRDCHDNYRHKGRCVT

Query:  AAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSNLLLMIQAIQNNIS
         AI N+T QL+  Y E TS+ +YN+TEAL+FLSHF+GD+HQP+HVGF  D GGN I + WY+ ++NLHHVWD  II +ALK  Y  NL L+ + ++ NI+
Subjt:  AAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSNLLLMIQAIQNNIS

Query:  DE-WHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI
        +  WH+++S+W  C  +   CP+ YASES+ +ACK+ YK    G  L + YF +RLP++ KR+ QGG+RLA  LNR+F+ +  +A +
Subjt:  DE-WHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI

Arabidopsis top hitse value%identityAlignment
AT1G68290.1 endonuclease 26.6e-8854.91Show/hide
Query:  LLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRDCHDNYRHKGRCVTAA
        L   P I GWG+EGH +ICKIA+  L E A   VKELLP  AEGDL+++C WAD ++    YHWS  LHY++TPD  C+Y+ +RDC D    KGRCV  A
Subjt:  LLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRDCHDNYRHKGRCVTAA

Query:  IYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSNLLLMIQAIQNNISDE
        IYNYT QL S     +S+ +YNLTEAL+F+SHF+GD+HQPLHV +  D GGN I+V WY R+ NLHH+WD+ II++A    Y+S L  M+ A++ NI+ E
Subjt:  IYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSNLLLMIQAIQNNISDE

Query:  WHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIF
        W ++V  W  CT  +T CP+ YASE +  AC +AYK  T G  LED YF SRLP++ +RLAQGG+RLA+TLNRIF
Subjt:  WHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIF

AT4G21585.1 endonuclease 41.7e-11561.95Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
        M  S   W A   +   L+ G L WG+EGHY +CKIAE Y  E+ ++ VK+LLP  A+GDLA+VCSW DE++ H  + W+  LHYVDTPD+ CNY+  RD
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD

Query:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
        CHD ++++ RCVT AI+NYTMQL SA +   + + YNLTEALMFLSHFIGD+HQPLHVGF+GD GGN I V WYRR+TNLHHVWD MII+SALK +Y+ +
Subjt:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN

Query:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVA
        L LMI+A+Q N++++W N+V  W +C +NQT CPNPYASES+++ACKYAY+NATPG+ L D YFLSRLP++EKRLAQGGIRLA+TLNRIF+S+ K A
Subjt:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVA

AT4G21590.1 endonuclease 32.8e-10256.9Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
        MG S   W  +  +   L+ G L WG  GHY +CKIA+ Y  ED +  VK+LLP  A G+LAAVCSW DE++  P + W+ ALH+ DTPD+ CNY+ SRD
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD

Query:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
        C  ++     CVT AI+NYT QL S  +   S + YNLTEALMFLSH++GD+HQPLH GF+GD+GGN IKV WY + TNLH VWD MII+SAL+ +Y+S+
Subjt:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN

Query:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVA
        L  MI  +Q  + + W N+V +W +C +NQT CPNPYASES+ +ACKYAY+NAT G+ L D YF+SRLPV+EKRLAQGGIRLA TLNRIF+++ K+A
Subjt:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVA

AT4G21590.2 endonuclease 32.8e-10256.9Show/hide
Query:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD
        MG S   W  +  +   L+ G L WG  GHY +CKIA+ Y  ED +  VK+LLP  A G+LAAVCSW DE++  P + W+ ALH+ DTPD+ CNY+ SRD
Subjt:  MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRD

Query:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN
        C  ++     CVT AI+NYT QL S  +   S + YNLTEALMFLSH++GD+HQPLH GF+GD+GGN IKV WY + TNLH VWD MII+SAL+ +Y+S+
Subjt:  CHDNYRHKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSN

Query:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVA
        L  MI  +Q  + + W N+V +W +C +NQT CPNPYASES+ +ACKYAY+NAT G+ L D YF+SRLPV+EKRLAQGGIRLA TLNRIF+++ K+A
Subjt:  LLLMIQAIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVA

AT4G21600.1 endonuclease 58.0e-10256.01Show/hide
Query:  WTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAE-GDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRDCHDNYR
        W  +  +   L+ G L WG++GHY +CK+AE +  +D ++ VK+LLP   + G LA  CSW DE++    + W+  LHYV+TP++ CNY+  RDCHD ++
Subjt:  WTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAE-GDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRDCHDNYR

Query:  HKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSNLLLMIQ
        HK  CVT AI+NYT QL SA +   + + YNLTEAL+FLSH++GDVHQPLH GF+GD+GGN I V+WY  ++NLHHVWD MIIDSAL+ +Y+S+L  MIQ
Subjt:  HKGRCVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSNLLLMIQ

Query:  AIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVA
        A+Q  + + W N+V +W++C  +Q  CPN YASES+ +ACKYAY+NATPG+ L D YFLSRLPV+EKRLAQGGIRLA+TLNRIF+++ K+A
Subjt:  AIQNNISDEWHNEVSAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCAGTCTGAGCTTTGTTGGACTGCCAATGCTTTCCTTTTTCTCTTACTTCTACCTGGAATCCTCGGCTGGGGAAGGGAAGGTCACTATATGATTTGCAAGATAGC
AGAGAAATATTTGACTGAAGATGCTCTATCAATGGTCAAAGAATTGCTTCCATCTTATGCTGAGGGCGATCTTGCAGCTGTATGCTCCTGGGCTGATGAACTTCGAGCAC
ATCCTGATTATCACTGGAGTGGCGCCTTACACTATGTTGACACGCCAGATTTCTTTTGTAATTATAAATGCTCGAGAGACTGTCATGACAATTATAGACACAAAGGTAGA
TGTGTGACAGCAGCAATTTACAACTACACTATGCAACTCGAATCAGCTTACAAGGAAATAACTTCAGAAATTAAATATAACTTAACAGAGGCTCTTATGTTCTTGTCCCA
TTTTATCGGAGACGTTCATCAGCCCCTTCATGTCGGTTTTGTCGGAGATATAGGAGGTAATTTAATAAAAGTCAGTTGGTACCGAAGAAGGACCAATCTCCATCATGTCT
GGGATACCATGATCATCGATTCTGCCTTGAAGAGATTCTACCATTCAAATCTTTTGCTCATGATTCAAGCCATTCAAAACAACATTTCGGATGAATGGCATAATGAGGTC
TCAGCTTGGAGAAATTGCACAGTGAACCAAACAACTTGTCCAAACCCGTATGCTTCTGAAAGTGTTAGCATGGCATGTAAATATGCATACAAGAACGCCACTCCAGGAAG
CGTACTAGAAGACAGTTATTTTCTTAGCCGGTTACCAGTCATAGAGAAGAGGTTAGCACAAGGTGGCATTAGATTGGCTTCTACCCTCAATCGTATCTTTGCTTCTGAAG
GGAAAGTGGCTGAAATTTGA
mRNA sequenceShow/hide mRNA sequence
GTTAACTGTCCGTTTTTGTGATTCTAATTTTCGTAGCCTCTGATTTGGTGAGAGGCAAGCGTCGAAAAACGAGAAACAAGGCTGGTTTAAATTTGAATTTGGAGACAAGC
TTCTAAGCCAGAGGTCAAGCAGCTGAAGTCGGAGCTGCAATCGCTGCCAGCAACTGATTTCTGATTTCATTGTGTTTATCATTACCGTATTTTGTTTCTAGGGAGCAGAC
ATGGGTCAGTCTGAGCTTTGTTGGACTGCCAATGCTTTCCTTTTTCTCTTACTTCTACCTGGAATCCTCGGCTGGGGAAGGGAAGGTCACTATATGATTTGCAAGATAGC
AGAGAAATATTTGACTGAAGATGCTCTATCAATGGTCAAAGAATTGCTTCCATCTTATGCTGAGGGCGATCTTGCAGCTGTATGCTCCTGGGCTGATGAACTTCGAGCAC
ATCCTGATTATCACTGGAGTGGCGCCTTACACTATGTTGACACGCCAGATTTCTTTTGTAATTATAAATGCTCGAGAGACTGTCATGACAATTATAGACACAAAGGTAGA
TGTGTGACAGCAGCAATTTACAACTACACTATGCAACTCGAATCAGCTTACAAGGAAATAACTTCAGAAATTAAATATAACTTAACAGAGGCTCTTATGTTCTTGTCCCA
TTTTATCGGAGACGTTCATCAGCCCCTTCATGTCGGTTTTGTCGGAGATATAGGAGGTAATTTAATAAAAGTCAGTTGGTACCGAAGAAGGACCAATCTCCATCATGTCT
GGGATACCATGATCATCGATTCTGCCTTGAAGAGATTCTACCATTCAAATCTTTTGCTCATGATTCAAGCCATTCAAAACAACATTTCGGATGAATGGCATAATGAGGTC
TCAGCTTGGAGAAATTGCACAGTGAACCAAACAACTTGTCCAAACCCGTATGCTTCTGAAAGTGTTAGCATGGCATGTAAATATGCATACAAGAACGCCACTCCAGGAAG
CGTACTAGAAGACAGTTATTTTCTTAGCCGGTTACCAGTCATAGAGAAGAGGTTAGCACAAGGTGGCATTAGATTGGCTTCTACCCTCAATCGTATCTTTGCTTCTGAAG
GGAAAGTGGCTGAAATTTGAAGACAGTGACTTGGAGAGATCAGATCAATTTCCATAAGATAGATGTCAAAATGGTTTCTCACATGAATTGATTTGTGCTCTTGTCTTTTG
TCTTCGTATTAATATACTTGCTATTGCTCCAAAAGAACCCTAAATGTGGAAACTCATTTAGTAAAAGAACAAGATATTCATAAATAAACCAGGTCGTTTGATATAAAATC
CTAACATATAGTACTTATAGTATAAACTAAAGTGATTAGATCATTGAGCCATTCATTTTTACCTTGTCTTATGAAAGAAAAAAAACCTCGAATTGAGGTGGTTAATGGGG
Protein sequenceShow/hide protein sequence
MGQSELCWTANAFLFLLLLPGILGWGREGHYMICKIAEKYLTEDALSMVKELLPSYAEGDLAAVCSWADELRAHPDYHWSGALHYVDTPDFFCNYKCSRDCHDNYRHKGR
CVTAAIYNYTMQLESAYKEITSEIKYNLTEALMFLSHFIGDVHQPLHVGFVGDIGGNLIKVSWYRRRTNLHHVWDTMIIDSALKRFYHSNLLLMIQAIQNNISDEWHNEV
SAWRNCTVNQTTCPNPYASESVSMACKYAYKNATPGSVLEDSYFLSRLPVIEKRLAQGGIRLASTLNRIFASEGKVAEI