; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G02480 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G02480
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionEndoribonuclease E-like protein
Genome locationClcChr04:8156968..8162309
RNA-Seq ExpressionClc04G02480
SyntenyClc04G02480
Gene Ontology termsNA
InterPro domainsIPR040320 - Uncharacterized protein At4g37920-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603165.1 hypothetical protein SDJN03_03774, partial [Cucurbita argyrosperma subsp. sororia]8.2e-19475.9Show/hide
Query:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ
        MA TNQL FQLSIS TK+FIFR FS   KPLPSI SA+PFKSS K SKS N  T                A V  P +  A    SARANDVATTE EEQ
Subjt:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ

Query:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI
         E EVAEGYTISQFCDKIIDIFMNEKPKTKEWRK LVFREEWKKYRESFYSHCQRRADWESDPIMKEKL+SL R+VKRIDDEMEIHSELLKELQ SPTDI
Subjt:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI

Query:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN
        NAIVAKRR+EFTE+FFKFLTL+SETHDSLED DAVARLAARCL+AVSAYDRTLE+VETLDSAQ KFDDILNSPSLDVACEKIASL KAKELDSSLILLIN
Subjt:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN

Query:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD
        SAWASAKESTTMKNE                                                  VKEIMYHLYK TKS LRSMAPKEIKLLKHLLNIVD
Subjt:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD

Query:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKPNHVSEDAV
        PEERFSALATAFAPGDGSE KDPNA+YTTP ELHKWIKIMLDSYHLNQE+TDIREAR M QP+VIQRLFILKDTIETEYLEQNES N QSKPNHVS +AV
Subjt:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKPNHVSEDAV

Query:  SI
        SI
Subjt:  SI

XP_004146379.1 uncharacterized protein At4g37920 isoform X1 [Cucumis sativus]6.7e-19675.4Show/hide
Query:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ
        MAFTN LPFQ  +S TK FIF SFS  L PLPSIYSASPFK S KISKS N T+          + IT   Q+           +SAR NDVAT+EKEEQ
Subjt:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ

Query:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI
         E EVA+GY++SQFCDKIIDIF+NEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWE DPIMKEKLISLRRKVK+IDDEMEIHSELLKELQ SPTDI
Subjt:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI

Query:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN
        NAIVAKR KEFT+EFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAY+RTLENVETLDSAQ KFD+ILNSPSLDVACEKIASL KAKELDSSLILLIN
Subjt:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN

Query:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD
        SAWASAKESTTMKNE                                                  VKEIMYHLYK TKSSLRSMAPKEIKLLKHLLNIVD
Subjt:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD

Query:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKP--NHVSED
        PEERFSALAT F+PGDGSEQKDPNALYTTP ELHKWIKIMLDSYHLNQE+TDIREARNMTQP+VIQRLFILKDTIETEYLEQN+  NPQS+P  NH SED
Subjt:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKP--NHVSED

Query:  AVSI
        A+SI
Subjt:  AVSI

XP_008442081.1 PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis melo]2.9e-19976.79Show/hide
Query:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ
        MAFTN LPFQ  IS TKSFIF +FS  LKPLPSIYSASPFK S K SKS N TT          + IT   Q+           +SAR NDVAT+EKEEQ
Subjt:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ

Query:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI
         E EVA+GY++SQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVK+IDDEMEIHSELLKELQ SPTDI
Subjt:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI

Query:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN
        NAIVA RRKEFT+EFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFD+ILNSPSLDVACEKIASL KAKELDSSLILLIN
Subjt:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN

Query:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD
        SAWASAKESTTMKNE                                                  VKEIMYHLYK TKSSLRSMAPKEIKLLKHLLNIVD
Subjt:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD

Query:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKP--NHVSED
        PEERFSALATAF+PGDGSEQKDPNALYTTP ELHKWIKIMLDSYHLNQE+TDIREARNMTQP+VIQRLFILKDTIETEYLEQN+  NPQS+P  NH SED
Subjt:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKP--NHVSED

Query:  AVSI
        A+SI
Subjt:  AVSI

XP_038883874.1 uncharacterized protein At4g37920 isoform X1 [Benincasa hispida]8.2e-20278.73Show/hide
Query:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQV-ILPKRRVALCASSARANDVATTEKEE
        MAFTN L FQLSIS TKSFIF SFS  LKPLPSIYSAS FK S +I KS NPT           + IT   Q  I  + R   C   A  NDVATTEKEE
Subjt:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQV-ILPKRRVALCASSARANDVATTEKEE

Query:  QEETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTD
        + E EVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIH ELLKELQ SPTD
Subjt:  QEETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTD

Query:  INAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLI
        INAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDIL SPSLDVACEKIASL KAKELDSSLILLI
Subjt:  INAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLI

Query:  NSAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIV
        NSAWA+AKESTTMKNE                                                  VKEIMYHLYK TKSSLRSMAPKEIKLLKHLLNIV
Subjt:  NSAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIV

Query:  DPEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKPNHVSEDA
        DPEERFSALATAFAPGDGSEQKDP ALYTTP ELHKWIKIMLDSYHLNQE+TDIREARNMTQPVVIQRLFILKDTIETEYLEQNE  NPQS PNHVSEDA
Subjt:  DPEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKPNHVSEDA

Query:  VSI
        VSI
Subjt:  VSI

XP_038883875.1 uncharacterized protein At4g37920 isoform X2 [Benincasa hispida]1.2e-20078.49Show/hide
Query:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ
        MAFTN L FQLSIS TKSFIF SFS  LKPLPSIYSAS FK S +I KS NPT           + IT   Q            +SA  NDVATTEKEE+
Subjt:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ

Query:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI
         E EVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIH ELLKELQ SPTDI
Subjt:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI

Query:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN
        NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDIL SPSLDVACEKIASL KAKELDSSLILLIN
Subjt:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN

Query:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD
        SAWA+AKESTTMKNE                                                  VKEIMYHLYK TKSSLRSMAPKEIKLLKHLLNIVD
Subjt:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD

Query:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKPNHVSEDAV
        PEERFSALATAFAPGDGSEQKDP ALYTTP ELHKWIKIMLDSYHLNQE+TDIREARNMTQPVVIQRLFILKDTIETEYLEQNE  NPQS PNHVSEDAV
Subjt:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKPNHVSEDAV

Query:  SI
        SI
Subjt:  SI

TrEMBL top hitse value%identityAlignment
A0A0A0L3X1 Uncharacterized protein3.2e-19675.4Show/hide
Query:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ
        MAFTN LPFQ  +S TK FIF SFS  L PLPSIYSASPFK S KISKS N T+          + IT   Q+           +SAR NDVAT+EKEEQ
Subjt:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ

Query:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI
         E EVA+GY++SQFCDKIIDIF+NEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWE DPIMKEKLISLRRKVK+IDDEMEIHSELLKELQ SPTDI
Subjt:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI

Query:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN
        NAIVAKR KEFT+EFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAY+RTLENVETLDSAQ KFD+ILNSPSLDVACEKIASL KAKELDSSLILLIN
Subjt:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN

Query:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD
        SAWASAKESTTMKNE                                                  VKEIMYHLYK TKSSLRSMAPKEIKLLKHLLNIVD
Subjt:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD

Query:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKP--NHVSED
        PEERFSALAT F+PGDGSEQKDPNALYTTP ELHKWIKIMLDSYHLNQE+TDIREARNMTQP+VIQRLFILKDTIETEYLEQN+  NPQS+P  NH SED
Subjt:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKP--NHVSED

Query:  AVSI
        A+SI
Subjt:  AVSI

A0A1S3B4W5 uncharacterized protein At4g37920, chloroplastic isoform X11.4e-19976.79Show/hide
Query:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ
        MAFTN LPFQ  IS TKSFIF +FS  LKPLPSIYSASPFK S K SKS N TT          + IT   Q+           +SAR NDVAT+EKEEQ
Subjt:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ

Query:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI
         E EVA+GY++SQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVK+IDDEMEIHSELLKELQ SPTDI
Subjt:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI

Query:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN
        NAIVA RRKEFT+EFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFD+ILNSPSLDVACEKIASL KAKELDSSLILLIN
Subjt:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN

Query:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD
        SAWASAKESTTMKNE                                                  VKEIMYHLYK TKSSLRSMAPKEIKLLKHLLNIVD
Subjt:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD

Query:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKP--NHVSED
        PEERFSALATAF+PGDGSEQKDPNALYTTP ELHKWIKIMLDSYHLNQE+TDIREARNMTQP+VIQRLFILKDTIETEYLEQN+  NPQS+P  NH SED
Subjt:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKP--NHVSED

Query:  AVSI
        A+SI
Subjt:  AVSI

A0A6J1DBT6 uncharacterized protein At4g37920, chloroplastic isoform X11.2e-16970.87Show/hide
Query:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ
        MA  N LPF LS S  K+ IF    P     P I SA     S K SKS +PTT          I IT   ++           +S  ANDVAT E E Q
Subjt:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ

Query:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI
         E EVAEGYTISQFCDKIIDIF+NEKPKTKEWRK LVFREEWKKYRESFYSHCQRR DWESDP MKE+LISLRRKVKRIDDEMEIHSEL KELQ SPTDI
Subjt:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI

Query:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN
        NAIVAKRRK+FTEEFF FLTLISETHDSLEDRDAVARLAARCL+AVSAYDRTLE V+TLD AQAKFDDILNSPSLDVACEKI SL KAKELDSSLILLIN
Subjt:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN

Query:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD
        SAWASAKESTTMKNE                                                  VKEIMY LY+ TKSSLRSMAPKEIKLLKHLLNIVD
Subjt:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD

Query:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNE
        PEERFSALATAFAPGDGSE +DPNA+YTTP ELHKWIKIMLDSYHLNQE+T++REARNM QPVVIQRLFILKDTIETEYLEQ E
Subjt:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNE

A0A6J1F3Z5 uncharacterized protein At4g379204.0e-19475.9Show/hide
Query:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ
        MA TNQL FQLSIS TK+FIFR FS   KPLPSI SA+PFKSS K SKS N  T                A V  P +  A    SAR NDVATTE EEQ
Subjt:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ

Query:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI
         E EVAEGYTISQFCDKIIDIFMNEKPKTKEWRK LVFREEWKKYRESFYSHCQRRADWESDPIMKEKL+SL R+VKRIDDEMEIHSELLKELQ SPTDI
Subjt:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI

Query:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN
        NAIVAKRRKEFTE+FFKFLTL+SETHDSLED DAVARLAARCL+AVSAYDRTLE+VETLDSAQ KFDDILNSPSLDVACEKIASL KAKELDSSLILLIN
Subjt:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN

Query:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD
        SAWASAKESTTMKNE                                                  VKEIMYHLYK TKS LRSMAPKEIKLLKHLLNIVD
Subjt:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD

Query:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKPNHVSEDAV
        PEERFSALATAFAPGDGSE KDPNA+YTTP ELHKWIKIMLDSYHLNQE+TDIREAR M QP+VIQRLFILKDTIETEYLEQNES N QSKPNHVS +AV
Subjt:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKPNHVSEDAV

Query:  SI
        SI
Subjt:  SI

A0A6J1HRT8 uncharacterized protein At4g379202.4e-19174.9Show/hide
Query:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ
        MA TNQL FQLSIS T++FIFR FS    PLPSI SA PFK + K SKS N  T                A V  P +  A    SARANDVATTE EEQ
Subjt:  MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQ

Query:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI
         E EVAEGYTISQFCDKIIDIFMNEKPKTKEWRK LVFREEWKKYRESFYSHCQRRADWESDPIMKEKL+SL R+VKRIDDEMEIHSELLKELQ SPTDI
Subjt:  EETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDI

Query:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN
        NAIVAKRRKEFTE+FFKFLTL+SETHDSLED DAVARLAARCL+AVSAYDRTLE+VETLDSAQ KFDDILNSP+LDVACEKIASL KAKELDSSLILLIN
Subjt:  NAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLIN

Query:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD
        SAWASAKESTTMKNE                                                  VKEIMY LYK TKS LRSMAPKEIKLLKHLLNIVD
Subjt:  SAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVD

Query:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKPNHVSEDAV
        PEERFSALATAFAPGDGSE KDPNA+YTTP ELHKWIKIMLDSYHLNQE+TDIREAR M QP+VIQRLFILKDTIETEYLEQNE  NPQSKPNHVS +AV
Subjt:  PEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKPNHVSEDAV

Query:  SI
        SI
Subjt:  SI

SwissProt top hitse value%identityAlignment
Q84WN0 Uncharacterized protein At4g379205.8e-13464.68Show/hide
Query:  EEQEETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSP
        E+  E EVAEGYT++QFCDKIID+F+NEKPK K+W+ +LV R+EW KY  +FY  C+ RAD E+DPI+K+KL+SL  KVK+ID EME H++LLKE+Q +P
Subjt:  EEQEETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSP

Query:  TDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLIL
        TDINAI AKRR++FT EFF+++TL+SET D LEDRDAVARLA RCL+AVSAYD TLE+VETLD+AQAKF+DILNSPS+D ACEKI SL KAKELDSSLIL
Subjt:  TDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLIL

Query:  LINSAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLN
        LINSA+A+AKES T+ NE                                                   K+IMYHLYK TKSSLRS+ PKEIKLLK+LLN
Subjt:  LINSAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLN

Query:  IVDPEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQ
        I DPEERFSALATAF+PGD  E KDP ALYTTP ELHKWIKIMLD+YHLN+EETDI+EA+ M+QP+VIQRLFILKDTIE EYL++
Subjt:  IVDPEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQ

Arabidopsis top hitse value%identityAlignment
AT1G36320.1 unknown protein6.9e-7437.85Show/hide
Query:  VATTEKEEQEETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLK
        VA  EK++  E  V +   + + CDK+I++FM +KP   +WR+ L F +EW   R  FY  CQ RAD E +P MK K+  L RK+K +D++++ H+ELL 
Subjt:  VATTEKEEQEETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLK

Query:  ELQGS-PTDINAIVAKRRKEFTEEFFKFLTLISET-HDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAK
         ++ + P +I  +VA+RRK+FT EFF+ L  ++E+ +D+ ++++A+A L    +AAV AYD + E+++ L++A+ K  DI+NSPSLD AC KI SL +  
Subjt:  ELQGS-PTDINAIVAKRRKEFTEEFFKFLTLISET-HDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAK

Query:  ELDSSLILLINSAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEI
        +LDS+L+L+I  AW++AKES  MK E                                                  VK+I+YHLY T + +L+ + PKE+
Subjt:  ELDSSLILLINSAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEI

Query:  KLLKHLLNIVDPEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYL
        ++LK+LL+I DP+E+ SAL  AF PGD  E  D + LYTTP  L   +K +L++YH ++E + ++EA+++  P +I ++  LK  +E +Y+
Subjt:  KLLKHLLNIVDPEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYL

AT4G37920.1 unknown protein4.1e-13564.68Show/hide
Query:  EEQEETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSP
        E+  E EVAEGYT++QFCDKIID+F+NEKPK K+W+ +LV R+EW KY  +FY  C+ RAD E+DPI+K+KL+SL  KVK+ID EME H++LLKE+Q +P
Subjt:  EEQEETEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSP

Query:  TDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLIL
        TDINAI AKRR++FT EFF+++TL+SET D LEDRDAVARLA RCL+AVSAYD TLE+VETLD+AQAKF+DILNSPS+D ACEKI SL KAKELDSSLIL
Subjt:  TDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLIL

Query:  LINSAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLN
        LINSA+A+AKES T+ NE                                                   K+IMYHLYK TKSSLRS+ PKEIKLLK+LLN
Subjt:  LINSAWASAKESTTMKNEPLYYCGLSVSLATILIAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLN

Query:  IVDPEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQ
        I DPEERFSALATAF+PGD  E KDP ALYTTP ELHKWIKIMLD+YHLN+EETDI+EA+ M+QP+VIQRLFILKDTIE EYL++
Subjt:  IVDPEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIMLDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTTCACAAATCAGCTTCCCTTTCAGCTCTCCATTTCCTTAACAAAGTCTTTCATCTTCCGCAGCTTTTCCCCAAATCTAAAACCACTCCCATCAATCTACTCTGC
TTCACCCTTTAAATCATCATCCAAAATTTCCAAATCCCAGAACCCAACAACAGAGTTCCAAGGATTCTGCTATTGGGATAGAATCAGAATTACTATTAGGGCTCAAGTGA
TATTACCCAAAAGGAGGGTGGCTCTATGTGCCTCAAGTGCACGAGCAAATGATGTAGCTACAACTGAAAAGGAAGAGCAAGAAGAGACGGAAGTTGCAGAGGGATATACC
ATCTCTCAATTTTGTGATAAAATAATTGATATTTTCATGAATGAGAAGCCAAAGACTAAAGAATGGAGGAAGTTTTTGGTATTTAGGGAGGAGTGGAAAAAGTATAGGGA
GAGCTTCTACAGTCATTGCCAAAGGCGGGCAGACTGGGAGAGTGATCCAATTATGAAAGAGAAGTTAATTTCACTTAGGAGAAAGGTCAAAAGGATTGATGATGAAATGG
AAATCCACAGTGAACTTCTCAAGGAATTACAGGGCAGCCCAACTGACATTAATGCGATAGTTGCAAAGCGGCGCAAAGAGTTCACAGAGGAATTTTTTAAGTTCCTTACT
CTGATATCGGAAACCCATGACAGTTTGGAAGATCGTGATGCGGTGGCTCGGCTGGCAGCCAGATGTCTGGCTGCAGTTAGTGCTTACGATCGAACATTAGAAAATGTGGA
GACATTGGATTCTGCACAGGCCAAATTTGATGATATACTGAATTCTCCCTCATTGGACGTGGCTTGTGAGAAGATTGCAAGTCTTGGAAAGGCAAAGGAACTTGACTCAT
CGTTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAGAATGAGCCATTATATTATTGTGGATTGTCAGTGTCCCTGGCCACAATATTA
ATTGCTGTTGTTAACTATGAAGTTCTTGAAAGTAACTCCCTGCAACATCCCCAGTTTTGCCCGAGTTGGAAACTTACTCTTATTTTACAATTCATTTCTAAGACGGTGAA
AGAAATAATGTATCATTTATACAAAACCACAAAAAGCAGTCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAACATTTGCTGAACATCGTAGATCCTGAAGAAC
GATTTTCTGCTTTAGCAACAGCCTTCGCCCCAGGTGATGGAAGTGAACAAAAAGATCCAAATGCTTTGTACACAACTCCGACAGAGCTGCATAAGTGGATAAAGATAATG
CTTGATTCATACCATCTGAACCAGGAAGAAACAGACATAAGAGAAGCAAGGAATATGACTCAGCCTGTTGTTATACAAAGGCTATTCATCCTCAAGGATACTATTGAAAC
TGAGTATTTGGAACAGAATGAGTCTCACAATCCTCAATCCAAACCAAATCATGTTTCTGAAGATGCGGTTTCTATATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCTTCACAAATCAGCTTCCCTTTCAGCTCTCCATTTCCTTAACAAAGTCTTTCATCTTCCGCAGCTTTTCCCCAAATCTAAAACCACTCCCATCAATCTACTCTGC
TTCACCCTTTAAATCATCATCCAAAATTTCCAAATCCCAGAACCCAACAACAGAGTTCCAAGGATTCTGCTATTGGGATAGAATCAGAATTACTATTAGGGCTCAAGTGA
TATTACCCAAAAGGAGGGTGGCTCTATGTGCCTCAAGTGCACGAGCAAATGATGTAGCTACAACTGAAAAGGAAGAGCAAGAAGAGACGGAAGTTGCAGAGGGATATACC
ATCTCTCAATTTTGTGATAAAATAATTGATATTTTCATGAATGAGAAGCCAAAGACTAAAGAATGGAGGAAGTTTTTGGTATTTAGGGAGGAGTGGAAAAAGTATAGGGA
GAGCTTCTACAGTCATTGCCAAAGGCGGGCAGACTGGGAGAGTGATCCAATTATGAAAGAGAAGTTAATTTCACTTAGGAGAAAGGTCAAAAGGATTGATGATGAAATGG
AAATCCACAGTGAACTTCTCAAGGAATTACAGGGCAGCCCAACTGACATTAATGCGATAGTTGCAAAGCGGCGCAAAGAGTTCACAGAGGAATTTTTTAAGTTCCTTACT
CTGATATCGGAAACCCATGACAGTTTGGAAGATCGTGATGCGGTGGCTCGGCTGGCAGCCAGATGTCTGGCTGCAGTTAGTGCTTACGATCGAACATTAGAAAATGTGGA
GACATTGGATTCTGCACAGGCCAAATTTGATGATATACTGAATTCTCCCTCATTGGACGTGGCTTGTGAGAAGATTGCAAGTCTTGGAAAGGCAAAGGAACTTGACTCAT
CGTTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAGAATGAGCCATTATATTATTGTGGATTGTCAGTGTCCCTGGCCACAATATTA
ATTGCTGTTGTTAACTATGAAGTTCTTGAAAGTAACTCCCTGCAACATCCCCAGTTTTGCCCGAGTTGGAAACTTACTCTTATTTTACAATTCATTTCTAAGACGGTGAA
AGAAATAATGTATCATTTATACAAAACCACAAAAAGCAGTCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAACATTTGCTGAACATCGTAGATCCTGAAGAAC
GATTTTCTGCTTTAGCAACAGCCTTCGCCCCAGGTGATGGAAGTGAACAAAAAGATCCAAATGCTTTGTACACAACTCCGACAGAGCTGCATAAGTGGATAAAGATAATG
CTTGATTCATACCATCTGAACCAGGAAGAAACAGACATAAGAGAAGCAAGGAATATGACTCAGCCTGTTGTTATACAAAGGCTATTCATCCTCAAGGATACTATTGAAAC
TGAGTATTTGGAACAGAATGAGTCTCACAATCCTCAATCCAAACCAAATCATGTTTCTGAAGATGCGGTTTCTATATAG
Protein sequenceShow/hide protein sequence
MAFTNQLPFQLSISLTKSFIFRSFSPNLKPLPSIYSASPFKSSSKISKSQNPTTEFQGFCYWDRIRITIRAQVILPKRRVALCASSARANDVATTEKEEQEETEVAEGYT
ISQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHSELLKELQGSPTDINAIVAKRRKEFTEEFFKFLT
LISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILNSPSLDVACEKIASLGKAKELDSSLILLINSAWASAKESTTMKNEPLYYCGLSVSLATIL
IAVVNYEVLESNSLQHPQFCPSWKLTLILQFISKTVKEIMYHLYKTTKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQKDPNALYTTPTELHKWIKIM
LDSYHLNQEETDIREARNMTQPVVIQRLFILKDTIETEYLEQNESHNPQSKPNHVSEDAVSI