; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G21890 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G21890
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionEndo/exonuclease/phosphatase domain-containing protein
Genome locationChr6:19840874..19843118
RNA-Seq ExpressionCSPI06G21890
SyntenyCSPI06G21890
Gene Ontology termsGO:0006506 - GPI anchor biosynthetic process (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140218.2 uncharacterized protein LOC101212223 [Cucumis sativus]7.7e-274100Show/hide
Query:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNSS
        MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNSS
Subjt:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNSS

Query:  AKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSER
        AKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSER
Subjt:  AKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSER

Query:  GTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVNV
        GTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVNV
Subjt:  GTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVNV

Query:  QCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSVQ
        QCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSVQ
Subjt:  QCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSVQ

Query:  GTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT
        GTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT
Subjt:  GTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT

XP_008449463.1 PREDICTED: uncharacterized protein LOC103491341 [Cucumis melo]4.5e-25895.01Show/hide
Query:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTS-QTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNS
        MLKFLNRKLRRLCSRLRWPRRR IRPRVL+IKKFGKTTS +T SHP+KTIDSFVN ASSPSAVHPNSQF+LL TQRPIRIATFNAASFSMAPAVPEKSNS
Subjt:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTS-QTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNS

Query:  SAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSE
        SAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSIN+GVA+TKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDR+GM IAKSGTPLRWTVSMPSE
Subjt:  SAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSE

Query:  RGTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVN
        RG+YRCSRTVVEVLR+LDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVN
Subjt:  RGTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVN

Query:  VQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSV
        VQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRW DIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKE+GGECESVVMIAKGQSV
Subjt:  VQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSV

Query:  QGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT
        QGTCKYGTRVDYI+ASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH+PPQPRPQP    QTQLHSHS+SPWKKRWT
Subjt:  QGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT

XP_022963872.1 uncharacterized protein LOC111464053 [Cucurbita moschata]1.4e-20683.16Show/hide
Query:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVP--EKSN
        ML FLNR LRRLCSRLRWPR R +RPRV++IKKFGKTTS+  + P  T+DSFVN AS  S VHP  QFH   TQRP+RIATFNAASFSMAPAVP  EKSN
Subjt:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVP--EKSN

Query:  SSAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVA-----------RTKPRVSINLPDNEISLLRNRQA--SEYEME-ENLSSSGNDRKGMRIA
        SSAKFRRSLDS+ RTKSVNDRPKSILKQSPLH NS+N  VA           + KPRVSINLPDNEISLLRNRQA  SEYEME E+ SSSGND  GMRIA
Subjt:  SSAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVA-----------RTKPRVSINLPDNEISLLRNRQA--SEYEME-ENLSSSGNDRKGMRIA

Query:  KSGTPLRWTVSMPSER---GTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTD
        KS  PLR  VSMP ER    +YRCSRTVVEVLRELDADILALQDVKA EEK MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFD TD
Subjt:  KSGTPLRWTVSMPSER---GTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTD

Query:  FRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAK
        FRNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRS N+EPHILLGGLNSLDPTDYSQQRW DIVKYYEEIGKPTPEAKV KFLKS M YRDAK
Subjt:  FRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAK

Query:  EFGGECESVVMIAKGQSVQGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH
        EFGGECESVVMIAKGQSVQGTCKYGTRVDYI+ASPDA+YEFV+GSYSV+SSKGTSDHHIVKVDFLK PH
Subjt:  EFGGECESVVMIAKGQSVQGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH

XP_023554061.1 uncharacterized protein LOC111811442 [Cucurbita pepo subsp. pepo]6.9e-20682.94Show/hide
Query:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVP--EKSN
        ML FLNR LRRLCSRLRWPR R +RPRV++IKKFGKTTS+  + P  T+DSFVN AS  S VHP  QF  L T RP+RIATFNAASFSMAPAVP  EKSN
Subjt:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVP--EKSN

Query:  SSAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVA-----------RTKPRVSINLPDNEISLLRNRQA--SEYEME-ENLSSSGNDRKGMRIA
        SSAKFRRSLDS+ RT SVNDRPKSILKQSPLH NS+N  VA           + KPRVSINLPDNEISLLRNRQA  SEYEME E+ SSSGND  GMRIA
Subjt:  SSAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVA-----------RTKPRVSINLPDNEISLLRNRQA--SEYEME-ENLSSSGNDRKGMRIA

Query:  KSGTPLRWTVSMPSER---GTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTD
        KS  PLR  VSMPSER    +YRCSRTVVEVLRELDADILALQDVKA EEK MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFD TD
Subjt:  KSGTPLRWTVSMPSER---GTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTD

Query:  FRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAK
        FRNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRS N+EPHILLGGLNSLDPTDYSQQRW DIVKYYEEIGKPTPEAKV KFLKS+M YRDAK
Subjt:  FRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAK

Query:  EFGGECESVVMIAKGQSVQGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH
        EFGGECESVVMIAKGQSVQGTCKYGTRVDYI+ASPDA+YEFV+GSYSV+SSKGTSDHHIVKVDFLK PH
Subjt:  EFGGECESVVMIAKGQSVQGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH

XP_038887606.1 uncharacterized protein LOC120077715 [Benincasa hispida]1.4e-23888.57Show/hide
Query:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNSS
        MLKFLNR LRRLCSRLRWPRRR IRPRV+IIKKFGKTTS+T S+P+K+IDSFVN ASSPSAVHPNSQFH L TQRPIRIATFNAASFSMAPAVPEKSNSS
Subjt:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNSS

Query:  AKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSI----NNGVARTKPRVSINLPDNEISLLRNRQA--SEYEMEENLSSSGNDRKGMRIAKSGTPLRWTV
        AKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNS+    N+ +A+ KPRVSINLPDNEISLLRNRQA  SEYEMEEN SSSGNDRK MR AKS TPLRWTV
Subjt:  AKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSI----NNGVARTKPRVSINLPDNEISLLRNRQA--SEYEMEENLSSSGNDRKGMRIAKSGTPLRWTV

Query:  SMPSE----RGTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATI
        SMPSE    R +YRCSRT+VEVLRELDADILALQDVKAEEEK+MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWK EKIFDDTDFRNVLK TI
Subjt:  SMPSE----RGTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATI

Query:  DVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESV
        DVEEVGEVNVQCT+LDHLDENWRMKQIKSIIRS NNEPHILLGGLNSLDPTDYS++RWMDIVKYYEEIGKPTPEAKVTKFLKS+MQYRDAKEFGGECESV
Subjt:  DVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESV

Query:  VMIAKGQSVQGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT
        VMIAKGQSVQGTCKYGTRVDYI+ASPDA+YEFVQGSYSV+SSKGTSDHHIVKVDFLKLP  PPQPRP      QTQLHSHS+SPW KRWT
Subjt:  VMIAKGQSVQGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT

TrEMBL top hitse value%identityAlignment
A0A0A0KDU0 Endo/exonuclease/phosphatase domain-containing protein3.7e-274100Show/hide
Query:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNSS
        MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNSS
Subjt:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNSS

Query:  AKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSER
        AKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSER
Subjt:  AKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSER

Query:  GTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVNV
        GTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVNV
Subjt:  GTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVNV

Query:  QCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSVQ
        QCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSVQ
Subjt:  QCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSVQ

Query:  GTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT
        GTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT
Subjt:  GTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT

A0A1S3BLG5 uncharacterized protein LOC1034913412.2e-25895.01Show/hide
Query:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTS-QTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNS
        MLKFLNRKLRRLCSRLRWPRRR IRPRVL+IKKFGKTTS +T SHP+KTIDSFVN ASSPSAVHPNSQF+LL TQRPIRIATFNAASFSMAPAVPEKSNS
Subjt:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTS-QTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNS

Query:  SAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSE
        SAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSIN+GVA+TKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDR+GM IAKSGTPLRWTVSMPSE
Subjt:  SAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSE

Query:  RGTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVN
        RG+YRCSRTVVEVLR+LDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVN
Subjt:  RGTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVN

Query:  VQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSV
        VQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRW DIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKE+GGECESVVMIAKGQSV
Subjt:  VQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSV

Query:  QGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT
        QGTCKYGTRVDYI+ASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH+PPQPRPQP    QTQLHSHS+SPWKKRWT
Subjt:  QGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT

A0A5A7UUR9 DNAse I-like superfamily protein2.2e-25895.01Show/hide
Query:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTS-QTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNS
        MLKFLNRKLRRLCSRLRWPRRR IRPRVL+IKKFGKTTS +T SHP+KTIDSFVN ASSPSAVHPNSQF+LL TQRPIRIATFNAASFSMAPAVPEKSNS
Subjt:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTS-QTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNS

Query:  SAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSE
        SAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSIN+GVA+TKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDR+GM IAKSGTPLRWTVSMPSE
Subjt:  SAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSE

Query:  RGTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVN
        RG+YRCSRTVVEVLR+LDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVN
Subjt:  RGTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVN

Query:  VQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSV
        VQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRW DIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKE+GGECESVVMIAKGQSV
Subjt:  VQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSV

Query:  QGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT
        QGTCKYGTRVDYI+ASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH+PPQPRPQP    QTQLHSHS+SPWKKRWT
Subjt:  QGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT

A0A6J1HGD4 uncharacterized protein LOC1114640536.7e-20783.16Show/hide
Query:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVP--EKSN
        ML FLNR LRRLCSRLRWPR R +RPRV++IKKFGKTTS+  + P  T+DSFVN AS  S VHP  QFH   TQRP+RIATFNAASFSMAPAVP  EKSN
Subjt:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVP--EKSN

Query:  SSAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVA-----------RTKPRVSINLPDNEISLLRNRQA--SEYEME-ENLSSSGNDRKGMRIA
        SSAKFRRSLDS+ RTKSVNDRPKSILKQSPLH NS+N  VA           + KPRVSINLPDNEISLLRNRQA  SEYEME E+ SSSGND  GMRIA
Subjt:  SSAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVA-----------RTKPRVSINLPDNEISLLRNRQA--SEYEME-ENLSSSGNDRKGMRIA

Query:  KSGTPLRWTVSMPSER---GTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTD
        KS  PLR  VSMP ER    +YRCSRTVVEVLRELDADILALQDVKA EEK MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFD TD
Subjt:  KSGTPLRWTVSMPSER---GTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTD

Query:  FRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAK
        FRNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRS N+EPHILLGGLNSLDPTDYSQQRW DIVKYYEEIGKPTPEAKV KFLKS M YRDAK
Subjt:  FRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAK

Query:  EFGGECESVVMIAKGQSVQGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH
        EFGGECESVVMIAKGQSVQGTCKYGTRVDYI+ASPDA+YEFV+GSYSV+SSKGTSDHHIVKVDFLK PH
Subjt:  EFGGECESVVMIAKGQSVQGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH

A0A6J1HR38 uncharacterized protein LOC1114670141.1e-20482.09Show/hide
Query:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVP--EKSN
        MLKFLNR LRRLC+RLRWPR R +RPRV++IKKFGKTTS+  + P  T+DSFVN  S  S VHP  QF  L   RP+RIATFNAASFSMAPAVP  EKSN
Subjt:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVP--EKSN

Query:  SSAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVA-----------RTKPRVSINLPDNEISLLRNRQA--SEYEME-ENLSSSGNDRKGMRIA
        SSAKFRRSLDS+ RTKSVNDRPKSILKQSPLH N++N  VA           + KPRVSINLPDNEISLLRNRQA  SEYEME E+ SSSGND  GMRIA
Subjt:  SSAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVA-----------RTKPRVSINLPDNEISLLRNRQA--SEYEME-ENLSSSGNDRKGMRIA

Query:  KSGTPLRWTVSMPSER---GTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTD
        KS  PLR  VSMPSER    +YRC+RTVVEVLRELDADILALQDVKA EEK MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFD TD
Subjt:  KSGTPLRWTVSMPSER---GTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTD

Query:  FRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAK
        FRNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRS N+EPHILLGGLNSLDPTDYSQQRW DIVKYYEEIGKPTPEAKV KFLKS+M YRDAK
Subjt:  FRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAK

Query:  EFGGECESVVMIAKGQSVQGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH
        EFGGECESVVMIAKGQSVQGTCKYGTRVDYI+ASPDA+Y+FV+GSYSV+SSKGTSDHHIVKVDFLK PH
Subjt:  EFGGECESVVMIAKGQSVQGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLKLPH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G48030.1 DNAse I-like superfamily protein8.3e-11757.27Show/hide
Query:  RRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDS------FVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNSSAKFRRSLDSNSRT
        RRR  RPR   I         + +H   ++DS        N  SS +A+HPN         + I +ATFNAA FSMAPAVP  SN    F        R+
Subjt:  RRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDS------FVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNSSAKFRRSLDSNSRT

Query:  KSVNDRPKSILK-----QSPLHTNSINNGVARTKP-RVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSERGTYRCSR
        KS  DRPKSILK      SP H +      A+++P RVSINLPDNEIS    RQ S  E  ++                 +PLR     P E G  R +R
Subjt:  KSVNDRPKSILK-----QSPLHTNSINNGVARTKP-RVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSERGTYRCSR

Query:  TVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVNVQCTHLDH
        T +EVL ELDAD+LALQDVKA+E  QMRPLSDLA ALGM YVFAESWAPEYGNA+LS+WPIK   V +IFD TDFRNVLKA+I+V   GEV   CTHLDH
Subjt:  TVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVNVQCTHLDH

Query:  LDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSVQGTCKYGT
        LDE WRMKQ+ +II+S  N PHIL G LNSLD +DYS +RW DIVKYYEE+GKP P+A+V +FLKS  +Y DAK+F GECESVV++AKGQSVQGTCKYGT
Subjt:  LDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSVQGTCKYGT

Query:  RVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLK
        RVDYI+AS D+ Y FV GSYSV+SSKGTSDHHIVKVD +K
Subjt:  RVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFLK

AT3G21530.1 DNAse I-like superfamily protein1.7e-11754.91Show/hide
Query:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKT--TSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSN
        ML    RKL  L SRLRW  ++ +R RV I+++F K    ++    P+          S  S++H +S      + R IR+ATFN A FS+AP V  ++ 
Subjt:  MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKT--TSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSN

Query:  SSAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPS
            F   LDS++ T      PK ILKQSPLH++++       KP+V INLPDNEISL ++   S   M EN  + G + +G    +S   L        
Subjt:  SSAKFRRSLDSNSRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPS

Query:  ERGTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEV
            Y   R++ E+LRELDADILALQDVKAEEE  M+PLSDLA ALGMKYVFAESWAPEYGNA+LS+WPIK+W+V++I D  DFRNVLK T+++   G+V
Subjt:  ERGTYRCSRTVVEVLRELDADILALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEV

Query:  NVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQS
        NV CT LDHLDENWRMKQI +I R + + PHILLGGLNSLD +DYS  RW  IVKYYE+ GKPTP  +V +FLK    Y D+KEF GECE VV+IAKGQ+
Subjt:  NVQCTHLDHLDENWRMKQIKSIIRSNNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQS

Query:  VQGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFL
        VQGTCKYGTRVDYI+ASP++ YEFV GSYSV+SSKGTSDHHIVKVD +
Subjt:  VQGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHIVKVDFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCAAGTTCCTCAACCGCAAACTCCGCCGCCTCTGTTCCCGTCTCCGATGGCCCCGTCGCCGTACTATCAGACCTAGGGTACTCATCATCAAGAAGTTTGGAAAAAC
CACCTCCCAAACCACCTCTCATCCCGACAAAACCATCGACTCCTTCGTCAATATTGCTTCCTCGCCCTCTGCTGTTCATCCCAATTCTCAATTCCACCTTCTCACTACAC
AGAGACCCATACGAATTGCAACATTCAATGCCGCCTCCTTCTCCATGGCACCTGCTGTTCCTGAAAAATCCAATTCCTCTGCTAAATTCCGCCGGAGTTTGGATTCAAAT
TCACGGACAAAATCCGTAAACGATCGTCCCAAAAGCATTTTGAAACAATCTCCATTGCATACCAATTCCATTAATAATGGAGTCGCTAGAACCAAGCCCCGGGTTTCCAT
CAACCTGCCTGATAACGAAATATCTTTACTTAGAAATCGACAAGCGAGCGAGTACGAAATGGAGGAGAATCTTTCTTCTTCAGGTAATGATAGGAAGGGGATGCGGATAG
CTAAGAGTGGAACTCCTTTGAGATGGACTGTAAGCATGCCATCGGAACGGGGGACTTACCGATGCAGTCGGACGGTTGTGGAGGTTCTTAGAGAATTGGATGCCGATATA
TTGGCGTTGCAAGATGTGAAAGCGGAGGAAGAGAAACAGATGAGACCGCTTTCAGATTTGGCAGAGGCTTTGGGAATGAAGTACGTTTTTGCAGAGAGTTGGGCGCCGGA
GTACGGAAACGCCGTCTTGTCGAGGTGGCCGATCAAACGGTGGAAAGTCGAGAAGATCTTTGACGACACCGATTTCAGGAATGTTTTAAAAGCAACCATTGATGTGGAAG
AAGTAGGAGAAGTAAATGTTCAGTGTACCCATTTGGATCATCTGGATGAGAATTGGAGGATGAAACAGATAAAGTCCATAATCCGATCAAACAACAATGAACCCCATATC
TTATTAGGAGGCCTCAATTCTCTGGATCCCACAGACTATTCTCAACAAAGGTGGATGGACATTGTGAAGTATTACGAAGAGATAGGAAAGCCGACTCCAGAAGCTAAAGT
CACCAAGTTTCTAAAAAGCAATATGCAATATAGGGATGCAAAAGAGTTTGGAGGAGAATGCGAATCGGTGGTGATGATCGCCAAAGGACAAAGTGTTCAAGGGACGTGTA
AGTACGGGACTCGTGTGGACTACATAATGGCCTCTCCCGATGCAAATTATGAGTTTGTACAAGGATCCTACTCCGTTATTTCTTCCAAAGGAACCTCTGATCATCACATT
GTCAAGGTTGATTTCCTCAAACTACCTCATCAGCCTCCACAGCCTCGGCCTCAGCCTCAGCCTCAAACTCAAACTCAACTTCATTCACATTCAATTTCCCCTTGGAAGAA
GAGATGGACATGA
mRNA sequenceShow/hide mRNA sequence
TGATGCCAATGCAATACAGTTAAGCAGAGTTGAATGTGACACTAACTTTCACGCTTTCACCCACCCCCTCAACTTTCTACAATGCTCAAGTTCCTCAACCGCAAACTCCG
CCGCCTCTGTTCCCGTCTCCGATGGCCCCGTCGCCGTACTATCAGACCTAGGGTACTCATCATCAAGAAGTTTGGAAAAACCACCTCCCAAACCACCTCTCATCCCGACA
AAACCATCGACTCCTTCGTCAATATTGCTTCCTCGCCCTCTGCTGTTCATCCCAATTCTCAATTCCACCTTCTCACTACACAGAGACCCATACGAATTGCAACATTCAAT
GCCGCCTCCTTCTCCATGGCACCTGCTGTTCCTGAAAAATCCAATTCCTCTGCTAAATTCCGCCGGAGTTTGGATTCAAATTCACGGACAAAATCCGTAAACGATCGTCC
CAAAAGCATTTTGAAACAATCTCCATTGCATACCAATTCCATTAATAATGGAGTCGCTAGAACCAAGCCCCGGGTTTCCATCAACCTGCCTGATAACGAAATATCTTTAC
TTAGAAATCGACAAGCGAGCGAGTACGAAATGGAGGAGAATCTTTCTTCTTCAGGTAATGATAGGAAGGGGATGCGGATAGCTAAGAGTGGAACTCCTTTGAGATGGACT
GTAAGCATGCCATCGGAACGGGGGACTTACCGATGCAGTCGGACGGTTGTGGAGGTTCTTAGAGAATTGGATGCCGATATATTGGCGTTGCAAGATGTGAAAGCGGAGGA
AGAGAAACAGATGAGACCGCTTTCAGATTTGGCAGAGGCTTTGGGAATGAAGTACGTTTTTGCAGAGAGTTGGGCGCCGGAGTACGGAAACGCCGTCTTGTCGAGGTGGC
CGATCAAACGGTGGAAAGTCGAGAAGATCTTTGACGACACCGATTTCAGGAATGTTTTAAAAGCAACCATTGATGTGGAAGAAGTAGGAGAAGTAAATGTTCAGTGTACC
CATTTGGATCATCTGGATGAGAATTGGAGGATGAAACAGATAAAGTCCATAATCCGATCAAACAACAATGAACCCCATATCTTATTAGGAGGCCTCAATTCTCTGGATCC
CACAGACTATTCTCAACAAAGGTGGATGGACATTGTGAAGTATTACGAAGAGATAGGAAAGCCGACTCCAGAAGCTAAAGTCACCAAGTTTCTAAAAAGCAATATGCAAT
ATAGGGATGCAAAAGAGTTTGGAGGAGAATGCGAATCGGTGGTGATGATCGCCAAAGGACAAAGTGTTCAAGGGACGTGTAAGTACGGGACTCGTGTGGACTACATAATG
GCCTCTCCCGATGCAAATTATGAGTTTGTACAAGGATCCTACTCCGTTATTTCTTCCAAAGGAACCTCTGATCATCACATTGTCAAGGTTGATTTCCTCAAACTACCTCA
TCAGCCTCCACAGCCTCGGCCTCAGCCTCAGCCTCAAACTCAAACTCAACTTCATTCACATTCAATTTCCCCTTGGAAGAAGAGATGGACATGA
Protein sequenceShow/hide protein sequence
MLKFLNRKLRRLCSRLRWPRRRTIRPRVLIIKKFGKTTSQTTSHPDKTIDSFVNIASSPSAVHPNSQFHLLTTQRPIRIATFNAASFSMAPAVPEKSNSSAKFRRSLDSN
SRTKSVNDRPKSILKQSPLHTNSINNGVARTKPRVSINLPDNEISLLRNRQASEYEMEENLSSSGNDRKGMRIAKSGTPLRWTVSMPSERGTYRCSRTVVEVLRELDADI
LALQDVKAEEEKQMRPLSDLAEALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSNNNEPHI
LLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKSNMQYRDAKEFGGECESVVMIAKGQSVQGTCKYGTRVDYIMASPDANYEFVQGSYSVISSKGTSDHHI
VKVDFLKLPHQPPQPRPQPQPQTQTQLHSHSISPWKKRWT