; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy1G013190 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy1G013190
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionReverse transcriptase
Genome locationchrH01:16918214..16922076
RNA-Seq ExpressionChy1G013190
SyntenyChy1G013190
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646221.1 hypothetical protein Csa_016557 [Cucumis sativus]0.092.75Show/hide
Query:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT
        RTKVGDLDRDRPLTCRRDSSPIS KER VSY KNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNS+DCFR ERASPVEN CKD+TSHHLVEKVT
Subjt:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT

Query:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYKFNNNLEKLQR
        PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYE GYGVISRLLSRLIPE+NQYKFNNNLEK+QR
Subjt:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYKFNNNLEKLQR

Query:  LRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDITAAVENYRSFISSLFKPQYGLYDQDEHSHLRK
        LRGRCCPW DYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNS+DNNFQ+KYRTKEFDCN DRKMTLL++TAAVENYRSFISSLFKPQYGLYDQDEH HLRK
Subjt:  LRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDITAAVENYRSFISSLFKPQYGLYDQDEHSHLRK

Query:  QKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNVAEKDIDPTFNNLH
        QKL+PLLLGWDTDYIKDES SQLTELNTIAKSPISFADD QPT+HESFGA PLCSSPFP SNRTNLNSLPYSSLAS QIHGLSWQNVA +DI  TFNNLH
Subjt:  QKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNVAEKDIDPTFNNLH

Query:  LNFSSVPKFLHQCDSYVDDGHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEIQPTALTNEKWCNDSS
        LNFSSVPKFLHQ +S VDDG CHDLCAQNTDWVMNNVLDDGSQHPS+ESLCASG VFDFG KYLS SKEQ QTAYHILKYPLDEIQPTALTNE+W NDSS
Subjt:  LNFSSVPKFLHQCDSYVDDGHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEIQPTALTNEKWCNDSS

Query:  DDVLVDYRPPFFIQPESFFQEGKVYSILTDKLSWDVARSEINVDDITEMNYI
        DDVLVDYRPPFFIQPESFFQ GKVYSILTDKLSWDVARSEINVDDITEMNYI
Subjt:  DDVLVDYRPPFFIQPESFFQEGKVYSILTDKLSWDVARSEINVDDITEMNYI

KAG6576069.1 hypothetical protein SDJN03_26708, partial [Cucurbita argyrosperma subsp. sororia]5.87e-28573.85Show/hide
Query:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT
        R KVGDLD  RPLTCRRD+SP+SLKE HV+   NAKTSEFAFFKKFKEDA+ RFSSSL RQKELQSKKFNS+D FR ERA  VENR +DFTSH  VE VT
Subjt:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT

Query:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYK-------FNN
        P+NFNS+HLPLGNSSKIS+VDV HAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYE  YGVIS LLSRLIPE NQYK       FNN
Subjt:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYK-------FNN

Query:  NLEKLQRLRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDI-----TAAVENYRSFISSLFKPQYG
        NLEKLQ L GRC P  DYEH LNNS SPCRLNKSRGR   HSDFSTN+DD+NF VKYRTKEFD + + KMTLLD      TAAVENYR  IS+LF  QYG
Subjt:  NLEKLQRLRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDI-----TAAVENYRSFISSLFKPQYG

Query:  LYDQDEHSHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNVA
         YDQ E  H+RKQ++EPLLLGWDTD IKD+S S+ TE +T A+ PISFADDHQP LHESFGAV LCSSPFP SN  +L SLPYSSLASYQIHGLS  NV 
Subjt:  LYDQDEHSHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNVA

Query:  -EKDIDPTFNNLHLNFSSVPKFLHQCDSYVDD-GHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEIQ
         E+ ID TFNN+HLNFSSVPK L QCD+YVDD G C   CAQ+ +W MN  LDD  ++PS++S+CASG VFDFGWKYLSGSKE CQTAYH+L+YPLDE++
Subjt:  -EKDIDPTFNNLHLNFSSVPKFLHQCDSYVDD-GHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEIQ

Query:  PTALTNEKWCNDSSDDVLVDYRPPFFIQPESFFQEGKVYSILTDKLSWDVARSEINVDDITEMNYI
        PT+  NE+   DSS     +Y  PFFIQPESFFQEGKV S+LTDKLSWDV RSEINV  ITEM+Y+
Subjt:  PTALTNEKWCNDSSDDVLVDYRPPFFIQPESFFQEGKVYSILTDKLSWDVARSEINVDDITEMNYI

XP_008461985.1 PREDICTED: uncharacterized protein LOC103500465 [Cucumis melo]0.089.25Show/hide
Query:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT
        RTKVGDLDRDRPLTCRRDSSPISLKERHV+Y KNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFR E ASPVENRCKDFTSHHLVEKVT
Subjt:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT

Query:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYKFNNNLEKLQR
        PVNFNSLHLPLGN SKISDVDV HAHKTF+DIQSKQRNVENDDIFSRKRQKLRQFIQNMSF GTGESYE GYGVISRLLSRLIPE+N YKFNNNLEK+Q+
Subjt:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYKFNNNLEKLQR

Query:  LRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDI-----TAAVENYRSFISSLFKPQYGLYDQDEH
        LRGRC P  DYEHHLNNSLSPCRLN SRGRS  HSDFSTNS+DNNFQVKYRTKEFDC+ DRKMTLLD+     TAAVENYRSFISSLF PQY LYDQDEH
Subjt:  LRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDI-----TAAVENYRSFISSLFKPQYGLYDQDEH

Query:  SHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNVA-EKDIDP
         HLRKQKLEPLLLGWDTDYIKDES SQLTELNT AKSPISFADDHQPTLHESFGAV LCSSPFP SNR N NSLPYS+LASYQI GLSWQNV+ E+DID 
Subjt:  SHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNVA-EKDIDP

Query:  TFNNLHLNFSSVPKFLHQCDSYVDDGHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEIQPTALTNEK
        TFNNLHLNFSSVPK LHQC+SYVDDG CHDLCAQN DWVMNNV++D SQHPSVESLCASG VFDFGWKYLSGSKEQCQT+YHILKYPLDEIQPTAL NE+
Subjt:  TFNNLHLNFSSVPKFLHQCDSYVDDGHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEIQPTALTNEK

Query:  WCNDSSDDVLVDYRPPFFIQPESFFQEGKVYSILTDKLS-WDVARSEINVDDITEMNY
        W NDSSDDVLVDY PPF+IQPESFFQEGKVYS+LTDKLS WDV RSEINVDDITEMNY
Subjt:  WCNDSSDDVLVDYRPPFFIQPESFFQEGKVYSILTDKLS-WDVARSEINVDDITEMNY

XP_011659159.1 uncharacterized protein LOC101207408 [Cucumis sativus]0.092.75Show/hide
Query:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT
        RTKVGDLDRDRPLTCRRDSSPIS KER VSY KNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNS+DCFR ERASPVEN CKD+TSHHLVEKVT
Subjt:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT

Query:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYKFNNNLEKLQR
        PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYE GYGVISRLLSRLIPE+NQYKFNNNLEK+QR
Subjt:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYKFNNNLEKLQR

Query:  LRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDITAAVENYRSFISSLFKPQYGLYDQDEHSHLRK
        LRGRCCPW DYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNS+DNNFQ+KYRTKEFDCN DRKMTLL++TAAVENYRSFISSLFKPQYGLYDQDEH HLRK
Subjt:  LRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDITAAVENYRSFISSLFKPQYGLYDQDEHSHLRK

Query:  QKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNVAEKDIDPTFNNLH
        QKL+PLLLGWDTDYIKDES SQLTELNTIAKSPISFADD QPT+HESFGA PLCSSPFP SNRTNLNSLPYSSLAS QIHGLSWQNVA +DI  TFNNLH
Subjt:  QKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNVAEKDIDPTFNNLH

Query:  LNFSSVPKFLHQCDSYVDDGHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEIQPTALTNEKWCNDSS
        LNFSSVPKFLHQ +S VDDG CHDLCAQNTDWVMNNVLDDGSQHPS+ESLCASG VFDFG KYLS SKEQ QTAYHILKYPLDEIQPTALTNE+W NDSS
Subjt:  LNFSSVPKFLHQCDSYVDDGHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEIQPTALTNEKWCNDSS

Query:  DDVLVDYRPPFFIQPESFFQEGKVYSILTDKLSWDVARSEINVDDITEMNYI
        DDVLVDYRPPFFIQPESFFQ GKVYSILTDKLSWDVARSEINVDDITEMNYI
Subjt:  DDVLVDYRPPFFIQPESFFQEGKVYSILTDKLSWDVARSEINVDDITEMNYI

XP_038897158.1 uncharacterized protein LOC120085308 isoform X3 [Benincasa hispida]5.06e-28481.1Show/hide
Query:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT
        R+KVGDLD DRPLT RRDSSPISLKERHV+Y  NAKTSEFAFFKKF+EDA+HRFSSSLPRQKELQSK FN+SD FRAERA+PVENRCKDFTSHHLVE VT
Subjt:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT

Query:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYKFNNNLEKLQR
        PVNFNS+HLPL NSSKIS+VDV H HKT KDI+SKQ NVENDDIFSRKRQKLRQFIQNMSFHG GESYE GYGV+S LLSRLIP+ NQ KFNN+LEKLQR
Subjt:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYKFNNNLEKLQR

Query:  LRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDI-----TAAVENYRSFISSLFKPQYGLYDQDEH
        L GRCCP  DYEHHLNNS SPCRLN+SRGR FSHSDFSTNSDDNNFQ  YRTKEFDC+ + KMTLLD+     TAAV+NYRS ISSLFKPQYGLYDQ EH
Subjt:  LRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDI-----TAAVENYRSFISSLFKPQYGLYDQDEH

Query:  SHLRKQKLEPLLLGWDTDYIKDESFS-QLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNVA-EKDID
         +LRKQ+LEPLLLGWDTD IKD SFS QLTELNT A+ PI+FADDHQP LH+SFGAV LCSSPFP SNR NL SLPYSSLASYQI GLSW NV  E+DID
Subjt:  SHLRKQKLEPLLLGWDTDYIKDESFS-QLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNVA-EKDID

Query:  PTFNNLHLNFSSVPKFLHQCDSYVDDGHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEI
         TFNN+HLNFSSVPK L QCD+YV+D    D CAQ+ DWVMNNVLDD  Q+PSVESLCAS  VFDFG KYLSGSKEQCQTAYHIL+YP DE+
Subjt:  PTFNNLHLNFSSVPKFLHQCDSYVDDGHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEI

TrEMBL top hitse value%identityAlignment
A0A1S3CFT3 uncharacterized protein LOC1035004652.5e-29089.25Show/hide
Query:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT
        RTKVGDLDRDRPLTCRRDSSPISLKERHV+Y KNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFR E ASPVENRCKDFTSHHLVEKVT
Subjt:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT

Query:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYKFNNNLEKLQR
        PVNFNSLHLPLGN SKISDVDV HAHKTF+DIQSKQRNVENDDIFSRKRQKLRQFIQNMSF GTGESYE GYGVISRLLSRLIPE+N YKFNNNLEK+Q+
Subjt:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYKFNNNLEKLQR

Query:  LRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDI-----TAAVENYRSFISSLFKPQYGLYDQDEH
        LRGRC P  DYEHHLNNSLSPCRLN SRGRS  HSDFSTNS+DNNFQVKYRTKEFDC+ DRKMTLLD+     TAAVENYRSFISSLF PQY LYDQDEH
Subjt:  LRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDI-----TAAVENYRSFISSLFKPQYGLYDQDEH

Query:  SHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNVA-EKDIDP
         HLRKQKLEPLLLGWDTDYIKDES SQLTELNT AKSPISFADDHQPTLHESFGAV LCSSPFP SNR N NSLPYS+LASYQI GLSWQNV+ E+DID 
Subjt:  SHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNVA-EKDIDP

Query:  TFNNLHLNFSSVPKFLHQCDSYVDDGHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEIQPTALTNEK
        TFNNLHLNFSSVPK LHQC+SYVDDG CHDLCAQN DWVMNNV++D SQHPSVESLCASG VFDFGWKYLSGSKEQCQT+YHILKYPLDEIQPTAL NE+
Subjt:  TFNNLHLNFSSVPKFLHQCDSYVDDGHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEIQPTALTNEK

Query:  WCNDSSDDVLVDYRPPFFIQPESFFQEGKVYSILTDKLS-WDVARSEINVDDITEMNY
        W NDSSDDVLVDY PPF+IQPESFFQEGKVYS+LTDKLS WDV RSEINVDDITEMNY
Subjt:  WCNDSSDDVLVDYRPPFFIQPESFFQEGKVYSILTDKLS-WDVARSEINVDDITEMNY

A0A6J1GQP2 uncharacterized protein LOC111456585 isoform X15.0e-22270.27Show/hide
Query:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT
        R KVGDLD  RPLTCRRD+SP+SLK  HV+   NAKTSEFAFFKKFKEDA+ RFSSSL RQKELQ KKFNS+D FRAERA  VENR +DFTSH  VE VT
Subjt:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT

Query:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYE--------------------------NGYGV
        P+NFNS+HLPLGNSSKIS+VDV HAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYE                          + YGV
Subjt:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYE--------------------------NGYGV

Query:  ISRLLSRLIPEKNQYK-------FNNNLEKLQRLRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLD
        IS LLSRLIPE NQYK       FNNNLEKLQ L GRC P  DYEH LNNS SPCRLNKSRGR   HSDFSTN+DD+NF VKYRTKEFD + + KMTLLD
Subjt:  ISRLLSRLIPEKNQYK-------FNNNLEKLQRLRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLD

Query:  I-----TAAVENYRSFISSLFKPQYGLYDQDEHSHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSN
              TAAVENYR  IS+LF  QYG YDQ E  H+RKQ++EPLLLGWDTD IKD+S S+ TE +T A+ PISFADDHQP LHESFGAV LCSSPFP SN
Subjt:  I-----TAAVENYRSFISSLFKPQYGLYDQDEHSHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSN

Query:  RTNLNSLPYSSLASYQIHGLSWQNV-AEKDIDPTFNNLHLNFSSVPKFLHQCDSYVDD-GHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFG
          +L SLPYSSLASYQIHGLS  NV  E+ ID T NN+HLNFSSVPK L QCD+YVDD G C   CAQ+ +W MN  LDD  ++PS++S+CASG VFDFG
Subjt:  RTNLNSLPYSSLASYQIHGLSWQNV-AEKDIDPTFNNLHLNFSSVPKFLHQCDSYVDD-GHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFG

Query:  WKYLSGSKEQCQTAYHILKYPLDEIQPTALTNEKWCNDSSDDVLVDYRPPFFIQPESFFQEGKVYSILTDKLSWDVARSEINVDDITEMNYI
        WKYLSGSKE CQTAYH+L+YPLDE++PT+  NE+   DSS     +Y  PFFIQPESFFQEGKV S+LTDKLSWDV RSEINV  ITEM+Y+
Subjt:  WKYLSGSKEQCQTAYHILKYPLDEIQPTALTNEKWCNDSSDDVLVDYRPPFFIQPESFFQEGKVYSILTDKLSWDVARSEINVDDITEMNYI

A0A6J1GS11 uncharacterized protein LOC111456585 isoform X33.3e-22673.5Show/hide
Query:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT
        R KVGDLD  RPLTCRRD+SP+SLK  HV+   NAKTSEFAFFKKFKEDA+ RFSSSL RQKELQ KKFNS+D FRAERA  VENR +DFTSH  VE VT
Subjt:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT

Query:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYK-------FNN
        P+NFNS+HLPLGNSSKIS+VDV HAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYE  YGVIS LLSRLIPE NQYK       FNN
Subjt:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYK-------FNN

Query:  NLEKLQRLRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDI-----TAAVENYRSFISSLFKPQYG
        NLEKLQ L GRC P  DYEH LNNS SPCRLNKSRGR   HSDFSTN+DD+NF VKYRTKEFD + + KMTLLD      TAAVENYR  IS+LF  QYG
Subjt:  NLEKLQRLRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDI-----TAAVENYRSFISSLFKPQYG

Query:  LYDQDEHSHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNV-
         YDQ E  H+RKQ++EPLLLGWDTD IKD+S S+ TE +T A+ PISFADDHQP LHESFGAV LCSSPFP SN  +L SLPYSSLASYQIHGLS  NV 
Subjt:  LYDQDEHSHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNV-

Query:  AEKDIDPTFNNLHLNFSSVPKFLHQCDSYVDD-GHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEIQ
         E+ ID T NN+HLNFSSVPK L QCD+YVDD G C   CAQ+ +W MN  LDD  ++PS++S+CASG VFDFGWKYLSGSKE CQTAYH+L+YPLDE++
Subjt:  AEKDIDPTFNNLHLNFSSVPKFLHQCDSYVDD-GHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEIQ

Query:  PTALTNEKWCNDSSDDVLVDYRPPFFIQPESFFQEGKVYSILTDKLSWDVARSEINVDDITEMNYI
        PT+  NE+   DSS     +Y  PFFIQPESFFQEGKV S+LTDKLSWDV RSEINV  ITEM+Y+
Subjt:  PTALTNEKWCNDSSDDVLVDYRPPFFIQPESFFQEGKVYSILTDKLSWDVARSEINVDDITEMNYI

A0A6J1GSJ6 uncharacterized protein LOC111456585 isoform X23.6e-22070.1Show/hide
Query:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT
        R KVGDLD  RPLTCRRD+SP+SLK  HV+   NAKTSEFAFFKKFKEDA+ RFSSSL RQKELQ KKFNS+D FR ERA  VENR +DFTSH  VE VT
Subjt:  RTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVT

Query:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYE--------------------------NGYGV
        P+NFNS+HLPLGNSSKIS+VDV HAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYE                          + YGV
Subjt:  PVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYE--------------------------NGYGV

Query:  ISRLLSRLIPEKNQYK-------FNNNLEKLQRLRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLD
        IS LLSRLIPE NQYK       FNNNLEKLQ L GRC P  DYEH LNNS SPCRLNKSRGR   HSDFSTN+DD+NF VKYRTKEFD + + KMTLLD
Subjt:  ISRLLSRLIPEKNQYK-------FNNNLEKLQRLRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLD

Query:  I-----TAAVENYRSFISSLFKPQYGLYDQDEHSHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSN
              TAAVENYR  IS+LF  QYG YDQ E  H+RKQ++EPLLLGWDTD IKD+S S+ TE +T A+ PISFADDHQP LHESFGAV LCSSPFP SN
Subjt:  I-----TAAVENYRSFISSLFKPQYGLYDQDEHSHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSN

Query:  RTNLNSLPYSSLASYQIHGLSWQNV-AEKDIDPTFNNLHLNFSSVPKFLHQCDSYVDD-GHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFG
          +L SLPYSSLASYQIHGLS  NV  E+ ID T NN+HLNFSSVPK L QCD+YVDD G C   CAQ+ +W MN  LDD  ++PS++S+CASG VFDFG
Subjt:  RTNLNSLPYSSLASYQIHGLSWQNV-AEKDIDPTFNNLHLNFSSVPKFLHQCDSYVDD-GHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFG

Query:  WKYLSGSKEQCQTAYHILKYPLDEIQPTALTNEKWCNDSSDDVLVDYRPPFFIQPESFFQEGKVYSILTDKLSWDVARSEINVDDITEMNYI
        WKYLSGSKE CQTAYH+L+YPLDE++PT+  NE+   DSS     +Y  PFFIQPESFFQEGKV S+LTDKLSWDV RSEINV  ITEM+Y+
Subjt:  WKYLSGSKEQCQTAYHILKYPLDEIQPTALTNEKWCNDSSDDVLVDYRPPFFIQPESFFQEGKVYSILTDKLSWDVARSEINVDDITEMNYI

A0A6J1JQC1 uncharacterized protein LOC111487990 isoform X42.1e-22072.41Show/hide
Query:  RTKVGDLDRDRPLTCRRDSSPISLKERHVS---YEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVE
        R KVGDLD DRPLTCRRD+SP+SLKE HV+      NAKTSEFAFFKKFK DA+ RFSSSL RQKELQSK+FNS+D FRAERA  VENR +DF SH  VE
Subjt:  RTKVGDLDRDRPLTCRRDSSPISLKERHVS---YEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVE

Query:  KVTPVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYK-------
         VTP+NFNS+HLPLGNSSKIS+VDV HAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYE  YGVIS LLSRLIPE NQYK       
Subjt:  KVTPVNFNSLHLPLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYK-------

Query:  FNNNLEKLQRLRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDI-----TAAVENYRSFISSLFKP
        FNNNLEKLQ L GRC P  DYEH LNNS SPCRLNKSRGR   HSDFSTN+DD+NF VKYRTK+FD + + KMTLLD      TAAVENYRS I +LF P
Subjt:  FNNNLEKLQRLRGRCCPWFDYEHHLNNSLSPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDI-----TAAVENYRSFISSLFKP

Query:  QYGLYDQDEHSHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQ
        QYG YDQ E  H+RKQ++EPLLLGWDTD IKD+S S++TE +T A+ PISFADDHQP L ESFGAV LCSSPFP S   NL  LPYSSL SYQIHGLS  
Subjt:  QYGLYDQDEHSHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTIAKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQ

Query:  NV-AEKDIDPTFNNLHLNFSSVPKFLHQCDSYVDDGHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDE
        NV  E+ ID TFNN+HLNFSSVPK L QCD+YV+D      CAQ+ +W+MN   +D  +HPS+ES+CASG VFDFGWKYLSGSKE CQTAYH+L+YPLDE
Subjt:  NV-AEKDIDPTFNNLHLNFSSVPKFLHQCDSYVDDGHCHDLCAQNTDWVMNNVLDDGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDE

Query:  IQPTALTNEKWCNDSSDDVLVDYRPPFFIQPESFFQEG-KVYSILTDKLSWDVARSEINVDDITEMNYI
        ++PT+  NE+   DSS     +YR PFFIQPESFFQEG KV S+LTDKLSWDV RSEINV  ITEM+Y+
Subjt:  IQPTALTNEKWCNDSSDDVLVDYRPPFFIQPESFFQEG-KVYSILTDKLSWDVARSEINVDDITEMNYI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20250.1 unknown protein8.3e-0429.85Show/hide
Query:  DLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASH--RFSSSLPRQKELQSKKFNSSDCFRAERASP----VENRCKDFTSHHLVEKV
        DL R R ++         L E       +AKTSEFAFFKK K +  H    S S P  K  Q  K   S      RA P        C D          
Subjt:  DLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASH--RFSSSLPRQKELQSKKFNSSDCFRAERASP----VENRCKDFTSHHLVEKV

Query:  TPVN-----FNSLHLPL---------GNSSKISDVDVLHAHKTFKDIQSKQRNV--ENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLI
        TP++      + LH  L         G SS     D  +   +  +++S+   +  E  DIFS KR+KL Q++++       E   NG+ ++S LL+RL 
Subjt:  TPVN-----FNSLHLPL---------GNSSKISDVDVLHAHKTFKDIQSKQRNV--ENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLI

Query:  P
        P
Subjt:  P


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAACAAAAGTTGGAGATCTTGATCGTGATAGACCTTTGACATGCAGAAGAGATTCTTCTCCCATATCCTTGAAAGAAAGACATGTTTCGTACGAAAAGAATGCAAA
AACTTCTGAGTTTGCATTTTTTAAAAAGTTCAAGGAAGATGCAAGCCATAGATTCAGCTCATCTCTTCCACGTCAGAAGGAACTTCAATCAAAAAAGTTCAACTCGAGTG
ATTGTTTCAGAGCTGAGAGAGCAAGCCCTGTTGAAAACCGGTGTAAAGACTTCACATCACATCATCTTGTTGAGAAGGTCACTCCTGTTAACTTTAACTCGTTGCATTTA
CCTCTGGGTAATTCATCCAAAATTTCAGATGTAGATGTCTTACATGCTCATAAAACATTTAAGGATATACAGAGCAAACAGAGGAACGTGGAAAATGATGATATTTTTAG
TAGAAAGAGGCAGAAATTACGTCAGTTCATTCAGAATATGTCGTTCCATGGTACTGGTGAATCTTATGAGAATGGGTATGGTGTTATTTCCAGGCTACTTAGCCGGCTTA
TACCAGAGAAAAATCAGTATAAGTTTAATAATAACTTGGAAAAATTACAAAGGTTGCGTGGAAGGTGCTGTCCTTGGTTTGATTATGAGCATCATTTGAATAATAGTTTA
TCACCTTGTCGCTTGAATAAATCAAGAGGAAGATCTTTTTCTCATTCTGATTTCTCAACCAATAGCGATGACAACAATTTCCAAGTTAAATACAGAACCAAGGAGTTCGA
CTGCAATGCAGACAGAAAAATGACTTTGCTTGATATCACTGCTGCAGTTGAAAACTATAGATCATTTATTTCCAGCCTTTTCAAGCCACAATACGGTTTATATGATCAAG
ATGAACATTCGCACCTAAGAAAGCAAAAGCTAGAACCTCTTCTGCTCGGTTGGGATACTGACTACATAAAAGATGAAAGTTTTTCTCAACTTACAGAGTTGAACACAATT
GCCAAGTCACCAATTTCATTCGCTGATGATCATCAGCCAACCTTGCACGAGAGTTTTGGTGCTGTTCCGCTGTGTTCATCCCCCTTCCCTTGCAGTAATCGTACAAACTT
AAACTCATTGCCATACTCCAGTTTAGCCAGCTATCAAATCCATGGATTAAGTTGGCAAAATGTAGCAGAGAAAGATATCGATCCCACTTTCAACAACCTGCATTTGAATT
TCTCATCAGTACCCAAATTTCTTCATCAATGCGATAGTTATGTCGATGACGGACACTGTCATGACTTGTGTGCACAAAACACTGATTGGGTTATGAATAACGTGTTGGAT
GACGGATCCCAACATCCTTCTGTTGAAAGTCTGTGTGCTTCTGGCCATGTCTTTGATTTTGGATGGAAATACCTCTCAGGCTCAAAGGAACAATGCCAAACAGCTTATCA
TATACTTAAGTACCCACTGGATGAAATACAACCCACAGCCCTAACCAATGAAAAATGGTGTAATGACAGTTCAGATGATGTGCTTGTGGATTATCGACCACCCTTCTTTA
TCCAACCCGAGTCATTCTTTCAAGAAGGAAAGGTATACTCTATACTGACTGATAAACTTAGCTGGGATGTAGCCAGAAGTGAAATAAATGTTGATGATATAACTGAAATG
AATTACATATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAACAAAAGTTGGAGATCTTGATCGTGATAGACCTTTGACATGCAGAAGAGATTCTTCTCCCATATCCTTGAAAGAAAGACATGTTTCGTACGAAAAGAATGCAAA
AACTTCTGAGTTTGCATTTTTTAAAAAGTTCAAGGAAGATGCAAGCCATAGATTCAGCTCATCTCTTCCACGTCAGAAGGAACTTCAATCAAAAAAGTTCAACTCGAGTG
ATTGTTTCAGAGCTGAGAGAGCAAGCCCTGTTGAAAACCGGTGTAAAGACTTCACATCACATCATCTTGTTGAGAAGGTCACTCCTGTTAACTTTAACTCGTTGCATTTA
CCTCTGGGTAATTCATCCAAAATTTCAGATGTAGATGTCTTACATGCTCATAAAACATTTAAGGATATACAGAGCAAACAGAGGAACGTGGAAAATGATGATATTTTTAG
TAGAAAGAGGCAGAAATTACGTCAGTTCATTCAGAATATGTCGTTCCATGGTACTGGTGAATCTTATGAGAATGGGTATGGTGTTATTTCCAGGCTACTTAGCCGGCTTA
TACCAGAGAAAAATCAGTATAAGTTTAATAATAACTTGGAAAAATTACAAAGGTTGCGTGGAAGGTGCTGTCCTTGGTTTGATTATGAGCATCATTTGAATAATAGTTTA
TCACCTTGTCGCTTGAATAAATCAAGAGGAAGATCTTTTTCTCATTCTGATTTCTCAACCAATAGCGATGACAACAATTTCCAAGTTAAATACAGAACCAAGGAGTTCGA
CTGCAATGCAGACAGAAAAATGACTTTGCTTGATATCACTGCTGCAGTTGAAAACTATAGATCATTTATTTCCAGCCTTTTCAAGCCACAATACGGTTTATATGATCAAG
ATGAACATTCGCACCTAAGAAAGCAAAAGCTAGAACCTCTTCTGCTCGGTTGGGATACTGACTACATAAAAGATGAAAGTTTTTCTCAACTTACAGAGTTGAACACAATT
GCCAAGTCACCAATTTCATTCGCTGATGATCATCAGCCAACCTTGCACGAGAGTTTTGGTGCTGTTCCGCTGTGTTCATCCCCCTTCCCTTGCAGTAATCGTACAAACTT
AAACTCATTGCCATACTCCAGTTTAGCCAGCTATCAAATCCATGGATTAAGTTGGCAAAATGTAGCAGAGAAAGATATCGATCCCACTTTCAACAACCTGCATTTGAATT
TCTCATCAGTACCCAAATTTCTTCATCAATGCGATAGTTATGTCGATGACGGACACTGTCATGACTTGTGTGCACAAAACACTGATTGGGTTATGAATAACGTGTTGGAT
GACGGATCCCAACATCCTTCTGTTGAAAGTCTGTGTGCTTCTGGCCATGTCTTTGATTTTGGATGGAAATACCTCTCAGGCTCAAAGGAACAATGCCAAACAGCTTATCA
TATACTTAAGTACCCACTGGATGAAATACAACCCACAGCCCTAACCAATGAAAAATGGTGTAATGACAGTTCAGATGATGTGCTTGTGGATTATCGACCACCCTTCTTTA
TCCAACCCGAGTCATTCTTTCAAGAAGGAAAGGTATACTCTATACTGACTGATAAACTTAGCTGGGATGTAGCCAGAAGTGAAATAAATGTTGATGATATAACTGAAATG
AATTACATATGA
Protein sequenceShow/hide protein sequence
MRTKVGDLDRDRPLTCRRDSSPISLKERHVSYEKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRAERASPVENRCKDFTSHHLVEKVTPVNFNSLHL
PLGNSSKISDVDVLHAHKTFKDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFHGTGESYENGYGVISRLLSRLIPEKNQYKFNNNLEKLQRLRGRCCPWFDYEHHLNNSL
SPCRLNKSRGRSFSHSDFSTNSDDNNFQVKYRTKEFDCNADRKMTLLDITAAVENYRSFISSLFKPQYGLYDQDEHSHLRKQKLEPLLLGWDTDYIKDESFSQLTELNTI
AKSPISFADDHQPTLHESFGAVPLCSSPFPCSNRTNLNSLPYSSLASYQIHGLSWQNVAEKDIDPTFNNLHLNFSSVPKFLHQCDSYVDDGHCHDLCAQNTDWVMNNVLD
DGSQHPSVESLCASGHVFDFGWKYLSGSKEQCQTAYHILKYPLDEIQPTALTNEKWCNDSSDDVLVDYRPPFFIQPESFFQEGKVYSILTDKLSWDVARSEINVDDITEM
NYI