; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0010321 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0010321
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr01:5153141..5169668
RNA-Seq ExpressionCmc01g0010321
SyntenyCmc01g0010321
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050636.1 reverse transcriptase [Cucumis melo var. makuwa]6.9e-24589.38Show/hide
Query:  TPVNFNSLHLPLGNPSKISDVDVKH------------AHKTFEDIQSKQRNVE-NDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIPER
        TP+ FN     LGN      +D K               K  E I    R ++   DIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIP+R
Subjt:  TPVNFNSLHLPLGNPSKISDVDVKH------------AHKTFEDIQSKQRNVE-NDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIPER

Query:  NHYK-------FNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVEN
        NHYK       FNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVEN
Subjt:  NHYK-------FNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVEN

Query:  YRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNL
        YRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNL
Subjt:  YRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNL

Query:  ASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQT
        ASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQT
Subjt:  ASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQT

Query:  SYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNYK
        SYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNYK
Subjt:  SYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNYK

KAE8646221.1 hypothetical protein Csa_016557 [Cucumis sativus]1.3e-28087.92Show/hide
Query:  DSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVENRCKDFTSHHL
        +SSR+KRTKVGDLDRDRPLTCRRDSSPIS KER V+Y KNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNS+DCFRE ASPVEN CKD+TSHHL
Subjt:  DSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVENRCKDFTSHHL

Query:  VEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIPERNHYKFNNNL
        VEKVTPVNFNSLHLPLGN SKISDVDV HAHKTF+DIQSKQRNVENDDIFSRKRQKLRQFIQNMSF GTGESYEKGYGVISRLLSRLIPERN YKFNNNL
Subjt:  VEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIPERNHYKFNNNL

Query:  EKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFISSLFNPQYSLY
        EKIQ+LRGRC P LDYEHHLNNSLSPCRLN SRGRS  HSDFSTNS DNNFQ+KYRTKEFDC+VDRKMTLL+V     TAAVENYRSFISSLF PQY LY
Subjt:  EKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFISSLFNPQYSLY

Query:  DQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNLASYQIQGLSWQNVSKE
        DQDEH HLRKQKL+PLLLGWDTDYIKDESSSQLTELNT AKSPISFADD QPT+HESFGA  LCSSPFPSSNR N NSLPYS+LAS QI GLSWQNV+  
Subjt:  DQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNLASYQIQGLSWQNVSKE

Query:  EDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQTSYHILKYPLDEIQPTA
        EDI  TFNNLHLNFSSVPK LHQ NS VDDGGCHDLCAQN DWVMNNV++D SQHPS+ESLCASGLVFDFG KYLS SKEQ QT+YHILKYPLDEIQPTA
Subjt:  EDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQTSYHILKYPLDEIQPTA

Query:  LINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNY
        L NEEWSNDSSDDVLVDY PPF+IQPESFFQ GKVYS+LTDKLS WDV RSEINVDDITEMNY
Subjt:  LINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNY

KAG6576069.1 hypothetical protein SDJN03_26708, partial [Cucurbita argyrosperma subsp. sororia]7.8e-24174.74Show/hide
Query:  MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVE
        MKR+ +P+IS DS+ RKR KVGDLD  RPLTCRRD+SP+SLKE HVT   NAKTSEFAFFKKFKEDA+ RFSSSL RQKELQSKKFNS+D FRE A  VE
Subjt:  MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVE

Query:  NRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIP
        NR +DFTSH  VE VTP+NFNS+HLPLGN SKIS+VDVKHAHKTF+DIQSKQRNVENDDIFSRKRQKLRQFIQNMSF GTGESYEK YGVIS LLSRLIP
Subjt:  NRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIP

Query:  ERNHYK-------FNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAV
        E N YK       FNNNLEK+Q L GRC+PRLDYEH LNNS SPCRLN SRGR  FHSDFSTN++D+NF VKYRTKEFD DV+ KMTLLD N SP TAAV
Subjt:  ERNHYK-------FNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAV

Query:  ENYRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYS
        ENYR  IS+LFN QY  YDQ E LH+RKQ++EPLLLGWDTD IKD+ SS+ TE +TFA+ PISFADDHQP LHESFGAVALCSSPFPSSN  +  SLPYS
Subjt:  ENYRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYS

Query:  NLASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGG-CHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQ
        +LASYQI GLS  NV KEE IDATFNN+HLNFSSVPKCL QC++YVDD G C   CAQ+A+W MN  ++DE ++PS++S+CASG VFDFGWKYLSGSKE 
Subjt:  NLASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGG-CHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQ

Query:  CQTSYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNY
        CQT+YH+L+YPLDE++PT+ +NEE + DSS     +YG PF+IQPESFFQEGKV S+LTDKLS WDV RSEINV  ITEM+Y
Subjt:  CQTSYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNY

XP_008461985.1 PREDICTED: uncharacterized protein LOC103500465 [Cucumis melo]0.0e+00100Show/hide
Query:  MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVE
        MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVE
Subjt:  MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVE

Query:  NRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIP
        NRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIP
Subjt:  NRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIP

Query:  ERNHYKFNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFI
        ERNHYKFNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFI
Subjt:  ERNHYKFNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFI

Query:  SSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNLASYQI
        SSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNLASYQI
Subjt:  SSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNLASYQI

Query:  QGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQTSYHIL
        QGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQTSYHIL
Subjt:  QGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQTSYHIL

Query:  KYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNYK
        KYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNYK
Subjt:  KYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNYK

XP_011659159.1 uncharacterized protein LOC101207408 [Cucumis sativus]4.2e-28788.15Show/hide
Query:  MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVE
        MKRNFQP ISNDSSR+KRTKVGDLDRDRPLTCRRDSSPIS KER V+Y KNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNS+DCFRE ASPVE
Subjt:  MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVE

Query:  NRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIP
        N CKD+TSHHLVEKVTPVNFNSLHLPLGN SKISDVDV HAHKTF+DIQSKQRNVENDDIFSRKRQKLRQFIQNMSF GTGESYEKGYGVISRLLSRLIP
Subjt:  NRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIP

Query:  ERNHYKFNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFI
        ERN YKFNNNLEKIQ+LRGRC P LDYEHHLNNSLSPCRLN SRGRS  HSDFSTNS DNNFQ+KYRTKEFDC+VDRKMTLL+V     TAAVENYRSFI
Subjt:  ERNHYKFNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFI

Query:  SSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNLASYQI
        SSLF PQY LYDQDEH HLRKQKL+PLLLGWDTDYIKDESSSQLTELNT AKSPISFADD QPT+HESFGA  LCSSPFPSSNR N NSLPYS+LAS QI
Subjt:  SSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNLASYQI

Query:  QGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQTSYHIL
         GLSWQNV+  EDI  TFNNLHLNFSSVPK LHQ NS VDDGGCHDLCAQN DWVMNNV++D SQHPS+ESLCASGLVFDFG KYLS SKEQ QT+YHIL
Subjt:  QGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQTSYHIL

Query:  KYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNY
        KYPLDEIQPTAL NEEWSNDSSDDVLVDY PPF+IQPESFFQ GKVYS+LTDKLS WDV RSEINVDDITEMNY
Subjt:  KYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNY

TrEMBL top hitse value%identityAlignment
A0A1S3CFT3 uncharacterized protein LOC1035004650.0e+00100Show/hide
Query:  MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVE
        MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVE
Subjt:  MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVE

Query:  NRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIP
        NRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIP
Subjt:  NRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIP

Query:  ERNHYKFNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFI
        ERNHYKFNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFI
Subjt:  ERNHYKFNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFI

Query:  SSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNLASYQI
        SSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNLASYQI
Subjt:  SSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNLASYQI

Query:  QGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQTSYHIL
        QGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQTSYHIL
Subjt:  QGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQTSYHIL

Query:  KYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNYK
        KYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNYK
Subjt:  KYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNYK

A0A5A7U8Y2 Reverse transcriptase3.3e-24589.38Show/hide
Query:  TPVNFNSLHLPLGNPSKISDVDVKH------------AHKTFEDIQSKQRNVE-NDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIPER
        TP+ FN     LGN      +D K               K  E I    R ++   DIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIP+R
Subjt:  TPVNFNSLHLPLGNPSKISDVDVKH------------AHKTFEDIQSKQRNVE-NDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIPER

Query:  NHYK-------FNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVEN
        NHYK       FNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVEN
Subjt:  NHYK-------FNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVEN

Query:  YRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNL
        YRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNL
Subjt:  YRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNL

Query:  ASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQT
        ASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQT
Subjt:  ASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQT

Query:  SYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNYK
        SYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNYK
Subjt:  SYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNYK

A0A6J1GQP2 uncharacterized protein LOC111456585 isoform X12.5e-23270.94Show/hide
Query:  MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFR-ESASPV
        MKR+ +P+IS DS+ RKR KVGDLD  RPLTCRRD+SP+SLK  HVT   NAKTSEFAFFKKFKEDA+ RFSSSL RQKELQ KKFNS+D FR E A  V
Subjt:  MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFR-ESASPV

Query:  ENRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEK-------------
        ENR +DFTSH  VE VTP+NFNS+HLPLGN SKIS+VDVKHAHKTF+DIQSKQRNVENDDIFSRKRQKLRQFIQNMSF GTGESYEK             
Subjt:  ENRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEK-------------

Query:  -------------GYGVISRLLSRLIPERNHYK-------FNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKY
                      YGVIS LLSRLIPE N YK       FNNNLEK+Q L GRC+PRLDYEH LNNS SPCRLN SRGR  FHSDFSTN++D+NF VKY
Subjt:  -------------GYGVISRLLSRLIPERNHYK-------FNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKY

Query:  RTKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLH
        RTKEFD DV+ KMTLLD N SP TAAVENYR  IS+LFN QY  YDQ E LH+RKQ++EPLLLGWDTD IKD+ SS+ TE +TFA+ PISFADDHQP LH
Subjt:  RTKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLH

Query:  ESFGAVALCSSPFPSSNRINFNSLPYSNLASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGG-CHDLCAQNADWVMNNVVNDESQ
        ESFGAVALCSSPFPSSN  +  SLPYS+LASYQI GLS  NV KEE IDAT NN+HLNFSSVPKCL QC++YVDD G C   CAQ+A+W MN  ++DE +
Subjt:  ESFGAVALCSSPFPSSNRINFNSLPYSNLASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGG-CHDLCAQNADWVMNNVVNDESQ

Query:  HPSVESLCASGLVFDFGWKYLSGSKEQCQTSYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEIN
        +PS++S+CASG VFDFGWKYLSGSKE CQT+YH+L+YPLDE++PT+ +NEE + DSS     +YG PF+IQPESFFQEGKV S+LTDKLS WDV RSEIN
Subjt:  HPSVESLCASGLVFDFGWKYLSGSKEQCQTSYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEIN

Query:  VDDITEMNY
        V  ITEM+Y
Subjt:  VDDITEMNY

A0A6J1GS11 uncharacterized protein LOC111456585 isoform X31.3e-23674.1Show/hide
Query:  MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFR-ESASPV
        MKR+ +P+IS DS+ RKR KVGDLD  RPLTCRRD+SP+SLK  HVT   NAKTSEFAFFKKFKEDA+ RFSSSL RQKELQ KKFNS+D FR E A  V
Subjt:  MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFR-ESASPV

Query:  ENRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLI
        ENR +DFTSH  VE VTP+NFNS+HLPLGN SKIS+VDVKHAHKTF+DIQSKQRNVENDDIFSRKRQKLRQFIQNMSF GTGESYEK YGVIS LLSRLI
Subjt:  ENRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLI

Query:  PERNHYK-------FNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAA
        PE N YK       FNNNLEK+Q L GRC+PRLDYEH LNNS SPCRLN SRGR  FHSDFSTN++D+NF VKYRTKEFD DV+ KMTLLD N SP TAA
Subjt:  PERNHYK-------FNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAA

Query:  VENYRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPY
        VENYR  IS+LFN QY  YDQ E LH+RKQ++EPLLLGWDTD IKD+ SS+ TE +TFA+ PISFADDHQP LHESFGAVALCSSPFPSSN  +  SLPY
Subjt:  VENYRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPY

Query:  SNLASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGG-CHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKE
        S+LASYQI GLS  NV KEE IDAT NN+HLNFSSVPKCL QC++YVDD G C   CAQ+A+W MN  ++DE ++PS++S+CASG VFDFGWKYLSGSKE
Subjt:  SNLASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGG-CHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKE

Query:  QCQTSYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNY
         CQT+YH+L+YPLDE++PT+ +NEE + DSS     +YG PF+IQPESFFQEGKV S+LTDKLS WDV RSEINV  ITEM+Y
Subjt:  QCQTSYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINVDDITEMNY

A0A6J1GSJ6 uncharacterized protein LOC111456585 isoform X21.0e-23371.05Show/hide
Query:  MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVE
        MKR+ +P+IS DS+ RKR KVGDLD  RPLTCRRD+SP+SLK  HVT   NAKTSEFAFFKKFKEDA+ RFSSSL RQKELQ KKFNS+D FRE A  VE
Subjt:  MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVE

Query:  NRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEK--------------
        NR +DFTSH  VE VTP+NFNS+HLPLGN SKIS+VDVKHAHKTF+DIQSKQRNVENDDIFSRKRQKLRQFIQNMSF GTGESYEK              
Subjt:  NRCKDFTSHHLVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEK--------------

Query:  ------------GYGVISRLLSRLIPERNHYK-------FNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYR
                     YGVIS LLSRLIPE N YK       FNNNLEK+Q L GRC+PRLDYEH LNNS SPCRLN SRGR  FHSDFSTN++D+NF VKYR
Subjt:  ------------GYGVISRLLSRLIPERNHYK-------FNNNLEKIQKLRGRCYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYR

Query:  TKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHE
        TKEFD DV+ KMTLLD N SP TAAVENYR  IS+LFN QY  YDQ E LH+RKQ++EPLLLGWDTD IKD+ SS+ TE +TFA+ PISFADDHQP LHE
Subjt:  TKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLGWDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHE

Query:  SFGAVALCSSPFPSSNRINFNSLPYSNLASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGG-CHDLCAQNADWVMNNVVNDESQH
        SFGAVALCSSPFPSSN  +  SLPYS+LASYQI GLS  NV KEE IDAT NN+HLNFSSVPKCL QC++YVDD G C   CAQ+A+W MN  ++DE ++
Subjt:  SFGAVALCSSPFPSSNRINFNSLPYSNLASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVDDGG-CHDLCAQNADWVMNNVVNDESQH

Query:  PSVESLCASGLVFDFGWKYLSGSKEQCQTSYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINV
        PS++S+CASG VFDFGWKYLSGSKE CQT+YH+L+YPLDE++PT+ +NEE + DSS     +YG PF+IQPESFFQEGKV S+LTDKLS WDV RSEINV
Subjt:  PSVESLCASGLVFDFGWKYLSGSKEQCQTSYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVLTDKLSCWDVVRSEINV

Query:  DDITEMNY
          ITEM+Y
Subjt:  DDITEMNY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20250.1 unknown protein3.0e-0428.17Show/hide
Query:  SNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASH--RFSSSLPRQKELQSKKFNSSDCFRESASP----VENRC
        ++D  RR+R    D D   P           L E     + +AKTSEFAFFKK K +  H    S S P  K  Q  K      F   A P        C
Subjt:  SNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASH--RFSSSLPRQKELQSKKFNSSDCFRESASP----VENRC

Query:  KDFTSHHLVEKVTPVN-----FNSLHLPL---------GNPSKISDVDVKHAHKTFEDIQSKQRNV--ENDDIFSRKRQKLRQFIQNMSFRGTGESYEKG
         D          TP++      + LH  L         G  S     D ++   + ++++S+   +  E  DIFS KR+KL Q++++       E    G
Subjt:  KDFTSHHLVEKVTPVN-----FNSLHLPL---------GNPSKISDVDVKHAHKTFEDIQSKQRNV--ENDDIFSRKRQKLRQFIQNMSFRGTGESYEKG

Query:  YGVISRLLSRLIP
        + ++S LL+RL P
Subjt:  YGVISRLLSRLIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCGCAATTTCCAACCTGCCATTTCTAATGATAGTAGCCGTCGGAAGAGAACAAAAGTTGGAGATCTTGATCGTGATAGACCTTTGACATGCAGAAGAGATTCTTC
TCCCATATCCTTGAAAGAAAGACATGTTACATACGCAAAGAATGCAAAAACTTCTGAGTTTGCATTTTTTAAAAAGTTCAAGGAAGATGCAAGCCATAGATTCAGCTCAT
CTCTTCCTCGTCAGAAGGAACTTCAATCAAAAAAGTTCAACTCGAGTGATTGTTTCAGAGAGAGCGCAAGCCCTGTTGAAAACCGCTGTAAAGACTTCACATCACATCAT
CTTGTTGAGAAGGTCACTCCTGTTAACTTTAACTCGTTGCATTTACCTCTGGGTAATCCATCCAAAATTTCAGATGTAGATGTGAAACACGCTCATAAAACATTTGAGGA
TATACAGAGCAAACAGAGGAACGTGGAAAATGATGATATTTTTAGTAGAAAGAGGCAGAAATTACGTCAGTTCATTCAGAATATGTCGTTCCGTGGAACGGGTGAATCTT
ATGAGAAGGGGTATGGTGTTATTTCCAGGCTACTTAGCCGGCTTATACCAGAGAGAAATCATTATAAGTTTAATAATAACTTGGAAAAAATACAAAAGTTGCGTGGAAGG
TGCTATCCAAGGCTTGATTATGAGCATCATTTGAATAATAGTTTATCACCTTGTCGTTTGAATAATTCAAGAGGAAGATCTTCTTTTCATTCTGATTTCTCTACCAATAG
CAATGACAACAATTTCCAAGTTAAGTACAGAACCAAGGAGTTCGACTGCGATGTAGACAGAAAAATGACTTTGCTGGATGTCAATGGTTCACCTCTCACCGCTGCAGTTG
AAAACTATAGATCATTTATTTCCAGCCTTTTCAATCCGCAATATAGTTTATATGATCAAGATGAACATTTGCACCTAAGAAAGCAAAAGCTAGAACCTCTTCTGCTTGGT
TGGGATACTGACTACATAAAAGATGAAAGTTCTTCTCAACTTACAGAGTTGAACACATTTGCCAAGTCACCAATTTCATTCGCTGATGATCATCAGCCAACCTTGCACGA
GAGTTTTGGTGCTGTTGCGCTGTGTTCATCCCCTTTCCCTTCCAGTAATCGTATAAACTTCAACTCATTGCCATACTCCAATTTAGCCAGCTATCAAATCCAAGGATTAA
GTTGGCAAAATGTATCAAAGGAGGAAGATATTGATGCCACTTTCAACAACCTGCATTTGAATTTCTCATCAGTACCCAAATGTCTTCATCAATGCAATAGCTATGTTGAT
GACGGAGGCTGTCATGACTTGTGTGCACAAAACGCTGATTGGGTTATGAATAATGTGGTGAATGACGAATCCCAACATCCTTCTGTTGAAAGTCTGTGTGCTTCTGGCCT
TGTCTTTGATTTTGGATGGAAATACCTCTCAGGCTCAAAGGAACAATGCCAAACATCTTATCATATACTTAAGTACCCACTGGATGAAATACAACCCACAGCCCTAATCA
ATGAAGAATGGAGTAATGACAGTTCAGATGATGTGCTTGTGGATTATGGACCGCCCTTCTATATCCAACCCGAGTCATTCTTTCAAGAAGGGAAGGTATACTCTGTATTG
ACTGATAAACTTAGCTGCTGGGATGTAGTCAGAAGTGAAATAAATGTTGATGATATAACTGAAATGAATTACAAATGA
mRNA sequenceShow/hide mRNA sequence
GTTGGTTTGTAATGAGATAGGGTTGACGTATTTAGATTGACTGTTCAACGTTGTAATATTATTTGCAATGGAGAGTTATTGAAAGCAATGTGGATTTTGGCGACCAATTT
TCCTCCTAACCCTTTCCTTGTCGCCATTCCAACAAAAACCATAATCCATTGTTTTATGGAGCACACATTACAATGAAGCGCAATTTCCAACCTGCCATTTCTAATGATAG
TAGCCGTCGGAAGAGAACAAAAGTTGGAGATCTTGATCGTGATAGACCTTTGACATGCAGAAGAGATTCTTCTCCCATATCCTTGAAAGAAAGACATGTTACATACGCAA
AGAATGCAAAAACTTCTGAGTTTGCATTTTTTAAAAAGTTCAAGGAAGATGCAAGCCATAGATTCAGCTCATCTCTTCCTCGTCAGAAGGAACTTCAATCAAAAAAGTTC
AACTCGAGTGATTGTTTCAGAGAGAGCGCAAGCCCTGTTGAAAACCGCTGTAAAGACTTCACATCACATCATCTTGTTGAGAAGGTCACTCCTGTTAACTTTAACTCGTT
GCATTTACCTCTGGGTAATCCATCCAAAATTTCAGATGTAGATGTGAAACACGCTCATAAAACATTTGAGGATATACAGAGCAAACAGAGGAACGTGGAAAATGATGATA
TTTTTAGTAGAAAGAGGCAGAAATTACGTCAGTTCATTCAGAATATGTCGTTCCGTGGAACGGGTGAATCTTATGAGAAGGGGTATGGTGTTATTTCCAGGCTACTTAGC
CGGCTTATACCAGAGAGAAATCATTATAAGTTTAATAATAACTTGGAAAAAATACAAAAGTTGCGTGGAAGGTGCTATCCAAGGCTTGATTATGAGCATCATTTGAATAA
TAGTTTATCACCTTGTCGTTTGAATAATTCAAGAGGAAGATCTTCTTTTCATTCTGATTTCTCTACCAATAGCAATGACAACAATTTCCAAGTTAAGTACAGAACCAAGG
AGTTCGACTGCGATGTAGACAGAAAAATGACTTTGCTGGATGTCAATGGTTCACCTCTCACCGCTGCAGTTGAAAACTATAGATCATTTATTTCCAGCCTTTTCAATCCG
CAATATAGTTTATATGATCAAGATGAACATTTGCACCTAAGAAAGCAAAAGCTAGAACCTCTTCTGCTTGGTTGGGATACTGACTACATAAAAGATGAAAGTTCTTCTCA
ACTTACAGAGTTGAACACATTTGCCAAGTCACCAATTTCATTCGCTGATGATCATCAGCCAACCTTGCACGAGAGTTTTGGTGCTGTTGCGCTGTGTTCATCCCCTTTCC
CTTCCAGTAATCGTATAAACTTCAACTCATTGCCATACTCCAATTTAGCCAGCTATCAAATCCAAGGATTAAGTTGGCAAAATGTATCAAAGGAGGAAGATATTGATGCC
ACTTTCAACAACCTGCATTTGAATTTCTCATCAGTACCCAAATGTCTTCATCAATGCAATAGCTATGTTGATGACGGAGGCTGTCATGACTTGTGTGCACAAAACGCTGA
TTGGGTTATGAATAATGTGGTGAATGACGAATCCCAACATCCTTCTGTTGAAAGTCTGTGTGCTTCTGGCCTTGTCTTTGATTTTGGATGGAAATACCTCTCAGGCTCAA
AGGAACAATGCCAAACATCTTATCATATACTTAAGTACCCACTGGATGAAATACAACCCACAGCCCTAATCAATGAAGAATGGAGTAATGACAGTTCAGATGATGTGCTT
GTGGATTATGGACCGCCCTTCTATATCCAACCCGAGTCATTCTTTCAAGAAGGGAAGGTATACTCTGTATTGACTGATAAACTTAGCTGCTGGGATGTAGTCAGAAGTGA
AATAAATGTTGATGATATAACTGAAATGAATTACAAATGATAGCAGCTTCAATTTTTAATGACCCTCTATGCTTAATCCTGTGTCCTCTTTTGGCCTATCTTTTTATTTT
ACATTGTGGGGTAAAGAGAGATGGAGATTTCTGGCTAGAAGATCATTTCCACTAACATAATTGGGGAGAGGCGGAGGTTGCTTGGTTGAGAGAGAACTCAACATGATTTT
CTTGGTGGAGAATGAATGAAATAAACTCTGTCAGGTAGATTTTTCAGGCTTGGATGAAAAAACATTTCAAAGAGCTTCAAGTTGTGTTGTAAACAAAAAGATTCTTGGAT
TGAGTCATTTCTTGAGAAAACCACTCTTATACATCCTGTCCACATATTGCTTTTTAGTTGTTGCATTTGGATGAAAATGGTATTTGGAAGTCATGGGAACACTGTGCATT
TTAGTGAAGCTAAATGATCTGAAGCCGAGCATAGTTTTGTGTATAAAGACATCAATTGCCATCGTTTGTTTGTCAGAAGAAGAGGGTTGGTTAATTGAACAAGTGACAAC
CACCTAAGAGGCCCTGCCTAGGCCAACGGCTCAACCTGCCACTGATACCTACCCGAGTCCTTTGGAAAATGTATGAATTTTTTTTAATCCTTATAATCAACTTTAGAAAA
TAACTGAAAGTATAGTATTTCTTTTCTTTAGAAAAATGTATAAATAACTGGTATTCCTATGCAATGATTGATAGTAGCGACATATTCTTTTAAGTTCTATTTTTGTAATA
TTTTATGCATATTAATGCTAGCTTTGCCCATTACTCCTCATAAATATATATACCATATATGTTACTCTTTTCAAGCACTTTTACAACATTGAGAGC
Protein sequenceShow/hide protein sequence
MKRNFQPAISNDSSRRKRTKVGDLDRDRPLTCRRDSSPISLKERHVTYAKNAKTSEFAFFKKFKEDASHRFSSSLPRQKELQSKKFNSSDCFRESASPVENRCKDFTSHH
LVEKVTPVNFNSLHLPLGNPSKISDVDVKHAHKTFEDIQSKQRNVENDDIFSRKRQKLRQFIQNMSFRGTGESYEKGYGVISRLLSRLIPERNHYKFNNNLEKIQKLRGR
CYPRLDYEHHLNNSLSPCRLNNSRGRSSFHSDFSTNSNDNNFQVKYRTKEFDCDVDRKMTLLDVNGSPLTAAVENYRSFISSLFNPQYSLYDQDEHLHLRKQKLEPLLLG
WDTDYIKDESSSQLTELNTFAKSPISFADDHQPTLHESFGAVALCSSPFPSSNRINFNSLPYSNLASYQIQGLSWQNVSKEEDIDATFNNLHLNFSSVPKCLHQCNSYVD
DGGCHDLCAQNADWVMNNVVNDESQHPSVESLCASGLVFDFGWKYLSGSKEQCQTSYHILKYPLDEIQPTALINEEWSNDSSDDVLVDYGPPFYIQPESFFQEGKVYSVL
TDKLSCWDVVRSEINVDDITEMNYK