; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G09960 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G09960
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr09:8598614..8602187
RNA-Seq ExpressionClc09G09960
SyntenyClc09G09960
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646221.1 hypothetical protein Csa_016557 [Cucumis sativus]6.9e-23774.83Show/hide
Query:  MIKFIDSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFRERASPVENRYKDF
        ++  ++S R+KRTKV DLDRDRP TCRRDSSPIS KER V+Y  NAKTSEFAFFKK KEDA+HRFSSSLP QKELQSKKFNS+D FRERASPVEN  KD+
Subjt:  MIKFIDSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFRERASPVENRYKDF

Query:  RSHHLVENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGV
         SHHLVE VTPVNFNSLHL LGNSSKIS+VDV HAHKT KD QSKQRNVEND                DIFSRKRQKLRQFIQ+MSFH TG+SYEKGYGV
Subjt:  RSHHLVENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGV

Query:  ISALLSRLIPESNQYKFNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGSPLT
        IS LLSRLIPE NQYKFNNN EK+QRL GRCCP L YEHHLNNS SPC LNKS G  F HS FST+S+DNNFQ++YRTKEFDC+V+RKMTLL+V     T
Subjt:  ISALLSRLIPESNQYKFNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGSPLT

Query:  AAVENYRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNIYSL
        AAVENYRS IS LFKPQYGLYDQ E  H+RKQ+L+PLLLGWDTD IKDESSSQ+TELNT A+ PISFADD QP +H+SFGA  LCSSPFPSSNR N+ SL
Subjt:  AAVENYRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNIYSL

Query:  PCSSLDSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYL---K
        P SSL S QIHGLSW NV   EDI  TF N+HLNFSSVPK LHQ ++ VDDGG HD CAQ+ DW+M +VLDD  ++PS+ESLCASG+VFDFG KYL   K
Subjt:  PCSSLDSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYL---K

Query:  EQCQTAYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEGKVCSIHTDKLSWDAARSEIKV-DITEMNYI
        EQ QTAYHILKYPLDE++ TA  NE+ +NDSSDDVL DYRP FFIQPESFFQ GKV SI TDKLSWD ARSEI V DITEMNYI
Subjt:  EQCQTAYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEGKVCSIHTDKLSWDAARSEIKV-DITEMNYI

KAG6576069.1 hypothetical protein SDJN03_26708, partial [Cucurbita argyrosperma subsp. sororia]2.1e-23374.23Show/hide
Query:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFRERASPVENRYKDFRSHHL
        DS  RKR KV DLD  RP TCRRD+SP+SLKE HVT PNNAKTSEFAFFKK KEDAN RFSSSL  QKELQSKKFNS+D FRERA  VENR +DF SH  
Subjt:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFRERASPVENRYKDFRSHHL

Query:  VENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVISALL
        VENVTP+NFNS+HL LGNSSKISEVDVKHAHKT KD QSKQRNVEND                DIFSRKRQKLRQFIQ+MSFH TG+SYEK YGVIS LL
Subjt:  VENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVISALL

Query:  SRLIPESNQYK-------FNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGSP
        SRLIPESNQYK       FNNN EKLQ LPGRC PRL YEH LNNSSSPC LNKS G +  HS FST++DD+NF V+YRTKEFD DVE KMTLLD N SP
Subjt:  SRLIPESNQYK-------FNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGSP

Query:  LTAAVENYRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNIY
         TAAVENYR LIS LF  QYG YDQGE LHIRKQE+EPLLLGWDTDDIKD+ SS+ TE +TFAEPPISFADDHQPNLH+SFGAVALCSSPFPSSN R++Y
Subjt:  LTAAVENYRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNIY

Query:  SLPCSSLDSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGG-SHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYL-
        SLP SSL SYQIHGLS  NVEKEE IDATF NVHLNFSSVPKCL QCDNYVDD G    FCAQSA+W M   LDDE R PS++S+CASG VFDFG KYL 
Subjt:  SLPCSSLDSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGG-SHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYL-

Query:  --KEQCQTAYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEGKVCSIHTDKLSWDAARSEIKVDITEMNYI
          KE CQTAYH+L+YPLDEM+ T+ +NE+CN DSS     +Y   FFIQPESFFQEGKV S+ TDKLSWD  RSEI V ITEM+Y+
Subjt:  --KEQCQTAYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEGKVCSIHTDKLSWDAARSEIKVDITEMNYI

XP_008461985.1 PREDICTED: uncharacterized protein LOC103500465 [Cucumis melo]1.4e-25378.41Show/hide
Query:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFRERASPVENRYKDFRSHHL
        DS RRKRTKV DLDRDRP TCRRDSSPISLKERHVTY  NAKTSEFAFFKK KEDA+HRFSSSLP QKELQSKKFNSSD FRE ASPVENR KDF SHHL
Subjt:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFRERASPVENRYKDFRSHHL

Query:  VENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVISALL
        VE VTPVNFNSLHL LGN SKIS+VDVKHAHKT +D QSKQRNVEND                DIFSRKRQKLRQFIQ+MSF  TG+SYEKGYGVIS LL
Subjt:  VENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVISALL

Query:  SRLIPESNQYKFNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGSPLTAAVEN
        SRLIPE N YKFNNN EK+Q+L GRC PRL YEHHLNNS SPC LN S G    HS FST+S+DNNFQV+YRTKEFDCDV+RKMTLLDVNGSPLTAAVEN
Subjt:  SRLIPESNQYKFNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGSPLTAAVEN

Query:  YRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNIYSLPCSSL
        YRS IS LF PQY LYDQ E LH+RKQ+LEPLLLGWDTD IKDESSSQ+TELNTFA+ PISFADDHQP LH+SFGAVALCSSPFPSSNR N  SLP S+L
Subjt:  YRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNIYSLPCSSL

Query:  DSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYL---KEQCQT
         SYQI GLSW NV KEEDIDATF N+HLNFSSVPKCLHQC++YVDDGG HD CAQ+ADW+M +V++DE ++PSVESLCASG+VFDFG KYL   KEQCQT
Subjt:  DSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYL---KEQCQT

Query:  AYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEGKVCSIHTDKLS-WDAARSEIKV-DITEMNY
        +YHILKYPLDE++ TA +NE+ +NDSSDDVL DY P F+IQPESFFQEGKV S+ TDKLS WD  RSEI V DITEMNY
Subjt:  AYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEGKVCSIHTDKLS-WDAARSEIKV-DITEMNY

XP_011659159.1 uncharacterized protein LOC101207408 [Cucumis sativus]6.9e-23775.65Show/hide
Query:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFRERASPVENRYKDFRSHHL
        DS R+KRTKV DLDRDRP TCRRDSSPIS KER V+Y  NAKTSEFAFFKK KEDA+HRFSSSLP QKELQSKKFNS+D FRERASPVEN  KD+ SHHL
Subjt:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFRERASPVENRYKDFRSHHL

Query:  VENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVISALL
        VE VTPVNFNSLHL LGNSSKIS+VDV HAHKT KD QSKQRNVEND                DIFSRKRQKLRQFIQ+MSFH TG+SYEKGYGVIS LL
Subjt:  VENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVISALL

Query:  SRLIPESNQYKFNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGSPLTAAVEN
        SRLIPE NQYKFNNN EK+QRL GRCCP L YEHHLNNS SPC LNKS G  F HS FST+S+DNNFQ++YRTKEFDC+V+RKMTLL+V     TAAVEN
Subjt:  SRLIPESNQYKFNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGSPLTAAVEN

Query:  YRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNIYSLPCSSL
        YRS IS LFKPQYGLYDQ E  H+RKQ+L+PLLLGWDTD IKDESSSQ+TELNT A+ PISFADD QP +H+SFGA  LCSSPFPSSNR N+ SLP SSL
Subjt:  YRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNIYSLPCSSL

Query:  DSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYL---KEQCQT
         S QIHGLSW NV   EDI  TF N+HLNFSSVPK LHQ ++ VDDGG HD CAQ+ DW+M +VLDD  ++PS+ESLCASG+VFDFG KYL   KEQ QT
Subjt:  DSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYL---KEQCQT

Query:  AYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEGKVCSIHTDKLSWDAARSEIKV-DITEMNYI
        AYHILKYPLDE++ TA  NE+ +NDSSDDVL DYRP FFIQPESFFQ GKV SI TDKLSWD ARSEI V DITEMNYI
Subjt:  AYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEGKVCSIHTDKLSWDAARSEIKV-DITEMNYI

XP_038897158.1 uncharacterized protein LOC120085308 isoform X3 [Benincasa hispida]4.4e-23181.01Show/hide
Query:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFR-ERASPVENRYKDFRSHH
        DS R KR+KV DLD DRP T RRDSSPISLKERHVTYPNNAKTSEFAFFKK +EDANHRFSSSLP QKELQSK FN+SD FR ERA+PVENR KDF SHH
Subjt:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFR-ERASPVENRYKDFRSHH

Query:  LVENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVISAL
        LVENVTPVNFNS+HL L NSSKISEVDVKH HKTLKD +SKQ NVEND                DIFSRKRQKLRQFIQ+MSFH  G+SYEKGYGV+S L
Subjt:  LVENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVISAL

Query:  LSRLIPESNQYKFNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGSPLTAAVE
        LSRLIP+SNQ KFNN+ EKLQRLPGRCCPRL YEHHLNNSSSPC LN+S G +F HS FST+SDDNNFQ  YRTKEFDCDVE KMTLLDVN SPLTAAV+
Subjt:  LSRLIPESNQYKFNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGSPLTAAVE

Query:  NYRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDES-SSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNIYSLPCS
        NYRSLIS LFKPQYGLYDQGE L++RKQELEPLLLGWDTD+IKD S SSQ+TELNTFAE PI+FADDHQPNLHKSFGAVALCSSPFPSSNRRN+YSLP S
Subjt:  NYRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDES-SSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNIYSLPCS

Query:  SLDSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYL---KEQC
        SL SYQI GLSWHNVEKEEDIDATF N+HLNFSSVPKCL QCDNYV+D GS DFCAQSADW+M +VLDD+ +NPSVESLCAS IVFDFG+KYL   KEQC
Subjt:  SLDSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYL---KEQC

Query:  QTAYHILKYPLDEMKL
        QTAYHIL+YP DEM L
Subjt:  QTAYHILKYPLDEMKL

TrEMBL top hitse value%identityAlignment
A0A1S3CFT3 uncharacterized protein LOC1035004656.7e-25478.41Show/hide
Query:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFRERASPVENRYKDFRSHHL
        DS RRKRTKV DLDRDRP TCRRDSSPISLKERHVTY  NAKTSEFAFFKK KEDA+HRFSSSLP QKELQSKKFNSSD FRE ASPVENR KDF SHHL
Subjt:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFRERASPVENRYKDFRSHHL

Query:  VENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVISALL
        VE VTPVNFNSLHL LGN SKIS+VDVKHAHKT +D QSKQRNVEND                DIFSRKRQKLRQFIQ+MSF  TG+SYEKGYGVIS LL
Subjt:  VENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVISALL

Query:  SRLIPESNQYKFNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGSPLTAAVEN
        SRLIPE N YKFNNN EK+Q+L GRC PRL YEHHLNNS SPC LN S G    HS FST+S+DNNFQV+YRTKEFDCDV+RKMTLLDVNGSPLTAAVEN
Subjt:  SRLIPESNQYKFNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGSPLTAAVEN

Query:  YRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNIYSLPCSSL
        YRS IS LF PQY LYDQ E LH+RKQ+LEPLLLGWDTD IKDESSSQ+TELNTFA+ PISFADDHQP LH+SFGAVALCSSPFPSSNR N  SLP S+L
Subjt:  YRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNIYSLPCSSL

Query:  DSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYL---KEQCQT
         SYQI GLSW NV KEEDIDATF N+HLNFSSVPKCLHQC++YVDDGG HD CAQ+ADW+M +V++DE ++PSVESLCASG+VFDFG KYL   KEQCQT
Subjt:  DSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYL---KEQCQT

Query:  AYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEGKVCSIHTDKLS-WDAARSEIKV-DITEMNY
        +YHILKYPLDE++ TA +NE+ +NDSSDDVL DY P F+IQPESFFQEGKV S+ TDKLS WD  RSEI V DITEMNY
Subjt:  AYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEGKVCSIHTDKLS-WDAARSEIKV-DITEMNY

A0A6J1GS11 uncharacterized protein LOC111456585 isoform X33.4e-22973.59Show/hide
Query:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFR-ERASPVENRYKDFRSHH
        DS  RKR KV DLD  RP TCRRD+SP+SLK  HVT PNNAKTSEFAFFKK KEDAN RFSSSL  QKELQ KKFNS+D FR ERA  VENR +DF SH 
Subjt:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFR-ERASPVENRYKDFRSHH

Query:  LVENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVISAL
         VENVTP+NFNS+HL LGNSSKISEVDVKHAHKT KD QSKQRNVEND                DIFSRKRQKLRQFIQ+MSFH TG+SYEK YGVIS L
Subjt:  LVENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVISAL

Query:  LSRLIPESNQYK-------FNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGS
        LSRLIPESNQYK       FNNN EKLQ LPGRC PRL YEH LNNSSSPC LNKS G +  HS FST++DD+NF V+YRTKEFD DVE KMTLLD N S
Subjt:  LSRLIPESNQYK-------FNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGS

Query:  PLTAAVENYRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNI
        P TAAVENYR LIS LF  QYG YDQGE LHIRKQE+EPLLLGWDTDDIKD+ SS+ TE +TFAEPPISFADDHQPNLH+SFGAVALCSSPFPSSN R++
Subjt:  PLTAAVENYRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNI

Query:  YSLPCSSLDSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGG-SHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYL
        YSLP SSL SYQIHGLS  NVEKEE IDAT  NVHLNFSSVPKCL QCDNYVDD G    FCAQSA+W M   LDDE R PS++S+CASG VFDFG KYL
Subjt:  YSLPCSSLDSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGG-SHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYL

Query:  ---KEQCQTAYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEGKVCSIHTDKLSWDAARSEIKVDITEMNYI
           KE CQTAYH+L+YPLDEM+ T+ +NE+CN DSS     +Y   FFIQPESFFQEGKV S+ TDKLSWD  RSEI V ITEM+Y+
Subjt:  ---KEQCQTAYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEGKVCSIHTDKLSWDAARSEIKVDITEMNYI

A0A6J1JQC1 uncharacterized protein LOC111487990 isoform X43.0e-23073.9Show/hide
Query:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYP---NNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFR-ERASPVENRYKDFR
        DSI RKR KV DLD DRP TCRRD+SP+SLKE HVT P   NNAKTSEFAFFKK K DAN RFSSSL  QKELQSK+FNS+D FR ERA  VENR +DF 
Subjt:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYP---NNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFR-ERASPVENRYKDFR

Query:  SHHLVENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVI
        SH  VENVTP+NFNS+HL LGNSSKISEVDVKHAHKT KD QSKQRNVEND                DIFSRKRQKLRQFIQ+MSFH TG+SYEK YGVI
Subjt:  SHHLVENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVI

Query:  SALLSRLIPESNQYK-------FNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDV
        S LLSRLIPESNQYK       FNNN EKLQ LPGRC PRL YEH LNNSSSPC LNKS G +  HS FST++DD+NF V+YRTK+FD DVE KMTLLD 
Subjt:  SALLSRLIPESNQYK-------FNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDV

Query:  NGSPLTAAVENYRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNR
        N SP TAAVENYRSLI+ LF PQYG YDQGE LHIRKQE+EPLLLGWDTDDIKD+ SS+VTE +TFAEPPISFADDHQPNL +SFGAVALCSSPFPSS  
Subjt:  NGSPLTAAVENYRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNR

Query:  RNIYSLPCSSLDSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQK
        RN+Y LP SSLDSYQIHGLS HNVEKEE IDATF NVHLNFSSVPKCL QCDNYV+D G   FCAQSA+WLM    +DE R+PS+ES+CASG VFDFG K
Subjt:  RNIYSLPCSSLDSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQK

Query:  YL---KEQCQTAYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEG-KVCSIHTDKLSWDAARSEIKVDITEMNYI
        YL   KE CQTAYH+L+YPLDEM+ T+ +NE+CN DSS     +YR  FFIQPESFFQEG KV S+ TDKLSWD  RSEI V ITEM+Y+
Subjt:  YL---KEQCQTAYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEG-KVCSIHTDKLSWDAARSEIKVDITEMNYI

A0A6J1JSJ6 uncharacterized protein LOC111487990 isoform X31.4e-22771.13Show/hide
Query:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFR-ERASPVENRYKDFRSHH
        DSI RKR KV DLD DRP TCRRD+SP+SLKE HVT PNNAKTSEFAFFKK K DAN RFSSSL  QKELQSK+FNS+D FR ERA  VENR +DF SH 
Subjt:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFR-ERASPVENRYKDFRSHH

Query:  LVENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEK--------
         VENVTP+NFNS+HL LGNSSKISEVDVKHAHKT KD QSKQRNVEND                DIFSRKRQKLRQFIQ+MSFH TG+SYEK        
Subjt:  LVENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEK--------

Query:  ------------------GYGVISALLSRLIPESNQYK-------FNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNN
                           YGVIS LLSRLIPESNQYK       FNNN EKLQ LPGRC PRL YEH LNNSSSPC LNKS G +  HS FST++DD+N
Subjt:  ------------------GYGVISALLSRLIPESNQYK-------FNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNN

Query:  FQVQYRTKEFDCDVERKMTLLDVNGSPLTAAVENYRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDH
        F V+YRTK+FD DVE KMTLLD N SP TAAVENYRSLI+ LF PQYG YDQGE LHIRKQE+EPLLLGWDTDDIKD+ SS+VTE +TFAEPPISFADDH
Subjt:  FQVQYRTKEFDCDVERKMTLLDVNGSPLTAAVENYRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDH

Query:  QPNLHKSFGAVALCSSPFPSSNRRNIYSLPCSSLDSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSVLD
        QPNL +SFGAVALCSSPFPSS  RN+Y LP SSLDSYQIHGLS HNVEKEE IDATF NVHLNFSSVPKCL QCDNYV+D G   FCAQSA+WLM    +
Subjt:  QPNLHKSFGAVALCSSPFPSSNRRNIYSLPCSSLDSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSVLD

Query:  DERRNPSVESLCASGIVFDFGQKYL---KEQCQTAYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEG-KVCSIHTDKLSWDAAR
        DE R+PS+ES+CASG VFDFG KYL   KE CQTAYH+L+YPLDEM+ T+ +NE+CN DSS     +YR  FFIQPESFFQEG KV S+ TDKLSWD  R
Subjt:  DERRNPSVESLCASGIVFDFGQKYL---KEQCQTAYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEG-KVCSIHTDKLSWDAAR

Query:  SEIKVDITEMNYI
        SEI V ITEM+Y+
Subjt:  SEIKVDITEMNYI

A0A6J1JUE9 uncharacterized protein LOC111487990 isoform X22.4e-22770.89Show/hide
Query:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYP---NNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFRERASPVENRYKDFRS
        DSI RKR KV DLD DRP TCRRD+SP+SLKE HVT P   NNAKTSEFAFFKK K DAN RFSSSL  QKELQSK+FNS+D FRERA  VENR +DF S
Subjt:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYP---NNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFRERASPVENRYKDFRS

Query:  HHLVENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEK------
        H  VENVTP+NFNS+HL LGNSSKISEVDVKHAHKT KD QSKQRNVEND                DIFSRKRQKLRQFIQ+MSFH TG+SYEK      
Subjt:  HHLVENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEK------

Query:  --------------------GYGVISALLSRLIPESNQYK-------FNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDD
                             YGVIS LLSRLIPESNQYK       FNNN EKLQ LPGRC PRL YEH LNNSSSPC LNKS G +  HS FST++DD
Subjt:  --------------------GYGVISALLSRLIPESNQYK-------FNNNSEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDD

Query:  NNFQVQYRTKEFDCDVERKMTLLDVNGSPLTAAVENYRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFAD
        +NF V+YRTK+FD DVE KMTLLD N SP TAAVENYRSLI+ LF PQYG YDQGE LHIRKQE+EPLLLGWDTDDIKD+ SS+VTE +TFAEPPISFAD
Subjt:  NNFQVQYRTKEFDCDVERKMTLLDVNGSPLTAAVENYRSLISGLFKPQYGLYDQGEDLHIRKQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFAD

Query:  DHQPNLHKSFGAVALCSSPFPSSNRRNIYSLPCSSLDSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSV
        DHQPNL +SFGAVALCSSPFPSS  RN+Y LP SSLDSYQIHGLS HNVEKEE IDATF NVHLNFSSVPKCL QCDNYV+D G   FCAQSA+WLM   
Subjt:  DHQPNLHKSFGAVALCSSPFPSSNRRNIYSLPCSSLDSYQIHGLSWHNVEKEEDIDATF-NVHLNFSSVPKCLHQCDNYVDDGGSHDFCAQSADWLMYSV

Query:  LDDERRNPSVESLCASGIVFDFGQKYL---KEQCQTAYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEG-KVCSIHTDKLSWDA
         +DE R+PS+ES+CASG VFDFG KYL   KE CQTAYH+L+YPLDEM+ T+ +NE+CN DSS     +YR  FFIQPESFFQEG KV S+ TDKLSWD 
Subjt:  LDDERRNPSVESLCASGIVFDFGQKYL---KEQCQTAYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEG-KVCSIHTDKLSWDA

Query:  ARSEIKVDITEMNYI
         RSEI V ITEM+Y+
Subjt:  ARSEIKVDITEMNYI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20250.1 unknown protein1.4e-0627.52Show/hide
Query:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANH--RFSSSLPSQKELQSKKFNSSDDFRERASPVE-------NR
        D  RR+R   +D D   P           L E       +AKTSEFAFFKKLK +  H    S S PS K  Q  K      F  RA P +       + 
Subjt:  DSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANH--RFSSSLPSQKELQSKKFNSSDDFRERASPVE-------NR

Query:  YKDFRSHHLVENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEK
            +S+     +      S  LH    + +++ D    +     + S+ R     G++ +     +     DIFS KR+KL Q+++D    E  +    
Subjt:  YKDFRSHHLVENVTPVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEK

Query:  GYGVISALLSRLIPESNQ
        G+ ++S LL+RL P + +
Subjt:  GYGVISALLSRLIPESNQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAAATTTATAGATAGTATCCGTCGGAAGAGAACAAAAGTTGAAGATCTTGATCGTGATAGACCTTGGACATGCAGAAGAGATTCTTCTCCCATATCCTTGAAAGA
AAGACATGTTACATACCCAAATAATGCAAAAACTTCTGAGTTTGCATTTTTTAAGAAGTTGAAGGAAGATGCAAACCATAGATTCAGTTCATCTCTTCCAAGTCAAAAGG
AACTTCAATCCAAGAAGTTCAACTCGAGTGATGATTTCAGAGAGAGAGCAAGCCCTGTTGAAAACCGGTATAAAGACTTCAGATCACATCATCTTGTTGAGAACGTCACT
CCTGTTAACTTTAACTCCTTGCATTTACATCTGGGAAATTCATCCAAAATTTCAGAGGTAGATGTGAAACACGCTCATAAAACATTGAAGGATACACAGAGCAAACAGAG
AAACGTGGAAAATGATGGTAATTCTTTCAAAGAGAGATGGATAATCATGATAGCTATTCATGCAGATATTTTTAGTAGGAAGAGGCAGAAATTACGTCAGTTCATTCAGG
ATATGTCGTTCCATGAAACTGGCGACTCATATGAGAAGGGGTATGGTGTTATTTCCGCACTACTTAGCCGGCTTATACCTGAGAGCAATCAGTATAAGTTTAATAATAAC
TCGGAAAAGCTACAAAGGTTGCCTGGAAGGTGCTGTCCAAGGCTTGGTTATGAGCATCATTTGAATAATAGTTCATCACCTTGTAGTTTGAATAAATCAAGTGGAAGTCT
TTTTCCCCATTCTGTTTTCTCAACCAGTAGCGATGATAACAATTTCCAAGTACAGTACAGAACCAAGGAGTTTGACTGTGATGTAGAAAGAAAAATGACTTTGCTGGATG
TCAATGGTTCTCCTCTTACTGCTGCAGTTGAAAACTATAGATCACTCATTTCCGGCCTTTTCAAACCGCAATATGGTTTATATGATCAAGGTGAAGATTTGCACATAAGA
AAGCAAGAGCTAGAACCTCTTCTGCTAGGTTGGGATACGGACGACATAAAAGATGAAAGTTCTTCCCAAGTTACAGAATTGAACACATTTGCCGAGCCACCAATTTCATT
CGCTGATGATCATCAGCCAAACTTGCACAAGAGTTTTGGTGCTGTTGCGCTGTGTTCATCCCCTTTCCCTTCTAGTAATCGTAGAAACATATACTCATTGCCATGCTCCA
GTTTAGACAGCTATCAAATTCACGGATTAAGTTGGCATAATGTAGAAAAGGAGGAAGATATAGATGCCACCTTCAACGTGCATTTGAATTTCTCATCTGTACCCAAATGT
CTTCATCAGTGTGATAACTATGTCGATGATGGAGGCAGCCATGACTTCTGTGCACAAAGTGCTGATTGGCTTATGTATAGTGTGTTGGATGACGAACGCCGAAATCCTTC
TGTAGAAAGTCTGTGTGCTTCTGGCATCGTCTTTGATTTTGGACAGAAATATCTAAAGGAGCAATGCCAAACAGCTTATCATATACTTAAGTACCCACTGGATGAAATGA
AACTTACAGCCTTTCTCAATGAAGATTGTAATAATGACAGTTCAGATGATGTCCTCGCGGATTATCGGCCGACCTTCTTTATCCAACCTGAGTCATTCTTTCAAGAAGGG
AAGGTATGCTCCATACATACTGATAAACTTAGCTGGGATGCAGCCAGAAGTGAAATAAAGGTTGATATAACTGAAATGAATTACATATTATAG
mRNA sequenceShow/hide mRNA sequence
ATGATCAAATTTATAGATAGTATCCGTCGGAAGAGAACAAAAGTTGAAGATCTTGATCGTGATAGACCTTGGACATGCAGAAGAGATTCTTCTCCCATATCCTTGAAAGA
AAGACATGTTACATACCCAAATAATGCAAAAACTTCTGAGTTTGCATTTTTTAAGAAGTTGAAGGAAGATGCAAACCATAGATTCAGTTCATCTCTTCCAAGTCAAAAGG
AACTTCAATCCAAGAAGTTCAACTCGAGTGATGATTTCAGAGAGAGAGCAAGCCCTGTTGAAAACCGGTATAAAGACTTCAGATCACATCATCTTGTTGAGAACGTCACT
CCTGTTAACTTTAACTCCTTGCATTTACATCTGGGAAATTCATCCAAAATTTCAGAGGTAGATGTGAAACACGCTCATAAAACATTGAAGGATACACAGAGCAAACAGAG
AAACGTGGAAAATGATGGTAATTCTTTCAAAGAGAGATGGATAATCATGATAGCTATTCATGCAGATATTTTTAGTAGGAAGAGGCAGAAATTACGTCAGTTCATTCAGG
ATATGTCGTTCCATGAAACTGGCGACTCATATGAGAAGGGGTATGGTGTTATTTCCGCACTACTTAGCCGGCTTATACCTGAGAGCAATCAGTATAAGTTTAATAATAAC
TCGGAAAAGCTACAAAGGTTGCCTGGAAGGTGCTGTCCAAGGCTTGGTTATGAGCATCATTTGAATAATAGTTCATCACCTTGTAGTTTGAATAAATCAAGTGGAAGTCT
TTTTCCCCATTCTGTTTTCTCAACCAGTAGCGATGATAACAATTTCCAAGTACAGTACAGAACCAAGGAGTTTGACTGTGATGTAGAAAGAAAAATGACTTTGCTGGATG
TCAATGGTTCTCCTCTTACTGCTGCAGTTGAAAACTATAGATCACTCATTTCCGGCCTTTTCAAACCGCAATATGGTTTATATGATCAAGGTGAAGATTTGCACATAAGA
AAGCAAGAGCTAGAACCTCTTCTGCTAGGTTGGGATACGGACGACATAAAAGATGAAAGTTCTTCCCAAGTTACAGAATTGAACACATTTGCCGAGCCACCAATTTCATT
CGCTGATGATCATCAGCCAAACTTGCACAAGAGTTTTGGTGCTGTTGCGCTGTGTTCATCCCCTTTCCCTTCTAGTAATCGTAGAAACATATACTCATTGCCATGCTCCA
GTTTAGACAGCTATCAAATTCACGGATTAAGTTGGCATAATGTAGAAAAGGAGGAAGATATAGATGCCACCTTCAACGTGCATTTGAATTTCTCATCTGTACCCAAATGT
CTTCATCAGTGTGATAACTATGTCGATGATGGAGGCAGCCATGACTTCTGTGCACAAAGTGCTGATTGGCTTATGTATAGTGTGTTGGATGACGAACGCCGAAATCCTTC
TGTAGAAAGTCTGTGTGCTTCTGGCATCGTCTTTGATTTTGGACAGAAATATCTAAAGGAGCAATGCCAAACAGCTTATCATATACTTAAGTACCCACTGGATGAAATGA
AACTTACAGCCTTTCTCAATGAAGATTGTAATAATGACAGTTCAGATGATGTCCTCGCGGATTATCGGCCGACCTTCTTTATCCAACCTGAGTCATTCTTTCAAGAAGGG
AAGGTATGCTCCATACATACTGATAAACTTAGCTGGGATGCAGCCAGAAGTGAAATAAAGGTTGATATAACTGAAATGAATTACATATTATAG
Protein sequenceShow/hide protein sequence
MIKFIDSIRRKRTKVEDLDRDRPWTCRRDSSPISLKERHVTYPNNAKTSEFAFFKKLKEDANHRFSSSLPSQKELQSKKFNSSDDFRERASPVENRYKDFRSHHLVENVT
PVNFNSLHLHLGNSSKISEVDVKHAHKTLKDTQSKQRNVENDGNSFKERWIIMIAIHADIFSRKRQKLRQFIQDMSFHETGDSYEKGYGVISALLSRLIPESNQYKFNNN
SEKLQRLPGRCCPRLGYEHHLNNSSSPCSLNKSSGSLFPHSVFSTSSDDNNFQVQYRTKEFDCDVERKMTLLDVNGSPLTAAVENYRSLISGLFKPQYGLYDQGEDLHIR
KQELEPLLLGWDTDDIKDESSSQVTELNTFAEPPISFADDHQPNLHKSFGAVALCSSPFPSSNRRNIYSLPCSSLDSYQIHGLSWHNVEKEEDIDATFNVHLNFSSVPKC
LHQCDNYVDDGGSHDFCAQSADWLMYSVLDDERRNPSVESLCASGIVFDFGQKYLKEQCQTAYHILKYPLDEMKLTAFLNEDCNNDSSDDVLADYRPTFFIQPESFFQEG
KVCSIHTDKLSWDAARSEIKVDITEMNYIL