; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G14890 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G14890
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionNopRA1 domain-containing protein
Genome locationChr7:13537519..13542485
RNA-Seq ExpressionCSPI07G14890
SyntenyCSPI07G14890
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR039844 - Nucleolar pre-ribosomal-associated protein 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458087.1 PREDICTED: uncharacterized protein LOC103497624 isoform X1 [Cucumis melo]5.4e-22596.9Show/hide
Query:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
        VMKKSVKLQRMAFYLVENGLFSWLCSI+STSSRRLTEDQKSIF KQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
Subjt:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV

Query:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG
        NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDP RCSGFLSWAVSTALEFDSRM+A ESHLG
Subjt:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG

Query:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSG-TLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA
        LISESDEEH DESLTSKLLRWLSASAILGKVSLKF CM+LRTSERLS  TLYSLLEHVKNTRD NSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA
Subjt:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSG-TLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA

Query:  LCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEW
        LCLLLFDALISADLFHS+GADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQ LLPQDIEISRVFEW
Subjt:  LCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEW

Query:  ERNLIRTQDSNPQQKKVSL
        ERNLIRTQDSNPQQKKVSL
Subjt:  ERNLIRTQDSNPQQKKVSL

XP_008458089.1 PREDICTED: uncharacterized protein LOC103497624 isoform X3 [Cucumis melo]5.4e-22596.9Show/hide
Query:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
        VMKKSVKLQRMAFYLVENGLFSWLCSI+STSSRRLTEDQKSIF KQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
Subjt:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV

Query:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG
        NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDP RCSGFLSWAVSTALEFDSRM+A ESHLG
Subjt:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG

Query:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSG-TLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA
        LISESDEEH DESLTSKLLRWLSASAILGKVSLKF CM+LRTSERLS  TLYSLLEHVKNTRD NSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA
Subjt:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSG-TLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA

Query:  LCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEW
        LCLLLFDALISADLFHS+GADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQ LLPQDIEISRVFEW
Subjt:  LCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEW

Query:  ERNLIRTQDSNPQQKKVSL
        ERNLIRTQDSNPQQKKVSL
Subjt:  ERNLIRTQDSNPQQKKVSL

XP_011659212.1 uncharacterized protein LOC101215477 isoform X1 [Cucumis sativus]8.3e-234100Show/hide
Query:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
        VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
Subjt:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV

Query:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG
        NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG
Subjt:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG

Query:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISAL
        LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISAL
Subjt:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISAL

Query:  CLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEWE
        CLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEWE
Subjt:  CLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEWE

Query:  RNLIRTQDSNPQQKKVSL
        RNLIRTQDSNPQQKKVSL
Subjt:  RNLIRTQDSNPQQKKVSL

XP_011659213.1 uncharacterized protein LOC101215477 isoform X2 [Cucumis sativus]8.3e-234100Show/hide
Query:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
        VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
Subjt:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV

Query:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG
        NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG
Subjt:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG

Query:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISAL
        LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISAL
Subjt:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISAL

Query:  CLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEWE
        CLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEWE
Subjt:  CLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEWE

Query:  RNLIRTQDSNPQQKKVSL
        RNLIRTQDSNPQQKKVSL
Subjt:  RNLIRTQDSNPQQKKVSL

XP_038897459.1 uncharacterized protein LOC120085516 isoform X1 [Benincasa hispida]4.3e-19889.08Show/hide
Query:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
        VMKKSVKLQRMAFYLVENGLFSWLCSIIS SS   TEDQKSIF KQL LVLEVVNNVISFRNI +WLQKDALEQLMEFSSNIFKIL+GGE+LLLIEGALV
Subjt:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV

Query:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG
        NQIL+IITSVLRISQKRKI+QPH+TFSIEGLFHIYQAV+KLDCT  GSNSA  LKMILMNMPQISLLRMD KRCS FLSWAVSTALEFDSR+IAKESHL 
Subjt:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG

Query:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSG-TLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA
        L+SESDEEHFDESLTSKLLRWLSASAILGKVS K D  +LRTSER S  TLYSLLEHVKNTRDDNSLQEFGCE LLAANIFYL QHLQSSFMVLP+ +SA
Subjt:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSG-TLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA

Query:  LCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEW
        LCLLL D LISA LFHS GADLAQ LSKIRCPEEVNPAWRWTFYQPWKDYSLELT+LQKMDEVHACQTLQLVISNILSKKP DLQVLLPQDIEISRVFEW
Subjt:  LCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEW

Query:  ERNLIRTQDSNP
        ERNLI TQ+SNP
Subjt:  ERNLIRTQDSNP

TrEMBL top hitse value%identityAlignment
A0A0A0K4N8 Uncharacterized protein1.2e-233100Show/hide
Query:  MKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALVN
        MKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALVN
Subjt:  MKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALVN

Query:  QILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLGL
        QILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLGL
Subjt:  QILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLGL

Query:  ISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISALC
        ISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISALC
Subjt:  ISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISALC

Query:  LLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEWER
        LLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEWER
Subjt:  LLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEWER

Query:  NLIRTQDSNPQQKKVSL
        NLIRTQDSNPQQKKVSL
Subjt:  NLIRTQDSNPQQKKVSL

A0A1S3C6K1 uncharacterized protein LOC103497624 isoform X32.6e-22596.9Show/hide
Query:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
        VMKKSVKLQRMAFYLVENGLFSWLCSI+STSSRRLTEDQKSIF KQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
Subjt:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV

Query:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG
        NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDP RCSGFLSWAVSTALEFDSRM+A ESHLG
Subjt:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG

Query:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSG-TLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA
        LISESDEEH DESLTSKLLRWLSASAILGKVSLKF CM+LRTSERLS  TLYSLLEHVKNTRD NSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA
Subjt:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSG-TLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA

Query:  LCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEW
        LCLLLFDALISADLFHS+GADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQ LLPQDIEISRVFEW
Subjt:  LCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEW

Query:  ERNLIRTQDSNPQQKKVSL
        ERNLIRTQDSNPQQKKVSL
Subjt:  ERNLIRTQDSNPQQKKVSL

A0A1S3C7M7 uncharacterized protein LOC103497624 isoform X12.6e-22596.9Show/hide
Query:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
        VMKKSVKLQRMAFYLVENGLFSWLCSI+STSSRRLTEDQKSIF KQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
Subjt:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV

Query:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG
        NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDP RCSGFLSWAVSTALEFDSRM+A ESHLG
Subjt:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG

Query:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSG-TLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA
        LISESDEEH DESLTSKLLRWLSASAILGKVSLKF CM+LRTSERLS  TLYSLLEHVKNTRD NSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA
Subjt:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSG-TLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA

Query:  LCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEW
        LCLLLFDALISADLFHS+GADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQ LLPQDIEISRVFEW
Subjt:  LCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEW

Query:  ERNLIRTQDSNPQQKKVSL
        ERNLIRTQDSNPQQKKVSL
Subjt:  ERNLIRTQDSNPQQKKVSL

A0A6J1DRJ3 uncharacterized protein LOC111023605 isoform X12.2e-18782.52Show/hide
Query:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
        +MKKSVKLQRMAFYLVE+GL SWLCSIIST  RR T+DQKS F KQL LVLEVVNNVISFRNICEWLQKDALEQLMEFSSN+FKIL+G EQ   IEG LV
Subjt:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV

Query:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG
        N ILQIITSVLRISQKRKI+QPH+T SIEGLF++YQAVH+LDC  LG NSA GLK+ILMNMPQI+LLRMDPK+CS FLSWA+STALE DSRM+AKES LG
Subjt:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG

Query:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERL-SGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA
         +SE DEEH DESLTSKLLRWLSAS I+G++S K D ++L TSERL + TLYSLLEH+KNT DD+SLQEFGCE LLAANIFYLQQHL+SSF VLPVVI+A
Subjt:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERL-SGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA

Query:  LCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEW
        LCLLLFDALISA LFH+ GADLAQ LSKIRCPEEVNPAWRWTFYQPWKDYSLELT+LQK+DE+HACQTLQ+VISNILSKKPLDLQVLLPQDIEIS VF+W
Subjt:  LCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEW

Query:  ERNLIRTQDSNP
        ER+LIRTQ+SNP
Subjt:  ERNLIRTQDSNP

A0A6J1H985 uncharacterized protein LOC1114607771.3e-18983.74Show/hide
Query:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV
        VMKKSVKLQRMAFYLVE+GLFSWLCSIISTSSRRL EDQKS F KQL LVLEVVNNVISFRNICEWLQKDALEQLMEFSS +FK+L+GGE+  LIEGALV
Subjt:  VMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGALV

Query:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG
        N++LQIITSVLRISQKRK++QPH+T SIEGLFHIYQAVH+LD TRLGSNSA+GLK+ILMN+PQ +LL +  K+CS FLSWA+STALE DSRMI KESHLG
Subjt:  NQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLG

Query:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSG-TLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA
        L+SESDEEHFDESLTSKLLRWLSAS I G++S K D ++L T+E+ S  TLYSLLEHVKNT DD+SL EFGCE LLAANIFYLQQHL+SSFMVLPVVISA
Subjt:  LISESDEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSG-TLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISA

Query:  LCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEW
        LCLLL DALISA LFHS GADLAQ LSKIRCPEEVNPAWRWTFYQPWKDYSLELT+LQK+DEVHACQTLQ+VISNILSKKPLD Q LLPQD EISRVFEW
Subjt:  LCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEW

Query:  ERNLIRTQDSNP
        ERNL+RTQ+SNP
Subjt:  ERNLIRTQDSNP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)7.9e-5736.67Show/hide
Query:  VMKKSVKLQRMAFYLVEN-GLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGAL
        V++KSVKL +MA +LVEN GL SW  S  S  + + T D+ S F     +VLEV+ + ++ RN  EW Q+ ALE LME SS ++ +L  G   +   G  
Subjt:  VMKKSVKLQRMAFYLVEN-GLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGAL

Query:  VNQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHL
              I+++ L+IS KRK  QPHFT +IEG+F +++A    D  ++ +++   L  ILM+ P + ++ MD  R   FL W  STAL+ D   + K S  
Subjt:  VNQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHL

Query:  GLISESDEEHFDESLTSKLLRWLSASAILGKV-SLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHL--QSSFMVLPVV
        G   +  + H +E++ +K LRWL AS ILGK+ S   D   +  SE    TL +LLE++K      S+     E ++   I YLQ+HL  ++  ++LP V
Subjt:  GLISESDEEHFDESLTSKLLRWLSASAILGKV-SLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHL--QSSFMVLPVV

Query:  ISALCLLLFDALISADLFHSEGA--DLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQ--VLLPQDIE
        + AL L++    +      SEG    +    S+I  P E  P WRW+++Q     S   T+ +++DE++ACQ L L+ S++L + P + Q  +LL +  +
Subjt:  ISALCLLLFDALISADLFHSEGA--DLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQ--VLLPQDIE

Query:  ISRVFEWER
        +S VFEWER
Subjt:  ISRVFEWER

AT4G27010.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)2.4e-6940Show/hide
Query:  VMKKSVKLQRMAFYLVEN-GLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGAL
        V++KSVK  ++A +LVEN GLFSW  S IS  + +   D+       L +VLE++ +V++ RNI EWLQ+  LE LME SS ++K+L GG   +   G  
Subjt:  VMKKSVKLQRMAFYLVEN-GLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGAL

Query:  VNQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAK--ES
        V+ ILQI+++ L+ISQKR ++QPHFT +IEG+F +++ V      ++ +++ SGL  ILM+ P + +L MD  +   FL W  STAL+ D +  +K  ES
Subjt:  VNQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAK--ES

Query:  H--LGLISESDEEHFDESLTSKLLRWLSASAILGK-VSLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFM-VL
        H    ++ E  +E   E++ +K LRWLSAS ILGK  S   D      S+    TL + LE+ K    ++S+Q    E ++   I +LQQ L +++M +L
Subjt:  H--LGLISESDEEHFDESLTSKLLRWLSASAILGK-VSLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFM-VL

Query:  PVVISALCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLE-LTNLQKMDEVHACQTLQLVISNILSKKPLD-LQVLLPQDI
        P V+ AL L+L    +       +   +    SKI  P E  P WRW++YQ W+D S E  T+L K++E+HACQ L L+ S +L + P +  QVLL +  
Subjt:  PVVISALCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLE-LTNLQKMDEVHACQTLQLVISNILSKKPLD-LQVLLPQDI

Query:  EISRVFEWERNLIRT
        ++S VFEWER+L+ T
Subjt:  EISRVFEWERNLIRT

AT4G27010.2 INVOLVED IN: biological_process unknown2.4e-6940Show/hide
Query:  VMKKSVKLQRMAFYLVEN-GLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGAL
        V++KSVK  ++A +LVEN GLFSW  S IS  + +   D+       L +VLE++ +V++ RNI EWLQ+  LE LME SS ++K+L GG   +   G  
Subjt:  VMKKSVKLQRMAFYLVEN-GLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLIEGAL

Query:  VNQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAK--ES
        V+ ILQI+++ L+ISQKR ++QPHFT +IEG+F +++ V      ++ +++ SGL  ILM+ P + +L MD  +   FL W  STAL+ D +  +K  ES
Subjt:  VNQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAK--ES

Query:  H--LGLISESDEEHFDESLTSKLLRWLSASAILGK-VSLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFM-VL
        H    ++ E  +E   E++ +K LRWLSAS ILGK  S   D      S+    TL + LE+ K    ++S+Q    E ++   I +LQQ L +++M +L
Subjt:  H--LGLISESDEEHFDESLTSKLLRWLSASAILGK-VSLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFM-VL

Query:  PVVISALCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLE-LTNLQKMDEVHACQTLQLVISNILSKKPLD-LQVLLPQDI
        P V+ AL L+L    +       +   +    SKI  P E  P WRW++YQ W+D S E  T+L K++E+HACQ L L+ S +L + P +  QVLL +  
Subjt:  PVVISALCLLLFDALISADLFHSEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLE-LTNLQKMDEVHACQTLQLVISNILSKKPLD-LQVLLPQDI

Query:  EISRVFEWERNLIRT
        ++S VFEWER+L+ T
Subjt:  EISRVFEWERNLIRT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCACAATGTAACGAGGGAGTTTATCGAATCCCTAGCTTCTTGGTGTGGTGGTCTTTCTAAAAAAGAACCTTCTTTTAAGTCTAATTGGTTAAAGGAAGTGAGCTT
TAAGGCAAACGACAATGAAGATGGGTTTATTTTGGTCGAATGTATTGTGAATCACCAGGGGTTCTCCTTTTCGGTGAGAAGAGGAAGGATAGGGGAAGTCACGTTGAATT
CCACTAATAGTGATGGGCCGGCGGTGGTTAAAGGCCACAATGGGCATAAGAAGGGTAAAGACAAGCCATTGGGTACTGTTGATTTGGTCTCTCAAGAAATCCCTGTTGAG
CAAAGGAAATATTCAAAGTGGGCTGTCGAGCATTTGGTGACTGAAAATTTACCAGCTAAGGAAGGTGACGAGCATCCGAGGGTTCTTTTGGAAAAGAAGTTTAGATTTGG
GCCATTGGATTTGGTTTTTGACAATGGAGATTTTGATTCGTACTTGTCGATTGTAAGTTATTATGATGATTATGAGTTTTTGTTGACTCCAGTTAGAAAGAATTCTTCGA
AGGTGGAGATGGAAGATTTGTTTGAAAATGGGGATGGTTCAAGAGAGTTTTCATCTTCGCTCGTGTTGATCAAGAAGTTTACTGAGCAAGTTATTCCTTTCAAGGGGGCT
TCAGAGGGTGACCCTGTTAAGAGTATTCCTAAAGAGGACTTGTCAAAGTTGGATGTTTGCATTGTGAAGACTTTGAAATCGCCTTTAGCAAGTGGGTTTTCTCCCAACTC
CAATCAAGTGGAGAGCTCGTGTGGTGGGACAGATCAGTTGAATGTTTTCTCTCCTTCGGGTTGTGCCTTATCCCTTGAGCCTGTCTATTCTTTAGGAGACAGGTGGCCAG
AAGAAGTGAGTTTTGCAGTTGGAGGATTGCTAGTATTTGGAAGCATGGGAAGGTGGAGTAGAAAGTTCTTGAGGTTGAAGGTGGGTGGGATTCTGATGCTGTGGAACTCA
AGATCGTGTGTTGCCATTAAAGTGATTCCTAGTAGACATTCCGTTACTATTGCTTTTCTGGATGGTGAAGAGTTTTGGATGGTGGTGATGGGTAATTTTAATGTTGTTCA
GTTTCCAAATGAGAGTTTTGGAGAGACCCTCCTCTTGTCGGGGGCAAATTTACTTGGGCCAATAGTAGGGTGGCTATTGATAGGGTCCTTATGTCGGAAGCTTGGATTGA
GAGGTTTGGGAACCCTAGTCCTTTTTCGTTTCGAGAATATGTGGTTTGATCATCCTTCCTTTGAAGCGAATATATTTCTAGGATGGTGGGTTATACCAATTACTGTGGAT
GAAGCGAATATATTTCTAGGATGGTGGGCTCGTCCTGCTGGGACGGTAATGAAGAAGTCTGTTAAGTTGCAAAGGATGGCATTTTATTTGGTGGAAAATGGCCTGTTTTC
ATGGTTATGCTCTATCATTTCAACCTCTAGCAGGAGACTTACTGAAGATCAGAAATCCATTTTTCCGAAGCAGCTGGCTTTGGTATTAGAGGTTGTCAATAATGTCATTT
CATTCAGAAACATCTGTGAGTGGTTGCAAAAAGATGCCCTCGAGCAGCTAATGGAATTTTCATCTAATATTTTCAAAATCCTGGTTGGTGGTGAGCAATTGCTGCTAATA
GAAGGGGCACTTGTTAATCAAATTTTGCAGATAATAACATCCGTGCTCAGAATATCACAGAAAAGAAAAATTTTCCAGCCCCATTTTACGTTTTCCATTGAGGGCCTGTT
CCACATTTATCAGGCTGTTCACAAATTAGATTGTACAAGACTGGGTTCAAATTCAGCTAGCGGACTCAAAATGATACTGATGAACATGCCACAAATATCACTTCTACGCA
TGGACCCAAAGAGGTGCTCAGGTTTTCTGTCATGGGCAGTTTCCACCGCTCTAGAGTTTGATTCTAGAATGATAGCCAAAGAATCTCATTTGGGTTTAATAAGTGAATCT
GATGAAGAGCATTTTGACGAATCCTTGACATCAAAGCTTTTACGTTGGTTATCTGCTTCTGCAATTCTTGGAAAGGTTTCCCTGAAATTTGATTGCATGCATCTCCGAAC
TTCAGAAAGATTGAGTGGAACGCTCTACTCTCTATTGGAACATGTTAAAAATACACGTGATGACAATAGTCTACAAGAATTTGGTTGTGAGGGGCTCTTAGCTGCAAATA
TTTTCTATCTTCAACAACATCTTCAGAGTAGTTTCATGGTGCTCCCCGTGGTGATATCTGCCCTTTGTCTCCTCCTTTTTGATGCTCTCATTTCTGCAGATCTCTTTCAT
AGTGAAGGAGCTGATTTGGCACAACATTTGTCAAAGATACGTTGTCCGGAAGAAGTTAATCCTGCATGGAGATGGACGTTCTACCAACCATGGAAAGATTATTCATTGGA
ATTAACAAATTTGCAGAAGATGGATGAAGTTCATGCATGTCAAACTCTTCAACTCGTGATCTCAAATATTCTGAGTAAAAAGCCTCTCGATCTTCAAGTTTTGTTACCTC
AAGACATTGAGATTTCACGAGTATTTGAGTGGGAGAGAAATCTCATAAGAACCCAAGATTCAAATCCTCAACAGAAAAAGGTTAGTCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTCACAATGTAACGAGGGAGTTTATCGAATCCCTAGCTTCTTGGTGTGGTGGTCTTTCTAAAAAAGAACCTTCTTTTAAGTCTAATTGGTTAAAGGAAGTGAGCTT
TAAGGCAAACGACAATGAAGATGGGTTTATTTTGGTCGAATGTATTGTGAATCACCAGGGGTTCTCCTTTTCGGTGAGAAGAGGAAGGATAGGGGAAGTCACGTTGAATT
CCACTAATAGTGATGGGCCGGCGGTGGTTAAAGGCCACAATGGGCATAAGAAGGGTAAAGACAAGCCATTGGGTACTGTTGATTTGGTCTCTCAAGAAATCCCTGTTGAG
CAAAGGAAATATTCAAAGTGGGCTGTCGAGCATTTGGTGACTGAAAATTTACCAGCTAAGGAAGGTGACGAGCATCCGAGGGTTCTTTTGGAAAAGAAGTTTAGATTTGG
GCCATTGGATTTGGTTTTTGACAATGGAGATTTTGATTCGTACTTGTCGATTGTAAGTTATTATGATGATTATGAGTTTTTGTTGACTCCAGTTAGAAAGAATTCTTCGA
AGGTGGAGATGGAAGATTTGTTTGAAAATGGGGATGGTTCAAGAGAGTTTTCATCTTCGCTCGTGTTGATCAAGAAGTTTACTGAGCAAGTTATTCCTTTCAAGGGGGCT
TCAGAGGGTGACCCTGTTAAGAGTATTCCTAAAGAGGACTTGTCAAAGTTGGATGTTTGCATTGTGAAGACTTTGAAATCGCCTTTAGCAAGTGGGTTTTCTCCCAACTC
CAATCAAGTGGAGAGCTCGTGTGGTGGGACAGATCAGTTGAATGTTTTCTCTCCTTCGGGTTGTGCCTTATCCCTTGAGCCTGTCTATTCTTTAGGAGACAGGTGGCCAG
AAGAAGTGAGTTTTGCAGTTGGAGGATTGCTAGTATTTGGAAGCATGGGAAGGTGGAGTAGAAAGTTCTTGAGGTTGAAGGTGGGTGGGATTCTGATGCTGTGGAACTCA
AGATCGTGTGTTGCCATTAAAGTGATTCCTAGTAGACATTCCGTTACTATTGCTTTTCTGGATGGTGAAGAGTTTTGGATGGTGGTGATGGGTAATTTTAATGTTGTTCA
GTTTCCAAATGAGAGTTTTGGAGAGACCCTCCTCTTGTCGGGGGCAAATTTACTTGGGCCAATAGTAGGGTGGCTATTGATAGGGTCCTTATGTCGGAAGCTTGGATTGA
GAGGTTTGGGAACCCTAGTCCTTTTTCGTTTCGAGAATATGTGGTTTGATCATCCTTCCTTTGAAGCGAATATATTTCTAGGATGGTGGGTTATACCAATTACTGTGGAT
GAAGCGAATATATTTCTAGGATGGTGGGCTCGTCCTGCTGGGACGGTAATGAAGAAGTCTGTTAAGTTGCAAAGGATGGCATTTTATTTGGTGGAAAATGGCCTGTTTTC
ATGGTTATGCTCTATCATTTCAACCTCTAGCAGGAGACTTACTGAAGATCAGAAATCCATTTTTCCGAAGCAGCTGGCTTTGGTATTAGAGGTTGTCAATAATGTCATTT
CATTCAGAAACATCTGTGAGTGGTTGCAAAAAGATGCCCTCGAGCAGCTAATGGAATTTTCATCTAATATTTTCAAAATCCTGGTTGGTGGTGAGCAATTGCTGCTAATA
GAAGGGGCACTTGTTAATCAAATTTTGCAGATAATAACATCCGTGCTCAGAATATCACAGAAAAGAAAAATTTTCCAGCCCCATTTTACGTTTTCCATTGAGGGCCTGTT
CCACATTTATCAGGCTGTTCACAAATTAGATTGTACAAGACTGGGTTCAAATTCAGCTAGCGGACTCAAAATGATACTGATGAACATGCCACAAATATCACTTCTACGCA
TGGACCCAAAGAGGTGCTCAGGTTTTCTGTCATGGGCAGTTTCCACCGCTCTAGAGTTTGATTCTAGAATGATAGCCAAAGAATCTCATTTGGGTTTAATAAGTGAATCT
GATGAAGAGCATTTTGACGAATCCTTGACATCAAAGCTTTTACGTTGGTTATCTGCTTCTGCAATTCTTGGAAAGGTTTCCCTGAAATTTGATTGCATGCATCTCCGAAC
TTCAGAAAGATTGAGTGGAACGCTCTACTCTCTATTGGAACATGTTAAAAATACACGTGATGACAATAGTCTACAAGAATTTGGTTGTGAGGGGCTCTTAGCTGCAAATA
TTTTCTATCTTCAACAACATCTTCAGAGTAGTTTCATGGTGCTCCCCGTGGTGATATCTGCCCTTTGTCTCCTCCTTTTTGATGCTCTCATTTCTGCAGATCTCTTTCAT
AGTGAAGGAGCTGATTTGGCACAACATTTGTCAAAGATACGTTGTCCGGAAGAAGTTAATCCTGCATGGAGATGGACGTTCTACCAACCATGGAAAGATTATTCATTGGA
ATTAACAAATTTGCAGAAGATGGATGAAGTTCATGCATGTCAAACTCTTCAACTCGTGATCTCAAATATTCTGAGTAAAAAGCCTCTCGATCTTCAAGTTTTGTTACCTC
AAGACATTGAGATTTCACGAGTATTTGAGTGGGAGAGAAATCTCATAAGAACCCAAGATTCAAATCCTCAACAGAAAAAGGTTAGTCTTTAG
Protein sequenceShow/hide protein sequence
MGHNVTREFIESLASWCGGLSKKEPSFKSNWLKEVSFKANDNEDGFILVECIVNHQGFSFSVRRGRIGEVTLNSTNSDGPAVVKGHNGHKKGKDKPLGTVDLVSQEIPVE
QRKYSKWAVEHLVTENLPAKEGDEHPRVLLEKKFRFGPLDLVFDNGDFDSYLSIVSYYDDYEFLLTPVRKNSSKVEMEDLFENGDGSREFSSSLVLIKKFTEQVIPFKGA
SEGDPVKSIPKEDLSKLDVCIVKTLKSPLASGFSPNSNQVESSCGGTDQLNVFSPSGCALSLEPVYSLGDRWPEEVSFAVGGLLVFGSMGRWSRKFLRLKVGGILMLWNS
RSCVAIKVIPSRHSVTIAFLDGEEFWMVVMGNFNVVQFPNESFGETLLLSGANLLGPIVGWLLIGSLCRKLGLRGLGTLVLFRFENMWFDHPSFEANIFLGWWVIPITVD
EANIFLGWWARPAGTVMKKSVKLQRMAFYLVENGLFSWLCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIFKILVGGEQLLLI
EGALVNQILQIITSVLRISQKRKIFQPHFTFSIEGLFHIYQAVHKLDCTRLGSNSASGLKMILMNMPQISLLRMDPKRCSGFLSWAVSTALEFDSRMIAKESHLGLISES
DEEHFDESLTSKLLRWLSASAILGKVSLKFDCMHLRTSERLSGTLYSLLEHVKNTRDDNSLQEFGCEGLLAANIFYLQQHLQSSFMVLPVVISALCLLLFDALISADLFH
SEGADLAQHLSKIRCPEEVNPAWRWTFYQPWKDYSLELTNLQKMDEVHACQTLQLVISNILSKKPLDLQVLLPQDIEISRVFEWERNLIRTQDSNPQQKKVSL