; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G13000 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G13000
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein of unknown function (DUF620)
Genome locationChr4:11218347..11221359
RNA-Seq ExpressionCSPI04G13000
SyntenyCSPI04G13000
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572489.1 hypothetical protein SDJN03_29217, partial [Cucurbita argyrosperma subsp. sororia]1.8e-21590.12Show/hide
Query:  MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA
        MATAV IG GGSSSSSSSS   R +QIWYSQPLTPLMEGPDPQFQDQE NKKDSS SNWEFLRDWFKIQRNL PS+  SSFTN+PNSKTQDLKLLLGVLA
Subjt:  MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA

Query:  CPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL
        CPLAPIPLHS+++ P  S+FP   PLETSV HYIIQQYLAATGCLKQQKCAKNMY TGSV+MIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLP MWSL
Subjt:  CPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL

Query:  ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF
        ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLFEKAQCLGEKRIGED+CFVLKVSAEREAVMERNEGPAEVIRHVLYGYF
Subjt:  ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF

Query:  CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD
        CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEE+WSIDDVMFNVAGLS+DYF+PPADI D
Subjt:  CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD

Query:  SLHSHSHPHSHSHSP
        +LHSHS     SHSP
Subjt:  SLHSHSHPHSHSHSP

XP_004142167.1 uncharacterized protein LOC101217200 [Cucumis sativus]1.4e-242100Show/hide
Query:  MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA
        MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA
Subjt:  MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA

Query:  CPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL
        CPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL
Subjt:  CPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL

Query:  ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF
        ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF
Subjt:  ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF

Query:  CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD
        CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD
Subjt:  CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD

Query:  SLHSHSHPHSHSHSP
        SLHSHSHPHSHSHSP
Subjt:  SLHSHSHPHSHSHSP

XP_008449811.1 PREDICTED: uncharacterized protein LOC103491587 [Cucumis melo]8.5e-23798.8Show/hide
Query:  MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA
        MATAVEIGNGG  SSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA
Subjt:  MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA

Query:  CPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL
        CPLAPIPLHS NSPPQTS+FPPHIPLETSV HYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL
Subjt:  CPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL

Query:  ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF
        ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF
Subjt:  ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF

Query:  CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD
        CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD
Subjt:  CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD

Query:  SLHSHSHPHSHSHSP
        SLHSHSHPHSHSHSP
Subjt:  SLHSHSHPHSHSHSP

XP_022969378.1 uncharacterized protein LOC111468400 [Cucurbita maxima]3.5e-21490.67Show/hide
Query:  MATAVEIGNGGSSSSSSSSKSRRS--KQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGV
        MATAV IG GGSSSSSSSS S +S  +QIWYSQPLTPLMEGPDPQFQDQE NKKDSS SNWEFLRDWFKIQRNL PS+  SSFTN+PNSKTQDLKLLLGV
Subjt:  MATAVEIGNGGSSSSSSSSKSRRS--KQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGV

Query:  LACPLAPIPLHS-NNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAM
        LACPLAPIPLHS +NSPPQ S+FP   PLETSV HYIIQQYLAATGCLKQQKCAKNMY TGSVKMIRCETEVSSGKSVK VGTRCEDTGCFVLWQMLP M
Subjt:  LACPLAPIPLHS-NNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAM

Query:  WSLELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLY
        WSLELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLFEKAQCLGEKRIGED+CFVLKVSAEREAVMERNEGPAEVIRHVLY
Subjt:  WSLELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLY

Query:  GYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPAD
        GYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEE+WSIDDVMFNVAGLS+DYF+PPAD
Subjt:  GYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPAD

Query:  IFDSLHSHSHPHSHSHSP
        I D+LHSHS     SHSP
Subjt:  IFDSLHSHSHPHSHSHSP

XP_038887014.1 uncharacterized protein LOC120077181 [Benincasa hispida]5.0e-22192.86Show/hide
Query:  MATAVEIGNGGSSSSSSSSKS---RRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTP--SISQSSFTNLPNSKTQDLKLL
        MATAV IG+GGSSSSSSSS S   R SKQIWYSQPLTPLMEGPDPQFQDQE NKKDSS SNWEFLRDWFKIQRNLTP  SI  SSFTNLPNSKTQDLKLL
Subjt:  MATAVEIGNGGSSSSSSSSKS---RRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTP--SISQSSFTNLPNSKTQDLKLL

Query:  LGVLACPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLP
        LGVLACPLAPIPLHS +SPPQ S+FP   PLETSV HYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCED+GCFVLWQMLP
Subjt:  LGVLACPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLP

Query:  AMWSLELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHV
         MWSLELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIG+D+CFVLKVSAEREAVMERNEGPAEVIRHV
Subjt:  AMWSLELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHV

Query:  LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPP
        LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCI DY+DVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEE+WSIDDVMFNVAGLS+DYFIPP
Subjt:  LYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPP

Query:  ADIFDSLHSHSHPHSHSHSP
        ADIFD+LHSHSHPHSHSHSP
Subjt:  ADIFDSLHSHSHPHSHSHSP

TrEMBL top hitse value%identityAlignment
A0A0A0L1R2 Uncharacterized protein6.5e-243100Show/hide
Query:  MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA
        MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA
Subjt:  MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA

Query:  CPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL
        CPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL
Subjt:  CPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL

Query:  ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF
        ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF
Subjt:  ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF

Query:  CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD
        CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD
Subjt:  CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD

Query:  SLHSHSHPHSHSHSP
        SLHSHSHPHSHSHSP
Subjt:  SLHSHSHPHSHSHSP

A0A1S3BNV0 uncharacterized protein LOC1034915874.1e-23798.8Show/hide
Query:  MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA
        MATAVEIGNGG  SSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA
Subjt:  MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA

Query:  CPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL
        CPLAPIPLHS NSPPQTS+FPPHIPLETSV HYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL
Subjt:  CPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL

Query:  ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF
        ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF
Subjt:  ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF

Query:  CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD
        CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD
Subjt:  CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD

Query:  SLHSHSHPHSHSHSP
        SLHSHSHPHSHSHSP
Subjt:  SLHSHSHPHSHSHSP

A0A6J1F0N4 uncharacterized protein LOC1114412524.7e-20187.26Show/hide
Query:  GGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFT----NLPNSK-TQDLKLLLGVLACPLA
        GG  SSSSSSKSRR K IWYSQPLTPLMEGP PQFQDQEPNKKDS  SNWEF RDWFKIQRNL      +SFT    N+PNSK + DLKLLLGVLACPLA
Subjt:  GGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFT----NLPNSK-TQDLKLLLGVLACPLA

Query:  PIPLHSNNSPPQTSYFP---PHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSLE
        PIPLHS   P    +FP      PLETSV HYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGK+VKTVGTR ED GCFVLWQM+PAMWSLE
Subjt:  PIPLHSNNSPPQTSYFP---PHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSLE

Query:  LVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFC
        LVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRI+QGLDPKSTARLFEKAQCLGEKRIG+DDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFC
Subjt:  LVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFC

Query:  QKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFDS
        QKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMS QFSRTRMEE+WSIDDVMFNVAGLSMDYFIPPAD FD+
Subjt:  QKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFDS

Query:  LHSHSHPHSH--SHSP
        +HSHSH HSH  SHSP
Subjt:  LHSHSHPHSH--SHSP

A0A6J1GN81 uncharacterized protein LOC1114554622.9e-21490.12Show/hide
Query:  MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA
        MATAV IG GGSSSSSS S   R +QIWYSQPLTPLMEGPDPQFQDQE NKKDSS SNWEFLRDWFKIQRNL PS+  SSFTN+PNSKTQDLKLLLGVLA
Subjt:  MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLA

Query:  CPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL
        CPLAPIPLHS+++ P  S+FP   PLETSV HYIIQQYLAATGCLKQQKCAKNMY TGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLP MWSL
Subjt:  CPLAPIPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSL

Query:  ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF
        ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLFEKAQCLGEKRIGED+CFVLKVSAEREAVMERNEGPAEVIRHVLYGYF
Subjt:  ELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYF

Query:  CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD
        CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEE+WSIDDVMFNVAGLS+DYF+PPADI D
Subjt:  CQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFD

Query:  SLHSHSHPHSHSHSP
        +LHSHS     SHSP
Subjt:  SLHSHSHPHSHSHSP

A0A6J1I0T2 uncharacterized protein LOC1114684001.7e-21490.67Show/hide
Query:  MATAVEIGNGGSSSSSSSSKSRRS--KQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGV
        MATAV IG GGSSSSSSSS S +S  +QIWYSQPLTPLMEGPDPQFQDQE NKKDSS SNWEFLRDWFKIQRNL PS+  SSFTN+PNSKTQDLKLLLGV
Subjt:  MATAVEIGNGGSSSSSSSSKSRRS--KQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGV

Query:  LACPLAPIPLHS-NNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAM
        LACPLAPIPLHS +NSPPQ S+FP   PLETSV HYIIQQYLAATGCLKQQKCAKNMY TGSVKMIRCETEVSSGKSVK VGTRCEDTGCFVLWQMLP M
Subjt:  LACPLAPIPLHS-NNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAM

Query:  WSLELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLY
        WSLELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLFEKAQCLGEKRIGED+CFVLKVSAEREAVMERNEGPAEVIRHVLY
Subjt:  WSLELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLY

Query:  GYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPAD
        GYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEE+WSIDDVMFNVAGLS+DYF+PPAD
Subjt:  GYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPAD

Query:  IFDSLHSHSHPHSHSHSP
        I D+LHSHS     SHSP
Subjt:  IFDSLHSHSHPHSHSHSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27690.1 Protein of unknown function (DUF620)2.6e-9847.83Show/hide
Query:  RRSKQIWYSQP-------LTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLACPLAPIPLHSNNSP
        RR K  + +QP       L P++EGPDP  +D       SSG    F R W+   +   P ++  S ++  + K  DL+LLLGVL  PL P+ + + +  
Subjt:  RRSKQIWYSQP-------LTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLACPLAPIPLHSNNSP

Query:  PQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEV-SSGKSVKTVGTRCEDTGCFVLWQMLPAMWSLELVVGGSKVVAGS
        P  S    + P+ETS   YI+QQY AA+G  K     +N Y  G ++ +  E E  S G   K   ++  ++G FVLW M P MW +ELV+GGSKV+AG 
Subjt:  PQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEV-SSGKSVKTVGTRCEDTGCFVLWQMLPAMWSLELVVGGSKVVAGS

Query:  DGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDS
        DG  VWRHTPWLG HAAKGP RPLRR +QGLDP++TA +F  A+C+GEK+I  +DCF+LK+ A+   +  R+EG +E IRH L+GYF QK+G+LV+LEDS
Subjt:  DGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDS

Query:  HLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMST-QFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADI-FDS
         LTR+Q   G+AVYWETTI S + DY+ V+G++IAH GRS+AT+ +FG+MS+   ++T M+E W ID++ FNV GLS+D FIPP+++ FDS
Subjt:  HLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMST-QFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADI-FDS

AT1G49840.1 Protein of unknown function (DUF620)9.4e-10149.46Show/hide
Query:  SQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLACPLAPIPLHSNNSPPQTSYFPPHIPLETS
        S  L P+MEGPDP   +         GS    L  W K Q +  PS++ ++    P  +  DL+LLLGV+  PLAPI + S++     +      P ETS
Subjt:  SQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLACPLAPIPLHSNNSPPQTSYFPPHIPLETS

Query:  VPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSLELVVGGSKVVAGSDGNTVWRHTPWLGTHA
           YI+QQY AA G  K     KN YA G +KMI  E E  +G +V+   +   +TG FVLWQM P MW +EL VGGSKV AG +G  VWRHTPWLG+H 
Subjt:  VPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSLELVVGGSKVVAGSDGNTVWRHTPWLGTHA

Query:  AKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQT-EGDAVYWE
        AKGP RPLRR +QGLDP++TA +F +++C+GE+++  +DCF+LK+  + E +  R+EGPAE++RH+L+GYF Q++G+L  +EDS LTR+Q+ +GDAVYWE
Subjt:  AKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQT-EGDAVYWE

Query:  TTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADI
        TTI S + DY+ V+G++IAH GRS+ T+F+FGE++   +RT+MEE W+I++V FNV GLS+D FIPPAD+
Subjt:  TTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADI

AT1G79420.1 Protein of unknown function (DUF620)4.6e-14864.48Show/hide
Query:  SSSKSRRSKQIWYSQP--LTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNS-----KTQDLKLLLGVLACPLAPIPLH
        S+SKS   KQ W + P  LTPLMEGPDP  QD+   K+    S+WE +R+WFK+ + ++ ++S  S   L NS     K QDL+LLLGVL CPLAPI + 
Subjt:  SSSKSRRSKQIWYSQP--LTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNS-----KTQDLKLLLGVLACPLAPIPLH

Query:  SN----NSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVG----TRCEDTGCFVLWQMLPAMWSLE
         +    + P   S+   ++P ETS  HYIIQQYLAATGCLK+ K AKNMYATG +KM  CETE+++GKSVKT+G     R  D+GCFVLWQM P MWSLE
Subjt:  SN----NSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVG----TRCEDTGCFVLWQMLPAMWSLE

Query:  LVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNE--GPAEVIRHVLYGY
        LV+GG+K+++GSDG TVWRHTPWLGTHAAKGPQRPLRR+IQGLDPK+TA LF KAQCLGE+RIG+DDCFVLKVSA+R++++ERN+   PAEVIRH LYGY
Subjt:  LVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNE--GPAEVIRHVLYGY

Query:  FCQKSGVLVYLEDSHLTRVQT---EGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPA
        FCQKSG+LVYLEDSHLTRV T   E +AVYWETTIG+ IGDYRDVDGV +AH GR++ATVF+FGE S Q+SRTRMEEIW IDDV+F+V GLS+D FIPPA
Subjt:  FCQKSGVLVYLEDSHLTRVQT---EGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPA

Query:  DIFDSLHSHSH
        DIF+  + +++
Subjt:  DIFDSLHSHSH

AT3G19540.1 Protein of unknown function (DUF620)4.5e-10349.24Show/hide
Query:  GNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSS--GSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLACPLAP
        G GG         ++  + I  S  L P+MEGPDP       N  +S   GS    L  W K Q +  PS++ ++       +  DL+LLLGV+  PLAP
Subjt:  GNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSS--GSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLACPLAP

Query:  IPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSLELVVG
        I + S++  P  S    + P+ETS   YI+QQY AA+G  K Q   KN YA G +KMI  E E ++ ++V+       +TG FVLWQM P MW +EL VG
Subjt:  IPLHSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSLELVVG

Query:  GSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSG
        GSKV AG +G  VWRHTPWLG+H AKGP RPLRR +QGLDP++TA +F +A+C+GEK++  +DCF+LK+  + E +  R+EGPAE+IRHVL+GYF QK+G
Subjt:  GSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSG

Query:  VLVYLEDSHLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADI
        +LV++EDSHLTR+Q+  G+ V+WETT  S + DYR V+G++IAH G S+ T+F+FGE++T  +RT+MEE W+I++V FNV GLS+D FIPPAD+
Subjt:  VLVYLEDSHLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADI

AT5G05840.1 Protein of unknown function (DUF620)4.7e-8444.09Show/hide
Query:  WFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLACPLAPIPL---HSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVK
        W K     T + + ++ T L   +  +++LLLGV+  PL P+P+   H N+            PLE S+  YI++QY+AA G  +     ++MYA G V+
Subjt:  WFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLACPLAPIPL---HSNNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVK

Query:  MIRCE---------TEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSLELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARL
        M   E         +++   +S+K+ G    + G FVLWQ    +W LELVV G K+ AGSD    WR TPW  +HA++GP RPLRR +QGLDPKSTA L
Subjt:  MIRCE---------TEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSLELVVGGSKVVAGSDGNTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARL

Query:  FEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGD-AVYWETTIGSCIGDYRDVDGVLIAHRGR
        F ++ C+GEK+I ++DCF+LK+ AE  A+  R+    E+IRH ++G F Q++G+L+ LEDSHL R++ + D +++WETT+ S I DYR VDG+L+AH G+
Subjt:  FEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGD-AVYWETTIGSCIGDYRDVDGVLIAHRGR

Query:  SIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADI
        S  ++F+FGE S   SRTRMEE W I+++ FN+ GLSMD F+PP+D+
Subjt:  SIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGGCGGTTGAGATTGGGAACGGGGGATCTTCTTCTTCTTCTTCTTCTTCAAAGTCGAGGAGATCGAAACAAATTTGGTACTCACAGCCTTTGACTCCATTAAT
GGAAGGCCCAGATCCCCAATTCCAAGACCAAGAACCCAACAAAAAAGACTCCTCCGGTTCGAACTGGGAATTCCTCCGCGACTGGTTTAAGATCCAACGCAACCTCACCC
CCTCCATCTCCCAGTCTTCCTTCACAAACCTTCCTAACTCCAAAACCCAGGATTTAAAGCTCTTGCTCGGCGTCCTCGCATGTCCTCTCGCTCCCATTCCTCTCCATTCC
AATAATTCCCCTCCACAAACCTCCTACTTCCCGCCTCATATTCCTCTTGAAACTTCCGTGCCTCATTACATTATACAACAATACTTGGCCGCCACAGGATGTCTGAAACA
GCAAAAGTGCGCCAAGAACATGTACGCCACCGGAAGTGTGAAGATGATTCGTTGCGAAACAGAGGTTTCTTCTGGTAAATCTGTGAAGACGGTTGGAACAAGATGTGAGG
ACACTGGCTGCTTTGTTCTTTGGCAAATGCTACCAGCTATGTGGTCACTTGAATTGGTCGTTGGAGGTAGTAAGGTGGTGGCCGGCAGCGACGGCAATACTGTCTGGCGT
CACACTCCCTGGCTCGGCACCCATGCTGCCAAAGGCCCCCAACGTCCCCTCCGTCGCATCATTCAGGGGCTAGATCCGAAGAGCACAGCGAGGCTGTTTGAGAAAGCTCA
ATGTCTGGGAGAAAAGAGAATCGGAGAAGACGATTGCTTTGTGCTGAAAGTATCGGCAGAGAGGGAGGCAGTGATGGAGAGGAATGAGGGGCCTGCGGAAGTGATAAGGC
ATGTACTATATGGCTACTTCTGTCAAAAGAGTGGAGTGTTAGTGTACCTGGAGGACTCGCATCTAACCAGGGTCCAGACTGAAGGCGATGCTGTGTACTGGGAAACGACC
ATTGGAAGCTGCATTGGGGATTACAGAGACGTGGATGGGGTCCTCATCGCTCACCGTGGCAGGTCTATAGCTACTGTGTTCAAGTTTGGGGAAATGTCCACCCAATTTAG
CAGGACTCGCATGGAAGAGATTTGGAGTATTGACGATGTGATGTTTAACGTTGCTGGTCTTAGCATGGACTACTTTATTCCTCCTGCTGATATTTTTGATAGCCTTCACT
CCCATTCTCACCCTCACTCCCATTCTCATTCTCCATGA
mRNA sequenceShow/hide mRNA sequence
GTAATGATTTCAGTTTCCATTTAATATTTATAATCACACTCTCCCACCTTTGTCTTTTCCCAATCTTCTCTTTTATACTTTCATCTCCTTCATCCTCTCATCTTCATCTT
TCCTTCTCTCACATTCTCCTTAACTCCGGCCACCACCCACTGTGAGACTTTCACACTTCATAGTCCCCATTCTTTTCTCTTTCTCTTTTTTTTCCCTTTCATTTCTTCAC
TGGATCTTCTTTTTTTCAGTACAAAACCGTCCAAATCAACTGAAATCTTTATTTCCTGGTCGGTGTCAGCAGCAATGTTACTAAAACACGTTGGTAATCGACTCAGTTTT
GAAGCTTAATTGATTGGTAAGTTTGTGTGAAATTATATCTGGTTGAAGGCGAGGTAATTAAAGGTGGAATGGCGACGGCGGTTGAGATTGGGAACGGGGGATCTTCTTCT
TCTTCTTCTTCTTCAAAGTCGAGGAGATCGAAACAAATTTGGTACTCACAGCCTTTGACTCCATTAATGGAAGGCCCAGATCCCCAATTCCAAGACCAAGAACCCAACAA
AAAAGACTCCTCCGGTTCGAACTGGGAATTCCTCCGCGACTGGTTTAAGATCCAACGCAACCTCACCCCCTCCATCTCCCAGTCTTCCTTCACAAACCTTCCTAACTCCA
AAACCCAGGATTTAAAGCTCTTGCTCGGCGTCCTCGCATGTCCTCTCGCTCCCATTCCTCTCCATTCCAATAATTCCCCTCCACAAACCTCCTACTTCCCGCCTCATATT
CCTCTTGAAACTTCCGTGCCTCATTACATTATACAACAATACTTGGCCGCCACAGGATGTCTGAAACAGCAAAAGTGCGCCAAGAACATGTACGCCACCGGAAGTGTGAA
GATGATTCGTTGCGAAACAGAGGTTTCTTCTGGTAAATCTGTGAAGACGGTTGGAACAAGATGTGAGGACACTGGCTGCTTTGTTCTTTGGCAAATGCTACCAGCTATGT
GGTCACTTGAATTGGTCGTTGGAGGTAGTAAGGTGGTGGCCGGCAGCGACGGCAATACTGTCTGGCGTCACACTCCCTGGCTCGGCACCCATGCTGCCAAAGGCCCCCAA
CGTCCCCTCCGTCGCATCATTCAGGGGCTAGATCCGAAGAGCACAGCGAGGCTGTTTGAGAAAGCTCAATGTCTGGGAGAAAAGAGAATCGGAGAAGACGATTGCTTTGT
GCTGAAAGTATCGGCAGAGAGGGAGGCAGTGATGGAGAGGAATGAGGGGCCTGCGGAAGTGATAAGGCATGTACTATATGGCTACTTCTGTCAAAAGAGTGGAGTGTTAG
TGTACCTGGAGGACTCGCATCTAACCAGGGTCCAGACTGAAGGCGATGCTGTGTACTGGGAAACGACCATTGGAAGCTGCATTGGGGATTACAGAGACGTGGATGGGGTC
CTCATCGCTCACCGTGGCAGGTCTATAGCTACTGTGTTCAAGTTTGGGGAAATGTCCACCCAATTTAGCAGGACTCGCATGGAAGAGATTTGGAGTATTGACGATGTGAT
GTTTAACGTTGCTGGTCTTAGCATGGACTACTTTATTCCTCCTGCTGATATTTTTGATAGCCTTCACTCCCATTCTCACCCTCACTCCCATTCTCATTCTCCATGACTAA
GTACTCTTCTAATTCTCTGCTACTGCTTTAATTAACCCAATGCTAATCACTTCTCACGGGTATACATCATCCATCTGTAACAACTTTTCTTCTCCTGAATTTGCATAAAT
TGTTTAGATGGGACCAAAATCTTCCTTACTCCA
Protein sequenceShow/hide protein sequence
MATAVEIGNGGSSSSSSSSKSRRSKQIWYSQPLTPLMEGPDPQFQDQEPNKKDSSGSNWEFLRDWFKIQRNLTPSISQSSFTNLPNSKTQDLKLLLGVLACPLAPIPLHS
NNSPPQTSYFPPHIPLETSVPHYIIQQYLAATGCLKQQKCAKNMYATGSVKMIRCETEVSSGKSVKTVGTRCEDTGCFVLWQMLPAMWSLELVVGGSKVVAGSDGNTVWR
HTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDDCFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETT
IGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSTQFSRTRMEEIWSIDDVMFNVAGLSMDYFIPPADIFDSLHSHSHPHSHSHSP