; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002169 (gene) of Snake gourd v1 genome

Gene IDTan0002169
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSNF2 domain protein
Genome locationLG09:63988122..63991529
RNA-Seq ExpressionTan0002169
SyntenyTan0002169
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589981.1 hypothetical protein SDJN03_15404, partial [Cucurbita argyrosperma subsp. sororia]1.0e-23780.63Show/hide
Query:  MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAP
        M+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNLG S   E FYELIQAR DPP+E+IWFYSALTFRSR ST  GDFLDRVA MK+LFQ  CSCSAP
Subjt:  MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAP

Query:  CDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQFLPLVNSEV
        C SSKTIALL+PVV+EVYKLI DML KDL+ KREKKAMREVKSLVE ++GFINLSSCKDSDQN ESLDFNL+TPFVDLISIW + NEGLDQFLPLV+SEV
Subjt:  CDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQFLPLVNSEV

Query:  CGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD
         GEFSSGVC +RRLAGVVIAE FLMKLCLD NSGRSRQ LE +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALILVD
Subjt:  CGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD

Query:  YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVF
        YSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVSSQIIRWV+SQI SN NVN P GSSPKI L+WLLK ED GVRVF
Subjt:  YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVF

Query:  DNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKIKFIKYDL
        D+TIS+RR KLVLD SKSVSGHPT EG+ VDD+LLFYIDKQGENENGS EE + MDESVN A V+ A TMSTT+N S K+KR+R A +K KKIKF KYDL
Subjt:  DNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKIKFIKYDL

Query:  IPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE
        +PNSD  +LRSAV++ND++SEGEVHNPH DEDS+ +E
Subjt:  IPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE

KAG7023645.1 hypothetical protein SDJN02_14671, partial [Cucurbita argyrosperma subsp. argyrosperma]8.2e-24080.74Show/hide
Query:  VESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSC
        VESM+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNLG S   E FYELIQAR DPP+E+IWFYSALTFRSR ST  GDFLDRVA MK+LFQ  CSC
Subjt:  VESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSC

Query:  SAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQFLPLVN
        SAPC SSKTIALL+PVV+EVYKLI DML KDL+ KREKKAMREVKSLVE ++GFINLSSCKDSDQN ESLDFNL+TPFVDLISIW + NEGLDQFLPLV+
Subjt:  SAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQFLPLVN

Query:  SEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALI
        SEV GEFSSGVC +RRLAGVVIAE FLMKLCLD NSGRSRQ LE +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALI
Subjt:  SEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALI

Query:  LVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGV
        LVDYSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVSSQIIRWV+SQI SN N N P GSSPKI L+WLLK ED GV
Subjt:  LVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGV

Query:  RVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKIKFIK
        RVFD+TIS+RR KLVLD SKSVSGHPT EG+ VDD+LLFYIDKQGENENGS EE + MDESVN A V+ A TMSTT+N SGK+KR+R A +K KKIKF+K
Subjt:  RVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKIKFIK

Query:  YDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE
        YDL+PNSD  +LRSAV++ND++SEGEVHNPH DEDS+ +E
Subjt:  YDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE

XP_022147658.1 uncharacterized protein LOC111016527 [Momordica charantia]6.1e-24382.54Show/hide
Query:  MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQ
        MALALVESMDSMN  NQNPFLGENYE TL+QSI+NVLAEIR+GNLG  H TE FY+L+QAR DPPME+IWFYSAL FRS SS  KGDFLDR+AAMKVLFQ
Subjt:  MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQ

Query:  LVCSCSAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQF
        LVCSCSAPC SSKT+A LAPVVFEVYKLI DML KDLA KREKKAMREVK+LVE I+GFINLSSCK SDQN E LDFNLITPF+DLISIWTHPNEGLDQF
Subjt:  LVCSCSAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQF

Query:  LPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL
        LPLV+SEV G F SGVC VR LAGVVIAEAFLMKLCLDF+SGRSRQ LEK+LR+W VGSIT IRN Y FETL+RFLL  TLPVMSLL STEDE LLRKVL
Subjt:  LPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL

Query:  YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKT
        YDALILVDYSFL+P KAI+L AEHVA LAVKRLILTH+AIEF+REHGDQSRA+SYLNAFSSS V SQ+IRWVRSQI SN NVNRPNGSSPKILL+WL K 
Subjt:  YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKT

Query:  EDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKI
        EDQGVRVFDNTISD R KLVLD+SKS S HP LEG+KVDD LLFY+DKQGE EN SEE K MDESVN A VTVARTMS  EN SGK+KR+RK+ERKN KI
Subjt:  EDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKI

Query:  KFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE
        KF+KYDL PN DAAQLRSAVDNND NSEGEVHNPH DEDS+MEE
Subjt:  KFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE

XP_022987671.1 uncharacterized protein LOC111485155 [Cucurbita maxima]1.7e-23780.82Show/hide
Query:  MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAP
        M+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNL  S   E FYELIQARADPP+E+IWFYSALTFRSR ST  GDFLDRVA MK+LFQ  CSCSAP
Subjt:  MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAP

Query:  CDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQFLPLVNSEV
        C SSKTIALLAPVV+EVYKLI DML KDL  KREKKAMREVKSLVE I+GFINLSSCKDSDQN ESLDFNL+TPFVDLISIWT+ NEGLDQFLPLV+SEV
Subjt:  CDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQFLPLVNSEV

Query:  CGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD
         GEFSSGVC +RRLAGVVIAE FL+KLCLD NSGRSRQ LE +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALILVD
Subjt:  CGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD

Query:  YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVF
        YSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVSSQIIRWV+SQI S+ NVN P GSSPKI L+WL K ED GVRVF
Subjt:  YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVF

Query:  DNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKIKFIKYDL
        D+TIS+RR KLVLD SKSVSGHPT EG+ VDD+LLFYIDKQGENENGS EE + MDE+VN A V+ A TMSTT+N   K+KRRR A +K KKIKF KYDL
Subjt:  DNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKIKFIKYDL

Query:  IPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE
        +PNSDA +LRSAVD+ND++S+ EVHNPHLDEDS+M+E
Subjt:  IPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE

XP_038880003.1 uncharacterized protein LOC120071696 [Benincasa hispida]1.2e-23882.02Show/hide
Query:  MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQ
        MALALVESMDSMN   +NPFLGENYE TL QSIQNV+AEIRKGN G S  TE FYELIQARADPP+E+IWFYSALTFRSR    KGDFL+RVAAMKVLFQ
Subjt:  MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQ

Query:  LVCSCSAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQN-CESLDFNLITPFVDLISIWTHPNEGLDQ
        LV SCSAPC SSKTI LL+PVV EVYKLI DML KDLA KREKKAMREVKSLVE I+GFINLSSCKDSD+N  ESLDFNLITPFVDLIS+WTHPNEGLDQ
Subjt:  LVCSCSAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQN-CESLDFNLITPFVDLISIWTHPNEGLDQ

Query:  FLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKV
        FLPLV+SEV GEFSSGVC VRRLAGVVIAE FLMKLCLDFN+G SRQ LEK+LRIWTVGSITRIRNFYFFETLVRFLLEATLPV SLL STEDEALLRKV
Subjt:  FLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKV

Query:  LYDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLK
        LYD+LILV+YSFL PEKAI+LPAEHVA LAVKRLILTHEAIEFYREHGDQSRA+SYLNAFSSS VSSQIIRWV+SQ+ SN NV RPNGSSPKI+L+WLL+
Subjt:  LYDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLK

Query:  TEDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKK
         EDQGVRVFD TIS+R  KLVLD SKSVS    LEGDKVDDDLLFYIDKQGE+ENGSE+  TMDESVN A V+VARTMSTTEN SGK KR+R  +RKN+K
Subjt:  TEDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKK

Query:  IKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE
        IKF+KYDL+P+SD  Q RS  DNND++SEG+VHNPH D+DS+++E
Subjt:  IKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE

TrEMBL top hitse value%identityAlignment
A0A0A0M1W0 Uncharacterized protein1.5e-21568.89Show/hide
Query:  MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQ
        MAL LVESM+S+N   QNPFLGENYE TL QSIQNVLAEIRKGN+  S  T+RFY+LIQARADPP+E+IWFYSAL FRS S   KGDFL+RVAAMKVLFQ
Subjt:  MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQ

Query:  LVCSCSAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQF
        LVCSCSAPC SSKTI LL+PVV EVYKL+ DM  KDL   REKKAMREVKSLVE I+GF+NLSS +DSD+N +SLDF+LITPF+DLISIWT PNEGLDQF
Subjt:  LVCSCSAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQF

Query:  LPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL
        LPLV SEV  EFSSG C VRRLAGVVIAE FLMKLCLDFN GRSRQ LEK+L  W VGSIT+IRNFY FETLVR LLEATLPV SLL ST++EALLRKVL
Subjt:  LPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL

Query:  YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKT
        YDALILVDYSFL PE AINLPAEHVA LAVKRLILT+EAIEFYREHGDQ+RA+SYLNAFSSSLVSSQIIRW++SQ+ SN N+N PNG SPK+ L+WLLK 
Subjt:  YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKT

Query:  EDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRR----------
        EDQGVRVFDNTIS+RR+KLVLD SKSVS     EGDKVDDDLLFYIDKQG N NGSEE  TMDESVN A  + A TMSTTEN+S K+  R          
Subjt:  EDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRR----------

Query:  ------------------------------------------------------------RKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGE
                                                                    RKA+RKNKK K +KYDL+PN+DA QL+SAV+NND++SEGE
Subjt:  ------------------------------------------------------------RKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGE

Query:  VHNPHLDEDSEMEE
        VHNPH D+DS+M++
Subjt:  VHNPHLDEDSEMEE

A0A5D3CFU4 Pentatricopeptide repeat-containing protein3.9e-21175Show/hide
Query:  MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQ
        MAL LVESM+S+N   +N FLGENYE TL QSIQNVLAEIRKGN+  S  TE FY+LIQARADPP+E+IWFYSALTFRS S   KGDFL+RVAAMKVLFQ
Subjt:  MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQ

Query:  LVCSCSAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQF
        LVCSCSAPC SSKTI LL+PVV EVYKL+ DM  KDL  KREKKAMREVKSLVE I+G  NLSSC+DS++N +SLDFN ITPFVDLISIWTHPNEGLDQF
Subjt:  LVCSCSAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQF

Query:  LPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL
        LPLV SEV  EFSSG C VRRLAGVVIAE FL+KLCLDFN G SRQ LE++LR WTVGSITRIRNFYFFETLVR LLEATLPV SLL ST+DEALLRKVL
Subjt:  LPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL

Query:  YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKT
         DALILVDYSFL PEKAINLPAEH A LAVKRLILT+EA EFYR+HGDQ+RA+SYLNAFSSSLVSSQIIRWV+SQ+ SN N+N  NGSSPK+ L+WLLK 
Subjt:  YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKT

Query:  EDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKI
        EDQGVRVFDNTIS+ R K+VLD SKSV      EGDKVDDDLLFYIDKQGENENG EE KTMD+SVN A V+VA TMSTTEN+S K KR RKA+++NKK 
Subjt:  EDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKI

Query:  KFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE
                 N+D +QL+SAV+NND+N +         ED+ M+E
Subjt:  KFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE

A0A6J1D1X1 uncharacterized protein LOC1110165272.9e-24382.54Show/hide
Query:  MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQ
        MALALVESMDSMN  NQNPFLGENYE TL+QSI+NVLAEIR+GNLG  H TE FY+L+QAR DPPME+IWFYSAL FRS SS  KGDFLDR+AAMKVLFQ
Subjt:  MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQ

Query:  LVCSCSAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQF
        LVCSCSAPC SSKT+A LAPVVFEVYKLI DML KDLA KREKKAMREVK+LVE I+GFINLSSCK SDQN E LDFNLITPF+DLISIWTHPNEGLDQF
Subjt:  LVCSCSAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQF

Query:  LPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL
        LPLV+SEV G F SGVC VR LAGVVIAEAFLMKLCLDF+SGRSRQ LEK+LR+W VGSIT IRN Y FETL+RFLL  TLPVMSLL STEDE LLRKVL
Subjt:  LPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL

Query:  YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKT
        YDALILVDYSFL+P KAI+L AEHVA LAVKRLILTH+AIEF+REHGDQSRA+SYLNAFSSS V SQ+IRWVRSQI SN NVNRPNGSSPKILL+WL K 
Subjt:  YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKT

Query:  EDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKI
        EDQGVRVFDNTISD R KLVLD+SKS S HP LEG+KVDD LLFY+DKQGE EN SEE K MDESVN A VTVARTMS  EN SGK+KR+RK+ERKN KI
Subjt:  EDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKI

Query:  KFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE
        KF+KYDL PN DAAQLRSAVDNND NSEGEVHNPH DEDS+MEE
Subjt:  KFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE

A0A6J1H8Q6 uncharacterized protein LOC1114615264.1e-23780.45Show/hide
Query:  MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAP
        M+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNLG S   E FYELIQAR DPP+E+IWFYSALTFRSR ST  GDFLDRVA MK+LFQ  CSCSAP
Subjt:  MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAP

Query:  CDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQFLPLVNSEV
        C SSKTIALL+PVV+EVYKLI DML KDL+ KREKKAMREVKSLVE ++GFINLSSCKDSDQN ESLDFNL+TPFVDLISIW + NEGLDQFLPLV+SEV
Subjt:  CDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQFLPLVNSEV

Query:  CGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD
         GEFSSGVC +RRLAGVVIAE FLMKLCLD NSGRSRQ LE +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALILVD
Subjt:  CGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD

Query:  YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVF
        YSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVSSQIIRWV+SQI SN N N P GSSPKI L+WLLK ED GVRVF
Subjt:  YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVF

Query:  DNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKIKFIKYDL
        D+TIS+RR KLVLD SKSVSGHPT EG+ VDD+LLFYIDKQGENENGS EE + MDESVN A V+ A TMSTT+N SGK+KR+R A +K KKIKF KYDL
Subjt:  DNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKIKFIKYDL

Query:  IPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE
        + NSD  +LRSAV++ND++SEGEVHNPH DEDS+ +E
Subjt:  IPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE

A0A6J1JEZ4 uncharacterized protein LOC1114851558.3e-23880.82Show/hide
Query:  MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAP
        M+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNL  S   E FYELIQARADPP+E+IWFYSALTFRSR ST  GDFLDRVA MK+LFQ  CSCSAP
Subjt:  MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAP

Query:  CDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQFLPLVNSEV
        C SSKTIALLAPVV+EVYKLI DML KDL  KREKKAMREVKSLVE I+GFINLSSCKDSDQN ESLDFNL+TPFVDLISIWT+ NEGLDQFLPLV+SEV
Subjt:  CDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQFLPLVNSEV

Query:  CGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD
         GEFSSGVC +RRLAGVVIAE FL+KLCLD NSGRSRQ LE +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALILVD
Subjt:  CGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD

Query:  YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVF
        YSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVSSQIIRWV+SQI S+ NVN P GSSPKI L+WL K ED GVRVF
Subjt:  YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVF

Query:  DNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKIKFIKYDL
        D+TIS+RR KLVLD SKSVSGHPT EG+ VDD+LLFYIDKQGENENGS EE + MDE+VN A V+ A TMSTT+N   K+KRRR A +K KKIKF KYDL
Subjt:  DNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKIKFIKYDL

Query:  IPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE
        +PNSDA +LRSAVD+ND++S+ EVHNPHLDEDS+M+E
Subjt:  IPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G11780.1 unknown protein3.0e-3827.53Show/hide
Query:  LEQSIQNVLAEIRKGNLGSSHLTERFYELIQARAD-PPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEVYK
        L  SI+ +L + R G    S     F  ++    + PP+E +WFYSA+ F S     +    D V      FQL+ S S      K ++LL+PVV+++ +
Subjt:  LEQSIQNVLAEIRKGNLGSSHLTERFYELIQARAD-PPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEVYK

Query:  LIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLIT--PFVDLISIWT--------HPNEGLDQFLPLVNSEVCGEFSSGVC
        L+         + R + A+    SL+E I+ +I++  C D   N E  D  +++   F DL  +W            + L+ F+P  +  +  E  S  C
Subjt:  LIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLIT--PFVDLISIWT--------HPNEGLDQFLPLVNSEVCGEFSSGVC

Query:  VVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQST-EDEALLRKVLYDALI-LVDYSFLDPE
         V  LAG+V ++ FL+ LC  F+    R  L+K+L+   +  I+   + +FF+ +++ LLE  L + SL+    EDEA L +++ +A+I  V+  FL+P 
Subjt:  VVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQST-EDEALLRKVLYDALI-LVDYSFLDPE

Query:  KAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDR
           +  + H+  +A+  L L  + +   R + DQ +   Y N FS+SL+   +I WV SQ     + +     +P   ++WL+  E+QG RVF+   S  
Subjt:  KAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDR

Query:  RTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNK----KIKFIKY
          K V+  S+               DL      Q + E   ++    DE  N + +++            +  R+RK ER NK    K+K  K+
Subjt:  RTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNK----KIKFIKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTGGCTCTTGTTGAATCGATGGATTCTATGAACACTTCAAACCAGAATCCTTTTCTTGGAGAAAATTATGAGTGTACTCTTGAGCAATCAATCCAGAACGTTTT
AGCTGAAATTCGCAAAGGAAATCTTGGTTCTTCTCATTTGACGGAACGATTCTATGAGTTGATTCAAGCTAGAGCTGACCCACCAATGGAAGCGATCTGGTTCTACTCCG
CATTAACGTTTCGTAGCCGTAGCTCCACTACTAAGGGCGACTTTTTGGACCGAGTGGCAGCCATGAAAGTCTTGTTTCAGTTGGTGTGTTCTTGTTCGGCTCCTTGTGAT
TCTTCGAAGACCATTGCGTTGCTTGCTCCAGTGGTTTTTGAGGTGTATAAGTTGATTGGTGACATGCTAAAAAAGGATTTGGCCTTGAAAAGGGAGAAGAAAGCGATGAG
AGAGGTTAAATCTTTAGTTGAAGTGATTATTGGCTTTATAAATCTGAGTTCTTGCAAGGATTCGGACCAGAATTGTGAATCTCTTGATTTCAATTTGATTACTCCTTTTG
TGGATTTAATTAGTATTTGGACGCACCCAAATGAGGGATTGGATCAGTTCTTACCGCTCGTGAACAGTGAGGTTTGTGGAGAGTTTAGTTCAGGCGTCTGTGTTGTTCGT
CGCTTGGCTGGAGTTGTAATTGCTGAGGCATTTCTGATGAAACTGTGCTTGGACTTCAACAGTGGGCGTTCGAGGCAAGGTTTGGAGAAAGAGCTAAGGATATGGACTGT
TGGTTCTATAACTCGGATTAGGAACTTCTACTTTTTTGAAACTCTTGTAAGATTCCTGCTGGAGGCGACTTTACCTGTAATGTCTCTGTTGCAGAGCACTGAAGATGAAG
CTCTGTTAAGGAAGGTTCTATATGATGCTCTAATATTGGTTGATTATTCATTTTTGGATCCTGAGAAAGCCATTAACTTACCTGCCGAACATGTGGCATTGCTGGCTGTT
AAGAGATTGATTCTTACTCATGAGGCCATAGAGTTTTACAGGGAGCATGGAGATCAGAGCAGAGCCGTCTCTTATCTAAATGCCTTCTCAAGTTCTCTTGTATCTTCTCA
AATTATTAGATGGGTCAGAAGCCAAATTCTTAGCAATGGAAATGTAAATCGACCCAATGGATCGTCGCCTAAAATACTTCTCAAGTGGCTTCTCAAGACTGAAGATCAAG
GTGTGAGAGTATTTGACAATACCATTTCCGATCGTCGAACCAAATTAGTTCTTGATGTTTCTAAATCAGTCTCGGGGCATCCCACATTGGAGGGAGATAAAGTAGATGAC
GATCTTTTGTTTTACATTGACAAGCAAGGGGAAAATGAAAATGGAAGTGAGGAGGGCAAAACGATGGATGAATCAGTAAATGAAGCGTTTGTAACCGTTGCTCGTACCAT
GTCAACGACAGAAAACGCTTCAGGAAAGCAGAAGCGACGGAGAAAGGCTGAAAGAAAGAATAAGAAGATCAAATTTATAAAGTACGATCTCATCCCGAACTCTGATGCTG
CCCAATTGAGGTCAGCTGTTGATAATAATGACTCGAACAGCGAGGGCGAAGTTCATAATCCACACCTGGACGAAGATTCTGAGATGGAAGAGTAA
mRNA sequenceShow/hide mRNA sequence
TGAATTCATACAGAAAAAACTTTGCTTTCTTCGAGTCCCAATTCTTTTCATAGTAATTGATTCAATTCGATAAGTTTCAGCGAGTCTCATGGCTTTGGCTCTTGTTGAAT
CGATGGATTCTATGAACACTTCAAACCAGAATCCTTTTCTTGGAGAAAATTATGAGTGTACTCTTGAGCAATCAATCCAGAACGTTTTAGCTGAAATTCGCAAAGGAAAT
CTTGGTTCTTCTCATTTGACGGAACGATTCTATGAGTTGATTCAAGCTAGAGCTGACCCACCAATGGAAGCGATCTGGTTCTACTCCGCATTAACGTTTCGTAGCCGTAG
CTCCACTACTAAGGGCGACTTTTTGGACCGAGTGGCAGCCATGAAAGTCTTGTTTCAGTTGGTGTGTTCTTGTTCGGCTCCTTGTGATTCTTCGAAGACCATTGCGTTGC
TTGCTCCAGTGGTTTTTGAGGTGTATAAGTTGATTGGTGACATGCTAAAAAAGGATTTGGCCTTGAAAAGGGAGAAGAAAGCGATGAGAGAGGTTAAATCTTTAGTTGAA
GTGATTATTGGCTTTATAAATCTGAGTTCTTGCAAGGATTCGGACCAGAATTGTGAATCTCTTGATTTCAATTTGATTACTCCTTTTGTGGATTTAATTAGTATTTGGAC
GCACCCAAATGAGGGATTGGATCAGTTCTTACCGCTCGTGAACAGTGAGGTTTGTGGAGAGTTTAGTTCAGGCGTCTGTGTTGTTCGTCGCTTGGCTGGAGTTGTAATTG
CTGAGGCATTTCTGATGAAACTGTGCTTGGACTTCAACAGTGGGCGTTCGAGGCAAGGTTTGGAGAAAGAGCTAAGGATATGGACTGTTGGTTCTATAACTCGGATTAGG
AACTTCTACTTTTTTGAAACTCTTGTAAGATTCCTGCTGGAGGCGACTTTACCTGTAATGTCTCTGTTGCAGAGCACTGAAGATGAAGCTCTGTTAAGGAAGGTTCTATA
TGATGCTCTAATATTGGTTGATTATTCATTTTTGGATCCTGAGAAAGCCATTAACTTACCTGCCGAACATGTGGCATTGCTGGCTGTTAAGAGATTGATTCTTACTCATG
AGGCCATAGAGTTTTACAGGGAGCATGGAGATCAGAGCAGAGCCGTCTCTTATCTAAATGCCTTCTCAAGTTCTCTTGTATCTTCTCAAATTATTAGATGGGTCAGAAGC
CAAATTCTTAGCAATGGAAATGTAAATCGACCCAATGGATCGTCGCCTAAAATACTTCTCAAGTGGCTTCTCAAGACTGAAGATCAAGGTGTGAGAGTATTTGACAATAC
CATTTCCGATCGTCGAACCAAATTAGTTCTTGATGTTTCTAAATCAGTCTCGGGGCATCCCACATTGGAGGGAGATAAAGTAGATGACGATCTTTTGTTTTACATTGACA
AGCAAGGGGAAAATGAAAATGGAAGTGAGGAGGGCAAAACGATGGATGAATCAGTAAATGAAGCGTTTGTAACCGTTGCTCGTACCATGTCAACGACAGAAAACGCTTCA
GGAAAGCAGAAGCGACGGAGAAAGGCTGAAAGAAAGAATAAGAAGATCAAATTTATAAAGTACGATCTCATCCCGAACTCTGATGCTGCCCAATTGAGGTCAGCTGTTGA
TAATAATGACTCGAACAGCGAGGGCGAAGTTCATAATCCACACCTGGACGAAGATTCTGAGATGGAAGAGTAATATGATCTAAAGAGAAAGGTCGTATTTGGTTGTCACA
ACACAACTTCAGATTCAGTTTGTTATCTGTTGATGACCTTTCTGGAGGAGCTGGCCAAGATACAAGTGCGAGCTCGTGAGGTTCGAGTAGGCAGGTCGATATGCCAAGAA
TGTTGGATTTTCAACACAAGTGCAACAATTCAACCAAGGTTGCAAATGTAAGTTGATCTATGATCAAATCAAAGTAAGTGAAAAAAGGAAAAGACATTTTTGGTACATTA
TGATACTGCTGTTTTTCATGAAGATGAATCAAATATTTAACTGCACTTGATGCGCTGTATTTTATTATTAATGCTTATTTTGACAACTTGACAAAACCATTGTTCAAACT
TGTTCTCAACATGAGGAAAAACTAAAAGGAAAAAAACGAC
Protein sequenceShow/hide protein sequence
MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCD
SSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVR
RLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVDYSFLDPEKAINLPAEHVALLAV
KRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDD
DLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE