; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019749 (gene) of Snake gourd v1 genome

Gene IDTan0019749
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein KOKOPELLI-like isoform X1
Genome locationLG07:4550674..4559902
RNA-Seq ExpressionTan0019749
SyntenyTan0019749
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022958322.1 uncharacterized protein LOC111459571 isoform X2 [Cucurbita moschata]4.5e-16964.66Show/hide
Query:  MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMV
        ME DELYLDLLALR+LY+ LLK CLRDANSEL +  RA+IL KHLLDDA+ G+LEF SK L      FYNFL KDDK T PLDEKVAEWME NQTAR M 
Subjt:  MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMV

Query:  NPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGH
        NPEKIEH P RDRASA+NVAANDL +GISSA+RRIELHILSLQRY      TRSHI+ETK AY GQSV QGNE+LN                        
Subjt:  NPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGH

Query:  NLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR
                     QK++P+V +HCS+FVHGFRIPL QD +EA+          KQH+L  P TL+DKSG    SKAT R  MKLN+T IQEKRS NS GR
Subjt:  NLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR

Query:  MVMRPTLLDHPSREVRKEQTHNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKK
        +VMRPTL             HNKTHLA +QESE+TNSESESA SSS AT+QTSESETT DSSSP  Q SP ATGSEASS+  +SSS+IS +AFKF+HGKK
Subjt:  MVMRPTLLDHPSREVRKEQTHNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKK

Query:  ESKRAIGRAKRLKNKLRLIFHHHHHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKAMKKKE
        ESK+A+GR K L+NKL LIF  HHHHHH+HNGHN MWKQ+R++FH T  K+L  KEE+ G L+KT IRSV   NQVGKFQALAEG+RSHVW+SKAMKKKE
Subjt:  ESKRAIGRAKRLKNKLRLIFHHHHHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKAMKKKE

Query:  LRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV
         R LNCGK  G KKLHWWKM RR RGVKLPNKG VKIGYVN+K  +K++
Subjt:  LRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV

XP_022996025.1 uncharacterized protein LOC111491355 isoform X1 [Cucurbita maxima]1.1e-17566.06Show/hide
Query:  MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMV
        ME DELYLDLLALR+LY  LLK CLRDANSEL +  RA+ILLKHLLDDA+ G+LEF SK LA     FYNFL KDDK T PLDEKVAEWME NQTAR+M 
Subjt:  MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMV

Query:  NPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGH
        NPEKIEH P+RDRASA+NVAANDL +GI+SA+RRIELHILSLQRY      TRSHI+ETK AY GQSV QGNE+ NQ                       
Subjt:  NPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGH

Query:  NLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR
                     QK++P+V +HCS+FV+GFRIPL QD DEA+          KQH+LV P TL+DKSG    SKAT R  MKLN+T IQEKRS NS GR
Subjt:  NLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR

Query:  MVMRPTLLDHPSREVRKEQT-HNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGK
        +VM+PTL  HPSREVRKEQT HN+ HLA +QESEFTN  SESAS SS AT QTSESETTDDSSSP +Q SP ATGSEASS+Y +SSS+I+ KAFKF+HGK
Subjt:  MVMRPTLLDHPSREVRKEQT-HNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGK

Query:  KESKRAIGRAKRLKNKLRLIFHHH----HHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKA
        KES  A+GR K L+NKL LIFHHH    HHHHHHH+GHN MWKQ+R +FH TD K+L  KEE+ GKL+KT IRSV   NQVGKFQAL EG+RSHVW+SKA
Subjt:  KESKRAIGRAKRLKNKLRLIFHHH----HHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKA

Query:  MKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV
        MKKKE R LNCG     KKLHWWKM RR RGVK PNKG VKIGYVNRK  +K++
Subjt:  MKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV

XP_022996027.1 uncharacterized protein LOC111491355 isoform X2 [Cucurbita maxima]1.1e-17566.06Show/hide
Query:  MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMV
        ME DELYLDLLALR+LY  LLK CLRDANSEL +  RA+ILLKHLLDDA+ G+LEF SK LA     FYNFL KDDK T PLDEKVAEWME NQTAR+M 
Subjt:  MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMV

Query:  NPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGH
        NPEKIEH P+RDRASA+NVAANDL +GI+SA+RRIELHILSLQRY      TRSHI+ETK AY GQSV QGNE+ NQ                       
Subjt:  NPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGH

Query:  NLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR
                     QK++P+V +HCS+FV+GFRIPL QD DEA+          KQH+LV P TL+DKSG    SKAT R  MKLN+T IQEKRS NS GR
Subjt:  NLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR

Query:  MVMRPTLLDHPSREVRKEQT-HNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGK
        +VM+PTL  HPSREVRKEQT HN+ HLA +QESEFTN  SESAS SS AT QTSESETTDDSSSP +Q SP ATGSEASS+Y +SSS+I+ KAFKF+HGK
Subjt:  MVMRPTLLDHPSREVRKEQT-HNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGK

Query:  KESKRAIGRAKRLKNKLRLIFHHH----HHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKA
        KES  A+GR K L+NKL LIFHHH    HHHHHHH+GHN MWKQ+R +FH TD K+L  KEE+ GKL+KT IRSV   NQVGKFQAL EG+RSHVW+SKA
Subjt:  KESKRAIGRAKRLKNKLRLIFHHH----HHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKA

Query:  MKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV
        MKKKE R LNCG     KKLHWWKM RR RGVK PNKG VKIGYVNRK  +K++
Subjt:  MKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV

XP_038877121.1 protein KOKOPELLI-like isoform X1 [Benincasa hispida]1.6e-17165.54Show/hide
Query:  MEVDELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVN
        M+VD+LYLDLLALRELYILLLKSCL DANSELLDERAQILLKHLLDDA+AGVLEF S +LATNS +F NFLHKDDK   PL +KV EWM+ NQT RKM N
Subjt:  MEVDELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVN

Query:  PEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGHN
        PE       RDRASA+NVA N+L + ISSA+RRIELHILSLQ  T+Q R TR H          QSVLQ NE+LNQQ    RT  S LR+RF + IKG  
Subjt:  PEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGHN

Query:  LSSQLRSHLVGGQ-KIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGY-SVESKATVRSGMKLNQTI--QEKRSHNSY
             R H VG Q K++P   +HCSE+VHGFRIPL Q NDEA+KP T+ET I+KQHK+VNPMTLIDKSGY SV SKAT R  MKLNQT   Q KR+ NSY
Subjt:  LSSQLRSHLVGGQ-KIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGY-SVESKATVRSGMKLNQTI--QEKRSHNSY

Query:  GRMVMRPTLLD-HPSREVRKEQTHNKTHL-ATEQESEFTNSE--SESASSSSWATQQTSESETT-----DDSSSPHHQDSPLATGSEASSRYRSSSSSIS
        G+MVM PTLLD HPS+E R E+ ++KTHL AT+QESEFT+SE  S S+SSSSW TQ+TS SET       + SSP HQD PL+T S++SS          
Subjt:  GRMVMRPTLLD-HPSREVRKEQTHNKTHL-ATEQESEFTNSE--SESASSSSWATQQTSESETT-----DDSSSPHHQDSPLATGSEASSRYRSSSSSIS

Query:  TKAFKFNHGKKESKRAIGRAKRLKNKLRLIF-HHHHHHHHHHNGHNFMWK-QLRKIFHCTDNKK-LAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGI
        TK F    GK ESK+ +GR KRLKNKL ++F HHHHHHHHHHN +NFMWK QLRKIFH  DNK+ L  KE+   K+KK AIR+V  KNQVGKFQALAEG+
Subjt:  TKAFKFNHGKKESKRAIGRAKRLKNKLRLIF-HHHHHHHHHHNGHNFMWK-QLRKIFHCTDNKK-LAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGI

Query:  RSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQL
        RSHVWRSKAMK+K ++ + CG KKGVKKLHWWKMFR  RGV+LPNKGH+KIGYVN+K +L
Subjt:  RSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQL

XP_038877123.1 protein KOKOPELLI-like isoform X3 [Benincasa hispida]1.6e-17165.54Show/hide
Query:  MEVDELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVN
        M+VD+LYLDLLALRELYILLLKSCL DANSELLDERAQILLKHLLDDA+AGVLEF S +LATNS +F NFLHKDDK   PL +KV EWM+ NQT RKM N
Subjt:  MEVDELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVN

Query:  PEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGHN
        PE       RDRASA+NVA N+L + ISSA+RRIELHILSLQ  T+Q R TR H          QSVLQ NE+LNQQ    RT  S LR+RF + IKG  
Subjt:  PEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGHN

Query:  LSSQLRSHLVGGQ-KIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGY-SVESKATVRSGMKLNQTI--QEKRSHNSY
             R H VG Q K++P   +HCSE+VHGFRIPL Q NDEA+KP T+ET I+KQHK+VNPMTLIDKSGY SV SKAT R  MKLNQT   Q KR+ NSY
Subjt:  LSSQLRSHLVGGQ-KIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGY-SVESKATVRSGMKLNQTI--QEKRSHNSY

Query:  GRMVMRPTLLD-HPSREVRKEQTHNKTHL-ATEQESEFTNSE--SESASSSSWATQQTSESETT-----DDSSSPHHQDSPLATGSEASSRYRSSSSSIS
        G+MVM PTLLD HPS+E R E+ ++KTHL AT+QESEFT+SE  S S+SSSSW TQ+TS SET       + SSP HQD PL+T S++SS          
Subjt:  GRMVMRPTLLD-HPSREVRKEQTHNKTHL-ATEQESEFTNSE--SESASSSSWATQQTSESETT-----DDSSSPHHQDSPLATGSEASSRYRSSSSSIS

Query:  TKAFKFNHGKKESKRAIGRAKRLKNKLRLIF-HHHHHHHHHHNGHNFMWK-QLRKIFHCTDNKK-LAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGI
        TK F    GK ESK+ +GR KRLKNKL ++F HHHHHHHHHHN +NFMWK QLRKIFH  DNK+ L  KE+   K+KK AIR+V  KNQVGKFQALAEG+
Subjt:  TKAFKFNHGKKESKRAIGRAKRLKNKLRLIF-HHHHHHHHHHNGHNFMWK-QLRKIFHCTDNKK-LAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGI

Query:  RSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQL
        RSHVWRSKAMK+K ++ + CG KKGVKKLHWWKMFR  RGV+LPNKGH+KIGYVN+K +L
Subjt:  RSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQL

TrEMBL top hitse value%identityAlignment
A0A6J1ETH9 protein KOKOPELLI-like isoform X14.8e-16162.75Show/hide
Query:  MEVDELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVN
        M+VDE YLDLLALRELYILLLKSCLRDA SELLDERAQILLK+LLDDA+A VLEF  KN+AT+SG+FY FLHKDDK + PLDEKV EWM           
Subjt:  MEVDELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVN

Query:  PEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGHN
            +  PKR R SA+N   + +L GISSAIRRIE HILSLQRYT+QS+  RSHI     +YCG+SVL+GNET N+QK QSRTDHS +  R         
Subjt:  PEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGHN

Query:  LSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQTIQEKRSHNSYGRMV
           Q++  LVGGQ  + +VT HCSEFVHGFR+PL Q + E  KP  VET +SKQHKLVNPMTLIDK G SV SKAT+R   K +Q+ + K+S NSYG MV
Subjt:  LSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQTIQEKRSHNSYGRMV

Query:  MRPTLLDHPSREVRKEQTHNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKES
        M+PTLLDHPSREVRKE+T  KTHLAT+ ESEFT    +SA SSSW TQQTSES T DD SSP HQD   A  SE SS              +++ GKKES
Subjt:  MRPTLLDHPSREVRKEQTHNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKES

Query:  KRAIGRAKRLKNKLRLIFHHHHHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKAMKKKELR
        KRAIGR KRLKNKL +IF HHHHHHHHHN H+FMW ++RKIFH T+NKKL   E+RY K K TAIRS    NQVGKFQA+A+ +RSHV RSKA+ KK+  
Subjt:  KRAIGRAKRLKNKLRLIFHHHHHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKAMKKKELR

Query:  VLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHV-KIGYVNRKTQL
         + CG KKGVKKLHWWK+FR   GV+L NKG + +I YVN+K QL
Subjt:  VLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHV-KIGYVNRKTQL

A0A6J1H1S0 uncharacterized protein LOC111459571 isoform X12.2e-16964.66Show/hide
Query:  MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMV
        ME DELYLDLLALR+LY+ LLK CLRDANSEL +  RA+IL KHLLDDA+ G+LEF SK L      FYNFL KDDK T PLDEKVAEWME NQTAR M 
Subjt:  MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMV

Query:  NPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGH
        NPEKIEH P RDRASA+NVAANDL +GISSA+RRIELHILSLQRY      TRSHI+ETK AY GQSV QGNE+LN                        
Subjt:  NPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGH

Query:  NLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR
                     QK++P+V +HCS+FVHGFRIPL QD +EA+          KQH+L  P TL+DKSG    SKAT R  MKLN+T IQEKRS NS GR
Subjt:  NLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR

Query:  MVMRPTLLDHPSREVRKEQTHNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKK
        +VMRPTL             HNKTHLA +QESE+TNSESESA SSS AT+QTSESETT DSSSP  Q SP ATGSEASS+  +SSS+IS +AFKF+HGKK
Subjt:  MVMRPTLLDHPSREVRKEQTHNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKK

Query:  ESKRAIGRAKRLKNKLRLIFHHHHHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKAMKKKE
        ESK+A+GR K L+NKL LIF  HHHHHH+HNGHN MWKQ+R++FH T  K+L  KEE+ G L+KT IRSV   NQVGKFQALAEG+RSHVW+SKAMKKKE
Subjt:  ESKRAIGRAKRLKNKLRLIFHHHHHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKAMKKKE

Query:  LRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV
         R LNCGK  G KKLHWWKM RR RGVKLPNKG VKIGYVN+K  +K++
Subjt:  LRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV

A0A6J1H2T7 uncharacterized protein LOC111459571 isoform X22.2e-16964.66Show/hide
Query:  MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMV
        ME DELYLDLLALR+LY+ LLK CLRDANSEL +  RA+IL KHLLDDA+ G+LEF SK L      FYNFL KDDK T PLDEKVAEWME NQTAR M 
Subjt:  MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMV

Query:  NPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGH
        NPEKIEH P RDRASA+NVAANDL +GISSA+RRIELHILSLQRY      TRSHI+ETK AY GQSV QGNE+LN                        
Subjt:  NPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGH

Query:  NLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR
                     QK++P+V +HCS+FVHGFRIPL QD +EA+          KQH+L  P TL+DKSG    SKAT R  MKLN+T IQEKRS NS GR
Subjt:  NLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR

Query:  MVMRPTLLDHPSREVRKEQTHNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKK
        +VMRPTL             HNKTHLA +QESE+TNSESESA SSS AT+QTSESETT DSSSP  Q SP ATGSEASS+  +SSS+IS +AFKF+HGKK
Subjt:  MVMRPTLLDHPSREVRKEQTHNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKK

Query:  ESKRAIGRAKRLKNKLRLIFHHHHHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKAMKKKE
        ESK+A+GR K L+NKL LIF  HHHHHH+HNGHN MWKQ+R++FH T  K+L  KEE+ G L+KT IRSV   NQVGKFQALAEG+RSHVW+SKAMKKKE
Subjt:  ESKRAIGRAKRLKNKLRLIFHHHHHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKAMKKKE

Query:  LRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV
         R LNCGK  G KKLHWWKM RR RGVKLPNKG VKIGYVN+K  +K++
Subjt:  LRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV

A0A6J1K0S1 uncharacterized protein LOC111491355 isoform X25.3e-17666.06Show/hide
Query:  MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMV
        ME DELYLDLLALR+LY  LLK CLRDANSEL +  RA+ILLKHLLDDA+ G+LEF SK LA     FYNFL KDDK T PLDEKVAEWME NQTAR+M 
Subjt:  MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMV

Query:  NPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGH
        NPEKIEH P+RDRASA+NVAANDL +GI+SA+RRIELHILSLQRY      TRSHI+ETK AY GQSV QGNE+ NQ                       
Subjt:  NPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGH

Query:  NLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR
                     QK++P+V +HCS+FV+GFRIPL QD DEA+          KQH+LV P TL+DKSG    SKAT R  MKLN+T IQEKRS NS GR
Subjt:  NLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR

Query:  MVMRPTLLDHPSREVRKEQT-HNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGK
        +VM+PTL  HPSREVRKEQT HN+ HLA +QESEFTN  SESAS SS AT QTSESETTDDSSSP +Q SP ATGSEASS+Y +SSS+I+ KAFKF+HGK
Subjt:  MVMRPTLLDHPSREVRKEQT-HNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGK

Query:  KESKRAIGRAKRLKNKLRLIFHHH----HHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKA
        KES  A+GR K L+NKL LIFHHH    HHHHHHH+GHN MWKQ+R +FH TD K+L  KEE+ GKL+KT IRSV   NQVGKFQAL EG+RSHVW+SKA
Subjt:  KESKRAIGRAKRLKNKLRLIFHHH----HHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKA

Query:  MKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV
        MKKKE R LNCG     KKLHWWKM RR RGVK PNKG VKIGYVNRK  +K++
Subjt:  MKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV

A0A6J1K5J4 uncharacterized protein LOC111491355 isoform X15.3e-17666.06Show/hide
Query:  MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMV
        ME DELYLDLLALR+LY  LLK CLRDANSEL +  RA+ILLKHLLDDA+ G+LEF SK LA     FYNFL KDDK T PLDEKVAEWME NQTAR+M 
Subjt:  MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMV

Query:  NPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGH
        NPEKIEH P+RDRASA+NVAANDL +GI+SA+RRIELHILSLQRY      TRSHI+ETK AY GQSV QGNE+ NQ                       
Subjt:  NPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGH

Query:  NLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR
                     QK++P+V +HCS+FV+GFRIPL QD DEA+          KQH+LV P TL+DKSG    SKAT R  MKLN+T IQEKRS NS GR
Subjt:  NLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR

Query:  MVMRPTLLDHPSREVRKEQT-HNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGK
        +VM+PTL  HPSREVRKEQT HN+ HLA +QESEFTN  SESAS SS AT QTSESETTDDSSSP +Q SP ATGSEASS+Y +SSS+I+ KAFKF+HGK
Subjt:  MVMRPTLLDHPSREVRKEQT-HNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGK

Query:  KESKRAIGRAKRLKNKLRLIFHHH----HHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKA
        KES  A+GR K L+NKL LIFHHH    HHHHHHH+GHN MWKQ+R +FH TD K+L  KEE+ GKL+KT IRSV   NQVGKFQAL EG+RSHVW+SKA
Subjt:  KESKRAIGRAKRLKNKLRLIFHHH----HHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKA

Query:  MKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV
        MKKKE R LNCG     KKLHWWKM RR RGVK PNKG VKIGYVNRK  +K++
Subjt:  MKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV

SwissProt top hitse value%identityAlignment
Q9FFP2 Protein KOKOPELLI1.4e-1631.89Show/hide
Query:  VMRPTLLDH-------PSREVRKEQTHNKTHLATEQE----SEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSIST
        +M+PTL+D         S E   +QT + T   +E E    S+  + E+ S+S S W TQ  +++E+  +SS P   D  ++           S+S   T
Subjt:  VMRPTLLDH-------PSREVRKEQTHNKTHLATEQE----SEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSIST

Query:  KAFKFNHGKKESKRAIGRAKRLKNKLRLIFHHHHHHHHHHNGHN----FMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGI
                 K+ +  +GR KR+KNK+  IFHHHHHHHHHH+ H+      W +L+  FH    +K   KE +    +   + +   ++Q G F AL EG+
Subjt:  KAFKFNHGKKESKRAIGRAKRLKNKLRLIFHHHHHHHHHHNGHN----FMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGI

Query:  RSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHR--GVKLPNKGHVKIG
          H   SK  K +         K   KK  WWK+ ++ +  GVK+P +G VK+G
Subjt:  RSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHR--GVKLPNKGHVKIG

Arabidopsis top hitse value%identityAlignment
AT5G63720.1 kokopelli1.0e-1731.89Show/hide
Query:  VMRPTLLDH-------PSREVRKEQTHNKTHLATEQE----SEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSIST
        +M+PTL+D         S E   +QT + T   +E E    S+  + E+ S+S S W TQ  +++E+  +SS P   D  ++           S+S   T
Subjt:  VMRPTLLDH-------PSREVRKEQTHNKTHLATEQE----SEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSIST

Query:  KAFKFNHGKKESKRAIGRAKRLKNKLRLIFHHHHHHHHHHNGHN----FMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGI
                 K+ +  +GR KR+KNK+  IFHHHHHHHHHH+ H+      W +L+  FH    +K   KE +    +   + +   ++Q G F AL EG+
Subjt:  KAFKFNHGKKESKRAIGRAKRLKNKLRLIFHHHHHHHHHHNGHN----FMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGI

Query:  RSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHR--GVKLPNKGHVKIG
          H   SK  K +         K   KK  WWK+ ++ +  GVK+P +G VK+G
Subjt:  RSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHR--GVKLPNKGHVKIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTGACGAGTTATATCTTGATCTCCTAGCACTGAGGGAATTATACATCCTTCTCTTAAAGAGCTGTTTGCGAGATGCAAATTCAGAACTTCTGGATGAAAGGGC
ACAGATTTTATTGAAGCATTTGCTCGATGATGCATCTGCTGGAGTTCTCGAGTTCCAATCAAAGAACTTGGCAACAAACTCAGGCGTTTTTTACAACTTTCTACACAAAG
ATGATAAACACACAAACCCACTGGACGAGAAAGTTGCTGAATGGATGGAACGCAATCAAACTGCAAGAAAGATGGTAAATCCAGAGAAGATTGAACACAATCCCAAAAGA
GACAGAGCTTCAGCTACAAATGTTGCCGCTAATGACTTATTAAATGGCATCAGTTCAGCAATCAGAAGAATTGAACTCCACATTTTATCCCTACAACGTTACACAAATCA
AAGTAGGAACACAAGAAGCCATATCAATGAAACTAAATTTGCTTACTGCGGACAGTCTGTTCTTCAAGGGAATGAGACATTAAACCAGCAGAAAGATCAGTCAAGGACAG
ATCACTCAGCTTTAAGGACCAGATTTGCTGAGTCGATTAAAGGCCATAACTTGAGTAGTCAGTTAAGAAGTCATCTTGTCGGTGGACAGAAAATTGAGCCGATAGTGACC
AGCCATTGTTCTGAGTTTGTTCATGGATTCAGAATACCTCTGAGGCAAGACAATGATGAGGCCATCAAACCTCCAACAGTTGAAACTTGCATATCTAAACAACACAAACT
TGTAAATCCAATGACTCTGATAGATAAATCTGGATATTCAGTAGAGTCCAAGGCAACCGTCAGGTCCGGAATGAAGCTGAATCAAACTATACAAGAAAAGAGGAGCCATA
ATTCATATGGTCGTATGGTAATGAGGCCAACTTTGCTGGATCACCCCTCTCGAGAAGTAAGAAAGGAACAAACTCATAACAAGACCCATTTGGCCACTGAACAAGAATCA
GAATTCACGAACTCAGAATCAGAATCAGCTTCTTCTTCAAGTTGGGCAACTCAACAAACCAGTGAAAGTGAAACCACTGATGATTCTTCTTCTCCACATCACCAAGACAG
TCCACTGGCAACTGGTTCAGAGGCAAGTAGCCGGTACAGAAGCAGCAGTAGCAGCATTTCAACAAAAGCATTCAAATTCAACCATGGGAAAAAAGAGTCCAAAAGAGCAA
TAGGACGGGCCAAGAGACTCAAAAACAAACTAAGACTTATCTTCCACCACCACCATCATCATCACCACCACCATAACGGCCATAACTTCATGTGGAAGCAGCTAAGAAAG
ATCTTCCATTGCACAGATAACAAAAAACTAGCAGGTAAAGAAGAAAGATATGGGAAGCTAAAGAAAACAGCAATCAGAAGTGTGCCTTGCAAGAACCAAGTTGGGAAGTT
TCAGGCACTTGCTGAAGGGATTCGAAGCCATGTATGGAGATCAAAAGCCATGAAGAAGAAAGAGCTTAGGGTGCTTAATTGTGGGAAGAAGAAGGGTGTAAAGAAGTTGC
ATTGGTGGAAAATGTTTCGTCGCCACCGTGGAGTGAAGTTGCCCAATAAAGGGCATGTGAAAATAGGATATGTAAATAGAAAAACACAGCTTAAAATGGTTTAG
mRNA sequenceShow/hide mRNA sequence
GAGAAACTGAAAAATTGGAGGAGAAACAACCAGAAGGACGACTCGAAGTTTTACAAATGCCAGAAACGAGAAAATTTACAAGATGGAAGTTGACGAGTTATATCTTGATC
TCCTAGCACTGAGGGAATTATACATCCTTCTCTTAAAGAGCTGTTTGCGAGATGCAAATTCAGAACTTCTGGATGAAAGGGCACAGATTTTATTGAAGCATTTGCTCGAT
GATGCATCTGCTGGAGTTCTCGAGTTCCAATCAAAGAACTTGGCAACAAACTCAGGCGTTTTTTACAACTTTCTACACAAAGATGATAAACACACAAACCCACTGGACGA
GAAAGTTGCTGAATGGATGGAACGCAATCAAACTGCAAGAAAGATGGTAAATCCAGAGAAGATTGAACACAATCCCAAAAGAGACAGAGCTTCAGCTACAAATGTTGCCG
CTAATGACTTATTAAATGGCATCAGTTCAGCAATCAGAAGAATTGAACTCCACATTTTATCCCTACAACGTTACACAAATCAAAGTAGGAACACAAGAAGCCATATCAAT
GAAACTAAATTTGCTTACTGCGGACAGTCTGTTCTTCAAGGGAATGAGACATTAAACCAGCAGAAAGATCAGTCAAGGACAGATCACTCAGCTTTAAGGACCAGATTTGC
TGAGTCGATTAAAGGCCATAACTTGAGTAGTCAGTTAAGAAGTCATCTTGTCGGTGGACAGAAAATTGAGCCGATAGTGACCAGCCATTGTTCTGAGTTTGTTCATGGAT
TCAGAATACCTCTGAGGCAAGACAATGATGAGGCCATCAAACCTCCAACAGTTGAAACTTGCATATCTAAACAACACAAACTTGTAAATCCAATGACTCTGATAGATAAA
TCTGGATATTCAGTAGAGTCCAAGGCAACCGTCAGGTCCGGAATGAAGCTGAATCAAACTATACAAGAAAAGAGGAGCCATAATTCATATGGTCGTATGGTAATGAGGCC
AACTTTGCTGGATCACCCCTCTCGAGAAGTAAGAAAGGAACAAACTCATAACAAGACCCATTTGGCCACTGAACAAGAATCAGAATTCACGAACTCAGAATCAGAATCAG
CTTCTTCTTCAAGTTGGGCAACTCAACAAACCAGTGAAAGTGAAACCACTGATGATTCTTCTTCTCCACATCACCAAGACAGTCCACTGGCAACTGGTTCAGAGGCAAGT
AGCCGGTACAGAAGCAGCAGTAGCAGCATTTCAACAAAAGCATTCAAATTCAACCATGGGAAAAAAGAGTCCAAAAGAGCAATAGGACGGGCCAAGAGACTCAAAAACAA
ACTAAGACTTATCTTCCACCACCACCATCATCATCACCACCACCATAACGGCCATAACTTCATGTGGAAGCAGCTAAGAAAGATCTTCCATTGCACAGATAACAAAAAAC
TAGCAGGTAAAGAAGAAAGATATGGGAAGCTAAAGAAAACAGCAATCAGAAGTGTGCCTTGCAAGAACCAAGTTGGGAAGTTTCAGGCACTTGCTGAAGGGATTCGAAGC
CATGTATGGAGATCAAAAGCCATGAAGAAGAAAGAGCTTAGGGTGCTTAATTGTGGGAAGAAGAAGGGTGTAAAGAAGTTGCATTGGTGGAAAATGTTTCGTCGCCACCG
TGGAGTGAAGTTGCCCAATAAAGGGCATGTGAAAATAGGATATGTAAATAGAAAAACACAGCTTAAAATGGTTTAGTTTAGTGGGACAATTTTGAAGTTTCTGCAAGCTT
CCTTTCATGAGAGTTAGAACATTTTTATCCAACAGTTATCAAATGGAGTCTATGTAGTCATATGCATATACCAAAAATCAAATTTTCTCAATCGTTCACTTTCCTACACA
AACTAATAACTTCTCTTTCCATCTCTCCTATAAAATCAGGAGCCAAGAAAACTTCTGTTTCTTCAAAAACTCGATTAAAATTA
Protein sequenceShow/hide protein sequence
MEVDELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVNPEKIEHNPKR
DRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGHNLSSQLRSHLVGGQKIEPIVT
SHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQTIQEKRSHNSYGRMVMRPTLLDHPSREVRKEQTHNKTHLATEQES
EFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKESKRAIGRAKRLKNKLRLIFHHHHHHHHHHNGHNFMWKQLRK
IFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV