; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019638 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019638
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr5:44110729..44113323
RNA-Seq ExpressionLag0019638
SyntenyLag0019638
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022922692.1 uncharacterized protein LOC111430613 isoform X1 [Cucurbita moschata]6.1e-10434.6Show/hide
Query:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG
        T KQ+E+   +K  HR+NL +EE PTF KW+ P  R    +D SSKS KR + Y  V+R P   RDS    GG VA DVS     E   +A+ +S  ARG
Subjt:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG

Query:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH
        GCCCWQR     R             +VV N + EGVAAGV D+REE            GC C       C PTF ICGRRKV P GV  L ++ SGRR 
Subjt:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH

Query:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP
        S++ +RR LP        K              +VA   PDL KEG AI  P   K  S  CC++W+CLP+F +C RK      +V  +EEV+   PN P
Subjt:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP

Query:  EEIEVAT---DVVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVGD-------LDTLEEGVASGGLNLHKEDEVVVADVSDLSKEG
        E  EVA     V  G+  + P           ++G  S  +  C  +  +C  RK   G+       +  L +   + G  L +E E    D   L K  
Subjt:  EEIEVAT---DVVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVGD-------LDTLEEGVASGGLNLHKEDEVVVADVSDLSKEG

Query:  SGGCCC-FKCTPTFQFCGRRKVASGGLNLHKED-----EV-VVANVSNLSKEG------------SGGCCCV-RCSPTFKLCGRR-KVASGGLNRHKEDE
        SG CC  +KC P F  CG RKVA+  + L   D     EV  VAN   L KEG            SG CC   +C P+F  CGR+  V    +     + 
Subjt:  SGGCCC-FKCTPTFQFCGRRKVASGGLNLHKED-----EV-VVANVSNLSKEG------------SGGCCCV-RCSPTFKLCGRR-KVASGGLNRHKEDE

Query:  VVAADVSDLSKKG-----------SGGCC---------FRC---------------------------------------------TPTFQLCG------
           A+  DL+K+G             GCC         F+                                               P FQ+C       
Subjt:  VVAADVSDLSKKG-----------SGGCC---------FRC---------------------------------------------TPTFQLCG------

Query:  -----------------------RRRKE-------------ASSDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVK
                               +++KE              SS  NLH+E+E + A V +L K GS GC   +   +F +CG   KV SD+P+P+ + K
Subjt:  -----------------------RRRKE-------------ASSDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVK

Query:  VVSGVSDLPE------KKVAAAGVLELRGEVQF------------CVGGCWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKG
        VV+G SD+ E        VAAA   +   E Q             C    W + LLC     V+ASNHQ +E+  P+    S  +++V  V+   H EKG
Subjt:  VVSGVSDLPE------KKVAAAGVLELRGEVQF------------CVGGCWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKG

Query:  VATGVSDPHKENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGC
         A GV D H+E+G VA        ++QVA  DIP+  KEEK SAGD L+L ++EK ++S                   KEEE ++GS VSD G+EE G C
Subjt:  VATGVSDPHKENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGC

Query:  SCFKPAGKK--------GGRRHRRRSSRSRERCWPFQICGRKWL---SFLICRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVA
          FK  GK+        G  RHRR+SS SRE CWPFQICG+ WL   +FLIC  RKK+S SI +         NG +       V+ V G S     V+A
Subjt:  SCFKPAGKK--------GGRRHRRRSSRSRERCWPFQICGRKWL---SFLICRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVA

Query:  VAGATGVVAAASHSES----TEGCSCWQSKRHRRWPPRKTVATGKEDGGSQHAK-SKRRRG---CLGRRERRGRERQR
        VAG +G++A  + S+S      GC CW SK     P R++V+  KE+G S+ +K SKRRRG     GR+E+ G+ER+R
Subjt:  VAGATGVVAAASHSES----TEGCSCWQSKRHRRWPPRKTVATGKEDGGSQHAK-SKRRRG---CLGRRERRGRERQR

XP_022922693.1 uncharacterized protein LOC111430613 isoform X2 [Cucurbita moschata]3.3e-10234.64Show/hide
Query:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG
        T KQ+E+   +K  HR+NL +EE PTF KW+ P  R    +D SSKS KR + Y  V+R P   RDS    GG VA DVS     E   +A+ +S  ARG
Subjt:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG

Query:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH
        GCCCWQR     R             +VV N + EGVAAGV D+REE            GC C       C PTF ICGRRKV P GV  L ++ SGRR 
Subjt:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH

Query:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP
        S++ +RR LP        K              +VA   PDL KEG AI  P   K  S  CC++W+CLP+F +C RK      +V  +EEV+   PN P
Subjt:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP

Query:  EEIEVAT---DVVHGKKRVDP---------------------------------IVVN---------------MQEGVSKEDGFRSAQRAG-------CF
        E  EVA     V  G+  + P                                  VVN               +QEG    D     +R+G       C 
Subjt:  EEIEVAT---DVVHGKKRVDP---------------------------------IVVN---------------MQEGVSKEDGFRSAQRAG-------CF

Query:  QAFHIC----AGRKKVVGDLDT---LEEGVASGGLNLHKEDEVVVADVSDLSKEGSGGCC-CFKCTPTFQFCGRR-KVASGGLNLHKEDEVVVANVSNLS
         AF +C    A  K+ + D+D    LE    + G  L KE E  VA      KE SG C   +KC P+F  CGR+  V    + +   +   VAN  +L+
Subjt:  QAFHIC----AGRKKVVGDLDT---LEEGVASGGLNLHKEDEVVVADVSDLSKEGSGGCC-CFKCTPTFQFCGRR-KVASGGLNLHKEDEVVVANVSNLS

Query:  KEG-----------SGGCCCVRCSPTFKLCGRRKVASGGLNRHKEDEVVAADV----SDLSKKGSGGCCFR----CTPTFQLCG----------------
        KEG             GCC  R     +   R    + G      D     +V    +D+  K    C  +      P FQ+C                 
Subjt:  KEG-----------SGGCCCVRCSPTFKLCGRRKVASGGLNRHKEDEVVAADV----SDLSKKGSGGCCFR----CTPTFQLCG----------------

Query:  -------------RRRKE-------------ASSDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPE
                     +++KE              SS  NLH+E+E + A V +L K GS GC   +   +F +CG   KV SD+P+P+ + KVV+G SD+ E
Subjt:  -------------RRRKE-------------ASSDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPE

Query:  ------KKVAAAGVLELRGEVQF------------CVGGCWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHK
                VAAA   +   E Q             C    W + LLC     V+ASNHQ +E+  P+    S  +++V  V+   H EKG A GV D H+
Subjt:  ------KKVAAAGVLELRGEVQF------------CVGGCWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHK

Query:  ENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGCSCFKPAGKK-
        E+G VA        ++QVA  DIP+  KEEK SAGD L+L ++EK ++S                   KEEE ++GS VSD G+EE G C  FK  GK+ 
Subjt:  ENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGCSCFKPAGKK-

Query:  -------GGRRHRRRSSRSRERCWPFQICGRKWL---SFLICRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVAVAGATGVVAA
               G  RHRR+SS SRE CWPFQICG+ WL   +FLIC  RKK+S SI +         NG +       V+ V G S     V+AVAG +G++A 
Subjt:  -------GGRRHRRRSSRSRERCWPFQICGRKWL---SFLICRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVAVAGATGVVAA

Query:  ASHSES----TEGCSCWQSKRHRRWPPRKTVATGKEDGGSQHAK-SKRRRG---CLGRRERRGRERQR
         + S+S      GC CW SK     P R++V+  KE+G S+ +K SKRRRG     GR+E+ G+ER+R
Subjt:  ASHSES----TEGCSCWQSKRHRRWPPRKTVATGKEDGGSQHAK-SKRRRG---CLGRRERRGRERQR

XP_022922694.1 uncharacterized protein LOC111430613 isoform X3 [Cucurbita moschata]1.8e-10836.03Show/hide
Query:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG
        T KQ+E+   +K  HR+NL +EE PTF KW+ P  R    +D SSKS KR + Y  V+R P   RDS    GG VA DVS     E   +A+ +S  ARG
Subjt:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG

Query:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH
        GCCCWQR     R             +VV N + EGVAAGV D+REE            GC C       C PTF ICGRRKV P GV  L ++ SGRR 
Subjt:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH

Query:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP
        S++ +RR LP        K              +VA   PDL KEG AI  P   K  S  CC++W+CLP+F +C RK      +V  +EEV+   PN P
Subjt:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP

Query:  EEIEVAT---DVVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVGD-------LDTLEEGVASGGLNLHKEDEVVVADVSDLSKEG
        E  EVA     V  G+  + P           ++G  S  +  C  +  +C  RK   G+       +  L +   + G  L +E E    D   L K  
Subjt:  EEIEVAT---DVVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVGD-------LDTLEEGVASGGLNLHKEDEVVVADVSDLSKEG

Query:  SGGCCC-FKCTPTFQFCGRRKVASGGLNLHKED-----EV-VVANVSNLSKEG------------SGGCCCV-RCSPTFKLCGRR-KVASGGLNRHKEDE
        SG CC  +KC P F  CG RKVA+  + L   D     EV  VAN   L KEG            SG CC   +C P+F  CGR+  V    +     + 
Subjt:  SGGCCC-FKCTPTFQFCGRRKVASGGLNLHKED-----EV-VVANVSNLSKEG------------SGGCCCV-RCSPTFKLCGRR-KVASGGLNRHKEDE

Query:  VVAADVSDLSKKG-----------SGGCC---FRCTPTFQL-----------------------------CGRRRKE-------------ASSDLNLHKE
           A+  DL+K+G             GCC    +    FQ+                               +++KE              SS  NLH+E
Subjt:  VVAADVSDLSKKG-----------SGGCC---FRCTPTFQL-----------------------------CGRRRKE-------------ASSDLNLHKE

Query:  DEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPE------KKVAAAGVLELRGEVQF------------CVGGCW
        +E + A V +L K GS GC   +   +F +CG   KV SD+P+P+ + KVV+G SD+ E        VAAA   +   E Q             C    W
Subjt:  DEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPE------KKVAAAGVLELRGEVQF------------CVGGCW

Query:  SNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHKENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLH
         + LLC     V+ASNHQ +E+  P+    S  +++V  V+   H EKG A GV D H+E+G VA        ++QVA  DIP+  KEEK SAGD L+L 
Subjt:  SNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHKENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLH

Query:  KEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGCSCFKPAGKK--------GGRRHRRRSSRSRERCWPFQICGRKWL---SFLI
        ++EK ++S                   KEEE ++GS VSD G+EE G C  FK  GK+        G  RHRR+SS SRE CWPFQICG+ WL   +FLI
Subjt:  KEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGCSCFKPAGKK--------GGRRHRRRSSRSRERCWPFQICGRKWL---SFLI

Query:  CRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVAVAGATGVVAAASHSES----TEGCSCWQSKRHRRWPPRKTVATGKEDGGSQ
        C  RKK+S SI +         NG +       V+ V G S     V+AVAG +G++A  + S+S      GC CW SK     P R++V+  KE+G S+
Subjt:  CRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVAVAGATGVVAAASHSES----TEGCSCWQSKRHRRWPPRKTVATGKEDGGSQ

Query:  HAK-SKRRRG---CLGRRERRGRERQR
         +K SKRRRG     GR+E+ G+ER+R
Subjt:  HAK-SKRRRG---CLGRRERRGRERQR

XP_022922695.1 uncharacterized protein LOC111430613 isoform X4 [Cucurbita moschata]4.5e-10737.45Show/hide
Query:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG
        T KQ+E+   +K  HR+NL +EE PTF KW+ P  R    +D SSKS KR + Y  V+R P   RDS    GG VA DVS     E   +A+ +S  ARG
Subjt:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG

Query:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH
        GCCCWQR     R             +VV N + EGVAAGV D+REE            GC C       C PTF ICGRRKV P GV  L ++ SGRR 
Subjt:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH

Query:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP
        S++ +RR LP        K              +VA   PDL KEG AI  P   K  S  CC++W+CLP+F +C RK      +V  +EEV+   PN P
Subjt:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP

Query:  EEIEVATDVVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVGDLDTLEEGVASGGLN---LHKEDEVVVADVSDLSKEGSGGCCCF
        E  EVA    +G + V      +  G  KE+      +  C  +F  C  +  VV +  T+++       N   L KE E  VA   D  KEGS GCC  
Subjt:  EEIEVATDVVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVGDLDTLEEGVASGGLN---LHKEDEVVVADVSDLSKEGSGGCCCF

Query:  KCTPTFQFCGRR-KVASG--GLNLHKEDEVVVANVSNLSKEGSGGCCCVRCS----PTFKLCGRRKVASG------GLNRHKEDEVVAADVSDLSKKGSG
        +     +F  R  KVA+G   + +   + + V  V+N         C  +      P F++C   KVA+G       +    E E VA DV    KK   
Subjt:  KCTPTFQFCGRR-KVASG--GLNLHKEDEVVVANVSNLSKEGSGGCCCVRCS----PTFKLCGRRKVASG------GLNRHKEDEVVAADVSDLSKKGSG

Query:  GCCFRCTPTFQLCGRRRKEASSDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPE------KKVAAA
        G      P     G      SS  NLH+E+E + A V +L K GS GC   +   +F +CG   KV SD+P+P+ + KVV+G SD+ E        VAAA
Subjt:  GCCFRCTPTFQLCGRRRKEASSDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPE------KKVAAA

Query:  GVLELRGEVQF------------CVGGCWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHKENGVVAGGIPDH
           +   E Q             C    W + LLC     V+ASNHQ +E+  P+    S  +++V  V+   H EKG A GV D H+E+G VA      
Subjt:  GVLELRGEVQF------------CVGGCWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHKENGVVAGGIPDH

Query:  RKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGCSCFKPAGKK--------GGRRH
          ++QVA  DIP+  KEEK SAGD L+L ++EK ++S                   KEEE ++GS VSD G+EE G C  FK  GK+        G  RH
Subjt:  RKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGCSCFKPAGKK--------GGRRH

Query:  RRRSSRSRERCWPFQICGRKWL---SFLICRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVAVAGATGVVAAASHSES----TE
        RR+SS SRE CWPFQICG+ WL   +FLIC  RKK+S SI +         NG +       V+ V G S     V+AVAG +G++A  + S+S      
Subjt:  RRRSSRSRERCWPFQICGRKWL---SFLICRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVAVAGATGVVAAASHSES----TE

Query:  GCSCWQSKRHRRWPPRKTVATGKEDGGSQHAK-SKRRRG---CLGRRERRGRERQR
        GC CW SK     P R++V+  KE+G S+ +K SKRRRG     GR+E+ G+ER+R
Subjt:  GCSCWQSKRHRRWPPRKTVATGKEDGGSQHAK-SKRRRG---CLGRRERRGRERQR

XP_038880648.1 uncharacterized protein LOC120072275 [Benincasa hispida]2.5e-9736.79Show/hide
Query:  MATHKQREESCTRKPRHRKNLIMEELPTFNKWVPS-NRPVPPEDPSSKSKRVLGYGSVQRKPVVLRDSPEEAGGVAADVSSLPQGEGERI-AMADSCFAR
        M T KQ EE  +RKPR R+NL MEELPTF KW+ S        D +SKSK V G   V R PVV+ DS E+ G V ADV        +R+ A+ DS  AR
Subjt:  MATHKQREESCTRKPRHRKNLIMEELPTFNKWVPS-NRPVPPEDPSSKSKRVLGYGSVQRKPVVLRDSPEEAGGVAADVSSLPQGEGERI-AMADSCFAR

Query:  GGCCCWQRGDGKSR------------RRVVDNALHEGVAAGVLDMREEGR-----------GCWCCRCAGRGCLPTFEICGRRKVPAGVPNLADEGSGRR
         GCCCWQ      R            R+VV N     V   V ++REE             GC C       C PTF+ICGRRK    +  L  E     
Subjt:  GGCCCWQRGDGKSR------------RRVVDNALHEGVAAGVLDMREEGR-----------GCWCCRCAGRGCLPTFEICGRRKVPAGVPNLADEGSGRR

Query:  HSQTAKRRCLPTFHICGRRKKVAAGVPDLHKEG--VAIDVPDLRKGR-SGGCCARWECLPAFQICRRKRKVAADEEVTADVPNVPEEIEVATDVVHGKKR
                         R ++VA   PDL KE   V +   DLRK   SG CC R  C PAFQICRR+  VA+ EE   DVP V E  EVA D       
Subjt:  HSQTAKRRCLPTFHICGRRKKVAAGVPDLHKEG--VAIDVPDLRKGR-SGGCCARWECLPAFQICRRKRKVAADEEVTADVPNVPEEIEVATDVVHGKKR

Query:  VDPIVVNMQEGVSKEDGFRSAQRAGCFQAF--HI-CAGRKKVVGDLDTLEEGVASGGLNLHKEDEVVVADVSDLSKEGSGGCC-CFKCTPTFQFCGRRKV
             VN QEG S E          C QAF  HI CAGRK V  +  T E+        L  ED   V DV DL KEGSGGCC CFKC PT   CGRR+ 
Subjt:  VDPIVVNMQEGVSKEDGFRSAQRAGCFQAF--HI-CAGRKKVVGDLDTLEEGVASGGLNLHKEDEVVVADVSDLSKEGSGGCC-CFKCTPTFQFCGRRKV

Query:  ASGGLNLHKEDEVVVANVSNLSKEGSGGCCCVRCSPTFKLCGRRKVASGGLNRHKEDEVVAADVSDLSKKGSGGCCFRCTPTFQLCGRRRKEASSDLNLH
        A   +     +E VV +VS+   EG                               +EVVAAD    SK   GG C+                       
Subjt:  ASGGLNLHKEDEVVVANVSNLSKEGSGGCCCVRCSPTFKLCGRRKVASGGLNRHKEDEVVAADVSDLSKKGSGGCCFRCTPTFQLCGRRRKEASSDLNLH

Query:  KEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPEKKVAAAGVLELRGEVQFCVGGCWSNFLLCGRRTAVDASN
                                     FQ+C R                                              G    F LCG R  VDASN
Subjt:  KEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPEKKVAAAGVLELRGEVQFCVGGCWSNFLLCGRRTAVDASN

Query:  HQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHKENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREE
        H+ +EE+ P  VQ    ++EV  VIP             DP KE+  VA GIPD  KE+QVA+ DIP+ + EEK+SAG      KEE  S+ S+ +  E 
Subjt:  HQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHKENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREE

Query:  EAVATGVHDLQKEEEASLGSGVSDLGEEEEGG-CSCFKPAGKKGGRRHRRRSSRSRERCWPFQICGRKWLSFL-ICRGRKKVSVSIPNLREEEGLVVNGA
                               DL +E+EGG C CFK  GK+G RR  RRS +SRE  W FQICG  WL  L +CRGRKK SVSI  L E+EG      
Subjt:  EAVATGVHDLQKEEEASLGSGVSDLGEEEEGG-CSCFKPAGKKGGRRHRRRSSRSRERCWPFQICGRKWLSFL-ICRGRKKVSVSIPNLREEEGLVVNGA

Query:  ASPLREKEVLVVNGASDVHDEVVAVAGATGVVAAASHSESTEGCSCWQSKRHRRWPPRKTVATGKEDGGSQHAKSKRRRGCL---GRRERRGRERQR
                 LV NG S+VH+EVV  AG TGVVA   HS ST  C CW SK  R    R+ V   K+ G  + +KSKRR+G L   GR++R G+ER++
Subjt:  ASPLREKEVLVVNGASDVHDEVVAVAGATGVVAAASHSESTEGCSCWQSKRHRRWPPRKTVATGKEDGGSQHAKSKRRRGCL---GRRERRGRERQR

TrEMBL top hitse value%identityAlignment
A0A6J1E412 uncharacterized protein LOC111430613 isoform X12.9e-10434.6Show/hide
Query:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG
        T KQ+E+   +K  HR+NL +EE PTF KW+ P  R    +D SSKS KR + Y  V+R P   RDS    GG VA DVS     E   +A+ +S  ARG
Subjt:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG

Query:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH
        GCCCWQR     R             +VV N + EGVAAGV D+REE            GC C       C PTF ICGRRKV P GV  L ++ SGRR 
Subjt:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH

Query:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP
        S++ +RR LP        K              +VA   PDL KEG AI  P   K  S  CC++W+CLP+F +C RK      +V  +EEV+   PN P
Subjt:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP

Query:  EEIEVAT---DVVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVGD-------LDTLEEGVASGGLNLHKEDEVVVADVSDLSKEG
        E  EVA     V  G+  + P           ++G  S  +  C  +  +C  RK   G+       +  L +   + G  L +E E    D   L K  
Subjt:  EEIEVAT---DVVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVGD-------LDTLEEGVASGGLNLHKEDEVVVADVSDLSKEG

Query:  SGGCCC-FKCTPTFQFCGRRKVASGGLNLHKED-----EV-VVANVSNLSKEG------------SGGCCCV-RCSPTFKLCGRR-KVASGGLNRHKEDE
        SG CC  +KC P F  CG RKVA+  + L   D     EV  VAN   L KEG            SG CC   +C P+F  CGR+  V    +     + 
Subjt:  SGGCCC-FKCTPTFQFCGRRKVASGGLNLHKED-----EV-VVANVSNLSKEG------------SGGCCCV-RCSPTFKLCGRR-KVASGGLNRHKEDE

Query:  VVAADVSDLSKKG-----------SGGCC---------FRC---------------------------------------------TPTFQLCG------
           A+  DL+K+G             GCC         F+                                               P FQ+C       
Subjt:  VVAADVSDLSKKG-----------SGGCC---------FRC---------------------------------------------TPTFQLCG------

Query:  -----------------------RRRKE-------------ASSDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVK
                               +++KE              SS  NLH+E+E + A V +L K GS GC   +   +F +CG   KV SD+P+P+ + K
Subjt:  -----------------------RRRKE-------------ASSDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVK

Query:  VVSGVSDLPE------KKVAAAGVLELRGEVQF------------CVGGCWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKG
        VV+G SD+ E        VAAA   +   E Q             C    W + LLC     V+ASNHQ +E+  P+    S  +++V  V+   H EKG
Subjt:  VVSGVSDLPE------KKVAAAGVLELRGEVQF------------CVGGCWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKG

Query:  VATGVSDPHKENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGC
         A GV D H+E+G VA        ++QVA  DIP+  KEEK SAGD L+L ++EK ++S                   KEEE ++GS VSD G+EE G C
Subjt:  VATGVSDPHKENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGC

Query:  SCFKPAGKK--------GGRRHRRRSSRSRERCWPFQICGRKWL---SFLICRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVA
          FK  GK+        G  RHRR+SS SRE CWPFQICG+ WL   +FLIC  RKK+S SI +         NG +       V+ V G S     V+A
Subjt:  SCFKPAGKK--------GGRRHRRRSSRSRERCWPFQICGRKWL---SFLICRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVA

Query:  VAGATGVVAAASHSES----TEGCSCWQSKRHRRWPPRKTVATGKEDGGSQHAK-SKRRRG---CLGRRERRGRERQR
        VAG +G++A  + S+S      GC CW SK     P R++V+  KE+G S+ +K SKRRRG     GR+E+ G+ER+R
Subjt:  VAGATGVVAAASHSES----TEGCSCWQSKRHRRWPPRKTVATGKEDGGSQHAK-SKRRRG---CLGRRERRGRERQR

A0A6J1E452 uncharacterized protein LOC111430613 isoform X38.9e-10936.03Show/hide
Query:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG
        T KQ+E+   +K  HR+NL +EE PTF KW+ P  R    +D SSKS KR + Y  V+R P   RDS    GG VA DVS     E   +A+ +S  ARG
Subjt:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG

Query:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH
        GCCCWQR     R             +VV N + EGVAAGV D+REE            GC C       C PTF ICGRRKV P GV  L ++ SGRR 
Subjt:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH

Query:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP
        S++ +RR LP        K              +VA   PDL KEG AI  P   K  S  CC++W+CLP+F +C RK      +V  +EEV+   PN P
Subjt:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP

Query:  EEIEVAT---DVVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVGD-------LDTLEEGVASGGLNLHKEDEVVVADVSDLSKEG
        E  EVA     V  G+  + P           ++G  S  +  C  +  +C  RK   G+       +  L +   + G  L +E E    D   L K  
Subjt:  EEIEVAT---DVVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVGD-------LDTLEEGVASGGLNLHKEDEVVVADVSDLSKEG

Query:  SGGCCC-FKCTPTFQFCGRRKVASGGLNLHKED-----EV-VVANVSNLSKEG------------SGGCCCV-RCSPTFKLCGRR-KVASGGLNRHKEDE
        SG CC  +KC P F  CG RKVA+  + L   D     EV  VAN   L KEG            SG CC   +C P+F  CGR+  V    +     + 
Subjt:  SGGCCC-FKCTPTFQFCGRRKVASGGLNLHKED-----EV-VVANVSNLSKEG------------SGGCCCV-RCSPTFKLCGRR-KVASGGLNRHKEDE

Query:  VVAADVSDLSKKG-----------SGGCC---FRCTPTFQL-----------------------------CGRRRKE-------------ASSDLNLHKE
           A+  DL+K+G             GCC    +    FQ+                               +++KE              SS  NLH+E
Subjt:  VVAADVSDLSKKG-----------SGGCC---FRCTPTFQL-----------------------------CGRRRKE-------------ASSDLNLHKE

Query:  DEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPE------KKVAAAGVLELRGEVQF------------CVGGCW
        +E + A V +L K GS GC   +   +F +CG   KV SD+P+P+ + KVV+G SD+ E        VAAA   +   E Q             C    W
Subjt:  DEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPE------KKVAAAGVLELRGEVQF------------CVGGCW

Query:  SNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHKENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLH
         + LLC     V+ASNHQ +E+  P+    S  +++V  V+   H EKG A GV D H+E+G VA        ++QVA  DIP+  KEEK SAGD L+L 
Subjt:  SNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHKENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLH

Query:  KEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGCSCFKPAGKK--------GGRRHRRRSSRSRERCWPFQICGRKWL---SFLI
        ++EK ++S                   KEEE ++GS VSD G+EE G C  FK  GK+        G  RHRR+SS SRE CWPFQICG+ WL   +FLI
Subjt:  KEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGCSCFKPAGKK--------GGRRHRRRSSRSRERCWPFQICGRKWL---SFLI

Query:  CRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVAVAGATGVVAAASHSES----TEGCSCWQSKRHRRWPPRKTVATGKEDGGSQ
        C  RKK+S SI +         NG +       V+ V G S     V+AVAG +G++A  + S+S      GC CW SK     P R++V+  KE+G S+
Subjt:  CRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVAVAGATGVVAAASHSES----TEGCSCWQSKRHRRWPPRKTVATGKEDGGSQ

Query:  HAK-SKRRRG---CLGRRERRGRERQR
         +K SKRRRG     GR+E+ G+ER+R
Subjt:  HAK-SKRRRG---CLGRRERRGRERQR

A0A6J1E7K5 uncharacterized protein LOC111430613 isoform X21.6e-10234.64Show/hide
Query:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG
        T KQ+E+   +K  HR+NL +EE PTF KW+ P  R    +D SSKS KR + Y  V+R P   RDS    GG VA DVS     E   +A+ +S  ARG
Subjt:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG

Query:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH
        GCCCWQR     R             +VV N + EGVAAGV D+REE            GC C       C PTF ICGRRKV P GV  L ++ SGRR 
Subjt:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH

Query:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP
        S++ +RR LP        K              +VA   PDL KEG AI  P   K  S  CC++W+CLP+F +C RK      +V  +EEV+   PN P
Subjt:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP

Query:  EEIEVAT---DVVHGKKRVDP---------------------------------IVVN---------------MQEGVSKEDGFRSAQRAG-------CF
        E  EVA     V  G+  + P                                  VVN               +QEG    D     +R+G       C 
Subjt:  EEIEVAT---DVVHGKKRVDP---------------------------------IVVN---------------MQEGVSKEDGFRSAQRAG-------CF

Query:  QAFHIC----AGRKKVVGDLDT---LEEGVASGGLNLHKEDEVVVADVSDLSKEGSGGCC-CFKCTPTFQFCGRR-KVASGGLNLHKEDEVVVANVSNLS
         AF +C    A  K+ + D+D    LE    + G  L KE E  VA      KE SG C   +KC P+F  CGR+  V    + +   +   VAN  +L+
Subjt:  QAFHIC----AGRKKVVGDLDT---LEEGVASGGLNLHKEDEVVVADVSDLSKEGSGGCC-CFKCTPTFQFCGRR-KVASGGLNLHKEDEVVVANVSNLS

Query:  KEG-----------SGGCCCVRCSPTFKLCGRRKVASGGLNRHKEDEVVAADV----SDLSKKGSGGCCFR----CTPTFQLCG----------------
        KEG             GCC  R     +   R    + G      D     +V    +D+  K    C  +      P FQ+C                 
Subjt:  KEG-----------SGGCCCVRCSPTFKLCGRRKVASGGLNRHKEDEVVAADV----SDLSKKGSGGCCFR----CTPTFQLCG----------------

Query:  -------------RRRKE-------------ASSDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPE
                     +++KE              SS  NLH+E+E + A V +L K GS GC   +   +F +CG   KV SD+P+P+ + KVV+G SD+ E
Subjt:  -------------RRRKE-------------ASSDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPE

Query:  ------KKVAAAGVLELRGEVQF------------CVGGCWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHK
                VAAA   +   E Q             C    W + LLC     V+ASNHQ +E+  P+    S  +++V  V+   H EKG A GV D H+
Subjt:  ------KKVAAAGVLELRGEVQF------------CVGGCWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHK

Query:  ENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGCSCFKPAGKK-
        E+G VA        ++QVA  DIP+  KEEK SAGD L+L ++EK ++S                   KEEE ++GS VSD G+EE G C  FK  GK+ 
Subjt:  ENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGCSCFKPAGKK-

Query:  -------GGRRHRRRSSRSRERCWPFQICGRKWL---SFLICRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVAVAGATGVVAA
               G  RHRR+SS SRE CWPFQICG+ WL   +FLIC  RKK+S SI +         NG +       V+ V G S     V+AVAG +G++A 
Subjt:  -------GGRRHRRRSSRSRERCWPFQICGRKWL---SFLICRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVAVAGATGVVAA

Query:  ASHSES----TEGCSCWQSKRHRRWPPRKTVATGKEDGGSQHAK-SKRRRG---CLGRRERRGRERQR
         + S+S      GC CW SK     P R++V+  KE+G S+ +K SKRRRG     GR+E+ G+ER+R
Subjt:  ASHSES----TEGCSCWQSKRHRRWPPRKTVATGKEDGGSQHAK-SKRRRG---CLGRRERRGRERQR

A0A6J1E9I0 uncharacterized protein LOC111430613 isoform X42.2e-10737.45Show/hide
Query:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG
        T KQ+E+   +K  HR+NL +EE PTF KW+ P  R    +D SSKS KR + Y  V+R P   RDS    GG VA DVS     E   +A+ +S  ARG
Subjt:  THKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGG-VAADVSSLPQGEGERIAMADSCFARG

Query:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH
        GCCCWQR     R             +VV N + EGVAAGV D+REE            GC C       C PTF ICGRRKV P GV  L ++ SGRR 
Subjt:  GCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRRH

Query:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP
        S++ +RR LP        K              +VA   PDL KEG AI  P   K  S  CC++W+CLP+F +C RK      +V  +EEV+   PN P
Subjt:  SQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRK-----RKVAADEEVTADVPNVP

Query:  EEIEVATDVVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVGDLDTLEEGVASGGLN---LHKEDEVVVADVSDLSKEGSGGCCCF
        E  EVA    +G + V      +  G  KE+      +  C  +F  C  +  VV +  T+++       N   L KE E  VA   D  KEGS GCC  
Subjt:  EEIEVATDVVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVGDLDTLEEGVASGGLN---LHKEDEVVVADVSDLSKEGSGGCCCF

Query:  KCTPTFQFCGRR-KVASG--GLNLHKEDEVVVANVSNLSKEGSGGCCCVRCS----PTFKLCGRRKVASG------GLNRHKEDEVVAADVSDLSKKGSG
        +     +F  R  KVA+G   + +   + + V  V+N         C  +      P F++C   KVA+G       +    E E VA DV    KK   
Subjt:  KCTPTFQFCGRR-KVASG--GLNLHKEDEVVVANVSNLSKEGSGGCCCVRCS----PTFKLCGRRKVASG------GLNRHKEDEVVAADVSDLSKKGSG

Query:  GCCFRCTPTFQLCGRRRKEASSDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPE------KKVAAA
        G      P     G      SS  NLH+E+E + A V +L K GS GC   +   +F +CG   KV SD+P+P+ + KVV+G SD+ E        VAAA
Subjt:  GCCFRCTPTFQLCGRRRKEASSDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPE------KKVAAA

Query:  GVLELRGEVQF------------CVGGCWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHKENGVVAGGIPDH
           +   E Q             C    W + LLC     V+ASNHQ +E+  P+    S  +++V  V+   H EKG A GV D H+E+G VA      
Subjt:  GVLELRGEVQF------------CVGGCWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHKENGVVAGGIPDH

Query:  RKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGCSCFKPAGKK--------GGRRH
          ++QVA  DIP+  KEEK SAGD L+L ++EK ++S                   KEEE ++GS VSD G+EE G C  FK  GK+        G  RH
Subjt:  RKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGCSCFKPAGKK--------GGRRH

Query:  RRRSSRSRERCWPFQICGRKWL---SFLICRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVAVAGATGVVAAASHSES----TE
        RR+SS SRE CWPFQICG+ WL   +FLIC  RKK+S SI +         NG +       V+ V G S     V+AVAG +G++A  + S+S      
Subjt:  RRRSSRSRERCWPFQICGRKWL---SFLICRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVAVAGATGVVAAASHSES----TE

Query:  GCSCWQSKRHRRWPPRKTVATGKEDGGSQHAK-SKRRRG---CLGRRERRGRERQR
        GC CW SK     P R++V+  KE+G S+ +K SKRRRG     GR+E+ G+ER+R
Subjt:  GCSCWQSKRHRRWPPRKTVATGKEDGGSQHAK-SKRRRG---CLGRRERRGRERQR

A0A6J1JCE8 uncharacterized protein LOC1114832012.4e-9030.43Show/hide
Query:  MATHKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGGVAADVSSLPQGEGERIAMADSCFAR
        MAT KQ+E+   RK  HR+NL +EE PTF KW+ P  R    +D SSKS KR + Y  ++R P   RDS    GGV A    +P+ +G  +A+ +S  AR
Subjt:  MATHKQREESCTRKPRHRKNLIMEELPTFNKWV-PSNRPVPPEDPSSKS-KRVLGYGSVQRKPVVLRDSPEEAGGVAADVSSLPQGEGERIAMADSCFAR

Query:  GGCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRR
        GGCCCWQR     R             +VV N + E VAAGV D+REE            GC C       C PTF ICGRRKV P GVP   ++ SGRR
Subjt:  GGCCCWQRGDGKSRR------------RVVDNALHEGVAAGVLDMREE----------GRGCWCCRCAGRGCLPTFEICGRRKV-PAGVPNLADEGSGRR

Query:  HSQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRKRKV-----AADEEVTADVPNV
        +S++ +RRC P        K              +VA   PDL KEG AI  P   K  SG CC++W+CLP+F +C RK  V       +EEV+ D PN 
Subjt:  HSQTAKRRCLPTFHICGRRK--------------KVAAGVPDLHKEGVAIDVPDLRKGRSGGCCARWECLPAFQICRRKRKV-----AADEEVTADVPNV

Query:  PEEIEVATD---VVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVG-------DLDTLEEGVASGGLNLHKE--------------
        PE  EVA     V  G+  V P +         ++G  S  +  C Q+  +C  RK   G       D+  L E   + G  L KE              
Subjt:  PEEIEVATD---VVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVG-------DLDTLEEGVASGGLNLHKE--------------

Query:  -----------------------DEVVVADV------------SDLSKEG------------SGGCCC-FKCTPTFQFCGRRKVASG-------------
                               +E+ V DV             +L KEG            SG CC  +KC P F  CG RKVA+G             
Subjt:  -----------------------DEVVVADV------------SDLSKEG------------SGGCCC-FKCTPTFQFCGRRKVASG-------------

Query:  -------GLNLHKEDEVV--------------------------------------------------VANVSNLSKEG------------SGGCCCV-R
               G  L KE E                                                    VAN   L KEG            SG CC   +
Subjt:  -------GLNLHKEDEVV--------------------------------------------------VANVSNLSKEG------------SGGCCCV-R

Query:  CSPTFKLCGRR--------------------KVASG-GLNRHKEDEV------------------------------------------------VAADV
        C P F +CGR+                    +VA+G GL +  E  V                                                V A+ 
Subjt:  CSPTFKLCGRR--------------------KVASG-GLNRHKEDEV------------------------------------------------VAADV

Query:  SDLSKKG------------SGGCC--FRCTPTFQLCGRR------------------------------------RKEAS--------------------
         +L K+G            SG C   ++C P+F +CGR+                                    RKE S                    
Subjt:  SDLSKKG------------SGGCC--FRCTPTFQLCGRR------------------------------------RKEAS--------------------

Query:  ----------------------------------------------SDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPAT
                                                      S  N +KE+E + A V +L K+GS GCC  +    F +CG   KV SD+P+P+ 
Subjt:  ----------------------------------------------SDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPAT

Query:  DVKVVSGVSDLPEKKV-----AAAGVLELRGEVQF------------CVGGCWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLE
        + KVV+G SD+ ++ V     AAA   +   EVQ             C    W + LLC     V+ASNHQ +E+  P+G      E++V  V+   H E
Subjt:  DVKVVSGVSDLPEKKV-----AAAGVLELRGEVQF------------CVGGCWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLE

Query:  KGVATGVSDPHKENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEG
        KG A  V D H+E+G VA        +EQVA+ DIP+  KEE+ SAGD  +L             I++E+ V++G     KEEE ++GS VSD G+EE G
Subjt:  KGVATGVSDPHKENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTSSVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEG

Query:  GCSCFKPAGKKGG------RRHRRRSSRSRERCWPFQICGRKWL---SFLICRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVA
         C C K  GK+ G       R RR+SS SRE CWPFQICG+ WL   +FLIC  RK++S SI +  +  G+ V+          V+ V G S     V+A
Subjt:  GCSCFKPAGKKGG------RRHRRRSSRSRERCWPFQICGRKWL---SFLICRGRKKVSVSIPNLREEEGLVVNGAASPLREKEVLVVNGASDVHDEVVA

Query:  VAGATGVVAAASHSESTEGCSC-WQSKRHRRWPPRKTVATGKEDGGSQHAK-SKRRRGCL---GRRERRGRERQR
        +AG +G +A    S+S     C W SK     P R++V+  KE+G S+ +K SKRRRG     GR+E+ G+ER+R
Subjt:  VAGATGVVAAASHSESTEGCSC-WQSKRHRRWPPRKTVATGKEDGGSQHAK-SKRRRGCL---GRRERRGRERQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACTCATAAGCAAAGGGAGGAATCATGTACACGCAAGCCCCGCCATCGGAAAAACTTGATCATGGAGGAATTGCCCACCTTCAACAAATGGGTACCCTCCAACCG
TCCAGTTCCACCCGAGGATCCCAGTTCCAAATCTAAGCGGGTTCTTGGCTATGGCTCTGTTCAGAGGAAGCCTGTCGTCCTTCGAGATTCGCCGGAGGAAGCTGGGGGGG
TTGCTGCCGATGTTTCTAGTTTGCCGCAGGGGGAGGGGGAGCGGATCGCCATGGCCGATTCTTGTTTTGCCAGAGGGGGTTGCTGTTGCTGGCAACGTGGAGATGGGAAA
TCAAGGAGAAGGGTGGTTGACAATGCTCTACACGAGGGGGTTGCTGCCGGCGTTTTGGATATGCGGGAGGAGGGGCGTGGTTGCTGGTGTTGCAGATGTGCAGGTCGAGG
ATGTTTGCCGACGTTCGAGATCTGCGGGAGGAGGAAGGTGCCTGCCGGAGTTCCAAATTTGGCCGACGAGGGGAGTGGCCGCCGACATTCCCAAACTGCAAAACGAAGAT
GTCTGCCGACATTTCATATCTGTGGGAGGAGGAAAAAGGTGGCCGCCGGCGTTCCGGATCTGCACAAAGAGGGAGTGGCTATCGACGTCCCGGATCTGCGCAAGGGACGG
AGTGGTGGTTGTTGTGCACGATGGGAATGTTTGCCCGCGTTTCAGATCTGCAGGAGGAAGAGGAAAGTCGCTGCCGACGAGGAGGTGACTGCCGACGTTCCAAATGTGCC
GGAGGAAATAGAAGTGGCCACCGACGTCGTACATGGGAAGAAGAGAGTAGATCCCATTGTTGTAAATATGCAAGAGGGCGTTTCAAAGGAGGATGGTTTTAGATCTGCGC
AACGAGCGGGATGTTTCCAAGCATTCCACATCTGCGCAGGGAGGAAGAAAGTCGTCGGCGATCTAGATACGTTGGAGGAGGGGGTGGCTTCCGGCGGTTTAAATCTTCAC
AAAGAGGACGAGGTGGTTGTTGCCGACGTCTCAGATCTGAGCAAAGAAGGGAGTGGTGGCTGCTGCTGTTTTAAATGCACGCCGACATTCCAATTTTGTGGGAGGAGGAA
GGTGGCTTCCGGCGGTTTAAATCTTCACAAAGAGGATGAGGTGGTTGTTGCCAACGTTTCAAATCTGAGCAAAGAAGGGAGTGGTGGCTGCTGTTGTGTTAGATGCTCGC
CGACATTCAAACTTTGTGGGAGGAGGAAGGTGGCTTCCGGCGGTTTAAATCGTCATAAAGAGGACGAGGTGGTTGCTGCCGACGTTTCAGATCTGAGCAAAAAAGGGAGT
GGTGGCTGTTGTTTTAGATGCACACCGACATTCCAACTTTGTGGGAGGAGGAGGAAGGAGGCTTCCAGCGATTTAAATCTTCACAAAGAGGACGAGGTGGTTGTCGCCGA
CGTTTCAGATCTGAGCAAAGAAGGGAGTGGTGGCTGCTGTTGTTGTAGATGCACGCTGACATTCCAACTTTGTGGGAGGAGGAGGAAGGTGGCTTCCGACATTCCAAGTC
CGGCAACGGATGTGAAAGTGGTCTCCGGCGTTTCAGATCTGCCGGAGAAGAAGGTGGCTGCTGCCGGTGTTCTGGAATTGCGTGGGGAAGTCCAATTCTGTGTGGGAGGG
TGTTGGTCCAATTTTCTTCTATGCGGGAGAAGGACGGCCGTCGATGCCTCGAATCATCAAGCTAAAGAAGAAAGAACGCCTGTCGGTGTTCAGAATTCGCATAAGGAGGA
CGAGGTTGCCACCGTCATTCCAAATACGCACCTGGAGAAGGGAGTGGCTACCGGCGTTTCGGATCCTCACAAGGAGAACGGTGTAGTTGCTGGCGGCATTCCAGATCATC
GAAAGGAGGAGCAAGTGGCTTCCGGCGACATTCCGATTCTTACTAAGGAGGAGAAGGTGTCTGCCGGCGACGCTCTAAATCTTCACAAGGAAGAGAAGGTGTCTACCAGC
AGCGTTCCAGATATTCGCGAGGAGGAAGCGGTGGCTACCGGCGTTCACGATCTTCAAAAGGAAGAGGAGGCAAGTCTGGGTTCTGGCGTTTCAGATTTGGGCGAGGAGGA
AGAGGGTGGCTGCAGCTGTTTCAAACCCGCGGGAAAAAAGGGTGGTCGACGTCACCGCCGCCGGAGTTCTAGATCTAGGGAACGATGTTGGCCATTCCAGATTTGTGGGA
GGAAGTGGCTATCATTCCTGATTTGCAGAGGGAGAAAAAAGGTGTCTGTAAGCATTCCAAATTTGCGGGAAGAAGAGGGGTTGGTTGTCAATGGCGCCGCCAGTCCTTTG
CGGGAAAAAGAGGTGTTGGTTGTCAATGGCGCTTCAGATGTTCATGACGAGGTGGTGGCGGTCGCCGGCGCTACGGGCGTGGTGGCTGCTGCCAGCCATTCTGAATCTAC
AGAAGGGTGTAGTTGTTGGCAGTCGAAACGGCATCGGAGATGGCCGCCGAGGAAGACGGTGGCTACAGGTAAGGAAGACGGTGGCAGTCAGCATGCCAAATCTAAGCGTA
GAAGAGGATGTCTAGGACGACGGGAAAGGAGAGGAAGAGAGAGACAAAGAAATTCCAATTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACTCATAAGCAAAGGGAGGAATCATGTACACGCAAGCCCCGCCATCGGAAAAACTTGATCATGGAGGAATTGCCCACCTTCAACAAATGGGTACCCTCCAACCG
TCCAGTTCCACCCGAGGATCCCAGTTCCAAATCTAAGCGGGTTCTTGGCTATGGCTCTGTTCAGAGGAAGCCTGTCGTCCTTCGAGATTCGCCGGAGGAAGCTGGGGGGG
TTGCTGCCGATGTTTCTAGTTTGCCGCAGGGGGAGGGGGAGCGGATCGCCATGGCCGATTCTTGTTTTGCCAGAGGGGGTTGCTGTTGCTGGCAACGTGGAGATGGGAAA
TCAAGGAGAAGGGTGGTTGACAATGCTCTACACGAGGGGGTTGCTGCCGGCGTTTTGGATATGCGGGAGGAGGGGCGTGGTTGCTGGTGTTGCAGATGTGCAGGTCGAGG
ATGTTTGCCGACGTTCGAGATCTGCGGGAGGAGGAAGGTGCCTGCCGGAGTTCCAAATTTGGCCGACGAGGGGAGTGGCCGCCGACATTCCCAAACTGCAAAACGAAGAT
GTCTGCCGACATTTCATATCTGTGGGAGGAGGAAAAAGGTGGCCGCCGGCGTTCCGGATCTGCACAAAGAGGGAGTGGCTATCGACGTCCCGGATCTGCGCAAGGGACGG
AGTGGTGGTTGTTGTGCACGATGGGAATGTTTGCCCGCGTTTCAGATCTGCAGGAGGAAGAGGAAAGTCGCTGCCGACGAGGAGGTGACTGCCGACGTTCCAAATGTGCC
GGAGGAAATAGAAGTGGCCACCGACGTCGTACATGGGAAGAAGAGAGTAGATCCCATTGTTGTAAATATGCAAGAGGGCGTTTCAAAGGAGGATGGTTTTAGATCTGCGC
AACGAGCGGGATGTTTCCAAGCATTCCACATCTGCGCAGGGAGGAAGAAAGTCGTCGGCGATCTAGATACGTTGGAGGAGGGGGTGGCTTCCGGCGGTTTAAATCTTCAC
AAAGAGGACGAGGTGGTTGTTGCCGACGTCTCAGATCTGAGCAAAGAAGGGAGTGGTGGCTGCTGCTGTTTTAAATGCACGCCGACATTCCAATTTTGTGGGAGGAGGAA
GGTGGCTTCCGGCGGTTTAAATCTTCACAAAGAGGATGAGGTGGTTGTTGCCAACGTTTCAAATCTGAGCAAAGAAGGGAGTGGTGGCTGCTGTTGTGTTAGATGCTCGC
CGACATTCAAACTTTGTGGGAGGAGGAAGGTGGCTTCCGGCGGTTTAAATCGTCATAAAGAGGACGAGGTGGTTGCTGCCGACGTTTCAGATCTGAGCAAAAAAGGGAGT
GGTGGCTGTTGTTTTAGATGCACACCGACATTCCAACTTTGTGGGAGGAGGAGGAAGGAGGCTTCCAGCGATTTAAATCTTCACAAAGAGGACGAGGTGGTTGTCGCCGA
CGTTTCAGATCTGAGCAAAGAAGGGAGTGGTGGCTGCTGTTGTTGTAGATGCACGCTGACATTCCAACTTTGTGGGAGGAGGAGGAAGGTGGCTTCCGACATTCCAAGTC
CGGCAACGGATGTGAAAGTGGTCTCCGGCGTTTCAGATCTGCCGGAGAAGAAGGTGGCTGCTGCCGGTGTTCTGGAATTGCGTGGGGAAGTCCAATTCTGTGTGGGAGGG
TGTTGGTCCAATTTTCTTCTATGCGGGAGAAGGACGGCCGTCGATGCCTCGAATCATCAAGCTAAAGAAGAAAGAACGCCTGTCGGTGTTCAGAATTCGCATAAGGAGGA
CGAGGTTGCCACCGTCATTCCAAATACGCACCTGGAGAAGGGAGTGGCTACCGGCGTTTCGGATCCTCACAAGGAGAACGGTGTAGTTGCTGGCGGCATTCCAGATCATC
GAAAGGAGGAGCAAGTGGCTTCCGGCGACATTCCGATTCTTACTAAGGAGGAGAAGGTGTCTGCCGGCGACGCTCTAAATCTTCACAAGGAAGAGAAGGTGTCTACCAGC
AGCGTTCCAGATATTCGCGAGGAGGAAGCGGTGGCTACCGGCGTTCACGATCTTCAAAAGGAAGAGGAGGCAAGTCTGGGTTCTGGCGTTTCAGATTTGGGCGAGGAGGA
AGAGGGTGGCTGCAGCTGTTTCAAACCCGCGGGAAAAAAGGGTGGTCGACGTCACCGCCGCCGGAGTTCTAGATCTAGGGAACGATGTTGGCCATTCCAGATTTGTGGGA
GGAAGTGGCTATCATTCCTGATTTGCAGAGGGAGAAAAAAGGTGTCTGTAAGCATTCCAAATTTGCGGGAAGAAGAGGGGTTGGTTGTCAATGGCGCCGCCAGTCCTTTG
CGGGAAAAAGAGGTGTTGGTTGTCAATGGCGCTTCAGATGTTCATGACGAGGTGGTGGCGGTCGCCGGCGCTACGGGCGTGGTGGCTGCTGCCAGCCATTCTGAATCTAC
AGAAGGGTGTAGTTGTTGGCAGTCGAAACGGCATCGGAGATGGCCGCCGAGGAAGACGGTGGCTACAGGTAAGGAAGACGGTGGCAGTCAGCATGCCAAATCTAAGCGTA
GAAGAGGATGTCTAGGACGACGGGAAAGGAGAGGAAGAGAGAGACAAAGAAATTCCAATTCTTAA
Protein sequenceShow/hide protein sequence
MATHKQREESCTRKPRHRKNLIMEELPTFNKWVPSNRPVPPEDPSSKSKRVLGYGSVQRKPVVLRDSPEEAGGVAADVSSLPQGEGERIAMADSCFARGGCCCWQRGDGK
SRRRVVDNALHEGVAAGVLDMREEGRGCWCCRCAGRGCLPTFEICGRRKVPAGVPNLADEGSGRRHSQTAKRRCLPTFHICGRRKKVAAGVPDLHKEGVAIDVPDLRKGR
SGGCCARWECLPAFQICRRKRKVAADEEVTADVPNVPEEIEVATDVVHGKKRVDPIVVNMQEGVSKEDGFRSAQRAGCFQAFHICAGRKKVVGDLDTLEEGVASGGLNLH
KEDEVVVADVSDLSKEGSGGCCCFKCTPTFQFCGRRKVASGGLNLHKEDEVVVANVSNLSKEGSGGCCCVRCSPTFKLCGRRKVASGGLNRHKEDEVVAADVSDLSKKGS
GGCCFRCTPTFQLCGRRRKEASSDLNLHKEDEVVVADVSDLSKEGSGGCCCCRCTLTFQLCGRRRKVASDIPSPATDVKVVSGVSDLPEKKVAAAGVLELRGEVQFCVGG
CWSNFLLCGRRTAVDASNHQAKEERTPVGVQNSHKEDEVATVIPNTHLEKGVATGVSDPHKENGVVAGGIPDHRKEEQVASGDIPILTKEEKVSAGDALNLHKEEKVSTS
SVPDIREEEAVATGVHDLQKEEEASLGSGVSDLGEEEEGGCSCFKPAGKKGGRRHRRRSSRSRERCWPFQICGRKWLSFLICRGRKKVSVSIPNLREEEGLVVNGAASPL
REKEVLVVNGASDVHDEVVAVAGATGVVAAASHSESTEGCSCWQSKRHRRWPPRKTVATGKEDGGSQHAKSKRRRGCLGRRERRGRERQRNSNS