; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g28720 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g28720
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr10:21601826..21615118
RNA-Seq ExpressionMoc10g28720
SyntenyMoc10g28720
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK11502.1 neprosin 2 [Cucumis melo var. makuwa]1.0e-12441.14Show/hide
Query:  VASVAMKRG-KKYYGDARSVSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTG------------------WQADGGIHTGYYNMFCRCFIQTNPSTPPN
        V S ++K+G +KYYG    +SVYN+S++  QSSSSNIWI+GGP  +  V++TG                  W ADGG  TG YNM+C+ F+Q NPS    
Subjt:  VASVAMKRG-KKYYGDARSVSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTG------------------WQADGGIHTGYYNMFCRCFIQTNPSTPPN

Query:  IPLYPSSTYQGKQYDYTFTVFQDRPTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNYGKHDDACYFRTLNYINE
         PL+P+STYQG+QYDY FT+ Q    G+WW+ V ++   +GYWPKEL  +L DGAEQ+AWGGIAKPS +GMSP LG+GHKPN    D+  Y   L ++  
Subjt:  IPLYPSSTYQGKQYDYTFTVFQDRPTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNYGKHDDACYFRTLNYINE

Query:  NNESEVSAIENTASYVKDFCPSFFSGRFYLHSSPADISGYLHLRPPPFFSVVTAVRLRLSRLKGGVPELTSVGGSLAFDSNLSREEELELEMQLKLLNRP
                               F+ +F   S+P                                              NLSREEELE+E QLKLLN+P
Subjt:  NNESEVSAIENTASYVKDFCPSFFSGRFYLHSSPADISGYLHLRPPPFFSVVTAVRLRLSRLKGGVPELTSVGGSLAFDSNLSREEELELEMQLKLLNRP

Query:  FITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFI----NNNNRACPAGYVPIRRTIKKDLIRIRSLSS---KEPTG
        FI T++T+EGDIIDCVDINKQPALDHP LK+HK+QT PS +   L K+ S S+  + I    NNN   CP G+VPIRRT+K+DLIR++SLSS   K+ + 
Subjt:  FITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFI----NNNNRACPAGYVPIRRTIKKDLIRIRSLSS---KEPTG

Query:  IKTS-------IKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVA-QDQSSSSNIWIIGGPPQAPNVILAG------------W-----------
        +K             V FPY+Q+VVS ++ K   Y+GA   ++VYN+S++ ++QSSS+NIW++GGP ++ NV++A             W           
Subjt:  IKTS-------IKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVA-QDQSSSSNIWIIGGPPQAPNVILAG------------W-----------

Query:  -----------QDRPT-------------------------GNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPS-PNGMSPPLGNGHKPNYS
                    D P                          G+WW+ VG+    +GYWP ELF +L  G EQVAWGG A+PS  +  SPPLG+GHKPN  
Subjt:  -----------QDRPT-------------------------GNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPS-PNGMSPPLGNGHKPNYS

Query:  KYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNCS
        + D+AC+ R + Y+  N     P  +NT NY+S++SCY L + E C  + F YC TFGGPGG +C+
Subjt:  KYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNCS

XP_022145286.1 uncharacterized protein LOC111014774 [Momordica charantia]1.5e-9957.93Show/hide
Query:  SLAFDSNLSREEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINNNNRACPAGYVPIRR
        S A  SNLSREEELELE QLKLLNRP ITTF+T+EG+IIDCVDI+KQPALDHPSLK+HK+Q RPSTYPFGLSKDS+SS++                    
Subjt:  SLAFDSNLSREEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINNNNRACPAGYVPIRR

Query:  TIKKDLIRIRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNV-----------ILAGWQ
                                          +VVS+ +K+GI+YYG  G  SVYNLSVAQDQSSSSNIWI+GGPP+  NV           +   W 
Subjt:  TIKKDLIRIRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNV-----------ILAGWQ

Query:  DRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPSPNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCY
        DR TG+WWLAV +S  TIGYWPKELFGHLNDG EQVAWGGIAKPSPNGMSPPLGNGHKPN  KY++ACYF+ +NY+D NN G  PA EN  +++SN+ CY
Subjt:  DRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPSPNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCY

Query:  AL-DNRETCGGEFFYYCITFGGPGGNNC
         L D   TC  +  Y+C TFGGPGGNNC
Subjt:  AL-DNRETCGGEFFYYCITFGGPGGNNC

XP_022145287.1 uncharacterized protein LOC111014775 [Momordica charantia]1.1e-13676.25Show/hide
Query:  MASKAIMWLMIVLLLHLNCKGSLAFDSNLSMEEELEFEMQLKLLNKPSITTF-----------------------------QVASVAMKRGKKYYGDARS
        MASKAIMWLMIVLLLHLNCKGSLAFDSNLSMEEELEFEMQLKLLNKPSITTF                             QVASVAMKRGKKYYGDARS
Subjt:  MASKAIMWLMIVLLLHLNCKGSLAFDSNLSMEEELEFEMQLKLLNKPSITTF-----------------------------QVASVAMKRGKKYYGDARS

Query:  VSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTGWQ------------------ADGGIHTGYYNMFCRCFIQTNPSTPPNIPLYPSSTYQGKQYDYTFT
        VSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTGWQ                  ADGGIHTGYYNMFCRCFIQTNPSTPPNIPLYPSSTYQGKQYDYTFT
Subjt:  VSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTGWQ------------------ADGGIHTGYYNMFCRCFIQTNPSTPPNIPLYPSSTYQGKQYDYTFT

Query:  VFQDRPTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNYGKHDDACYFRTLNYINENNESEVSAIENTASYVKDF
        VFQDRPTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNYGKHDDACYFRTLNYINENNESEVSAIENTASY+   
Subjt:  VFQDRPTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNYGKHDDACYFRTLNYINENNESEVSAIENTASYVKDF

Query:  CPSFFSGRFYLHSSPADISGYLHLRPPPFFSVVTAVRLRLS
                F L SSP                V T +RLRL+
Subjt:  CPSFFSGRFYLHSSPADISGYLHLRPPPFFSVVTAVRLRLS

XP_022145288.1 uncharacterized protein LOC111014777 [Momordica charantia]5.1e-18594.66Show/hide
Query:  GSLAFDSNLSREEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINNNNRACPAGYVPIR
        GSLAFDSNLSREEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINNNNRACPAGYVPIR
Subjt:  GSLAFDSNLSREEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINNNNRACPAGYVPIR

Query:  RTIKKDLIRIRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGWQ----------
        RTIKKDLIRIRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGWQ          
Subjt:  RTIKKDLIRIRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGWQ----------

Query:  --------DRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPSPNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTAN
                DRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPSPNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTAN
Subjt:  --------DRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPSPNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTAN

Query:  YLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNCSP
        YLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNCSP
Subjt:  YLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNCSP

XP_031738649.1 uncharacterized protein LOC116402744 [Cucumis sativus]3.5e-7742.53Show/hide
Query:  AFDSNLSREEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFI---NNNNRACPAGYVPIR
        A + NLSREE+LE+E QLKLLN+PFI T++T+EGDIIDCVDINKQPALDHP LK+HK+QT PS +   L K+ SS  +   +   NNN   CP G+VPIR
Subjt:  AFDSNLSREEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFI---NNNNRACPAGYVPIR

Query:  RTIKKDLIRIRSLSSKEPTGIKT-----------SIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQD-QSSSSNIWIIGGPPQAPNVILAG
        RT+K+DLIR++SLSS       +           S    V FPY Q+VVS ++ K   Y+GA   ++V+N+S++ + QSSS+NIW++GG   + NV++AG
Subjt:  RTIKKDLIRIRSLSSKEPTGIKT-----------SIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQD-QSSSSNIWIIGGPPQAPNVILAG

Query:  WQDRPT-----------------------------------------------------------------GNWWLAVGESHKTIGYWPKELFGHLNDGT
        WQ  P                                                                  G+WW+ VG++   +GYWP ELF +L  G 
Subjt:  WQDRPT-----------------------------------------------------------------GNWWLAVGESHKTIGYWPKELFGHLNDGT

Query:  EQVAWGGIAKPSPNG-MSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNC
        +QVAWGG A+P+  G  SPPLG+GHKPN  K D+A + R + Y+  N     P   NT NY+SN+SCY L + E C  + F YC TFGGPGG+ C
Subjt:  EQVAWGGIAKPSPNG-MSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNC

TrEMBL top hitse value%identityAlignment
A0A5A7UEV4 Uncharacterized protein1.7e-6931.53Show/hide
Query:  VASVAMKRG-KKYYGDARSVSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTG------------------WQADGGIHTGYYNMFCRCFIQTNPSTPPN
        V S ++K+G +KYYG    +SVYN+S++  QSSSSNIWI+GGP  +  V++TG                  W ADGG  TG YNM+C+ F+Q NPS    
Subjt:  VASVAMKRG-KKYYGDARSVSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTG------------------WQADGGIHTGYYNMFCRCFIQTNPSTPPN

Query:  IPLYPSSTYQGKQYDYTFTVFQDRPTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPN-YGKHDDACYFRTLNYIN
         PL+P+STYQG+QYDY FT+ Q    G+WW+ V ++   +GYWPKEL  +L DGAEQ+AWGGIAKPS +GMSP LG+GHKPN  G +++ CY R +  I+
Subjt:  IPLYPSSTYQGKQYDYTFTVFQDRPTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPN-YGKHDDACYFRTLNYIN

Query:  ENNESEVSAIENTASYVKDFCPSFFSGRFYLHSSPADISGYLHLRPPPFFSVVTAVRLRLSRLKGGVPELTSVGGSLAFDSNLSREEELELEMQLKLLNR
                A  NT                                                       +L +   +L++ SN S                
Subjt:  ENNESEVSAIENTASYVKDFCPSFFSGRFYLHSSPADISGYLHLRPPPFFSVVTAVRLRLSRLKGGVPELTSVGGSLAFDSNLSREEELELEMQLKLLNR

Query:  PFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINNNNRACPAGYVPIRRTIKKDLIRIRSLSSKEPTGIKTSIK
                       C D+N         ++          + FG     +   D+            G+V +                           
Subjt:  PFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINNNNRACPAGYVPIRRTIKKDLIRIRSLSSKEPTGIKTSIK

Query:  GGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGWQDRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQV
           D P    ++  ++ +G +Y         Y  S+ Q                              G+WW+ VG+    +GYWP ELF +L  G EQV
Subjt:  GGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGWQDRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQV

Query:  AWGGIAKPS-PNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNCS
        AWGG A+PS  +  SPPLG+GHKPN  + D+AC+ R + Y+  N     P  +NT NY+S++SCY L + E C  + F YC TFGGPGG +C+
Subjt:  AWGGIAKPS-PNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNCS

A0A5D3CJM0 Neprosin 24.9e-12541.14Show/hide
Query:  VASVAMKRG-KKYYGDARSVSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTG------------------WQADGGIHTGYYNMFCRCFIQTNPSTPPN
        V S ++K+G +KYYG    +SVYN+S++  QSSSSNIWI+GGP  +  V++TG                  W ADGG  TG YNM+C+ F+Q NPS    
Subjt:  VASVAMKRG-KKYYGDARSVSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTG------------------WQADGGIHTGYYNMFCRCFIQTNPSTPPN

Query:  IPLYPSSTYQGKQYDYTFTVFQDRPTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNYGKHDDACYFRTLNYINE
         PL+P+STYQG+QYDY FT+ Q    G+WW+ V ++   +GYWPKEL  +L DGAEQ+AWGGIAKPS +GMSP LG+GHKPN    D+  Y   L ++  
Subjt:  IPLYPSSTYQGKQYDYTFTVFQDRPTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNYGKHDDACYFRTLNYINE

Query:  NNESEVSAIENTASYVKDFCPSFFSGRFYLHSSPADISGYLHLRPPPFFSVVTAVRLRLSRLKGGVPELTSVGGSLAFDSNLSREEELELEMQLKLLNRP
                               F+ +F   S+P                                              NLSREEELE+E QLKLLN+P
Subjt:  NNESEVSAIENTASYVKDFCPSFFSGRFYLHSSPADISGYLHLRPPPFFSVVTAVRLRLSRLKGGVPELTSVGGSLAFDSNLSREEELELEMQLKLLNRP

Query:  FITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFI----NNNNRACPAGYVPIRRTIKKDLIRIRSLSS---KEPTG
        FI T++T+EGDIIDCVDINKQPALDHP LK+HK+QT PS +   L K+ S S+  + I    NNN   CP G+VPIRRT+K+DLIR++SLSS   K+ + 
Subjt:  FITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFI----NNNNRACPAGYVPIRRTIKKDLIRIRSLSS---KEPTG

Query:  IKTS-------IKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVA-QDQSSSSNIWIIGGPPQAPNVILAG------------W-----------
        +K             V FPY+Q+VVS ++ K   Y+GA   ++VYN+S++ ++QSSS+NIW++GGP ++ NV++A             W           
Subjt:  IKTS-------IKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVA-QDQSSSSNIWIIGGPPQAPNVILAG------------W-----------

Query:  -----------QDRPT-------------------------GNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPS-PNGMSPPLGNGHKPNYS
                    D P                          G+WW+ VG+    +GYWP ELF +L  G EQVAWGG A+PS  +  SPPLG+GHKPN  
Subjt:  -----------QDRPT-------------------------GNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPS-PNGMSPPLGNGHKPNYS

Query:  KYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNCS
        + D+AC+ R + Y+  N     P  +NT NY+S++SCY L + E C  + F YC TFGGPGG +C+
Subjt:  KYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNCS

A0A6J1CVJ6 uncharacterized protein LOC1110147772.5e-18594.66Show/hide
Query:  GSLAFDSNLSREEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINNNNRACPAGYVPIR
        GSLAFDSNLSREEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINNNNRACPAGYVPIR
Subjt:  GSLAFDSNLSREEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINNNNRACPAGYVPIR

Query:  RTIKKDLIRIRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGWQ----------
        RTIKKDLIRIRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGWQ          
Subjt:  RTIKKDLIRIRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGWQ----------

Query:  --------DRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPSPNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTAN
                DRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPSPNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTAN
Subjt:  --------DRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPSPNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTAN

Query:  YLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNCSP
        YLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNCSP
Subjt:  YLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNCSP

A0A6J1CVW9 uncharacterized protein LOC1110147747.1e-10057.93Show/hide
Query:  SLAFDSNLSREEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINNNNRACPAGYVPIRR
        S A  SNLSREEELELE QLKLLNRP ITTF+T+EG+IIDCVDI+KQPALDHPSLK+HK+Q RPSTYPFGLSKDS+SS++                    
Subjt:  SLAFDSNLSREEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINNNNRACPAGYVPIRR

Query:  TIKKDLIRIRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNV-----------ILAGWQ
                                          +VVS+ +K+GI+YYG  G  SVYNLSVAQDQSSSSNIWI+GGPP+  NV           +   W 
Subjt:  TIKKDLIRIRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNV-----------ILAGWQ

Query:  DRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPSPNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCY
        DR TG+WWLAV +S  TIGYWPKELFGHLNDG EQVAWGGIAKPSPNGMSPPLGNGHKPN  KY++ACYF+ +NY+D NN G  PA EN  +++SN+ CY
Subjt:  DRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPSPNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCY

Query:  AL-DNRETCGGEFFYYCITFGGPGGNNC
         L D   TC  +  Y+C TFGGPGGNNC
Subjt:  AL-DNRETCGGEFFYYCITFGGPGGNNC

A0A6J1CW60 uncharacterized protein LOC1110147755.6e-13776.25Show/hide
Query:  MASKAIMWLMIVLLLHLNCKGSLAFDSNLSMEEELEFEMQLKLLNKPSITTF-----------------------------QVASVAMKRGKKYYGDARS
        MASKAIMWLMIVLLLHLNCKGSLAFDSNLSMEEELEFEMQLKLLNKPSITTF                             QVASVAMKRGKKYYGDARS
Subjt:  MASKAIMWLMIVLLLHLNCKGSLAFDSNLSMEEELEFEMQLKLLNKPSITTF-----------------------------QVASVAMKRGKKYYGDARS

Query:  VSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTGWQ------------------ADGGIHTGYYNMFCRCFIQTNPSTPPNIPLYPSSTYQGKQYDYTFT
        VSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTGWQ                  ADGGIHTGYYNMFCRCFIQTNPSTPPNIPLYPSSTYQGKQYDYTFT
Subjt:  VSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTGWQ------------------ADGGIHTGYYNMFCRCFIQTNPSTPPNIPLYPSSTYQGKQYDYTFT

Query:  VFQDRPTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNYGKHDDACYFRTLNYINENNESEVSAIENTASYVKDF
        VFQDRPTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNYGKHDDACYFRTLNYINENNESEVSAIENTASY+   
Subjt:  VFQDRPTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNYGKHDDACYFRTLNYINENNESEVSAIENTASYVKDF

Query:  CPSFFSGRFYLHSSPADISGYLHLRPPPFFSVVTAVRLRLS
                F L SSP                V T +RLRL+
Subjt:  CPSFFSGRFYLHSSPADISGYLHLRPPPFFSVVTAVRLRLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20170.1 Protein of Unknown Function (DUF239)7.7e-3028.46Show/hide
Query:  REEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYP-FGLSKDSSSSRDKSFINNNNRACPAGYVPIRRTIKKDLIR
        + E+ EL+  L  +N+P I +FQT+ G I+DC+DI KQ A DHP LK+H IQ +P+  P +   K++  S    F  + + +CP G V I+RT  +DLI+
Subjt:  REEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYP-FGLSKDSSSSRDKSFINNNNRACPAGYVPIRRTIKKDLIR

Query:  IRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIK-YYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGW-------------------
        I+ L   +  G+K +     DF  N      A+ +  +  YGA+G++++++  V  DQ S ++I++  G   +   I AGW                   
Subjt:  IRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIK-YYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGW-------------------

Query:  -----------------------------------------------QDRPTGNWWLAVGESHKTIGYWPKELF--GHLNDGTEQVAWGG-IAKPSPNGM
                                                       QD  TGNWW  +   ++ IGYWPK LF    L  G  +V WGG +        
Subjt:  -----------------------------------------------QDRPTGNWWLAVGESHKTIGYWPKELF--GHLNDGTEQVAWGG-IAKPSPNGM

Query:  SPPLGNGHKPNYSKYDDACYFRYMNYVD-ENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGG
        SP +G+GH P    +  A +   +  +D E  K + P  ++   + ++  CY ++ + T  GE +   I +GGPGG
Subjt:  SPPLGNGHKPNYSKYDDACYFRYMNYVD-ENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGG

AT2G20170.2 Protein of Unknown Function (DUF239)7.7e-3028.46Show/hide
Query:  REEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYP-FGLSKDSSSSRDKSFINNNNRACPAGYVPIRRTIKKDLIR
        + E+ EL+  L  +N+P I +FQT+ G I+DC+DI KQ A DHP LK+H IQ +P+  P +   K++  S    F  + + +CP G V I+RT  +DLI+
Subjt:  REEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYP-FGLSKDSSSSRDKSFINNNNRACPAGYVPIRRTIKKDLIR

Query:  IRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIK-YYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGW-------------------
        I+ L   +  G+K +     DF  N      A+ +  +  YGA+G++++++  V  DQ S ++I++  G   +   I AGW                   
Subjt:  IRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIK-YYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGW-------------------

Query:  -----------------------------------------------QDRPTGNWWLAVGESHKTIGYWPKELF--GHLNDGTEQVAWGG-IAKPSPNGM
                                                       QD  TGNWW  +   ++ IGYWPK LF    L  G  +V WGG +        
Subjt:  -----------------------------------------------QDRPTGNWWLAVGESHKTIGYWPKELF--GHLNDGTEQVAWGG-IAKPSPNGM

Query:  SPPLGNGHKPNYSKYDDACYFRYMNYVD-ENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGG
        SP +G+GH P    +  A +   +  +D E  K + P  ++   + ++  CY ++ + T  GE +   I +GGPGG
Subjt:  SPPLGNGHKPNYSKYDDACYFRYMNYVD-ENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGG

AT2G20170.3 Protein of Unknown Function (DUF239)8.2e-3229.81Show/hide
Query:  REEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYP-FGLSKDSSSSRDKSFINNNNRACPAGYVPIRRTIKKDLIR
        + E+ EL+  L  +N+P I +FQT+ G I+DC+DI KQ A DHP LK+H IQ +P+  P +   K++  S    F  + + +CP G V I+RT  +DLI+
Subjt:  REEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYP-FGLSKDSSSSRDKSFINNNNRACPAGYVPIRRTIKKDLIR

Query:  IRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIK-YYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGW-------------------
        I+ L   +  G+K +     DF  N      A+ +  +  YGA+G++++++  V  DQ S ++I++  G   +   I AGW                   
Subjt:  IRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIK-YYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGW-------------------

Query:  ------------------------------QDRPTGNWWLAVGESHKTIGYWPKELF--GHLNDGTEQVAWGG-IAKPSPNGMSPPLGNGHKPNYSKYDD
                                      QD  TGNWW  +   ++ IGYWPK LF    L  G  +V WGG +        SP +G+GH P    +  
Subjt:  ------------------------------QDRPTGNWWLAVGESHKTIGYWPKELF--GHLNDGTEQVAWGG-IAKPSPNGMSPPLGNGHKPNYSKYDD

Query:  ACYFRYMNYVD-ENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGG
        A +   +  +D E  K + P  ++   + ++  CY ++ + T  GE +   I +GGPGG
Subjt:  ACYFRYMNYVD-ENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGG

AT4G23360.1 unknown protein3.6e-3523.52Show/hide
Query:  YYGDARSVSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTGWQAD-----------------GGIHTGYYNMFCRCFIQTNPSTPPNIPLYPSSTYQGKQ
        + G   +++V+   + +DQ S + I + GG  +    I  GW+ +                  G +TG  +M C  F+Q + + P    + P+S Y+G Q
Subjt:  YYGDARSVSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTGWQAD-----------------GGIHTGYYNMFCRCFIQTNPSTPPNIPLYPSSTYQGKQ

Query:  YDYTFTVFQDRPTGDWWLAVSDSQTTIGYWPKELF--GHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNYGKHDDACYFRTLNYINE----NNESEVS
        Y+   T++QD   GDWW A++D    +GYWP  LF     ++ A   +WGG         SPP+G+GH P+ G    A        + +    N ++   
Subjt:  YDYTFTVFQDRPTGDWWLAVSDSQTTIGYWPKELF--GHLNDGAEQVAWGGIAKPSPNGMSPPLGNGHKPNYGKHDDACYFRTLNYINE----NNESEVS

Query:  AIENT------ASYVKDFCPSFFSGRFYLHSSPADISGYLHL---------------------RPPPFFSVVTAVRLRLS------------RLKGGVPE
         +  T      A  V +    +    +Y    P    G ++L                         F+       +R S            R+K     
Subjt:  AIENT------ASYVKDFCPSFFSGRFYLHSSPADISGYLHL---------------------RPPPFFSVVTAVRLRLS------------RLKGGVPE

Query:  LTSVGG------------SLAFDSNLSRE--------EELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKD
        + S+              +L   +  ++E        ++ E++ QLK +N+P I +F+TE GDI DC+DI+KQ ALDH  LK+H +Q +P+T P  ++++
Subjt:  LTSVGG------------SLAFDSNLSRE--------EELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKD

Query:  SSSSRDKSFINNNNRACPAGYVPIRRTIKKDLIRIRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIKYY-GASGSVSVYNLSVAQDQSSSSNIWI
        + S      +     +CP G V ++RT  +DL+  + L S    G +  +    +   N          G   + G  G ++V+  ++ QDQ S + I +
Subjt:  SSSSRDKSFINNNNRACPAGYVPIRRTIKKDLIRIRSLSSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIKYY-GASGSVSVYNLSVAQDQSSSSNIWI

Query:  IGGPPQAPNVILAGWQDRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPSP----NGMSPPLGNGHKPNY----SKYDDACYFRYMNYV
         GG  +    I  GW+  P+    L  G+  +    W      + N G   ++  G  + S       +  PL     P Y    + Y D     +    
Subjt:  IGGPPQAPNVILAGWQDRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGGIAKPSP----NGMSPPLGNGHKPNY----SKYDDACYFRYMNYV

Query:  DENNKGQFPANENTANYLSNNSCYA
        ++ + G +PA+   +   SN + YA
Subjt:  DENNKGQFPANENTANYLSNNSCYA

AT5G18460.1 Protein of Unknown Function (DUF239)5.0e-2927.72Show/hide
Query:  LEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINN-------NNRACPAGYVPIRRTIKKDLIR
        ++  L  +N+  + T Q+ +GD+IDCV   KQPALDHP LK HKIQ  P   P    KD      ++ +         N   CP G VPIRR    D++R
Subjt:  LEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINN-------NNRACPAGYVPIRRTIKKDLIR

Query:  IRSL--SSKEPTGI----KTSIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQ-DQSSSSNIWIIGGPPQAP--NVILAGWQ----------
         +SL    K+   I    +T     +    ++  ++   +   + YGA  +++V++  + + ++ S S IWI+ G    P  N I AGWQ          
Subjt:  IRSL--SSKEPTGI----KTSIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQ-DQSSSSNIWIIGGPPQAP--NVILAGWQ----------

Query:  ---------------------------------------------------------DRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGG---
                                                                 D   GNWW+ +G+S   +GYWP ELF HL D    V WGG   
Subjt:  ---------------------------------------------------------DRPTGNWWLAVGESHKTIGYWPKELFGHLNDGTEQVAWGG---

Query:  IAKPSPNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCYAL-DNRETCGGEFFYYCITFGGPGGN
          + S    +  +G+GH P+   +  A YFR +  VD +N    P ++       N  CY +  +     G +FYY    GGPG N
Subjt:  IAKPSPNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCYAL-DNRETCGGEFFYYCITFGGPGGN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCAAAGCAATAATGTGGTTGATGATAGTATTGTTACTCCACTTAAACTGCAAAGGCAGCTTGGCTTTTGATTCAAACCTCTCAATGGAAGAAGAATTG
GAATTCGAAATGCAACTCAAACTTCTCAACAAGCCATCCATTACAACGTTTCAGGTGGCTTCTGTTGCTATGAAGAGAGGTAAGAAATATTATGGAGATGCTAGA
AGTGTGTCGGTATACAATTTGAGTGTGGCTCAAGATCAATCTTCTTCTTCTAACATATGGATAATTGGTGGACCTCCTGAAGCTCCCAATGTAATACTGACCGGC
TGGCAGGCTGATGGAGGTATTCACACCGGATACTACAACATGTTTTGTCGATGTTTTATACAAACAAATCCAAGTACTCCTCCTAATATCCCCCTTTACCCATCG
TCTACCTATCAAGGGAAACAATATGACTATACATTTACGGTTTTTCAGGATCGACCAACGGGGGATTGGTGGCTTGCAGTGAGTGATAGCCAAACAACAATAGGG
TATTGGCCAAAGGAACTGTTTGGACATCTGAATGATGGGGCCGAGCAAGTGGCATGGGGAGGCATTGCAAAGCCTTCACCAAATGGAATGAGCCCTCCATTGGGC
AATGGCCACAAGCCAAATTATGGTAAGCACGACGACGCTTGCTATTTCAGAACGTTGAACTACATAAATGAAAACAACGAAAGCGAAGTTTCTGCTATTGAGAAT
ACAGCGAGTTATGTAAAAGATTTTTGTCCTTCCTTCTTCTCCGGTCGTTTCTACCTGCATTCGTCTCCGGCTGACATCTCCGGCTACCTTCATCTCCGGCCGCCT
CCGTTTTTCTCCGTCGTCACAGCAGTTCGTCTCCGTTTATCCCGATTGAAAGGAGGAGTACCAGAACTGACTTCAGTCGGAGGCAGCTTGGCTTTTGACTCAAAC
CTCTCGAGGGAAGAAGAACTGGAACTCGAAATGCAACTCAAACTTCTCAACAGGCCATTCATTACAACGTTTCAGACGGAAGAGGGAGATATCATTGATTGTGTG
GACATCAATAAACAACCGGCACTAGATCATCCTTCACTAAAAAGTCACAAAATTCAGACTCGACCGAGTACATATCCCTTTGGCTTGTCAAAAGATTCGTCTTCA
TCAAGAGATAAATCATTCATAAACAACAACAATAGAGCTTGTCCAGCTGGATATGTTCCTATCCGAAGAACAATAAAGAAAGATTTGATTAGAATAAGATCTCTA
TCATCAAAGGAACCAACCGGTATCAAAACAAGTATAAAAGGTGGAGTGGACTTCCCATACAATCAAGATGTGGTTTCTGTTGCTATGAAGAAAGGCATTAAATAT
TATGGAGCTAGTGGTAGTGTGTCAGTATACAATTTGAGTGTGGCTCAAGATCAATCTTCTTCTTCTAACATATGGATAATTGGTGGACCTCCCCAAGCTCCTAAT
GTAATACTAGCAGGCTGGCAGGATCGACCAACCGGGAATTGGTGGCTTGCAGTAGGTGAGAGCCATAAAACAATAGGGTATTGGCCAAAGGAACTGTTTGGACAT
CTGAACGACGGGACAGAGCAAGTGGCATGGGGAGGCATCGCAAAGCCTTCACCAAATGGAATGAGCCCTCCATTGGGGAATGGCCACAAGCCAAATTATAGTAAA
TACGACGATGCTTGCTATTTCAGATATATGAACTACGTTGATGAAAACAACAAAGGCCAATTTCCAGCCAATGAGAACACAGCGAATTATTTAAGTAACAACTCT
TGCTATGCTTTGGATAACAGAGAGACGTGTGGAGGAGAGTTTTTTTATTATTGCATCACTTTTGGAGGACCTGGTGGAAATAATTGCAGTCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCAAAGCAATAATGTGGTTGATGATAGTATTGTTACTCCACTTAAACTGCAAAGGCAGCTTGGCTTTTGATTCAAACCTCTCAATGGAAGAAGAATTG
GAATTCGAAATGCAACTCAAACTTCTCAACAAGCCATCCATTACAACGTTTCAGGTGGCTTCTGTTGCTATGAAGAGAGGTAAGAAATATTATGGAGATGCTAGA
AGTGTGTCGGTATACAATTTGAGTGTGGCTCAAGATCAATCTTCTTCTTCTAACATATGGATAATTGGTGGACCTCCTGAAGCTCCCAATGTAATACTGACCGGC
TGGCAGGCTGATGGAGGTATTCACACCGGATACTACAACATGTTTTGTCGATGTTTTATACAAACAAATCCAAGTACTCCTCCTAATATCCCCCTTTACCCATCG
TCTACCTATCAAGGGAAACAATATGACTATACATTTACGGTTTTTCAGGATCGACCAACGGGGGATTGGTGGCTTGCAGTGAGTGATAGCCAAACAACAATAGGG
TATTGGCCAAAGGAACTGTTTGGACATCTGAATGATGGGGCCGAGCAAGTGGCATGGGGAGGCATTGCAAAGCCTTCACCAAATGGAATGAGCCCTCCATTGGGC
AATGGCCACAAGCCAAATTATGGTAAGCACGACGACGCTTGCTATTTCAGAACGTTGAACTACATAAATGAAAACAACGAAAGCGAAGTTTCTGCTATTGAGAAT
ACAGCGAGTTATGTAAAAGATTTTTGTCCTTCCTTCTTCTCCGGTCGTTTCTACCTGCATTCGTCTCCGGCTGACATCTCCGGCTACCTTCATCTCCGGCCGCCT
CCGTTTTTCTCCGTCGTCACAGCAGTTCGTCTCCGTTTATCCCGATTGAAAGGAGGAGTACCAGAACTGACTTCAGTCGGAGGCAGCTTGGCTTTTGACTCAAAC
CTCTCGAGGGAAGAAGAACTGGAACTCGAAATGCAACTCAAACTTCTCAACAGGCCATTCATTACAACGTTTCAGACGGAAGAGGGAGATATCATTGATTGTGTG
GACATCAATAAACAACCGGCACTAGATCATCCTTCACTAAAAAGTCACAAAATTCAGACTCGACCGAGTACATATCCCTTTGGCTTGTCAAAAGATTCGTCTTCA
TCAAGAGATAAATCATTCATAAACAACAACAATAGAGCTTGTCCAGCTGGATATGTTCCTATCCGAAGAACAATAAAGAAAGATTTGATTAGAATAAGATCTCTA
TCATCAAAGGAACCAACCGGTATCAAAACAAGTATAAAAGGTGGAGTGGACTTCCCATACAATCAAGATGTGGTTTCTGTTGCTATGAAGAAAGGCATTAAATAT
TATGGAGCTAGTGGTAGTGTGTCAGTATACAATTTGAGTGTGGCTCAAGATCAATCTTCTTCTTCTAACATATGGATAATTGGTGGACCTCCCCAAGCTCCTAAT
GTAATACTAGCAGGCTGGCAGGATCGACCAACCGGGAATTGGTGGCTTGCAGTAGGTGAGAGCCATAAAACAATAGGGTATTGGCCAAAGGAACTGTTTGGACAT
CTGAACGACGGGACAGAGCAAGTGGCATGGGGAGGCATCGCAAAGCCTTCACCAAATGGAATGAGCCCTCCATTGGGGAATGGCCACAAGCCAAATTATAGTAAA
TACGACGATGCTTGCTATTTCAGATATATGAACTACGTTGATGAAAACAACAAAGGCCAATTTCCAGCCAATGAGAACACAGCGAATTATTTAAGTAACAACTCT
TGCTATGCTTTGGATAACAGAGAGACGTGTGGAGGAGAGTTTTTTTATTATTGCATCACTTTTGGAGGACCTGGTGGAAATAATTGCAGTCCCTAA
Protein sequenceShow/hide protein sequence
MASKAIMWLMIVLLLHLNCKGSLAFDSNLSMEEELEFEMQLKLLNKPSITTFQVASVAMKRGKKYYGDARSVSVYNLSVAQDQSSSSNIWIIGGPPEAPNVILTG
WQADGGIHTGYYNMFCRCFIQTNPSTPPNIPLYPSSTYQGKQYDYTFTVFQDRPTGDWWLAVSDSQTTIGYWPKELFGHLNDGAEQVAWGGIAKPSPNGMSPPLG
NGHKPNYGKHDDACYFRTLNYINENNESEVSAIENTASYVKDFCPSFFSGRFYLHSSPADISGYLHLRPPPFFSVVTAVRLRLSRLKGGVPELTSVGGSLAFDSN
LSREEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQPALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFINNNNRACPAGYVPIRRTIKKDLIRIRSL
SSKEPTGIKTSIKGGVDFPYNQDVVSVAMKKGIKYYGASGSVSVYNLSVAQDQSSSSNIWIIGGPPQAPNVILAGWQDRPTGNWWLAVGESHKTIGYWPKELFGH
LNDGTEQVAWGGIAKPSPNGMSPPLGNGHKPNYSKYDDACYFRYMNYVDENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNCSP