; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS008664 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS008664
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationscaffold4:2912577..2915039
RNA-Seq ExpressionMS008664
SyntenyMS008664
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN55313.1 hypothetical protein Csa_012040 [Cucumis sativus]1.7e-17370.84Show/hide
Query:  MGKKNGVILVAVALAVAAILDDAAAAANMGRNSEGREQN-SMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFES
        MGK+N  +++ +A+    +       A +  N+   E N S+  I+ KLKLLNKP+IKTI+SEDGDI +C+D+YKQPAFDHP LKNHTI QMKP+   + 
Subjt:  MGKKNGVILVAVALAVAAILDDAAAAANMGRNSEGREQN-SMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFES

Query:  NVSTAQNESF--RSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYG-SKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFT
         +S+ QNESF  R  PFQTWQ+SGSCP+GTIPIR+V  +DLLR NS+  FGKKF YG SKLG E NRSTAIL+T G+NYIGASG+INVWNPKVDLPNDFT
Subjt:  NVSTAQNESF--RSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYG-SKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFT

Query:  ASQIWLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKM
        AS++WLKNGPSEKF+SVEAGW+VNP+LYGD KTR SLYWT DSY +TGCFDLTCSGFVQTNP VAIG  IEPLSS  GQQ+ ISIGIF DP S NWWLK 
Subjt:  ASQIWLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKM

Query:  QGKPVGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTE
        QG PVGYWP  LFGYL HSATL EWGGEVFSSN+K  PHTGT MGSGDYAS  Y  +SFVK PRI+DYSLQLKYP RVG WADEPSCYS +NYQ++YTTE
Subjt:  QGKPVGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTE

Query:  PVFFYGGPGLSRDCH
        PVF++GGPGLSRDCH
Subjt:  PVFFYGGPGLSRDCH

XP_004141419.2 uncharacterized protein LOC101213587 [Cucumis sativus]1.5e-17272.75Show/hide
Query:  VAAILDDAAAAANMGRNSEGREQN-SMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESNVSTAQNESF--RSI
        V A++      A +  N+   E N S+  I+ KLKLLNKP+IKTI+SEDGDI +C+D+YKQPAFDHP LKNHTI QMKP+   +  +S+ QNESF  R  
Subjt:  VAAILDDAAAAANMGRNSEGREQN-SMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESNVSTAQNESF--RSI

Query:  PFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYG-SKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGPSEKFD
        PFQTWQ+SGSCP+GTIPIR+V  +DLLR NS+  FGKKF YG SKLG E NRSTAIL+T G+NYIGASG+INVWNPKVDLPNDFTAS++WLKNGPSEKF+
Subjt:  PFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYG-SKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGPSEKFD

Query:  SVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGKPVGYWPPALFGY
        SVEAGW+VNP+LYGD KTR SLYWT DSY +TGCFDLTCSGFVQTNP VAIG  IEPLSS  GQQ+ ISIGIF DP S NWWLK QG PVGYWP  LFGY
Subjt:  SVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGKPVGYWPPALFGY

Query:  LSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGPGLSRDCH
        L HSATL EWGGEVFSSN+K  PHTGT MGSGDYAS  Y  +SFVK PRI+DYSLQLKYP RVG WADEPSCYS +NYQ++YTTEPVF++GGPGLSRDCH
Subjt:  LSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGPGLSRDCH

XP_016901292.1 PREDICTED: uncharacterized protein LOC103493897 [Cucumis melo]2.2e-17370.26Show/hide
Query:  MGKKNG---VILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDF
        MGK+N    ++L  VAL V        A  N+   +   + +++  I+ KLKLLNKP+IKTI+SEDGD+  C+DIYKQPAFDHP LKNHTI QMKP+ D 
Subjt:  MGKKNG---VILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDF

Query:  ESNVSTAQNESF--RSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYG-SKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPND
        +  +S  QN+SF   S PFQ WQ+SGSCP+GTIPIR+V+ +DLLR NS+  FGKKF YG SKLG E NRSTAIL+T G+NYIGASG+INVWNPKVDLPND
Subjt:  ESNVSTAQNESF--RSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYG-SKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPND

Query:  FTASQIWLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWL
        FTAS+IWLKNGPSEKF+SVEAGW+VNP+LYGD KTRFSLYWT DSY STGCFDLTCSGFVQTNP VAIG  I+PLSS  GQQ+ I +GIF DP+S NWWL
Subjt:  FTASQIWLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWL

Query:  KMQGKPVGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYT
        K Q +PVGYWPP LFGYL HSATL EWGGEVFSSN+K  PHTGT MGSGDYAS  Y  +SFVK PRI+DYS+QLKYP +VG WADEPSCYS +NYQRTYT
Subjt:  KMQGKPVGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYT

Query:  TEPVFFYGGPGLSRDCH
        +EPVF++GGPGLSRDCH
Subjt:  TEPVFFYGGPGLSRDCH

XP_022139554.1 uncharacterized protein LOC111010434 [Momordica charantia]3.1e-23998.3Show/hide
Query:  MGKKNGVILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESN
        MGKKNGVILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIY QPAFDHPALKNHTI QMKPNFDFES+
Subjt:  MGKKNGVILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESN

Query:  VSTAQNESFRSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYGSKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQI
        +STAQNESFRSIPFQTWQRSGSC EGTIPIR+VQTQDLLRTNSIDDFGKKFAYGSKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQI
Subjt:  VSTAQNESFRSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYGSKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQI

Query:  WLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGKP
        WLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDP+SSNWWLKMQGKP
Subjt:  WLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGKP

Query:  VGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFF
        VGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFF
Subjt:  VGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFF

Query:  YGGPGLSRDCH
        YGGPGLSRDCH
Subjt:  YGGPGLSRDCH

XP_023523394.1 uncharacterized protein LOC111787609 [Cucurbita pepo subsp. pepo]2.3e-17070.63Show/hide
Query:  GKKNGVILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESNV
        GK + ++  A+   VAAIL   AA   M ++ +   Q     I KKLKLLNKPA+ TI+S+DGDI DC+DIYKQPAFDHPALKNHTI QMKPN+  +  +
Subjt:  GKKNGVILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESNV

Query:  STAQNESFRSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAY-GSKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQI
         TAQNESFRS  FQ WQ+SG CP+GTIPIR+V+ +DLLR NS  +FGKKF   GSKLG E+NRSTAILVT G NYIGA+G+INVWNP VDLPNDF+AS++
Subjt:  STAQNESFRSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAY-GSKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQI

Query:  WLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSS-RGGQQFFISIGIFMDPESSNWWLKMQGK
        WLKNGPSE FDSVE GW VNPRLYGDT+TR ++YWT DSY STGCFDLTCSGFVQTNP V +GG IEP+SS  GGQQF I++GIF DP+S NWWLK+Q +
Subjt:  WLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSS-RGGQQFFISIGIFMDPESSNWWLKMQGK

Query:  PVGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVF
        PVGYWPP LFGYLSHSATL EWGGEVFSS LKK PHTGT MGSG YA AR   +SFV  PRI+DYSLQLKYP+RVG WADEP+CYS +N+Q TYT+EPVF
Subjt:  PVGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVF

Query:  FYGGPGLSRDCH
        FYGGPG SRDCH
Subjt:  FYGGPGLSRDCH

TrEMBL top hitse value%identityAlignment
A0A0A0L0M3 Uncharacterized protein8.3e-17470.84Show/hide
Query:  MGKKNGVILVAVALAVAAILDDAAAAANMGRNSEGREQN-SMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFES
        MGK+N  +++ +A+    +       A +  N+   E N S+  I+ KLKLLNKP+IKTI+SEDGDI +C+D+YKQPAFDHP LKNHTI QMKP+   + 
Subjt:  MGKKNGVILVAVALAVAAILDDAAAAANMGRNSEGREQN-SMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFES

Query:  NVSTAQNESF--RSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYG-SKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFT
         +S+ QNESF  R  PFQTWQ+SGSCP+GTIPIR+V  +DLLR NS+  FGKKF YG SKLG E NRSTAIL+T G+NYIGASG+INVWNPKVDLPNDFT
Subjt:  NVSTAQNESF--RSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYG-SKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFT

Query:  ASQIWLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKM
        AS++WLKNGPSEKF+SVEAGW+VNP+LYGD KTR SLYWT DSY +TGCFDLTCSGFVQTNP VAIG  IEPLSS  GQQ+ ISIGIF DP S NWWLK 
Subjt:  ASQIWLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKM

Query:  QGKPVGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTE
        QG PVGYWP  LFGYL HSATL EWGGEVFSSN+K  PHTGT MGSGDYAS  Y  +SFVK PRI+DYSLQLKYP RVG WADEPSCYS +NYQ++YTTE
Subjt:  QGKPVGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTE

Query:  PVFFYGGPGLSRDCH
        PVF++GGPGLSRDCH
Subjt:  PVFFYGGPGLSRDCH

A0A1S4DZ87 uncharacterized protein LOC1034938971.1e-17370.26Show/hide
Query:  MGKKNG---VILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDF
        MGK+N    ++L  VAL V        A  N+   +   + +++  I+ KLKLLNKP+IKTI+SEDGD+  C+DIYKQPAFDHP LKNHTI QMKP+ D 
Subjt:  MGKKNG---VILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDF

Query:  ESNVSTAQNESF--RSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYG-SKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPND
        +  +S  QN+SF   S PFQ WQ+SGSCP+GTIPIR+V+ +DLLR NS+  FGKKF YG SKLG E NRSTAIL+T G+NYIGASG+INVWNPKVDLPND
Subjt:  ESNVSTAQNESF--RSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYG-SKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPND

Query:  FTASQIWLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWL
        FTAS+IWLKNGPSEKF+SVEAGW+VNP+LYGD KTRFSLYWT DSY STGCFDLTCSGFVQTNP VAIG  I+PLSS  GQQ+ I +GIF DP+S NWWL
Subjt:  FTASQIWLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWL

Query:  KMQGKPVGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYT
        K Q +PVGYWPP LFGYL HSATL EWGGEVFSSN+K  PHTGT MGSGDYAS  Y  +SFVK PRI+DYS+QLKYP +VG WADEPSCYS +NYQRTYT
Subjt:  KMQGKPVGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYT

Query:  TEPVFFYGGPGLSRDCH
        +EPVF++GGPGLSRDCH
Subjt:  TEPVFFYGGPGLSRDCH

A0A5A7V8M6 Uncharacterized protein1.1e-17370.26Show/hide
Query:  MGKKNG---VILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDF
        MGK+N    ++L  VAL V        A  N+   +   + +++  I+ KLKLLNKP+IKTI+SEDGD+  C+DIYKQPAFDHP LKNHTI QMKP+ D 
Subjt:  MGKKNG---VILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDF

Query:  ESNVSTAQNESF--RSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYG-SKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPND
        +  +S  QN+SF   S PFQ WQ+SGSCP+GTIPIR+V+ +DLLR NS+  FGKKF YG SKLG E NRSTAIL+T G+NYIGASG+INVWNPKVDLPND
Subjt:  ESNVSTAQNESF--RSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYG-SKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPND

Query:  FTASQIWLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWL
        FTAS+IWLKNGPSEKF+SVEAGW+VNP+LYGD KTRFSLYWT DSY STGCFDLTCSGFVQTNP VAIG  I+PLSS  GQQ+ I +GIF DP+S NWWL
Subjt:  FTASQIWLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWL

Query:  KMQGKPVGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYT
        K Q +PVGYWPP LFGYL HSATL EWGGEVFSSN+K  PHTGT MGSGDYAS  Y  +SFVK PRI+DYS+QLKYP +VG WADEPSCYS +NYQRTYT
Subjt:  KMQGKPVGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYT

Query:  TEPVFFYGGPGLSRDCH
        +EPVF++GGPGLSRDCH
Subjt:  TEPVFFYGGPGLSRDCH

A0A6J1CDC2 uncharacterized protein LOC1110104341.5e-23998.3Show/hide
Query:  MGKKNGVILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESN
        MGKKNGVILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIY QPAFDHPALKNHTI QMKPNFDFES+
Subjt:  MGKKNGVILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESN

Query:  VSTAQNESFRSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYGSKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQI
        +STAQNESFRSIPFQTWQRSGSC EGTIPIR+VQTQDLLRTNSIDDFGKKFAYGSKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQI
Subjt:  VSTAQNESFRSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYGSKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQI

Query:  WLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGKP
        WLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDP+SSNWWLKMQGKP
Subjt:  WLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGKP

Query:  VGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFF
        VGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFF
Subjt:  VGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFF

Query:  YGGPGLSRDCH
        YGGPGLSRDCH
Subjt:  YGGPGLSRDCH

A0A6J1FT01 uncharacterized protein LOC1114466011.6e-16972.57Show/hide
Query:  VALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESNVSTAQNESFRS
        VALA+AA +  A AA     NS   +Q S++ IQKKLKLLNKPA+ TI+SEDGDI DC+DIYKQPAFDHPALKNHTI QM+P++  +  +S  QNE    
Subjt:  VALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESNVSTAQNESFRS

Query:  IPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYG-SKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGPSEKF
         PFQ WQRSG CPEGTIPIR+V+ QDLLR NS+D FGK F YG SKLGAE+NRST+IL TAG NYIGASG INVWNPKVDLPNDFTAS+IWLKNGPSE F
Subjt:  IPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYG-SKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGPSEKF

Query:  DSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGKPVGYWPPALFG
        +SVEAGW+VN RLYGDTKTRFS++WT DSY STGCFDLTCSGFVQTNP + +G  I+P+S+RGGQQF IS+G+F DP S NWWL +QG PVGYWPP LFG
Subjt:  DSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGKPVGYWPPALFG

Query:  YLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGPGLSRDC
        YL HSATL EWGGEVFSS++KK PHT T MGSGDYA   Y  +S+V  PRI+D SLQLKYP+RVG WA+EP CYSA+NYQRTY TEPVFFYGGPG SRDC
Subjt:  YLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGPGLSRDC

Query:  H
        H
Subjt:  H

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)2.0e-8743.6Show/hide
Query:  IQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFD----FESNVSTA--QNESFRSIPFQTWQRSGSCPEGTIPIRKVQTQD
        ++K L  LNKPA+K+I S DGD+ DC+ I KQPAFDHP LK+H I QMKPN+     F+ N  +A   NE    IP Q W R G C EGTIP+R+ +  D
Subjt:  IQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFD----FESNVSTA--QNESFRSIPFQTWQRSGSCPEGTIPIRKVQTQD

Query:  LLRTNSIDDFGKKFAYGSKL--GAE---INRS---TAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGP-SEKFDSVEAGWVVNPRLYGDTK
        +LR +S+  +GKK      L   AE   IN+S    AI    G  Y GA   INVW PK+   N+F+ SQIWL  G   +  +S+EAGW V+P LYGD  
Subjt:  LLRTNSIDDFGKKFAYGSKL--GAE---INRS---TAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGP-SEKFDSVEAGWVVNPRLYGDTK

Query:  TRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKM-QGKPVGYWPPALFGYLSHSATLAEWGGEVFS
        TR   YWT+D+Y +TGC++L CSGF+Q N  +A+G +I P+S     Q+ ISI I+ DP+  +WW++   G  +GYWP  LF YL+ SA++ EWGGEV +
Subjt:  TRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKM-QGKPVGYWPPALFGYLSHSATLAEWGGEVFS

Query:  SNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGPGLSRDC
        S      HT T MGSG +    +  +S+ ++ +++D S  LK P+ +G + ++ +CY  +           F+YGGPG ++ C
Subjt:  SNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGPGLSRDC

AT2G44210.1 Protein of Unknown Function (DUF239)6.9e-8841.62Show/hide
Query:  IQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQ---QMKPNFDF-ESNVSTAQNESFRSIPFQTWQRSGSCPEGTIPIRKVQTQDLL
        I+  LK LNKPA+K+I S DGD+ DC+ I  QPAF HP L NHT+Q    + P   F ES VS+       +   Q W  +G CP+ TIPIR+ + QDL 
Subjt:  IQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQ---QMKPNFDF-ESNVSTAQNESFRSIPFQTWQRSGSCPEGTIPIRKVQTQDLL

Query:  RTNSIDDFG---------KKFAYGSKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGP-SEKFDSVEAGWVVNPRLYGDTKT
        R +S++++G          K +    +  +     AI+      + GA   INVW P V++PN+F+ +QIW+  G  +   +S+EAGW V+P+LYGD +T
Subjt:  RTNSIDDFG---------KKFAYGSKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGP-SEKFDSVEAGWVVNPRLYGDTKT

Query:  RFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGK-PVGYWPPALFGYLSHSATLAEWGGEVFSS
        R   YWT+D+Y  TGC++L CSGFVQ N  +A+GG+I PLS+ G  Q+ I+I I+ DP+  +WWL+   K  +GYWP +LF YLS SA++ EWGGEV +S
Subjt:  RFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGK-PVGYWPPALFGYLSHSATLAEWGGEVFSS

Query:  NLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGPGLSRDC
          ++  HT T MGSG +A   +G +S+ K+ +++D S +L+ P+ + V+ D+ +CY+ ++     +    F+YGGPG + +C
Subjt:  NLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGPGLSRDC

AT3G13510.1 Protein of Unknown Function (DUF239)1.9e-8540.89Show/hide
Query:  ILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFD----FESNVSTAQNESFRSIPF
        +L  + AAA+ G + +  E      ++K L  LNKP +KTI S DGDI DCI I KQPAFDHP LK+H I QM+P++     F+ N  +A+ +   +   
Subjt:  ILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFD----FESNVSTAQNESFRSIPF

Query:  QTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYGSKL--GAE---INRS---TAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGP
        Q W R G C EGTIP+R+ +  D+LR +S+  +GKK      +   AE   IN++    AI    G  Y GA   +NVW PK+   N+F+ SQIWL  G 
Subjt:  QTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYGSKL--GAE---INRS---TAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGP

Query:  -SEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKM-QGKPVGYW
          +  +S+EAGW V+P LYGD  TR   YWT+D+Y +TGC++L CSGF+Q N  +A+G +I P+S     Q+ ISI I+ DP+  +WW++   G  +GYW
Subjt:  -SEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKM-QGKPVGYW

Query:  PPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGP
        P  LF YL+ SA++ EWGGEV +S   +  HT T MGSG +    +  +S+ ++ +++D S  LK P+ +G + ++ +CY  +           F+YGGP
Subjt:  PPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGP

Query:  GLSRDC
        G +++C
Subjt:  GLSRDC

AT5G25950.1 Protein of Unknown Function (DUF239)1.2e-10849.1Show/hide
Query:  IQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESNVSTAQNE-SFRSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTN
        I  KLK LNKPA+KTI SEDGDI DCIDIYKQ AFDHPALKNH I QMKP+  F +  +T  N  S   I  Q W +SG CP GTIP+R+V  +D+ R +
Subjt:  IQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESNVSTAQNE-SFRSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTN

Query:  SIDDFGKKFAYGSKL--------------GAEIN------RSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGPSEKFDSVEAGWVVNPR
        S   FG+K  +                   A+IN      RS A +V  G N++GA  DIN+WNP      D++ +QIWL  G SE F+SVE GW+VNP 
Subjt:  SIDDFGKKFAYGSKL--------------GAEIN------RSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGPSEKFDSVEAGWVVNPR

Query:  LYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGKPVGYWPPALFGYLSHSATLAEWG
        ++GD++TR  + WT D Y+ TGC +L C+GFVQT+   A+G  +EP+SS    Q+ I++ IF+DP S NWWL  +   +GYWP  LF YL HSAT  +WG
Subjt:  LYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGKPVGYWPPALFGYLSHSATLAEWG

Query:  GEVFSSN-LKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGPGLSRDC
        GEV S N + K PHT T MGSG +AS  + ++ F  + RI DYS+QLKYPQ +  +ADE +CYS   +++TY +EP F++GGPG +  C
Subjt:  GEVFSSN-LKKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGPGLSRDC

AT5G25960.1 Protein of Unknown Function (DUF239)3.3e-9847.83Show/hide
Query:  IQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESNVSTAQNE-SFRSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTN
        I  KLK LNKP++KTI SEDGDI DCIDIYKQ AFDHPAL+NH I QMKP+ DF +  +T  N  S   I  Q W +SG+CP+GTIP             
Subjt:  IQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESNVSTAQNE-SFRSIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTN

Query:  SIDDFGKKFAYGSKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSS
                               A+L+  G N+IGA  DINVWNP     +D++++QIWL  G S+ F+S+EAGW VNP ++GD++TR   YWT D YS 
Subjt:  SIDDFGKKFAYGSKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGPSEKFDSVEAGWVVNPRLYGDTKTRFSLYWTADSYSS

Query:  TGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGKPVGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGS
        TGC +L C+GFVQT    A+G AIEP+S+   +Q FI+     D  S NWWL      +GYWP  LF YL HSAT  + GGEV S N+ K PHT T MGS
Subjt:  TGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGKPVGYWPPALFGYLSHSATLAEWGGEVFSSNLKKTPHTGTDMGS

Query:  GDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGPGLSRDC
        G +AS  +  + +  + RI DYSLQ+KYP+ +  +ADE  CYS + +++TY +EP F++GGPG +  C
Subjt:  GDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGPGLSRDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGAAAAATGGAGTGATTTTGGTGGCAGTGGCATTGGCTGTTGCTGCCATTTTAGACGATGCTGCAGCAGCAGCCAATATGGGGAGGAATTCAGAAGGAAGAGA
ACAAAATTCTATGGAGATTATTCAGAAGAAGTTGAAACTTCTGAATAAGCCTGCCATCAAGACCATTCATAGTGAAGATGGAGATATTTTCGACTGCATTGACATTTACA
AGCAGCCTGCCTTTGACCATCCAGCTCTAAAAAACCATACCATTCAGCAGATGAAACCCAATTTCGATTTTGAGTCGAACGTGTCGACCGCACAAAACGAGTCATTTCGA
TCCATTCCATTTCAAACATGGCAAAGAAGTGGGAGTTGTCCCGAAGGAACCATTCCCATCCGCAAGGTTCAAACACAAGACTTATTGAGAACCAATTCTATCGATGATTT
TGGGAAGAAATTTGCTTATGGGAGCAAACTCGGGGCAGAAATCAATCGTTCTACGGCGATCTTAGTTACAGCAGGAATGAACTACATCGGAGCATCAGGAGATATTAACG
TTTGGAACCCTAAAGTTGATTTGCCAAACGACTTCACGGCCTCTCAAATATGGCTCAAAAATGGACCTTCTGAAAAATTTGACAGCGTTGAAGCTGGCTGGGTGGTGAAT
CCCAGGTTATACGGAGATACAAAAACTCGATTTAGTTTGTATTGGACAGCGGACTCGTACAGCTCGACGGGGTGCTTTGATCTAACTTGCAGTGGATTCGTCCAAACTAA
CCCGGGTGTGGCCATTGGTGGAGCCATCGAGCCATTGTCCAGCCGCGGTGGACAACAATTCTTCATCTCTATTGGCATCTTCATGGATCCAGAGTCAAGCAACTGGTGGC
TAAAGATGCAAGGAAAGCCGGTGGGGTATTGGCCGCCGGCGCTTTTCGGGTATCTGAGCCACAGCGCGACGCTGGCGGAATGGGGCGGCGAAGTTTTCAGCTCAAACCTG
AAGAAAACGCCGCACACCGGCACGGACATGGGCAGTGGCGATTACGCCTCCGCCCGCTACGGCGACTCCAGCTTTGTGAAACACCCCAGAATCCTTGATTATTCGCTGCA
GTTGAAGTACCCCCAGCGAGTCGGAGTTTGGGCCGACGAACCGTCTTGCTACTCTGCTGAAAATTACCAACGAACATACACCACCGAACCGGTCTTTTTCTACGGTGGCC
CGGGTCTCAGCCGCGACTGCCAC
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAGAAAAATGGAGTGATTTTGGTGGCAGTGGCATTGGCTGTTGCTGCCATTTTAGACGATGCTGCAGCAGCAGCCAATATGGGGAGGAATTCAGAAGGAAGAGA
ACAAAATTCTATGGAGATTATTCAGAAGAAGTTGAAACTTCTGAATAAGCCTGCCATCAAGACCATTCATAGTGAAGATGGAGATATTTTCGACTGCATTGACATTTACA
AGCAGCCTGCCTTTGACCATCCAGCTCTAAAAAACCATACCATTCAGCAGATGAAACCCAATTTCGATTTTGAGTCGAACGTGTCGACCGCACAAAACGAGTCATTTCGA
TCCATTCCATTTCAAACATGGCAAAGAAGTGGGAGTTGTCCCGAAGGAACCATTCCCATCCGCAAGGTTCAAACACAAGACTTATTGAGAACCAATTCTATCGATGATTT
TGGGAAGAAATTTGCTTATGGGAGCAAACTCGGGGCAGAAATCAATCGTTCTACGGCGATCTTAGTTACAGCAGGAATGAACTACATCGGAGCATCAGGAGATATTAACG
TTTGGAACCCTAAAGTTGATTTGCCAAACGACTTCACGGCCTCTCAAATATGGCTCAAAAATGGACCTTCTGAAAAATTTGACAGCGTTGAAGCTGGCTGGGTGGTGAAT
CCCAGGTTATACGGAGATACAAAAACTCGATTTAGTTTGTATTGGACAGCGGACTCGTACAGCTCGACGGGGTGCTTTGATCTAACTTGCAGTGGATTCGTCCAAACTAA
CCCGGGTGTGGCCATTGGTGGAGCCATCGAGCCATTGTCCAGCCGCGGTGGACAACAATTCTTCATCTCTATTGGCATCTTCATGGATCCAGAGTCAAGCAACTGGTGGC
TAAAGATGCAAGGAAAGCCGGTGGGGTATTGGCCGCCGGCGCTTTTCGGGTATCTGAGCCACAGCGCGACGCTGGCGGAATGGGGCGGCGAAGTTTTCAGCTCAAACCTG
AAGAAAACGCCGCACACCGGCACGGACATGGGCAGTGGCGATTACGCCTCCGCCCGCTACGGCGACTCCAGCTTTGTGAAACACCCCAGAATCCTTGATTATTCGCTGCA
GTTGAAGTACCCCCAGCGAGTCGGAGTTTGGGCCGACGAACCGTCTTGCTACTCTGCTGAAAATTACCAACGAACATACACCACCGAACCGGTCTTTTTCTACGGTGGCC
CGGGTCTCAGCCGCGACTGCCAC
Protein sequenceShow/hide protein sequence
MGKKNGVILVAVALAVAAILDDAAAAANMGRNSEGREQNSMEIIQKKLKLLNKPAIKTIHSEDGDIFDCIDIYKQPAFDHPALKNHTIQQMKPNFDFESNVSTAQNESFR
SIPFQTWQRSGSCPEGTIPIRKVQTQDLLRTNSIDDFGKKFAYGSKLGAEINRSTAILVTAGMNYIGASGDINVWNPKVDLPNDFTASQIWLKNGPSEKFDSVEAGWVVN
PRLYGDTKTRFSLYWTADSYSSTGCFDLTCSGFVQTNPGVAIGGAIEPLSSRGGQQFFISIGIFMDPESSNWWLKMQGKPVGYWPPALFGYLSHSATLAEWGGEVFSSNL
KKTPHTGTDMGSGDYASARYGDSSFVKHPRILDYSLQLKYPQRVGVWADEPSCYSAENYQRTYTTEPVFFYGGPGLSRDCH