; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g1852 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g1852
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationMC06:25880254..25883240
RNA-Seq ExpressionMC06g1852
SyntenyMC06g1852
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON62881.1 hypothetical protein PanWU01x14_135570 [Parasponia andersonii]9.29e-13147.52Show/hide
Query:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS
        M+  D K +  +F V     F  +  +  + L +EE +EL+ QLK +NKP ++SF+ E+GD IDCVD+YKQ + DHPLLK+HTIQMKP  IP+   S+ S
Subjt:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS

Query:  KVEMLL-RHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRG
            +  +++P    CP GSVPIRR T+EDLI A+S K L  +   D+    +T+D  G+H A LN + + YGA+S+IN+W+P     QFS  ++W+  G
Subjt:  KVEMLL-RHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRG

Query:  CRDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPK
         ++  N++Q GW V   +  N +RL TYWTA+GYQ TGC+N LCPGFVQV+S I  GL L P ST+NG Q D+ +S++QD  +G+W LMF DKY+GYWPK
Subjt:  CRDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPK

Query:  AVVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKY-TSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGP
         ++P L +GA   +WGGEVYSP     PAMG GHFPEEGF K+A++ QI+VV+  T+S+ F DP D  L    ++P C+   N +   G WG ++FFGGP
Subjt:  AVVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKY-TSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGP

Query:  SGCK
          C 
Subjt:  SGCK

PON84575.1 hypothetical protein TorRG33x02_196420 [Trema orientale]3.51e-13348.02Show/hide
Query:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS
        M+  D K +  +F +     F  +  +  +   +EE +EL+RQLK +NKP +KSF+ E+GD IDCVD+YKQ + DHPLLK+HTIQMKP  IP+   S+ S
Subjt:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS

Query:  KVEMLL-RHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRG
            +  +++P    CP GSVPIRR T+EDLI A+S K L  +   D+   S+T+D  G+H A LN + + YGA+S+IN+W+P     QFS  ++W+  G
Subjt:  KVEMLL-RHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRG

Query:  CRDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPK
         ++  N++Q GW V   +  N TRL TYWTA+GYQ TGC+N LCPGFVQV+S I  GL L P ST+NG Q D+ +S++QD  +G+W LMF DKY+GYWPK
Subjt:  CRDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPK

Query:  AVVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKY-TSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGP
         ++P L +GA   +WGGEVYSP     PAMG GHFPEEGF+K+A++ QI+VV+  T+S+ F DP D  L    ++P C+   N +   G WG ++FFGGP
Subjt:  AVVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKY-TSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGP

Query:  SGCK
          C 
Subjt:  SGCK

XP_018852494.1 uncharacterized protein LOC109014471 isoform X1 [Juglans regia]1.04e-13349.5Show/hide
Query:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS
        M+ +  K    L  V       +D  +    LS+ E+LEL+RQLK LNKPAI SF+ E+GD +DCVDIYKQ + DHPLLKNHTIQMKP  IP     + S
Subjt:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS

Query:  KVEMLLRHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPL---WSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLS
        +       +PN  +CP GSVPIRR+TKEDL+     K L   + H    S      ID +G+H A L ++++ YG K++INVWNP  + DQFS A + + 
Subjt:  KVEMLLRHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPL---WSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLS

Query:  RGCRDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYW
         G  ++ N+IQ GWGV   ++ N +RL T+WTA+GYQ +GCYN LCPGFVQV+S I  GL L P STY GPQYD+ IS++QD  TGNW  MF DKY+GYW
Subjt:  RGCRDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYW

Query:  PKAVVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGG
        PK +   LA  A   +WGG+VYSP  E  P MGSG FPEEG+ KSA++ QI+VV     +GFVDP DS   +  D+P C+  I+    D  WG H++FGG
Subjt:  PKAVVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGG

Query:  PSGC
           C
Subjt:  PSGC

XP_022158434.1 uncharacterized protein LOC111024921 [Momordica charantia]7.75e-28296.89Show/hide
Query:  DGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTSKVEMLLRHLPNINNCPPGSVPIR
        +GFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTSKVEMLLRHLPNINNCPPGSVPIR
Subjt:  DGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTSKVEMLLRHLPNINNCPPGSVPIR

Query:  RTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNTIQ-------VGWGVQPK
        RTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNTIQ       + + VQPK
Subjt:  RTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNTIQ-------VGWGVQPK

Query:  VFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKAVVPGLADGAAVAAWGG
        VFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKAVVPGLADGAAVAAWGG
Subjt:  VFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKAVVPGLADGAAVAAWGG

Query:  EVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGCK
        EVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGCK
Subjt:  EVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGCK

XP_040986168.1 uncharacterized protein LOC121234329 isoform X1 [Juglans microcarpa x Juglans regia]2.33e-14050.62Show/hide
Query:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS
        M+ + LK    L   +      +D  +    LS+EE+LEL+RQLK LNKPAI SF+ E+GD +DCVDIYKQ + DHPLLKNHTIQMKP  IP     + S
Subjt:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS

Query:  KVEMLLRHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGC
        + +     +PN  +CP GSVPIRR+TKEDL+     K L  +        ++TID +G+H A L +++  YG K++INVWNP  + DQFS A + +  G 
Subjt:  KVEMLLRHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGC

Query:  RDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKA
         ++ N+IQ GWGV   ++ N +RL T+WTA+GYQ +GC+N LCPGFVQV+S I  GL L P STY GPQYD+ IS++QD  TGNW  MF DKY+GYWPKA
Subjt:  RDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKA

Query:  VVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSG
        +   LA+GA   +WGG+VYSP  E  P MGSGHFPEEG+ KSA++NQI+VV     +GFVDP DS+  +  D+P C+  I+    D  WG H++FGG   
Subjt:  VVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSG

Query:  C
        C
Subjt:  C

TrEMBL top hitse value%identityAlignment
A0A2I4H8L6 uncharacterized protein LOC109014471 isoform X15.05e-13449.5Show/hide
Query:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS
        M+ +  K    L  V       +D  +    LS+ E+LEL+RQLK LNKPAI SF+ E+GD +DCVDIYKQ + DHPLLKNHTIQMKP  IP     + S
Subjt:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS

Query:  KVEMLLRHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPL---WSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLS
        +       +PN  +CP GSVPIRR+TKEDL+     K L   + H    S      ID +G+H A L ++++ YG K++INVWNP  + DQFS A + + 
Subjt:  KVEMLLRHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPL---WSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLS

Query:  RGCRDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYW
         G  ++ N+IQ GWGV   ++ N +RL T+WTA+GYQ +GCYN LCPGFVQV+S I  GL L P STY GPQYD+ IS++QD  TGNW  MF DKY+GYW
Subjt:  RGCRDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYW

Query:  PKAVVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGG
        PK +   LA  A   +WGG+VYSP  E  P MGSG FPEEG+ KSA++ QI+VV     +GFVDP DS   +  D+P C+  I+    D  WG H++FGG
Subjt:  PKAVVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGG

Query:  PSGC
           C
Subjt:  PSGC

A0A2P5CPC4 Uncharacterized protein4.50e-13147.52Show/hide
Query:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS
        M+  D K +  +F V     F  +  +  + L +EE +EL+ QLK +NKP ++SF+ E+GD IDCVD+YKQ + DHPLLK+HTIQMKP  IP+   S+ S
Subjt:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS

Query:  KVEMLL-RHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRG
            +  +++P    CP GSVPIRR T+EDLI A+S K L  +   D+    +T+D  G+H A LN + + YGA+S+IN+W+P     QFS  ++W+  G
Subjt:  KVEMLL-RHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRG

Query:  CRDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPK
         ++  N++Q GW V   +  N +RL TYWTA+GYQ TGC+N LCPGFVQV+S I  GL L P ST+NG Q D+ +S++QD  +G+W LMF DKY+GYWPK
Subjt:  CRDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPK

Query:  AVVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKY-TSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGP
         ++P L +GA   +WGGEVYSP     PAMG GHFPEEGF K+A++ QI+VV+  T+S+ F DP D  L    ++P C+   N +   G WG ++FFGGP
Subjt:  AVVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKY-TSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGP

Query:  SGCK
          C 
Subjt:  SGCK

A0A2P5EGA7 Uncharacterized protein1.70e-13348.02Show/hide
Query:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS
        M+  D K +  +F +     F  +  +  +   +EE +EL+RQLK +NKP +KSF+ E+GD IDCVD+YKQ + DHPLLK+HTIQMKP  IP+   S+ S
Subjt:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS

Query:  KVEMLL-RHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRG
            +  +++P    CP GSVPIRR T+EDLI A+S K L  +   D+   S+T+D  G+H A LN + + YGA+S+IN+W+P     QFS  ++W+  G
Subjt:  KVEMLL-RHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRG

Query:  CRDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPK
         ++  N++Q GW V   +  N TRL TYWTA+GYQ TGC+N LCPGFVQV+S I  GL L P ST+NG Q D+ +S++QD  +G+W LMF DKY+GYWPK
Subjt:  CRDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPK

Query:  AVVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKY-TSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGP
         ++P L +GA   +WGGEVYSP     PAMG GHFPEEGF+K+A++ QI+VV+  T+S+ F DP D  L    ++P C+   N +   G WG ++FFGGP
Subjt:  AVVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKY-TSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGP

Query:  SGCK
          C 
Subjt:  SGCK

A0A6J1DZE3 uncharacterized protein LOC1110249213.75e-28296.89Show/hide
Query:  DGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTSKVEMLLRHLPNINNCPPGSVPIR
        +GFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTSKVEMLLRHLPNINNCPPGSVPIR
Subjt:  DGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTSKVEMLLRHLPNINNCPPGSVPIR

Query:  RTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNTIQ-------VGWGVQPK
        RTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNTIQ       + + VQPK
Subjt:  RTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNTIQ-------VGWGVQPK

Query:  VFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKAVVPGLADGAAVAAWGG
        VFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKAVVPGLADGAAVAAWGG
Subjt:  VFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKAVVPGLADGAAVAAWGG

Query:  EVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGCK
        EVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGCK
Subjt:  EVYSPTSEAGPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGCK

A0A6P6FU00 uncharacterized protein LOC1124899913.36e-12547.52Show/hide
Query:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS
        M+G D K +  +          A+     + LS+EEEL+L+RQLK +NKPAIKSF+ E GD IDCVDIYKQ + DHP+LKNH IQMKPT IPKG+ +  S
Subjt:  MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS

Query:  KV--EMLLRHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSR
        +   + L   L NI+ CP GSVPI+R T++DLI A+  K +  +   +++  ++ ID  G+H A + Y+  V G ++ INVWNP   ++Q+S AS+ +S+
Subjt:  KV--EMLLRHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSR

Query:  GCRDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWP
        G + Q N +QVGWGV   ++ + +RL TYWT +GYQ TGC+N LCPGFVQV++ I  GL L P+STY G Q+ I +S+ QD  TGNW+L F ++Y+GYWP
Subjt:  GCRDQQNTIQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWP

Query:  KAVVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEG-FKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGG
        KA+   +ADGA   +W G  YS  ++  P MGSGHFP+EG +  SAFVN IQ++       FV P    L    D P C+ +I     D NWG +IFFGG
Subjt:  KAVVPGLADGAAVAAWGGEVYSPTSEAGPAMGSGHFPEEG-FKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGG

Query:  PSGC
        P  C
Subjt:  PSGC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20170.1 Protein of Unknown Function (DUF239)7.7e-8442.71Show/hide
Query:  KLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGL-ISDTSKVEMLL
        KL  L F         +  R + + +E++EL+  + L  +NKPAIKSF+ + G  +DC+DI KQ + DHPLLKNH+IQ+KPT+IPK     +T K   L 
Subjt:  KLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGL-ISDTSKVEMLL

Query:  RHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNT
               +CP G+V I+RTT EDLI  +  K L       +S+    ++  G H A   Y    YGA   IN+W+P  + DQFS ASI++  G RD   +
Subjt:  RHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNT

Query:  IQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKAV--VPG
        I  GW V PK+  N + L TYWTA+G++ TGCYN +CPGFVQV+S +  G    P STY+G QY +   ++QD  TGNW  +  ++ IGYWPK++  V G
Subjt:  IQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKAV--VPG

Query:  LADGAAVAAWGGEVYSPTSEA-GPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGC
        LA GA+   WGGEV+S   ++  P MGSGHFP+EGFKK+AFVN ++V+          P   +L +  + P C+ +  K  V   W   IF+GGP GC
Subjt:  LADGAAVAAWGGEVYSPTSEA-GPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGC

AT2G20170.2 Protein of Unknown Function (DUF239)7.7e-8442.71Show/hide
Query:  KLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGL-ISDTSKVEMLL
        KL  L F         +  R + + +E++EL+  + L  +NKPAIKSF+ + G  +DC+DI KQ + DHPLLKNH+IQ+KPT+IPK     +T K   L 
Subjt:  KLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGL-ISDTSKVEMLL

Query:  RHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNT
               +CP G+V I+RTT EDLI  +  K L       +S+    ++  G H A   Y    YGA   IN+W+P  + DQFS ASI++  G RD   +
Subjt:  RHLPNINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNT

Query:  IQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKAV--VPG
        I  GW V PK+  N + L TYWTA+G++ TGCYN +CPGFVQV+S +  G    P STY+G QY +   ++QD  TGNW  +  ++ IGYWPK++  V G
Subjt:  IQVGWGVQPKVFGNVTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKAV--VPG

Query:  LADGAAVAAWGGEVYSPTSEA-GPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGC
        LA GA+   WGGEV+S   ++  P MGSGHFP+EGFKK+AFVN ++V+          P   +L +  + P C+ +  K  V   W   IF+GGP GC
Subjt:  LADGAAVAAWGGEVYSPTSEA-GPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGC

AT4G23370.1 unknown protein2.0e-7947.68Show/hide
Query:  EEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTSKVEM--LLRHLPNINNCPPGSVPIRRTTKEDLI
        EEEE  L   L  +NK AIKSF+ + GDT+DC+DI+KQ + +HPLL NH+IQ  PT IPK  I++ +  E         +  +CP G+V ++RTT EDLI
Subjt:  EEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTSKVEM--LLRHLPNINNCPPGSVPIRRTTKEDLI

Query:  AAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNTIQVGWGVQPKVFGNVTRLTTYWTAN
         A+S K +    +   S  S  ID +GYH A   YK   YGAK  +N+W P  S +QFS ASI +S G  +Q   I+ GW V   +  N +RL TYWTA+
Subjt:  AAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNTIQVGWGVQPKVFGNVTRLTTYWTAN

Query:  GYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLM-FGDKYIGYWPKAVVP--GLADGAAVAAWGGEVYSPTSEAGPA
        G+  TGCYN LCPGFVQV++ I  G  L P+STY G QY++ I+M++D  TGNW L+ F + Y+GYWPK++    GL  G ++A+WGGEVYSP  E  P+
Subjt:  GYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLM-FGDKYIGYWPKAVVP--GLADGAAVAAWGGEVYSPTSEAGPA

Query:  MGSGHFPEE-GFKKSAFVNQIQV
        MGSGHFP++  + K A++N   V
Subjt:  MGSGHFPEE-GFKKSAFVNQIQV

AT4G23380.1 Protein of Unknown Function (DUF239)2.0e-7641.49Show/hide
Query:  SEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLI---SDTSKVEMLLRHLPNINNCPPGSVPIRRTTKED
        SEEE+ E+ RQLK +NKPAIKSFK E  +  DC+DI+KQ + DH LL+NH++++KPT +PK  I   +   KV  +   L  I +CP G+V ++RTT +D
Subjt:  SEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLI---SDTSKVEMLLRHLPNINNCPPGSVPIRRTTKED

Query:  LIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYK-TRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCR-DQQNTIQVGWGVQPKVFGNVTRLTTY
        LI ++  K +  +         + ID +G+H AT +Y    V G    IN+W+P  S DQ S A++ ++ G + +Q  +I VGW V P ++ +   L TY
Subjt:  LIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYK-TRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCR-DQQNTIQVGWGVQPKVFGNVTRLTTY

Query:  WTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKA--VVPGLADGAAVAAWGGEVYSPTSEA
        WTA+GY  TGCY+  CPGFVQV+  I  G+ L PIS YNG Q ++ +S+HQ   +           +GYWP++  +  GL  GA +A+WGG+VYSP +E 
Subjt:  WTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKA--VVPGLADGAAVAAWGGEVYSPTSEA

Query:  GPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGC
         P MGSGHFP+EGF K+AFVN I ++     +  + P    +      P C+        D  W R ++FGGP GC
Subjt:  GPAMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGC

AT4G23390.1 Protein of Unknown Function (DUF239)3.6e-8143.85Show/hide
Query:  SEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS--KVEMLLRHLPNINNCPPGSVPIRRTTKEDL
        S+EE+ E+ + L  LNKPA+KSF+ E G   DC+DI KQ + DHPLLKNH+I++KPT IPK    + +  K   L     +I +CP G+V ++R   EDL
Subjt:  SEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTS--KVEMLLRHLPNINNCPPGSVPIRRTTKEDL

Query:  IAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNTIQVGWGVQPKV-FGNVTRLTTYWT
        I A+  + L  +     S     ID  G+H AT++YK   YGAK  INVWNP  S DQFS A++ +S G +  Q +I  GW V P +   N + L TYWT
Subjt:  IAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNTIQVGWGVQPKV-FGNVTRLTTYWT

Query:  ANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKAVV--PGLADGAAVAAWGGEVYSPTSEAGP
        A+G   T CYN L PGFV V++    G+   P+S Y+G QY + +S++QD  T +W  +  ++ IGYWPK++    GLADGA+   WGGEVYS   E  P
Subjt:  ANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKAVV--PGLADGAAVAAWGGEVYSPTSEAGP

Query:  AMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGC
        +MGSGHFP+EGFKK+A+VN ++++   +      P  S L      P C+ +     V   W R I FGGP GC
Subjt:  AMGSGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGGTGAAGATTTGAAAAAACTGAGGTTTCTGTTTTTTGTTGTGTCTGTTTTTGCTTTTGGTGCAGATGGTTTCAGACTACTACAAATGTTGTCTGAAGAAGAAGA
GTTGGAACTCAACAGACAGCTCAAACAACTCAACAAACCTGCAATCAAGAGTTTTAAGAATGAATTTGGTGATACCATAGATTGTGTTGACATCTACAAGCAACCTTCAC
TTGATCATCCTCTGCTCAAGAACCATACAATCCAGATGAAACCGACGATGATCCCAAAAGGGCTGATAAGCGATACATCGAAAGTCGAGATGCTACTGCGGCATCTTCCA
AACATTAACAACTGCCCACCAGGATCAGTGCCAATCAGAAGGACTACAAAGGAAGATCTTATAGCAGCTAAAAGTTTCAAGCCATTGTGGTCTCATCAGGCAGCGGATAG
TTCCCGGCCGAGCACTACGATCGACGCCAATGGCTATCATCTTGCAACACTCAACTACAAGACCAGAGTTTATGGAGCAAAATCACAAATCAATGTGTGGAACCCAATTC
CATCAATGGATCAATTTAGTTCTGCCAGTATATGGCTTTCTCGAGGCTGTAGAGATCAACAGAATACCATACAAGTTGGTTGGGGAGTTCAACCAAAAGTGTTTGGGAAC
GTTACCAGATTAACTACCTACTGGACAGCAAATGGTTACCAGAGCACTGGATGCTACAACCAACTCTGTCCCGGGTTCGTACAAGTCAACTCCGGGATCATGCCCGGCCT
CCCTCTAAGCCCAATCTCCACCTACAATGGACCCCAATACGATATTCACATCAGCATGCATCAGGATACGCACACGGGGAATTGGATGTTGATGTTTGGGGACAAGTACA
TAGGGTACTGGCCAAAGGCAGTGGTGCCAGGGTTGGCAGATGGGGCAGCAGTTGCAGCATGGGGAGGGGAAGTTTACAGTCCTACATCAGAGGCAGGGCCAGCCATGGGA
AGTGGCCATTTCCCTGAAGAGGGCTTCAAAAAAAGTGCTTTTGTGAACCAGATTCAGGTGGTGAAGTATACAAGTTCAAGTGGTTTTGTTGATCCAGATGATTCAGAGCT
GAGTGTTGTTCTGGACAGACCTATCTGTTTTGGGCTCATTAATAAGTTCACTGTGGATGGGAATTGGGGGCGCCATATCTTCTTTGGAGGGCCAAGTGGGTGTAAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGGTGAAGATTTGAAAAAACTGAGGTTTCTGTTTTTTGTTGTGTCTGTTTTTGCTTTTGGTGCAGATGGTTTCAGACTACTACAAATGTTGTCTGAAGAAGAAGA
GTTGGAACTCAACAGACAGCTCAAACAACTCAACAAACCTGCAATCAAGAGTTTTAAGAATGAATTTGGTGATACCATAGATTGTGTTGACATCTACAAGCAACCTTCAC
TTGATCATCCTCTGCTCAAGAACCATACAATCCAGATGAAACCGACGATGATCCCAAAAGGGCTGATAAGCGATACATCGAAAGTCGAGATGCTACTGCGGCATCTTCCA
AACATTAACAACTGCCCACCAGGATCAGTGCCAATCAGAAGGACTACAAAGGAAGATCTTATAGCAGCTAAAAGTTTCAAGCCATTGTGGTCTCATCAGGCAGCGGATAG
TTCCCGGCCGAGCACTACGATCGACGCCAATGGCTATCATCTTGCAACACTCAACTACAAGACCAGAGTTTATGGAGCAAAATCACAAATCAATGTGTGGAACCCAATTC
CATCAATGGATCAATTTAGTTCTGCCAGTATATGGCTTTCTCGAGGCTGTAGAGATCAACAGAATACCATACAAGTTGGTTGGGGAGTTCAACCAAAAGTGTTTGGGAAC
GTTACCAGATTAACTACCTACTGGACAGCAAATGGTTACCAGAGCACTGGATGCTACAACCAACTCTGTCCCGGGTTCGTACAAGTCAACTCCGGGATCATGCCCGGCCT
CCCTCTAAGCCCAATCTCCACCTACAATGGACCCCAATACGATATTCACATCAGCATGCATCAGGATACGCACACGGGGAATTGGATGTTGATGTTTGGGGACAAGTACA
TAGGGTACTGGCCAAAGGCAGTGGTGCCAGGGTTGGCAGATGGGGCAGCAGTTGCAGCATGGGGAGGGGAAGTTTACAGTCCTACATCAGAGGCAGGGCCAGCCATGGGA
AGTGGCCATTTCCCTGAAGAGGGCTTCAAAAAAAGTGCTTTTGTGAACCAGATTCAGGTGGTGAAGTATACAAGTTCAAGTGGTTTTGTTGATCCAGATGATTCAGAGCT
GAGTGTTGTTCTGGACAGACCTATCTGTTTTGGGCTCATTAATAAGTTCACTGTGGATGGGAATTGGGGGCGCCATATCTTCTTTGGAGGGCCAAGTGGGTGTAAG
Protein sequenceShow/hide protein sequence
MSGEDLKKLRFLFFVVSVFAFGADGFRLLQMLSEEEELELNRQLKQLNKPAIKSFKNEFGDTIDCVDIYKQPSLDHPLLKNHTIQMKPTMIPKGLISDTSKVEMLLRHLP
NINNCPPGSVPIRRTTKEDLIAAKSFKPLWSHQAADSSRPSTTIDANGYHLATLNYKTRVYGAKSQINVWNPIPSMDQFSSASIWLSRGCRDQQNTIQVGWGVQPKVFGN
VTRLTTYWTANGYQSTGCYNQLCPGFVQVNSGIMPGLPLSPISTYNGPQYDIHISMHQDTHTGNWMLMFGDKYIGYWPKAVVPGLADGAAVAAWGGEVYSPTSEAGPAMG
SGHFPEEGFKKSAFVNQIQVVKYTSSSGFVDPDDSELSVVLDRPICFGLINKFTVDGNWGRHIFFGGPSGCK