; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g25040 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g25040
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDUF1985 domain-containing protein
Genome locationchr3:18003842..18009631
RNA-Seq ExpressionMoc03g25040
SyntenyMoc03g25040
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060374.1 uncharacterized protein E6C27_scaffold22G001730 [Cucumis melo var. makuwa]9.8e-4746.73Show/hide
Query:  DGINFNFESKVVHFGLQEFEAITRLKCDELPSVDFKKLSGRFLTKYFNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEY
        D + FNFE  +  FG+++FEAIT L C  LP VD  K+ G+FL+KYF N   P+ R+ +S LFN    +K++D++KMA+++ L NFLLGKQ   G + E+
Subjt:  DGINFNFESKVVHFGLQEFEAITRLKCDELPSVDFKKLSGRFLTKYFNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEY

Query:  IKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFEND
        IKLLDD + F+ YPWGR+ YN  IDSIKK+IKNP A  VG+SG   +L++W Y+C+ LL       A+ I  +   ++NW     PEW+++A RVF ++
Subjt:  IKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFEND

XP_022132727.1 uncharacterized protein LOC111005524 [Momordica charantia]2.0e-4436.31Show/hide
Query:  KRKEIEKDEVPTEEPHIEEITEEEVILDSVDRRRKRKEREESEDDDIPIIERLKKLKKG-----------------------VP--------RFKVKKKE
        KRK+    E  TEE  + + T+++     V  R+KRK  EE E +D    +R ++ K                         VP        R  +  K 
Subjt:  KRKEIEKDEVPTEEPHIEEITEEEVILDSVDRRRKRKEREESEDDDIPIIERLKKLKKG-----------------------VP--------RFKVKKKE

Query:  DVMTTKKNVKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDGIN---FNFESKVVHFGLQEFEAITRLKCDELPSVDFKKLSGRFLTKY
        DV++  KN    R+ D  R+  +  GNF+              LI   C     N   FN E +V  FG+++F  IT + C ELP++D  K+   + +K 
Subjt:  DVMTTKKNVKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDGIN---FNFESKVVHFGLQEFEAITRLKCDELPSVDFKKLSGRFLTKY

Query:  FNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPL
        +      +KR  +  +F  MD  + KD VKMA+L++L  FLLGKQ   GI  EY  L+DD EQFE YPWGRV Y  TID +KKAIK+ DAS +G+ G P 
Subjt:  FNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPL

Query:  ALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFE
        AL++WAYE I LLS     +A  IS+ +P M NW A   PEWRD++ ++F +D F+ +
Subjt:  ALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFE

XP_038883715.1 uncharacterized protein LOC120074618 isoform X1 [Benincasa hispida]1.1e-4238.52Show/hide
Query:  RFKVKKKEDVMTTKKNVKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDGIN---FNFESKVVHFGLQEFEAITRLKCDELPSVDFKKL
        R  +  K DV++  KN    R+  + +K  +  G+F+              L+   C     N   FN E ++  FG++EF  IT L C ELP +D  K+
Subjt:  RFKVKKKEDVMTTKKNVKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDGIN---FNFESKVVHFGLQEFEAITRLKCDELPSVDFKKL

Query:  -SGRFLTKYFNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDAS
          G+F  +YF      +KR  +  +F  MD  + KD VKMA+L++L  F+LGKQ   GI  EY  L+DD EQF+ YPWGR+ Y  TID +KKAIK+ DAS
Subjt:  -SGRFLTKYFNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDAS

Query:  VVGLSGMPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFENML
         +G+ G   AL++WAYE I LL       A  +S   P M NW A   PEW+D++ +VF++D F+ + ++
Subjt:  VVGLSGMPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFENML

XP_038883717.1 uncharacterized protein LOC120074618 isoform X3 [Benincasa hispida]1.1e-4238.52Show/hide
Query:  RFKVKKKEDVMTTKKNVKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDGIN---FNFESKVVHFGLQEFEAITRLKCDELPSVDFKKL
        R  +  K DV++  KN    R+  + +K  +  G+F+              L+   C     N   FN E ++  FG++EF  IT L C ELP +D  K+
Subjt:  RFKVKKKEDVMTTKKNVKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDGIN---FNFESKVVHFGLQEFEAITRLKCDELPSVDFKKL

Query:  -SGRFLTKYFNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDAS
          G+F  +YF      +KR  +  +F  MD  + KD VKMA+L++L  F+LGKQ   GI  EY  L+DD EQF+ YPWGR+ Y  TID +KKAIK+ DAS
Subjt:  -SGRFLTKYFNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDAS

Query:  VVGLSGMPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFENML
         +G+ G   AL++WAYE I LL       A  +S   P M NW A   PEW+D++ +VF++D F+ + ++
Subjt:  VVGLSGMPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFENML

XP_038883720.1 uncharacterized protein LOC120074618 isoform X6 [Benincasa hispida]1.1e-4238.52Show/hide
Query:  RFKVKKKEDVMTTKKNVKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDGIN---FNFESKVVHFGLQEFEAITRLKCDELPSVDFKKL
        R  +  K DV++  KN    R+  + +K  +  G+F+              L+   C     N   FN E ++  FG++EF  IT L C ELP +D  K+
Subjt:  RFKVKKKEDVMTTKKNVKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDGIN---FNFESKVVHFGLQEFEAITRLKCDELPSVDFKKL

Query:  -SGRFLTKYFNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDAS
          G+F  +YF      +KR  +  +F  MD  + KD VKMA+L++L  F+LGKQ   GI  EY  L+DD EQF+ YPWGR+ Y  TID +KKAIK+ DAS
Subjt:  -SGRFLTKYFNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDAS

Query:  VVGLSGMPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFENML
         +G+ G   AL++WAYE I LL       A  +S   P M NW A   PEW+D++ +VF++D F+ + ++
Subjt:  VVGLSGMPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFENML

TrEMBL top hitse value%identityAlignment
A0A0A0KI50 TF-B3 domain-containing protein7.3e-4031.09Show/hide
Query:  VEDQRGKRKEIEKDEVP---TEEPHIEEITEEEVILDSVDRRRKRKER---------EESEDDDIPIIERLKKLK-----KGVPRFKVKKKEDVMTTKKN
        V++Q+ +  EI+ D+      E    ++I E+        +R+KR +R             DD++ + +    L          R  +  K DV++  KN
Subjt:  VEDQRGKRKEIEKDEVP---TEEPHIEEITEEEVILDSVDRRRKRKER---------EESEDDDIPIIERLKKLK-----KGVPRFKVKKKEDVMTTKKN

Query:  VKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDGIN---FNFESKVVHFGLQEFEAITRLKCDELPSVDFKKL-SGRFLTKYFNNSTIP
            R+  + +K  +  GNF+              LI   C     N   FN E ++  FG+++F  IT L C ELP++D  K+  G+F  +YF      
Subjt:  VKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDGIN---FNFESKVVHFGLQEFEAITRLKCDELPSVDFKKL-SGRFLTKYFNNSTIP

Query:  VKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAY
        ++RA +  +F  MD  + KD VKMA+L++L  F+LGKQ   GI  EY  L+DD +QF++YPWGR+ Y  T+D +KK+IK+ DAS +G+ G P AL++WAY
Subjt:  VKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAY

Query:  ECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFENMLYEDENAQITGKKRVLEEGEPSNRNESYITSLFEEFKEMVLSSLDSM
        E I LL+      A  IS   P M NW A   PEW+D++ +VF+++ F+ + ++      ++     ++  G     NE  I+ + +E      +S +  
Subjt:  ECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFENMLYEDENAQITGKKRVLEEGEPSNRNESYITSLFEEFKEMVLSSLDSM

Query:  NC
        +C
Subjt:  NC

A0A1S3B0L9 uncharacterized protein LOC103484737 isoform X51.3e-3931.76Show/hide
Query:  VEDQRGKRKEIEKDEVP---TEEPHIEEITEEEVILDSVDRRRKRKER---------EESEDDDIPIIERLKKLK-----KGVPRFKVKKKEDVMTTKKN
        V++Q  +  EI+ D+      E    ++I E+        +R+KR ++             DD++ + +    L          R  +  K DV++  KN
Subjt:  VEDQRGKRKEIEKDEVP---TEEPHIEEITEEEVILDSVDRRRKRKER---------EESEDDDIPIIERLKKLK-----KGVPRFKVKKKEDVMTTKKN

Query:  VKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDG---INFNFESKVVHFGLQEFEAITRLKCDELPSVDFKKL-SGRFLTKYFNNSTIP
            R+  + +K  +  G F+              LI   C       + FN E ++  FG+++F  IT L C ELP++D  K+  G+F  +YF      
Subjt:  VKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDG---INFNFESKVVHFGLQEFEAITRLKCDELPSVDFKKL-SGRFLTKYFNNSTIP

Query:  VKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAY
        ++R  +  +F  MD  + KD VKMA+L++L  F+LGKQ   GI  EY  L+DD EQF++YPWGR+ Y  TID +KKAIK+ DAS +G+ G P AL +WAY
Subjt:  VKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAY

Query:  ECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFENMLYEDENAQITGKKRVLEEGEPSNRNESY
        E I LL+     +A  IS   P M NW A   PEW+D++ +VF+++ F+ + ++  +   +++        G+PSN    +
Subjt:  ECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFENMLYEDENAQITGKKRVLEEGEPSNRNESY

A0A1S3B1B6 uncharacterized protein LOC103484737 isoform X24.3e-4030.95Show/hide
Query:  QRGKRKEIEKDEVPTEEPHIEEITEEEVILDSVDRRRKRKEREESE-------------------------DDDIPIIERLKKLK-----KGVPRFKVKK
        ++ K+++    EV  +     EI  ++     V+ R+K+K  E+S+                         DD++ + +    L          R  +  
Subjt:  QRGKRKEIEKDEVPTEEPHIEEITEEEVILDSVDRRRKRKEREESE-------------------------DDDIPIIERLKKLK-----KGVPRFKVKK

Query:  KEDVMTTKKNVKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDG---INFNFESKVVHFGLQEFEAITRLKCDELPSVDFKKL-SGRFL
        K DV++  KN    R+  + +K  +  G F+              LI   C       + FN E ++  FG+++F  IT L C ELP++D  K+  G+F 
Subjt:  KEDVMTTKKNVKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDG---INFNFESKVVHFGLQEFEAITRLKCDELPSVDFKKL-SGRFL

Query:  TKYFNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSG
         +YF      ++R  +  +F  MD  + KD VKMA+L++L  F+LGKQ   GI  EY  L+DD EQF++YPWGR+ Y  TID +KKAIK+ DAS +G+ G
Subjt:  TKYFNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSG

Query:  MPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFENMLYEDENAQITGKKRVLEEGEPSNRNESY
         P AL +WAYE I LL+     +A  IS   P M NW A   PEW+D++ +VF+++ F+ + ++  +   +++        G+PSN    +
Subjt:  MPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFENMLYEDENAQITGKKRVLEEGEPSNRNESY

A0A5A7UZA2 DUF1985 domain-containing protein4.7e-4746.73Show/hide
Query:  DGINFNFESKVVHFGLQEFEAITRLKCDELPSVDFKKLSGRFLTKYFNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEY
        D + FNFE  +  FG+++FEAIT L C  LP VD  K+ G+FL+KYF N   P+ R+ +S LFN    +K++D++KMA+++ L NFLLGKQ   G + E+
Subjt:  DGINFNFESKVVHFGLQEFEAITRLKCDELPSVDFKKLSGRFLTKYFNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEY

Query:  IKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFEND
        IKLLDD + F+ YPWGR+ YN  IDSIKK+IKNP A  VG+SG   +L++W Y+C+ LL       A+ I  +   ++NW     PEW+++A RVF ++
Subjt:  IKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFEND

A0A6J1BX50 uncharacterized protein LOC1110055249.9e-4536.31Show/hide
Query:  KRKEIEKDEVPTEEPHIEEITEEEVILDSVDRRRKRKEREESEDDDIPIIERLKKLKKG-----------------------VP--------RFKVKKKE
        KRK+    E  TEE  + + T+++     V  R+KRK  EE E +D    +R ++ K                         VP        R  +  K 
Subjt:  KRKEIEKDEVPTEEPHIEEITEEEVILDSVDRRRKRKEREESEDDDIPIIERLKKLKKG-----------------------VP--------RFKVKKKE

Query:  DVMTTKKNVKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDGIN---FNFESKVVHFGLQEFEAITRLKCDELPSVDFKKLSGRFLTKY
        DV++  KN    R+ D  R+  +  GNF+              LI   C     N   FN E +V  FG+++F  IT + C ELP++D  K+   + +K 
Subjt:  DVMTTKKNVKKTRRKDRRRKGKYDKGNFIGQNKEDVYGDC---LITSPCPGDGIN---FNFESKVVHFGLQEFEAITRLKCDELPSVDFKKLSGRFLTKY

Query:  FNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPL
        +      +KR  +  +F  MD  + KD VKMA+L++L  FLLGKQ   GI  EY  L+DD EQFE YPWGRV Y  TID +KKAIK+ DAS +G+ G P 
Subjt:  FNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPL

Query:  ALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFE
        AL++WAYE I LLS     +A  IS+ +P M NW A   PEWRD++ ++F +D F+ +
Subjt:  ALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEATAQPEWRDIAIRVFENDEFEFE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31910.1 Domain of unknown function (DUF1985)6.8e-0633.04Show/hide
Query:  KDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAYECISLLSSAETKYAKHIS
        K+R+ + +L LLS  + G      I +   + + D   FE YPWGRV + S I+S+K    + D+ V+       ALVIW YE +  L        + I 
Subjt:  KDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAYECISLLSSAETKYAKHIS

Query:  TDIPLMVNWEAT
        T IPL+ +W ++
Subjt:  TDIPLMVNWEAT

AT3G32960.1 Domain of unknown function (DUF1985)1.8e-0631.25Show/hide
Query:  DGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAI-KNPDASVVGLSGMPLALVIWAYECISLLSS
        D    ++R  +A L L+ +  L   +     +E ++   D E+  NYPWG   +N  + SIKK +  N       + G PLAL IW  E I +L +
Subjt:  DGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAI-KNPDASVVGLSGMPLALVIWAYECISLLSS

AT5G28810.1 Domain of unknown function (DUF1985)1.9e-0827.59Show/hide
Query:  KVVHFGLQEFEAITRLKCDELPSVDFKKLSGRFLTKYFNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQ
        K + F L EFE IT L CD                  F+ S   ++R     +F        + R+ + +L LLS  + G      + +   K + D   
Subjt:  KVVHFGLQEFEAITRLKCDELPSVDFKKLSGRFLTKYFNNSTIPVKRATISTLFNSMDGVKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQ

Query:  FENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEAT
        FE YPWGRV ++S + S+K    + D+ V+   G   AL++W YE +  +  A   + K   T +PL+ +W ++
Subjt:  FENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEAT

AT5G45570.1 Ulp1 protease family protein6.6e-0925.56Show/hide
Query:  ESKVVHFGLQEFEAITRLKCDELPSVDFKKLSGRFLTKYFNNSTIPVKRATISTLFNSMDGVKK----KDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKL
        + + + F L EFE IT L CD     D  +   +    ++N   + +    + T    +  + K    + R+ + +L LLS  + G      + +   K 
Subjt:  ESKVVHFGLQEFEAITRLKCDELPSVDFKKLSGRFLTKYFNNSTIPVKRATISTLFNSMDGVKK----KDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKL

Query:  LDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEAT
        + D   FE YPWGRV ++S   S+K    + D+ V+   G    L++W YE +  +  A   + K   T +PL+ +W ++
Subjt:  LDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAYECISLLSSAETKYAKHISTDIPLMVNWEAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGGCGGGGCCCCTTGTTCAAGTCCCGGAGTCAGCATTTAAGGGAACACTCATCTACTCCCCGAAAGTCGGGGAGGAGTGAATTCCATCTTGTGAAGTAT
GTTCACAGCTCCTCACTCGGTCTCGTCCCCAAAATGGTAGGAATGTTGAGTCGGCGACTCGGGCCACTCTCACCCATACAGATCAAAGGACGAGCCCTCACGGGC
AGGAAACAAGATGAAAAGTTTAAGACGGAAGTTACCATATGTGATCCTATAGTTGAGTTTCCAAATGAAGATGAAGTTGTTGGGACTCAAATTGTTTCTTTTGCG
GTACGTGCGGATCAACCTCTTATTGAGGAAATAATAGAAGAAGTAATGGTGGATAATGTTGAGGATCAAAGAGGGAAAAGGAAAGAGATAGAGAAAGATGAGGTC
CCTACCGAGGAACCTCATATTGAAGAAATAACAGAAGAAGAAGTAATACTCGATAGTGTTGATCGAAGAAGGAAAAGGAAAGAGAGGGAGGAATCTGAGGATGAT
GATATCCCAATAATTGAAAGGTTGAAAAAATTAAAAAAGGGTGTGCCAAGATTTAAGGTGAAGAAGAAAGAAGATGTTATGACAACCAAAAAGAATGTTAAAAAA
ACAAGACGCAAAGATAGGAGGAGAAAAGGGAAATATGATAAAGGCAATTTTATTGGTCAGAACAAGGAAGATGTGTATGGGGATTGTCTTATCACAAGCCCTTGC
CCAGGAGATGGTATTAATTTCAACTTTGAGAGCAAAGTGGTTCATTTTGGATTACAAGAATTTGAGGCTATTACTAGACTTAAGTGCGATGAGCTACCTTCTGTG
GATTTCAAAAAACTAAGTGGGAGATTCTTGACAAAATATTTCAACAATTCAACAATTCCAGTGAAGAGGGCAACGATAAGTACCTTGTTTAATTCAATGGATGGA
GTGAAGAAGAAGGACAGGGTTAAGATGGCTCAGTTATTTTTGTTGTCAAATTTCCTGTTGGGAAAACAACGAGCAGTGGGAATTGAGGTGGAATATATCAAGCTT
CTCGATGACACCGAACAATTTGAGAACTACCCGTGGGGGCGTGTTCGCTATAACTCAACAATCGACTCTATAAAGAAAGCGATTAAAAATCCTGATGCATCGGTT
GTTGGACTCTCTGGAATGCCATTAGCATTGGTTATTTGGGCATACGAATGCATATCATTGCTTTCTTCAGCGGAAACAAAATATGCCAAGCACATATCTACTGAT
ATACCATTAATGGTAAATTGGGAGGCTACAGCACAACCGGAATGGCGGGATATAGCGATTAGAGTTTTCGAGAATGATGAGTTCGAATTTGAAAATATGTTATAT
GAAGATGAGAATGCACAGATTACAGGTAAAAAAAGAGTTTTGGAGGAAGGTGAACCATCGAATAGGAATGAAAGCTATATAACTTCACTTTTTGAAGAGTTCAAA
GAAATGGTGTTAAGTAGCCTCGATAGTATGAATTGCAAAATCAATACACTGTTTTCTGAGATTGACAGTGTTAAACGACTGGTGAATGATAAATTAAATGTTATC
AGAAAGGAGAATGGGAATGGTGATGGGAGTGAGGGTGGAAGGGATGGTAACAACGGTGGTAATGAGGGCGATGATAAACATGAAGAAAAATCGGAAGAAAATCAA
CCAAAGGACGATGGATCAAGGGACAGTTCAGCTACAAATAAGAGTGATAATGATGGAGATCGTAATAACGACATGTATTTACTATGTTTTACATTTTGCAAGGAA
CTCGAAGTTAAAGAAACCGAAGGAGATTTAATGCAAGATGTGTTCGTATGCACAACATTTTTAAATGAGGTTGATAGGATAGAAGAGGCTTGCCAAAAAAAGGAC
AAGGAGAAGATCGACAAAAATGATATTAATTCTGAGGGAGACATGTACATCCATGCTTTAAGAGGAACATGTCCGTTCAATAAAAGGGGGAATGGAGATATGAAG
ACATTAAAGAAAGATGCAGAACTCGAACAATTGAGACGGATAAAACGTAGAGTGGTGATGCCTTCTAAGGTGAATAGATCACCGTACACTACAAAGTTCGGTTCA
GCAGAAGATAGTAAAAATAAGATACAAAGCACTGAAGACTTTGAGGCCCTAACTTTTAACTTGTTATCACAACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGGCGGGGCCCCTTGTTCAAGTCCCGGAGTCAGCATTTAAGGGAACACTCATCTACTCCCCGAAAGTCGGGGAGGAGTGAATTCCATCTTGTGAAGTAT
GTTCACAGCTCCTCACTCGGTCTCGTCCCCAAAATGGTAGGAATGTTGAGTCGGCGACTCGGGCCACTCTCACCCATACAGATCAAAGGACGAGCCCTCACGGGC
AGGAAACAAGATGAAAAGTTTAAGACGGAAGTTACCATATGTGATCCTATAGTTGAGTTTCCAAATGAAGATGAAGTTGTTGGGACTCAAATTGTTTCTTTTGCG
GTACGTGCGGATCAACCTCTTATTGAGGAAATAATAGAAGAAGTAATGGTGGATAATGTTGAGGATCAAAGAGGGAAAAGGAAAGAGATAGAGAAAGATGAGGTC
CCTACCGAGGAACCTCATATTGAAGAAATAACAGAAGAAGAAGTAATACTCGATAGTGTTGATCGAAGAAGGAAAAGGAAAGAGAGGGAGGAATCTGAGGATGAT
GATATCCCAATAATTGAAAGGTTGAAAAAATTAAAAAAGGGTGTGCCAAGATTTAAGGTGAAGAAGAAAGAAGATGTTATGACAACCAAAAAGAATGTTAAAAAA
ACAAGACGCAAAGATAGGAGGAGAAAAGGGAAATATGATAAAGGCAATTTTATTGGTCAGAACAAGGAAGATGTGTATGGGGATTGTCTTATCACAAGCCCTTGC
CCAGGAGATGGTATTAATTTCAACTTTGAGAGCAAAGTGGTTCATTTTGGATTACAAGAATTTGAGGCTATTACTAGACTTAAGTGCGATGAGCTACCTTCTGTG
GATTTCAAAAAACTAAGTGGGAGATTCTTGACAAAATATTTCAACAATTCAACAATTCCAGTGAAGAGGGCAACGATAAGTACCTTGTTTAATTCAATGGATGGA
GTGAAGAAGAAGGACAGGGTTAAGATGGCTCAGTTATTTTTGTTGTCAAATTTCCTGTTGGGAAAACAACGAGCAGTGGGAATTGAGGTGGAATATATCAAGCTT
CTCGATGACACCGAACAATTTGAGAACTACCCGTGGGGGCGTGTTCGCTATAACTCAACAATCGACTCTATAAAGAAAGCGATTAAAAATCCTGATGCATCGGTT
GTTGGACTCTCTGGAATGCCATTAGCATTGGTTATTTGGGCATACGAATGCATATCATTGCTTTCTTCAGCGGAAACAAAATATGCCAAGCACATATCTACTGAT
ATACCATTAATGGTAAATTGGGAGGCTACAGCACAACCGGAATGGCGGGATATAGCGATTAGAGTTTTCGAGAATGATGAGTTCGAATTTGAAAATATGTTATAT
GAAGATGAGAATGCACAGATTACAGGTAAAAAAAGAGTTTTGGAGGAAGGTGAACCATCGAATAGGAATGAAAGCTATATAACTTCACTTTTTGAAGAGTTCAAA
GAAATGGTGTTAAGTAGCCTCGATAGTATGAATTGCAAAATCAATACACTGTTTTCTGAGATTGACAGTGTTAAACGACTGGTGAATGATAAATTAAATGTTATC
AGAAAGGAGAATGGGAATGGTGATGGGAGTGAGGGTGGAAGGGATGGTAACAACGGTGGTAATGAGGGCGATGATAAACATGAAGAAAAATCGGAAGAAAATCAA
CCAAAGGACGATGGATCAAGGGACAGTTCAGCTACAAATAAGAGTGATAATGATGGAGATCGTAATAACGACATGTATTTACTATGTTTTACATTTTGCAAGGAA
CTCGAAGTTAAAGAAACCGAAGGAGATTTAATGCAAGATGTGTTCGTATGCACAACATTTTTAAATGAGGTTGATAGGATAGAAGAGGCTTGCCAAAAAAAGGAC
AAGGAGAAGATCGACAAAAATGATATTAATTCTGAGGGAGACATGTACATCCATGCTTTAAGAGGAACATGTCCGTTCAATAAAAGGGGGAATGGAGATATGAAG
ACATTAAAGAAAGATGCAGAACTCGAACAATTGAGACGGATAAAACGTAGAGTGGTGATGCCTTCTAAGGTGAATAGATCACCGTACACTACAAAGTTCGGTTCA
GCAGAAGATAGTAAAAATAAGATACAAAGCACTGAAGACTTTGAGGCCCTAACTTTTAACTTGTTATCACAACTTTAA
Protein sequenceShow/hide protein sequence
MRGRGPLFKSRSQHLREHSSTPRKSGRSEFHLVKYVHSSSLGLVPKMVGMLSRRLGPLSPIQIKGRALTGRKQDEKFKTEVTICDPIVEFPNEDEVVGTQIVSFA
VRADQPLIEEIIEEVMVDNVEDQRGKRKEIEKDEVPTEEPHIEEITEEEVILDSVDRRRKRKEREESEDDDIPIIERLKKLKKGVPRFKVKKKEDVMTTKKNVKK
TRRKDRRRKGKYDKGNFIGQNKEDVYGDCLITSPCPGDGINFNFESKVVHFGLQEFEAITRLKCDELPSVDFKKLSGRFLTKYFNNSTIPVKRATISTLFNSMDG
VKKKDRVKMAQLFLLSNFLLGKQRAVGIEVEYIKLLDDTEQFENYPWGRVRYNSTIDSIKKAIKNPDASVVGLSGMPLALVIWAYECISLLSSAETKYAKHISTD
IPLMVNWEATAQPEWRDIAIRVFENDEFEFENMLYEDENAQITGKKRVLEEGEPSNRNESYITSLFEEFKEMVLSSLDSMNCKINTLFSEIDSVKRLVNDKLNVI
RKENGNGDGSEGGRDGNNGGNEGDDKHEEKSEENQPKDDGSRDSSATNKSDNDGDRNNDMYLLCFTFCKELEVKETEGDLMQDVFVCTTFLNEVDRIEEACQKKD
KEKIDKNDINSEGDMYIHALRGTCPFNKRGNGDMKTLKKDAELEQLRRIKRRVVMPSKVNRSPYTTKFGSAEDSKNKIQSTEDFEALTFNLLSQL