; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1695 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1695
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF1666)
Genome locationMC04:24444825..24446888
RNA-Seq ExpressionMC04g1695
SyntenyMC04g1695
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR012870 - Protein of unknown function DUF1666


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016902409.1 PREDICTED: uncharacterized protein LOC103498480 [Cucumis melo]1.41e-14054.46Show/hide
Query:  SVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEEN----LIELLQ----KVKGHGPG---LPQIPEESEYT
        S+PSIP PDF LH  RE  EN   N+ +CT+D  + E                 ++KFV+EE++    LI+ LQ    K K +      LP IPEE++Y+
Subjt:  SVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEEN----LIELLQ----KVKGHGPG---LPQIPEESEYT

Query:  IPMEE-DPKPWRTK-ENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQ
        I   E D KPW+ K E+F+ R+ TKELHK HK Y E+M+KYD LN QK YAK+LKMMQSK+ + SVL+K  C C+ +K     G +R IDG++E+VYV Q
Subjt:  IPMEE-DPKPWRTK-ENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQ

Query:  LWDSWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDN-
        LW SWEF+V +YKKALE+  R   GSCRFN VA KF+HF+V+I+RFMENE  EEGSRVECY R+RL RRKLLQVP++KEDEVK+        +G ++DN 
Subjt:  LWDSWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDN-

Query:  ESAIKIDRFIEILQESVRILWQFIRADKLAHIST-LKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRK-----NMRDTIEDEQ--DDEIYFF
        E+A+ IDR I ILQES+RILWQFIR DKL HIST LK H++A+QE  SP+HF+I  Q+  DLQ+K RKLKK++ RK     N ++ +E+++  DDE+ F 
Subjt:  ESAIKIDRFIEILQESVRILWQFIRADKLAHIST-LKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRK-----NMRDTIEDEQ--DDEIYFF

Query:  TLVDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF
        T+VDLKL+SR+LNM++ITR QLSWCHHKLSCI+F +G IK+ PSS  F
Subjt:  TLVDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF

XP_022133624.1 uncharacterized protein LOC111006162 [Momordica charantia]2.04e-27188.91Show/hide
Query:  DTSNENGSVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQKVKGHGPGLPQIPEESEYTIPME
        DTSNENGSVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQKVKGHGPGLPQIPEESEYTIPME
Subjt:  DTSNENGSVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQKVKGHGPGLPQIPEESEYTIPME

Query:  EDPKPWRTKENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLWDSWE
        EDPKPWRTKENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLWDSWE
Subjt:  EDPKPWRTKENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLWDSWE

Query:  FLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAIKID
        FLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVK+NDKKEIIGDGADDDNESAIKID
Subjt:  FLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAIKID

Query:  RFIEILQESVRILWQFIRADKLAHISTLKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRKNMRDTIEDEQDDEIYFFTLVDLKLLSRVLNMQ
        RFIEILQES                                               KVRKLKKMITRKNMRDTIEDEQDDEIYFFTLVDLKLLSRVLNMQ
Subjt:  RFIEILQESVRILWQFIRADKLAHISTLKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRKNMRDTIEDEQDDEIYFFTLVDLKLLSRVLNMQ

Query:  RITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF
        RITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF
Subjt:  RITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF

XP_031741130.1 uncharacterized protein LOC105435713 isoform X1 [Cucumis sativus]3.67e-13353.6Show/hide
Query:  ENGSVPSIPDPDFNLHHPREDPENFPQNNLNCTD-DFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQ----KVKGHGPGLPQIPEESEYTIPM
        ++ S+PS P P+  LH  RE  EN   N L+CT  DF +     K E  KE D          SE+++ I+ LQ    K K     LP IPEE++Y+I  
Subjt:  ENGSVPSIPDPDFNLHHPREDPENFPQNNLNCTD-DFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQ----KVKGHGPGLPQIPEESEYTIPM

Query:  EE-DPKPWRTK-ENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLWD
         E D KPW+ K E+F+ R+ TKELH+ HK Y E+M+KYD LN QK YAK+LKMMQSK+ + SV +K  C C   KG+    + + IDG++E+VYV QLW 
Subjt:  EE-DPKPWRTK-ENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLWD

Query:  SWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAI
        SWEF+V +YKKALE+  R D GSCRFNEVA KF+HF+V+IQRFMENE  EEGSRVECY ++RL RRK LQVP++KEDEVK+   KE       D+ E+A+
Subjt:  SWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAI

Query:  KIDRFIEILQESVRILWQFIRADKLAHIST-LKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITR-----KNMRDTIEDEQ--DDEIYFFTLVD
         IDR I+ILQES+RILWQFIR DKL HIST L  H++ +QE  SP+H N+  Q+  DLQ+K RKLKK++ R     KN ++ +E+++  DDE+ F T+VD
Subjt:  KIDRFIEILQESVRILWQFIRADKLAHIST-LKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITR-----KNMRDTIEDEQ--DDEIYFFTLVD

Query:  LKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF
        LKL+SR+LNM++ITR QLSWC HKLSCI+F +GKIK+ PSS  F
Subjt:  LKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF

XP_031741131.1 uncharacterized protein LOC105435713 isoform X2 [Cucumis sativus]9.12e-13153.38Show/hide
Query:  ENGSVPSIPDPDFNLHHPREDPENFPQNNLNCTD-DFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQ----KVKGHGPGLPQIPEESEYTIPM
        ++ S+PS P P+  LH  RE  EN   N L+CT  DF +     K E  KE D          SE+++ I+ LQ    K K     LP IPEE++Y+I  
Subjt:  ENGSVPSIPDPDFNLHHPREDPENFPQNNLNCTD-DFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQ----KVKGHGPGLPQIPEESEYTIPM

Query:  EE-DPKPWRTK-ENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLWD
         E D KPW+ K E+F+ R+ TKELH+ HK Y E+M+KYD LN QK YAK+LKMMQSK+ + SV +K  C C   KG+    + + IDG++E+VYV QLW 
Subjt:  EE-DPKPWRTK-ENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLWD

Query:  SWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAI
        SWEF+V +YKKALE+  R D GSCRFNEVA KF+HF+V+IQRFMENE  EEGSRVECY ++RL RRK LQVP++K DEVK+   KE       D+ E+A+
Subjt:  SWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAI

Query:  KIDRFIEILQESVRILWQFIRADKLAHIST-LKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITR-----KNMRDTIEDEQ--DDEIYFFTLVD
         IDR I+ILQES+RILWQFIR DKL HIST L  H++ +QE  SP+H N+  Q+  DLQ+K RKLKK++ R     KN ++ +E+++  DDE+ F T+VD
Subjt:  KIDRFIEILQESVRILWQFIRADKLAHIST-LKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITR-----KNMRDTIEDEQ--DDEIYFFTLVD

Query:  LKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF
        LKL+SR+LNM++ITR QLSWC HKLSCI+F +GKIK+ PSS  F
Subjt:  LKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF

XP_038889168.1 uncharacterized protein LOC120079052 isoform X1 [Benincasa hispida]1.20e-14955.72Show/hide
Query:  DTSNENG------SVPSIPDPDFNLHHPREDPENFPQNNLNCT------DDFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQ----KVKGHGP
        DTSNE+       S+PS PDP+F+LH P+E  EN   N L+CT      DD   +E D   E  K     IEDS +  SE++N I+ LQ    K K    
Subjt:  DTSNENG------SVPSIPDPDFNLHHPREDPENFPQNNLNCT------DDFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQ----KVKGHGP

Query:  G---LPQIPEESEYTIPMEEDPKPWRTKENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKR
            LP IPEESEY I  E D KPW+ +E+F+ R+  KELH  HK+Y E+M+KYDTLNHQ  YAK+L+ MQSK+ + SVLS+  C C+P K     G+ R
Subjt:  G---LPQIPEESEYTIPMEEDPKPWRTKENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKR

Query:  EIDGDLEVVYVGQLWDSWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIK----EDEVK
         ID +LE+VYV QLW SWEF+V QYKK LE+  R   G C FNEVA KF+HFQV+IQRFMENE+F+EGSRVECY RNRL RRKLLQVP++K    EDEVK
Subjt:  EIDGDLEVVYVGQLWDSWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIK----EDEVK

Query:  DNDKKEIIGDGADDDNESAIKIDRFIEILQESVRILWQFIRADKLAHISTLKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRK-----NMRD
        +N  K   G   +DD E+A+KIDR I+I QES+RILWQFI ADKL HISTLK  ++A+QEC SP+H  I  QI   LQ+K RKL+K++ RK     N ++
Subjt:  DNDKKEIIGDGADDDNESAIKIDRFIEILQESVRILWQFIRADKLAHISTLKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRK-----NMRD

Query:  TIE--DEQDDEIYFFTLVDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF
         IE  D  DDE+ F T+VDLKL+SR+LNM+RITR+QLSWCHHKLSCI+F +GKI++ PSS  F
Subjt:  TIE--DEQDDEIYFFTLVDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF

TrEMBL top hitse value%identityAlignment
A0A0A0KS72 Uncharacterized protein3.38e-12750.21Show/hide
Query:  ENGSVPSIPDPDFNLHHPREDPENFPQNNLNCTD-DFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQ----KVKGHGPGLPQIPEESEYTIPM
        ++ S+PS P P+  LH  RE  EN   N L+CT  DF +     K E  KE D          SE+++ I+ LQ    K K     LP IPEE++Y+I  
Subjt:  ENGSVPSIPDPDFNLHHPREDPENFPQNNLNCTD-DFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQ----KVKGHGPGLPQIPEESEYTIPM

Query:  EE-DPKPWRTK-ENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLWD
         E D KPW+ K E+F+ R+ TKELH+ HK Y E+M+KYD LN QK YAK+LKMMQSK+ + SV +K  C C   KG+    + + IDG++E+VYV QLW 
Subjt:  EE-DPKPWRTK-ENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLWD

Query:  SWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAI
        SWEF+V +YKKALE+  R D GSCRFNEVA KF+HF+V+IQRFMENE  EEGSRVECY ++RL RRK LQVP++KEDEVK+   KE       D+ E+A+
Subjt:  SWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAI

Query:  KIDRFIEILQESVRILWQFIRADKLAHIST-LKFHVKAEQECKSPTHFNILNQIFSDLQ--------------------------------QKVRKLKKM
         IDR I+ILQES+RILWQFIR DKL HIST L  H++ +QE  SP+H N+  Q+  DLQ                                QK RKLKK+
Subjt:  KIDRFIEILQESVRILWQFIRADKLAHIST-LKFHVKAEQECKSPTHFNILNQIFSDLQ--------------------------------QKVRKLKKM

Query:  ITR-----KNMRDTIEDEQ--DDEIYFFTLVDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF
        + R     KN ++ +E+++  DDE+ F T+VDLKL+SR+LNM++ITR QLSWC HKLSCI+F +GKIK+ PSS  F
Subjt:  ITR-----KNMRDTIEDEQ--DDEIYFFTLVDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF

A0A1S4E2E7 uncharacterized protein LOC1034984806.82e-14154.46Show/hide
Query:  SVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEEN----LIELLQ----KVKGHGPG---LPQIPEESEYT
        S+PSIP PDF LH  RE  EN   N+ +CT+D  + E                 ++KFV+EE++    LI+ LQ    K K +      LP IPEE++Y+
Subjt:  SVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEEN----LIELLQ----KVKGHGPG---LPQIPEESEYT

Query:  IPMEE-DPKPWRTK-ENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQ
        I   E D KPW+ K E+F+ R+ TKELHK HK Y E+M+KYD LN QK YAK+LKMMQSK+ + SVL+K  C C+ +K     G +R IDG++E+VYV Q
Subjt:  IPMEE-DPKPWRTK-ENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQ

Query:  LWDSWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDN-
        LW SWEF+V +YKKALE+  R   GSCRFN VA KF+HF+V+I+RFMENE  EEGSRVECY R+RL RRKLLQVP++KEDEVK+        +G ++DN 
Subjt:  LWDSWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDN-

Query:  ESAIKIDRFIEILQESVRILWQFIRADKLAHIST-LKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRK-----NMRDTIEDEQ--DDEIYFF
        E+A+ IDR I ILQES+RILWQFIR DKL HIST LK H++A+QE  SP+HF+I  Q+  DLQ+K RKLKK++ RK     N ++ +E+++  DDE+ F 
Subjt:  ESAIKIDRFIEILQESVRILWQFIRADKLAHIST-LKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRK-----NMRDTIEDEQ--DDEIYFF

Query:  TLVDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF
        T+VDLKL+SR+LNM++ITR QLSWCHHKLSCI+F +G IK+ PSS  F
Subjt:  TLVDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF

A0A6J1BWI3 uncharacterized protein LOC1110061629.88e-27288.91Show/hide
Query:  DTSNENGSVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQKVKGHGPGLPQIPEESEYTIPME
        DTSNENGSVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQKVKGHGPGLPQIPEESEYTIPME
Subjt:  DTSNENGSVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQKVKGHGPGLPQIPEESEYTIPME

Query:  EDPKPWRTKENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLWDSWE
        EDPKPWRTKENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLWDSWE
Subjt:  EDPKPWRTKENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLWDSWE

Query:  FLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAIKID
        FLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVK+NDKKEIIGDGADDDNESAIKID
Subjt:  FLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAIKID

Query:  RFIEILQESVRILWQFIRADKLAHISTLKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRKNMRDTIEDEQDDEIYFFTLVDLKLLSRVLNMQ
        RFIEILQES                                               KVRKLKKMITRKNMRDTIEDEQDDEIYFFTLVDLKLLSRVLNMQ
Subjt:  RFIEILQESVRILWQFIRADKLAHISTLKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRKNMRDTIEDEQDDEIYFFTLVDLKLLSRVLNMQ

Query:  RITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF
        RITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF
Subjt:  RITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF

A0A6J1H9K8 uncharacterized protein LOC1114608612.05e-12853.85Show/hide
Query:  DTSNENG---SVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQKVKGHGPGLPQIPEESEYTI
        +TSNEN    S  S  D D +L  PREDPE F  N L  ++D   ++ D + + P E      D D F+ E +N I+ ++       GLP IPEESEY I
Subjt:  DTSNENG---SVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQKVKGHGPGLPQIPEESEYTI

Query:  PMEEDPKPWRTKENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRP-KKGDLGAGKKREIDGDLEVVYVGQLW
         MEED K  + +E  + +   KELH+ HK Y E+M+KYDTLNHQ   AK+ KM QSK  + SV SK LC CRP KKG+   G+ R ++G+LEVVYV Q+W
Subjt:  PMEEDPKPWRTKENFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRP-KKGDLGAGKKREIDGDLEVVYVGQLW

Query:  DSWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESA
         SWEF+V QYKKALE+      GS RFNEVA KF+HF+V+IQRFMENE  EEGSRVE YA +R+G+RKLLQVP++KEDE KDN K          + E  
Subjt:  DSWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESA

Query:  IKIDRFIEILQESVRILWQFIRADKLAHIS-TLKFHV-KAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRK-----NMRDTI-EDEQDDEIYFFTLV
        +K+DR IEILQE +R+ WQFIRADKLAHIS T K H+ + +QE  SP + NILNQI  DL +K RKLKK+I RK     N +D   ED+ DDE YF  +V
Subjt:  IKIDRFIEILQESVRILWQFIRADKLAHIS-TLKFHV-KAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRK-----NMRDTI-EDEQDDEIYFFTLV

Query:  DLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSS
        DLKL+ RVLNM  I+ DQL WC HKLSCI+F +GKI+++PSS
Subjt:  DLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSS

A0A6J1JR14 uncharacterized protein LOC1114875634.19e-12854.85Show/hide
Query:  TSNENG---SVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEEN-LIELLQKVKGHGPGLPQIPEESEYTI
        TSNEN    S  S  D D +L  PREDPE F  N L C++D   ++ D + + P E      D D F+ E +N +++   K K    GLP IPEESEY I
Subjt:  TSNENG---SVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEEN-LIELLQKVKGHGPGLPQIPEESEYTI

Query:  PMEEDPKPWRTKEN-FDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRP-KKGDLGAGKKREIDGDLEVVYVGQL
         MEED K  + +E   + +   KELH+ HK Y E+M+KYDTLNHQ   AK+ KM QSK  + SV SK LC CRP KKG+    + R I+G+LEVVYV Q+
Subjt:  PMEEDPKPWRTKEN-FDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRP-KKGDLGAGKKREIDGDLEVVYVGQL

Query:  WDSWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNES
        W SWEF+V QYKKALE+  R   GS RFNEVA KF+HF+V IQRFME E  EEGSRVE YAR+R+GRRKLLQVP+++EDE KD  K   I      + E 
Subjt:  WDSWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNES

Query:  AIKIDRFIEILQESVRILWQFIRADKLAHIS-TLKFHV-KAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRK-----NMRDTI-EDEQDDEIYFFTL
         +K+DR IEILQE +R+ WQFIRADKLAHIS T K H+ + +QE  SP + NILNQI  DLQ+K RKLKKMI RK     N +D   ED+ DDE YF  +
Subjt:  AIKIDRFIEILQESVRILWQFIRADKLAHIS-TLKFHV-KAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRK-----NMRDTI-EDEQDDEIYFFTL

Query:  VDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSS
        VDL L+ RVLNMQ I+ DQL WC HKL+CISF +GKI+++PSS
Subjt:  VDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69610.1 Protein of unknown function (DUF1666)2.4e-3830.68Show/hide
Query:  NLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQKV--KGHGPGLPQIPEESEYTI----PMEEDPKPWRTKEN
        N  +  E+ E F          F  +E    E   + +D    D D+F  E  ++IE L+         GL  I EESE  +    P++ +PKP + K+ 
Subjt:  NLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQKV--KGHGPGLPQIPEESEYTI----PMEEDPKPWRTKEN

Query:  FDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQ---------SKDPMGSV------LSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLW
                E+HK +K Y  +M+K D ++ Q +++  L  ++          K P  S+        K    C P +  +     +E   D E VYVGQ+ 
Subjt:  FDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQ---------SKDPMGSV------LSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLW

Query:  DSWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESA
         SWE L  QY K LE        + ++N VAG+FQ FQV++QRF+ENE F+  SRVE Y +NR   +  LQ+P++++D    + KK+   +G     E A
Subjt:  DSWEFLVRQYKKALEVCDRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESA

Query:  IKIDRFIEILQESVRILWQFIRADKLAHISTLKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKM------ITRKNMRDTIEDE--QDDEIYFFTLV
        +K +   EI++ES+ + W+F+ ADK    S +K   + +   +      +L  I + LQ+K +KLK++      I +K  ++  +      DE+     +
Subjt:  IKIDRFIEILQESVRILWQFIRADKLAHISTLKFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKM------ITRKNMRDTIEDE--QDDEIYFFTLV

Query:  DLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEP
        +L+L+SRV+ M ++T ++L WC  KL  ISF   KI +EP
Subjt:  DLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEP

AT1G73850.1 Protein of unknown function (DUF1666)5.9e-2127.12Show/hide
Query:  WRTKENFDFRNFTKELHKCHK--------IYRERMQKYDTLNHQKLY-AKDLK--MMQSKDPMGSVLSKSLCACRPKK-----GDLGAGKKREIDGDLEV
        WR     D    T     C K         Y E M     ++ QKL+ A+ LK  M++ +     ++ K       KK     G  G+     +  +LE 
Subjt:  WRTKENFDFRNFTKELHKCHK--------IYRERMQKYDTLNHQKLY-AKDLK--MMQSKDPMGSVLSKSLCACRPKK-----GDLGAGKKREIDGDLEV

Query:  VYVGQLWDSWEFLVRQYKKALEVCDRGDCGSCRFNEV------AGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKE
         YV Q+  +WE L   YK       +       FN+V      A +F+ F +++QR++ENE +E G R E YAR R    KLL VP     E +D +++E
Subjt:  VYVGQLWDSWEFLVRQYKKALEVCDRGDCGSCRFNEV------AGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKE

Query:  IIGDGADDDNESAIKIDRFIEILQESVRILWQFIRADKLAHISTL--KFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLK---KMITRKNMRDTIEDEQ
           D  ++   S I    F+ I++E +R    F++ADK      +   F  ++++    PT  +++ ++ +  + K+++++   K + +K M  +IE+E 
Subjt:  IIGDGADDDNESAIKIDRFIEILQESVRILWQFIRADKLAHISTL--KFHVKAEQECKSPTHFNILNQIFSDLQQKVRKLK---KMITRKNMRDTIEDEQ

Query:  DDEIYFFTLVDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLP
        +       L+DLK++SRVL M  +  + L WC  K+S +  + G   ++  S P
Subjt:  DDEIYFFTLVDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLP

AT3G20260.1 Protein of unknown function (DUF1666)1.4e-2528.26Show/hide
Query:  ENDRKIEDSDKFVSEE--ENLIELLQKVKGHGPGLPQIPEESEYTIPMEEDPKPWRTKENFDFRNFTKE-------LHKCHKIYRERMQKYDTLNHQKLY
        + D   +D D F++ E    L EL  +       +P+  EE E    ++ED      K + ++R+   E           ++ Y ERM  +D L+ Q+L 
Subjt:  ENDRKIEDSDKFVSEE--ENLIELLQKVKGHGPGLPQIPEESEYTIPMEEDPKPWRTKENFDFRNFTKE-------LHKCHKIYRERMQKYDTLNHQKLY

Query:  AKDLKMMQS-KDPMGSVLSKSL-----CACRPK----KGDLGAGKKREIDG---DLEVVYVGQLWDSWEFLVRQYKKALEVCD-RGDCGSCRFNEVAGKF
           + +  S   P     SK L     C    K    + D+   +  E+D    DLE  YV QL  +WE L  QY +   +   + +  +C +N  A  F
Subjt:  AKDLKMMQS-KDPMGSVLSKSL-----CACRPK----KGDLGAGKKREIDG---DLEVVYVGQLWDSWEFLVRQYKKALEVCD-RGDCGSCRFNEVAGKF

Query:  QHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAIKIDRFIEILQESVRILWQFIRADKLAHISTLK-
        Q F V++QR++ENE FE+GSR E YAR R    KLLQ P     +++ +DKKE+     + D    +  D  I++++ S+     F++ DK      +  
Subjt:  QHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAIKIDRFIEILQESVRILWQFIRADKLAHISTLK-

Query:  FHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKM---ITRKNMRDTIEDEQDDEIYFFTLVDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKV
        F         S T   ++       + K ++L K    + +K+   T E  Q      F  +D+KL +RVL M +I+++QL WC  K+  ++F +GK++ 
Subjt:  FHVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKM---ITRKNMRDTIEDEQDDEIYFFTLVDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKV

Query:  EPSSLPF
         PS + F
Subjt:  EPSSLPF

AT5G39785.1 Protein of unknown function (DUF1666)1.2e-6138.6Show/hide
Query:  NEELDIKEETPKENDRKIEDSDKFVS--EEENLIELL----QKVKGHGPGLPQIPEESEY---TIPMEEDPKPWRTKENFDFRNFTK--ELHKCHKIYRE
        N + D         + + ED++ F S  E ++LIE L    +KVK  G GL  I EE E       + ED KPWR +E   F++     E+HK H+ YRE
Subjt:  NEELDIKEETPKENDRKIEDSDKFVS--EEENLIELL----QKVKGHGPGLPQIPEESEY---TIPMEEDPKPWRTKENFDFRNFTK--ELHKCHKIYRE

Query:  RMQKYDTLNHQKLYAKDLKMMQSKDP---------------MGSVLSKSLCACRPKKGDLGAGKK--REIDGDLEVVYVGQLWDSWEFLVRQYKKALEVC
        RM+K D L+ QK YA  L ++QSK P                 SV S ++   + KK ++    +  +EI G+LE VYVGQ+  SWE L  QY+KA+E+ 
Subjt:  RMQKYDTLNHQKLYAKDLKMMQSKDP---------------MGSVLSKSLCACRPKKGDLGAGKK--REIDGDLEVVYVGQLWDSWEFLVRQYKKALEVC

Query:  DRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAIKIDRFIEILQESVRIL
        +    GS R+NEVAG+FQ FQV++QRF+ENE FEE  RV+ Y + R   R LLQ+P+I+ED  K  DKK       +++N+  IK D+ +EI++E++R+ 
Subjt:  DRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAIKIDRFIEILQESVRIL

Query:  WQFIRADKL-AHISTLKFHVKAEQECKSPTH------FNILNQIFSDLQQKVRKLK------KMITRKNMRDTIEDEQDDEI-YFFTLVDLKLLSRVLNM
        W+F+R DKL + I   K   K++ E   P H        +  ++ S LQ K ++L+      + I R+  +   ED  +D++ +FF+ VD+KL++RVLNM
Subjt:  WQFIRADKL-AHISTLKFHVKAEQECKSPTH------FNILNQIFSDLQQKVRKLK------KMITRKNMRDTIEDEQDDEI-YFFTLVDLKLLSRVLNM

Query:  QRITRDQLSWCHHKLSCISFLSGKIKVEPS
         ++TRD L WCH+KL+ I+F++ ++ ++PS
Subjt:  QRITRDQLSWCHHKLSCISFLSGKIKVEPS

AT5G39785.2 Protein of unknown function (DUF1666)9.9e-6138.52Show/hide
Query:  NEELDIKEETPKENDRKIEDSDKFVS--EEENLIELL----QKVKGHGPGLPQIPEESEY---TIPMEEDPKPWRTKENFDFRNFTK--ELHKCHKIYRE
        N + D         + + ED++ F S  E ++LIE L    +KVK  G GL  I EE E       + ED KPWR +E   F++     E+HK H+ YRE
Subjt:  NEELDIKEETPKENDRKIEDSDKFVS--EEENLIELL----QKVKGHGPGLPQIPEESEY---TIPMEEDPKPWRTKENFDFRNFTK--ELHKCHKIYRE

Query:  RMQKYDTLNHQKLYAKDLKMMQSKDP---------------MGSVLSKSLCACRPKKGDLGAGKK--REIDGDLEVVYVGQLWDSWEFLVRQYKKALEVC
        RM+K D L+ QK YA  L ++QSK P                 SV S ++   + KK ++    +  +EI G+LE VYVGQ+  SWE L  QY+KA+E+ 
Subjt:  RMQKYDTLNHQKLYAKDLKMMQSKDP---------------MGSVLSKSLCACRPKKGDLGAGKK--REIDGDLEVVYVGQLWDSWEFLVRQYKKALEVC

Query:  DRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAIKIDRFIEILQESVRIL
        +    GS R+NEVAG+FQ FQV++QRF+ENE FEE  RV+ Y + R   R LLQ+P+I+ED  K  DKK       +++N+  IK D+ +EI++E++R+ 
Subjt:  DRGDCGSCRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAIKIDRFIEILQESVRIL

Query:  WQFIRADKL-AHISTLKFHVKAEQECKSPTH------FNILNQIFSDLQ-------QKVRKLKKMITRKNMRDTIEDEQDDEI-YFFTLVDLKLLSRVLN
        W+F+R DKL + I   K   K++ E   P H        +  ++ S LQ       + V K ++ I R+  +   ED  +D++ +FF+ VD+KL++RVLN
Subjt:  WQFIRADKL-AHISTLKFHVKAEQECKSPTH------FNILNQIFSDLQ-------QKVRKLKKMITRKNMRDTIEDEQDDEI-YFFTLVDLKLLSRVLN

Query:  MQRITRDQLSWCHHKLSCISFLSGKIKVEPS
        M ++TRD L WCH+KL+ I+F++ ++ ++PS
Subjt:  MQRITRDQLSWCHHKLSCISFLSGKIKVEPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GATACTTCAAACGAGAATGGTTCGGTTCCTTCAATTCCTGATCCGGATTTCAACCTTCATCATCCCAGAGAAGACCCAGAAAATTTTCCCCAAAATAATTTAAATTGCAC
CGATGATTTCAGCAACGAAGAGTTGGACATCAAAGAAGAAACCCCGAAAGAGAATGATCGGAAAATTGAAGATTCCGATAAATTTGTGTCGGAAGAGGAGAACCTGATAG
AATTGCTGCAAAAGGTCAAAGGCCACGGTCCTGGGCTGCCCCAAATTCCAGAAGAATCCGAGTACACAATACCAATGGAAGAAGATCCAAAGCCATGGAGGACGAAAGAA
AACTTCGATTTCAGAAACTTTACGAAAGAACTCCACAAATGCCATAAAATTTACAGAGAAAGGATGCAAAAATACGACACTTTAAATCACCAAAAGTTGTACGCAAAAGA
TTTGAAGATGATGCAGTCGAAAGATCCAATGGGCTCTGTTTTATCGAAAAGCTTGTGCGCGTGCAGGCCAAAGAAGGGTGATTTGGGGGCAGGGAAGAAGAGAGAAATTG
ACGGGGATTTGGAAGTGGTGTACGTGGGGCAATTGTGGGATTCTTGGGAGTTTCTTGTGCGGCAGTATAAGAAAGCCTTGGAAGTGTGCGATAGAGGGGATTGTGGAAGC
TGCAGATTCAATGAAGTTGCAGGAAAATTTCAACATTTTCAAGTGGTGATTCAGAGATTTATGGAGAATGAAGCGTTTGAAGAAGGCTCCAGAGTTGAATGTTACGCTAG
AAATCGACTTGGCAGGAGAAAGCTTCTTCAAGTTCCTATTATCAAAGAAGATGAAGTAAAAGATAATGACAAAAAAGAAATAATAGGGGATGGGGCCGACGATGACAATG
AAAGTGCAATTAAAATTGATAGGTTTATTGAGATCTTGCAAGAATCGGTAAGAATCCTTTGGCAATTCATTCGAGCCGATAAACTTGCCCACATATCGACTCTAAAGTTC
CATGTAAAGGCAGAACAAGAATGCAAAAGTCCAACACATTTTAATATTCTAAACCAAATTTTTTCAGATCTACAACAGAAAGTGAGAAAGTTGAAGAAGATGATTACGAG
GAAAAATATGAGAGATACGATAGAAGATGAACAAGATGATGAAATTTATTTCTTTACTTTAGTGGATTTGAAGTTGCTGTCAAGGGTACTGAATATGCAAAGGATTACAA
GAGATCAATTGAGTTGGTGTCACCATAAATTGAGTTGCATTAGTTTTCTAAGTGGGAAGATCAAAGTAGAGCCTTCATCCCTCCCCTTT
mRNA sequenceShow/hide mRNA sequence
GATACTTCAAACGAGAATGGTTCGGTTCCTTCAATTCCTGATCCGGATTTCAACCTTCATCATCCCAGAGAAGACCCAGAAAATTTTCCCCAAAATAATTTAAATTGCAC
CGATGATTTCAGCAACGAAGAGTTGGACATCAAAGAAGAAACCCCGAAAGAGAATGATCGGAAAATTGAAGATTCCGATAAATTTGTGTCGGAAGAGGAGAACCTGATAG
AATTGCTGCAAAAGGTCAAAGGCCACGGTCCTGGGCTGCCCCAAATTCCAGAAGAATCCGAGTACACAATACCAATGGAAGAAGATCCAAAGCCATGGAGGACGAAAGAA
AACTTCGATTTCAGAAACTTTACGAAAGAACTCCACAAATGCCATAAAATTTACAGAGAAAGGATGCAAAAATACGACACTTTAAATCACCAAAAGTTGTACGCAAAAGA
TTTGAAGATGATGCAGTCGAAAGATCCAATGGGCTCTGTTTTATCGAAAAGCTTGTGCGCGTGCAGGCCAAAGAAGGGTGATTTGGGGGCAGGGAAGAAGAGAGAAATTG
ACGGGGATTTGGAAGTGGTGTACGTGGGGCAATTGTGGGATTCTTGGGAGTTTCTTGTGCGGCAGTATAAGAAAGCCTTGGAAGTGTGCGATAGAGGGGATTGTGGAAGC
TGCAGATTCAATGAAGTTGCAGGAAAATTTCAACATTTTCAAGTGGTGATTCAGAGATTTATGGAGAATGAAGCGTTTGAAGAAGGCTCCAGAGTTGAATGTTACGCTAG
AAATCGACTTGGCAGGAGAAAGCTTCTTCAAGTTCCTATTATCAAAGAAGATGAAGTAAAAGATAATGACAAAAAAGAAATAATAGGGGATGGGGCCGACGATGACAATG
AAAGTGCAATTAAAATTGATAGGTTTATTGAGATCTTGCAAGAATCGGTAAGAATCCTTTGGCAATTCATTCGAGCCGATAAACTTGCCCACATATCGACTCTAAAGTTC
CATGTAAAGGCAGAACAAGAATGCAAAAGTCCAACACATTTTAATATTCTAAACCAAATTTTTTCAGATCTACAACAGAAAGTGAGAAAGTTGAAGAAGATGATTACGAG
GAAAAATATGAGAGATACGATAGAAGATGAACAAGATGATGAAATTTATTTCTTTACTTTAGTGGATTTGAAGTTGCTGTCAAGGGTACTGAATATGCAAAGGATTACAA
GAGATCAATTGAGTTGGTGTCACCATAAATTGAGTTGCATTAGTTTTCTAAGTGGGAAGATCAAAGTAGAGCCTTCATCCCTCCCCTTT
Protein sequenceShow/hide protein sequence
DTSNENGSVPSIPDPDFNLHHPREDPENFPQNNLNCTDDFSNEELDIKEETPKENDRKIEDSDKFVSEEENLIELLQKVKGHGPGLPQIPEESEYTIPMEEDPKPWRTKE
NFDFRNFTKELHKCHKIYRERMQKYDTLNHQKLYAKDLKMMQSKDPMGSVLSKSLCACRPKKGDLGAGKKREIDGDLEVVYVGQLWDSWEFLVRQYKKALEVCDRGDCGS
CRFNEVAGKFQHFQVVIQRFMENEAFEEGSRVECYARNRLGRRKLLQVPIIKEDEVKDNDKKEIIGDGADDDNESAIKIDRFIEILQESVRILWQFIRADKLAHISTLKF
HVKAEQECKSPTHFNILNQIFSDLQQKVRKLKKMITRKNMRDTIEDEQDDEIYFFTLVDLKLLSRVLNMQRITRDQLSWCHHKLSCISFLSGKIKVEPSSLPF