; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g1167 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g1167
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF3754)
Genome locationMC08:9893123..9897848
RNA-Seq ExpressionMC08g1167
SyntenyMC08g1167
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR022227 - Protein of unknown function DUF3754


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148089.1 uncharacterized protein LOC111016855 isoform X1 [Momordica charantia]0.099.61Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ

Query:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
        QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
Subjt:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT

Query:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF
        KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF
Subjt:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF

Query:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG
        VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG
Subjt:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG

Query:  TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAK
        TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAK
Subjt:  TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAK

Query:  EADASAT
        EADASAT
Subjt:  EADASAT

XP_022148090.1 uncharacterized protein LOC111016855 isoform X2 [Momordica charantia]0.099.8Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQ

Query:  NLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTK
        NLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTK
Subjt:  NLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTK

Query:  VNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIFV
        VNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIFV
Subjt:  VNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIFV

Query:  KHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGT
        KHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGT
Subjt:  KHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGT

Query:  LLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE
        LLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE
Subjt:  LLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE

Query:  ADASAT
        ADASAT
Subjt:  ADASAT

XP_022148091.1 uncharacterized protein LOC111016855 isoform X3 [Momordica charantia]0.093.9Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ

Query:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
        QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
Subjt:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT

Query:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF
        KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF
Subjt:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF

Query:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLS-FQSNLVSYQNLITRCVYDKQLDSGR
        VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLV   G    P +   V  +     S+ +    L  FQSNLVSYQNLITRCVYDKQLDSGR
Subjt:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLS-FQSNLVSYQNLITRCVYDKQLDSGR

Query:  GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKA
        GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKA
Subjt:  GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKA

Query:  KEADASAT
        KEADASAT
Subjt:  KEADASAT

XP_022148092.1 uncharacterized protein LOC111016855 isoform X4 [Momordica charantia]1.55e-30599.54Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ

Query:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
        QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
Subjt:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT

Query:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF
        KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF
Subjt:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF

Query:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG
        VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG
Subjt:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG

Query:  TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQ
        TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQ
Subjt:  TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQ

XP_038892952.1 uncharacterized protein LOC120081846 [Benincasa hispida]2.31e-30282.66Show/hide
Query:  FVCHELAIQQL-----ISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYAL
        F+CHEL  Q       ISVYG  F FGGFRTM+KK+ EVIRLERESVIPILKP LI+ LS+HL D  DR EF+  CQRVEYSIRAWYLL FDDLLHLY+L
Subjt:  FVCHELAIQQL-----ISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYAL

Query:  FDPIHGALKLEQQNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGI
        F+PIHGA KLE++NLS EE DV+EQKFLG LFQVM+KSNF++TTD+EIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGI
Subjt:  FDPIHGALKLEQQNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGI

Query:  GIDQMTDHFYDTKVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRP
        GIDQM D+FY TKVN IIMRIW FFLK++GL  L+  GASRS +SQVF+KQIDIST+SEDDGLYVERIRVENM  GISMLL++ITIQEPTFDRIIV+YRP
Subjt:  GIDQMTDHFYDTKVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRP

Query:  ANMNSEMERGIFVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITR
        AN   EMERGIFVKHFKNIPMADLEIVLPEK NP LTPMDWVKFLVSAAIGLVTVIGSLSVP AD++VIFAI+SAV  Y VKTYLSFQ NLVSYQ+LIT 
Subjt:  ANMNSEMERGIFVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITR

Query:  CVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKII
        CVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+AT QELD RCEELI+G+F QSCNFDVDDAVHKL+KLGI+VR ADGAYSCVDLRSANKII
Subjt:  CVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKII

Query:  GTTTEEIISKAKEADASAT
        G TTEEI+SKAKE DAS T
Subjt:  GTTTEEIISKAKEADASAT

TrEMBL top hitse value%identityAlignment
A0A6J1D1Z1 uncharacterized protein LOC111016855 isoform X20.099.8Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQ

Query:  NLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTK
        NLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTK
Subjt:  NLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTK

Query:  VNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIFV
        VNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIFV
Subjt:  VNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIFV

Query:  KHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGT
        KHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGT
Subjt:  KHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGT

Query:  LLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE
        LLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE
Subjt:  LLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE

Query:  ADASAT
        ADASAT
Subjt:  ADASAT

A0A6J1D2Z1 uncharacterized protein LOC111016855 isoform X10.099.61Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ

Query:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
        QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
Subjt:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT

Query:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF
        KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF
Subjt:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF

Query:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG
        VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG
Subjt:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG

Query:  TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAK
        TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAK
Subjt:  TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAK

Query:  EADASAT
        EADASAT
Subjt:  EADASAT

A0A6J1D449 uncharacterized protein LOC111016855 isoform X47.51e-30699.54Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ

Query:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
        QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
Subjt:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT

Query:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF
        KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF
Subjt:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF

Query:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG
        VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG
Subjt:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG

Query:  TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQ
        TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQ
Subjt:  TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQ

A0A6J1D4B7 uncharacterized protein LOC111016855 isoform X30.093.9Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ

Query:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
        QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
Subjt:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT

Query:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF
        KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF
Subjt:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIF

Query:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLS-FQSNLVSYQNLITRCVYDKQLDSGR
        VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLV   G    P +   V  +     S+ +    L  FQSNLVSYQNLITRCVYDKQLDSGR
Subjt:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLS-FQSNLVSYQNLITRCVYDKQLDSGR

Query:  GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKA
        GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKA
Subjt:  GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKA

Query:  KEADASAT
        KEADASAT
Subjt:  KEADASAT

A0A6J1F2F4 uncharacterized protein LOC1114415295.57e-28984.77Show/hide
Query:  FRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQNLSTEETDVLEQKFL
        FRTM+KKK EVIRLERESVIPILKPRLIS LS+ L D SDR+EF+K CQRVEYSIRAWYLLHFDDLLHLY+LFDPIHGA KLEQQNLS EETD LEQKFL
Subjt:  FRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQNLSTEETDVLEQKFL

Query:  GHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTKVNTIIMRIWTFFLKI
        G LFQVM+KSNF++TTD+EIAVALS QYRLNLPISVDESKLD KLLT YF ENPHDNLPYFADKYIIFRRGIGIDQM DHFY TKVN II RIW FFL I
Subjt:  GHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTKVNTIIMRIWTFFLKI

Query:  SGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEME-RGIFVKHFKNIPMADLEIV
         GL RL+   ASRSH+SQVF+KQIDISTDS+DDGLYVERIRVENM LG SML ++ITIQEPTFDRIIVVYRPA+ N E+E RGIF+KHFKNIPMADLEIV
Subjt:  SGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEME-RGIFVKHFKNIPMADLEIV

Query:  LPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVK
        LPEKK+P LTPMDWV FLVSAAIGLVTVIGSLSVP AD++VIFAI+SAV  Y VKTYLSFQ NLVSYQ+LIT CVYDKQLDSGRGTLLHLCDEVIQQEVK
Subjt:  LPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVK

Query:  EVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE
        EVIISFYILMKQG+AT QELD+RCEELIQ QF QSCNF+VDDAVHKLEKLGI++RDADGAYSCVDLRSAN IIG TTEEI++KAKE
Subjt:  EVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46915.1 Protein of unknown function (DUF3754)2.1e-1728Show/hide
Query:  ENMKLGISMLLSEITIQEPTFDRIIVVY------RPANMNSEMERGIFVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSL-----
        + +K  IS+LLS  T+QEP F+ +I++Y      +      E    + ++ F+ IP+ DL ++ P KK      +D V+  +++ +GL     +      
Subjt:  ENMKLGISMLLSEITIQEPTFDRIIVVY------RPANMNSEMERGIFVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSL-----

Query:  -SVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGK---ATIQELDKRCEELI
         S P+A    + A V+A+ +Y+ +  L ++     YQ L+ + +Y+K L SG G++  L D   QQ+ KE I+++ I+++ GK    + + +  RCE  +
Subjt:  -SVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGK---ATIQELDKRCEELI

Query:  QGQFGQSCNFDVDDAVHKLEKLGIV
           F       V+ A+  L +LG+V
Subjt:  QGQFGQSCNFDVDDAVHKLEKLGIV

AT3G19340.1 Protein of unknown function (DUF3754)1.0e-17662.94Show/hide
Query:  MSKKKN-EVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQNLSTEETDVLEQKFLGH
        M+++ N EVIRLE ESVIPILKP+LI TL+  +E  +DR EF+KLC+R+EY++RAWYLL F+DL+ LY+LFDP+HGA K++QQNL+++E DVLEQ FL +
Subjt:  MSKKKN-EVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQNLSTEETDVLEQKFLGH

Query:  LFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTKVNTIIMRIWTFFLKISG
        LFQVM+KSNF+IT+++E+ VA S QY LNLPI VDESKLDKKLL +YF E+PH+N+P F+DKY+IFRRGIG+D+ TD+F+  K++ II R W+F ++I+ 
Subjt:  LFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTKVNTIIMRIWTFFLKISG

Query:  LNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIFVKHFKNIPMADLEIVLPE
        L +L +   S S   +   K  + + D+++D LYVERIR+EN KL     LS++TIQEPTFDR+IVVYR A+  + +ERGI+VKHFKNIPMAD+EIVLPE
Subjt:  LNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIFVKHFKNIPMADLEIVLPE

Query:  KKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVKEVI
        K+NP LTPMDWVKFL+SA +GLV V+ S+ +P +D  VI AI+S V  Y  KTY +FQ N+ +YQNLIT+ +YDKQLDSGRGTLLHLCD+VIQQEVKEV+
Subjt:  KKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVKEVI

Query:  ISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE
        I FYILM+QGKAT+++LD RCEELI+ +FG  CNFDV+DAV KLEKLGIV RD  G Y C+ L+ AN+IIGTTTEE++ KAK+
Subjt:  ISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE

AT5G13940.1 aminopeptidases2.8e-16360.96Show/hide
Query:  NEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQNLSTEETDVLEQKFLGHLFQVMK
        N V +++ + VI  +K         +++D  +R EF++ CQRVE +IRAWY LHF+DL+ LY+LF+P+ GA +L QQNLST E D LE +FL HLFQVM+
Subjt:  NEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQNLSTEETDVLEQKFLGHLFQVMK

Query:  KSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTKVNTIIMRIWTFFLKISGLNRLIQ
        KSNF++ T++EI VALSAQYRLNLPI V+E+KLD KLLT+YF++ P D+LP+FADKYIIFRRG GID M  +F+  K++TI++RIW F L I+ L RL+ 
Subjt:  KSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTKVNTIIMRIWTFFLKISGLNRLIQ

Query:  CGASRSHRSQV-FAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIFVKHFKNIPMADLEIVLPEKKNPS
               ++ V  ++QIDIS ++E D LY+ERIR+E +KL +S L+ +ITIQEPTF+RIIVVYR  +   E ER I+VKHFK IPMAD+EIVLPEKKNP 
Subjt:  CGASRSHRSQV-FAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIFVKHFKNIPMADLEIVLPEKKNPS

Query:  LTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYI
        LTP+DWVKFLVSAAIGLVTV+ S+S+  ADIRVI AI+S V  Y VKTY +FQ NLV YQ+LITR VYDKQLDSGRGTLLHLCDEVIQQEVKEVIISF++
Subjt:  LTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYI

Query:  LMKQGKATI-QELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE
        L+K+G  T  +ELD + E  I+ +F +SCNFDVDDA+ KLEKLG+V RD++  Y CV+++ AN+I+GTTTEE++ KA++
Subjt:  LMKQGKATI-QELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCCATGGCCATGGATTCGTCTGTCACGAACTGGCAATCCAACAATTGATTTCTGTTTACGGTACTCTCTTTGATTTTGGAGGTTTCCGAACAATGTCCAAGAA
GAAGAATGAAGTCATACGCTTGGAGAGGGAGTCGGTTATTCCCATCCTCAAGCCCAGGCTTATCAGCACCTTGTCCGCCCATCTCGAGGACGATTCGGACCGGAATGAGT
TTATAAAGCTTTGCCAGAGAGTTGAATACTCGATTCGAGCTTGGTATCTTCTGCATTTTGATGATCTTTTGCATTTATATGCGTTATTCGATCCTATACACGGGGCTCTG
AAATTGGAGCAGCAAAATCTCTCTACCGAGGAAACTGATGTTTTGGAACAAAAATTTCTGGGACACCTGTTTCAGGTGATGAAGAAGAGCAATTTTAGAATTACGACAGA
TGATGAAATCGCGGTTGCACTTTCTGCACAATATCGTTTAAATCTTCCTATCTCTGTGGACGAATCCAAGCTTGACAAGAAGCTTTTGACGAAATACTTCACGGAGAATC
CTCACGACAATCTTCCATATTTTGCTGATAAGTACATCATTTTCCGCCGTGGTATTGGGATTGACCAAATGACCGACCACTTCTACGACACAAAAGTAAATACCATAATT
ATGCGAATATGGACGTTCTTTCTCAAAATCTCAGGGTTAAATAGACTTATTCAATGTGGAGCGTCAAGAAGTCACCGAAGTCAGGTCTTTGCAAAACAAATTGACATCAG
TACAGATTCAGAGGATGATGGCTTGTATGTTGAGCGTATCCGCGTTGAGAACATGAAACTTGGGATCTCTATGCTATTGAGCGAGATTACGATCCAAGAACCCACGTTTG
ATAGAATTATCGTTGTTTACAGGCCGGCAAATATGAATAGTGAAATGGAACGGGGTATCTTCGTGAAGCATTTCAAGAATATACCAATGGCAGATCTTGAGATTGTGCTC
CCTGAAAAGAAAAATCCAAGTTTAACTCCAATGGACTGGGTGAAATTCCTCGTGTCTGCTGCAATTGGGCTGGTTACTGTTATTGGCTCGCTTAGTGTCCCTACAGCAGA
TATCAGAGTCATTTTTGCTATCGTCTCTGCAGTCAGTGTTTACTCTGTGAAAACATATCTCTCGTTTCAGAGTAATTTAGTGTCATATCAGAACCTAATCACAAGGTGCG
TGTATGACAAACAACTAGACAGTGGAAGGGGCACTCTTCTTCACTTGTGTGACGAAGTTATTCAGCAAGAAGTAAAGGAGGTGATTATTTCCTTCTATATATTGATGAAA
CAGGGAAAGGCTACAATACAGGAGCTCGACAAGCGGTGCGAGGAGCTGATTCAAGGACAGTTTGGGCAGAGCTGTAATTTTGACGTGGATGATGCAGTTCACAAGTTAGA
GAAGTTAGGGATCGTTGTCCGGGATGCAGATGGAGCATATTCCTGTGTAGATTTGAGGAGTGCCAATAAGATCATAGGCACCACCACAGAGGAGATTATTTCCAAGGCTA
AAGAGGCTGATGCCTCCGCTACT
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGCCATGGCCATGGATTCGTCTGTCACGAACTGGCAATCCAACAATTGATTTCTGTTTACGGTACTCTCTTTGATTTTGGAGGTTTCCGAACAATGTCCAAGAA
GAAGAATGAAGTCATACGCTTGGAGAGGGAGTCGGTTATTCCCATCCTCAAGCCCAGGCTTATCAGCACCTTGTCCGCCCATCTCGAGGACGATTCGGACCGGAATGAGT
TTATAAAGCTTTGCCAGAGAGTTGAATACTCGATTCGAGCTTGGTATCTTCTGCATTTTGATGATCTTTTGCATTTATATGCGTTATTCGATCCTATACACGGGGCTCTG
AAATTGGAGCAGCAAAATCTCTCTACCGAGGAAACTGATGTTTTGGAACAAAAATTTCTGGGACACCTGTTTCAGGTGATGAAGAAGAGCAATTTTAGAATTACGACAGA
TGATGAAATCGCGGTTGCACTTTCTGCACAATATCGTTTAAATCTTCCTATCTCTGTGGACGAATCCAAGCTTGACAAGAAGCTTTTGACGAAATACTTCACGGAGAATC
CTCACGACAATCTTCCATATTTTGCTGATAAGTACATCATTTTCCGCCGTGGTATTGGGATTGACCAAATGACCGACCACTTCTACGACACAAAAGTAAATACCATAATT
ATGCGAATATGGACGTTCTTTCTCAAAATCTCAGGGTTAAATAGACTTATTCAATGTGGAGCGTCAAGAAGTCACCGAAGTCAGGTCTTTGCAAAACAAATTGACATCAG
TACAGATTCAGAGGATGATGGCTTGTATGTTGAGCGTATCCGCGTTGAGAACATGAAACTTGGGATCTCTATGCTATTGAGCGAGATTACGATCCAAGAACCCACGTTTG
ATAGAATTATCGTTGTTTACAGGCCGGCAAATATGAATAGTGAAATGGAACGGGGTATCTTCGTGAAGCATTTCAAGAATATACCAATGGCAGATCTTGAGATTGTGCTC
CCTGAAAAGAAAAATCCAAGTTTAACTCCAATGGACTGGGTGAAATTCCTCGTGTCTGCTGCAATTGGGCTGGTTACTGTTATTGGCTCGCTTAGTGTCCCTACAGCAGA
TATCAGAGTCATTTTTGCTATCGTCTCTGCAGTCAGTGTTTACTCTGTGAAAACATATCTCTCGTTTCAGAGTAATTTAGTGTCATATCAGAACCTAATCACAAGGTGCG
TGTATGACAAACAACTAGACAGTGGAAGGGGCACTCTTCTTCACTTGTGTGACGAAGTTATTCAGCAAGAAGTAAAGGAGGTGATTATTTCCTTCTATATATTGATGAAA
CAGGGAAAGGCTACAATACAGGAGCTCGACAAGCGGTGCGAGGAGCTGATTCAAGGACAGTTTGGGCAGAGCTGTAATTTTGACGTGGATGATGCAGTTCACAAGTTAGA
GAAGTTAGGGATCGTTGTCCGGGATGCAGATGGAGCATATTCCTGTGTAGATTTGAGGAGTGCCAATAAGATCATAGGCACCACCACAGAGGAGATTATTTCCAAGGCTA
AAGAGGCTGATGCCTCCGCTACT
Protein sequenceShow/hide protein sequence
MGSHGHGFVCHELAIQQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGAL
KLEQQNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTKVNTII
MRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYRPANMNSEMERGIFVKHFKNIPMADLEIVL
PEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK
QGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKEADASAT