; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013286 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013286
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF3754)
Genome locationscaffold459:1893994..1898855
RNA-Seq ExpressionMS013286
SyntenyMS013286
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR022227 - Protein of unknown function DUF3754


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148089.1 uncharacterized protein LOC111016855 isoform X1 [Momordica charantia]1.4e-28199.41Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ

Query:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
        QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
Subjt:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT

Query:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGI
        KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVY RPANMNSEMERGI
Subjt:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGI

Query:  FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGR
        FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGR
Subjt:  FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGR

Query:  GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKA
        GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKA
Subjt:  GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKA

Query:  KEADASAT
        KEADASAT
Subjt:  KEADASAT

XP_022148090.1 uncharacterized protein LOC111016855 isoform X2 [Momordica charantia]5.8e-28399.61Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQ

Query:  NLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTK
        NLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTK
Subjt:  NLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTK

Query:  VNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGIF
        VNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVY RPANMNSEMERGIF
Subjt:  VNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGIF

Query:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG
        VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG
Subjt:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG

Query:  TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAK
        TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAK
Subjt:  TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAK

Query:  EADASAT
        EADASAT
Subjt:  EADASAT

XP_022148091.1 uncharacterized protein LOC111016855 isoform X3 [Momordica charantia]2.4e-26093.71Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ

Query:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
        QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
Subjt:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT

Query:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGI
        KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVY RPANMNSEMERGI
Subjt:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGI

Query:  FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLS-FQSNLVSYQNLITRCVYDKQLDSG
        FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLV   G    P +   V  +     S+ +    L  FQSNLVSYQNLITRCVYDKQLDSG
Subjt:  FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLS-FQSNLVSYQNLITRCVYDKQLDSG

Query:  RGTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISK
        RGTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISK
Subjt:  RGTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISK

Query:  AKEADASAT
        AKEADASAT
Subjt:  AKEADASAT

XP_022148092.1 uncharacterized protein LOC111016855 isoform X4 [Momordica charantia]1.9e-23899.31Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ

Query:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
        QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
Subjt:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT

Query:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGI
        KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVY RPANMNSEMERGI
Subjt:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGI

Query:  FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGR
        FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGR
Subjt:  FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGR

Query:  GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQ
        GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQ
Subjt:  GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQ

XP_038892952.1 uncharacterized protein LOC120081846 [Benincasa hispida]2.8e-23782.5Show/hide
Query:  FVCHELAIQQL-----ISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYAL
        F+CHEL  Q       ISVYG  F FGGFRTM+KK+ EVIRLERESVIPILKP LI+ LS+HL D  DR EF+  CQRVEYSIRAWYLL FDDLLHLY+L
Subjt:  FVCHELAIQQL-----ISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYAL

Query:  FDPIHGALKLEQQNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGI
        F+PIHGA KLE++NLS EE DV+EQKFLG LFQVM+KSNF++TTD+EIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGI
Subjt:  FDPIHGALKLEQQNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGI

Query:  GIDQMTDHFYDTKVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSR
        GIDQM D+FY TKVN IIMRIW FFLK++GL  L+  GASRS +SQVF+KQIDIST+SEDDGLYVERIRVENM  GISMLL++ITIQEPTFDRIIV+Y R
Subjt:  GIDQMTDHFYDTKVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSR

Query:  PANMNSEMERGIFVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLIT
        PAN   EMERGIFVKHFKNIPMADLEIVLPEK NP LTPMDWVKFLVSAAIGLVTVIGSLSVP AD++VIFAI+SAV  Y VKTYLSFQ NLVSYQ+LIT
Subjt:  PANMNSEMERGIFVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLIT

Query:  RCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKI
         CVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+AT QELD RCEELI+G+F QSCNFDVDDAVHKL+KLGI+VR ADGAYSCVDLRSANKI
Subjt:  RCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKI

Query:  IGTTTEEIISKAKEADASAT
        IG TTEEI+SKAKE DAS T
Subjt:  IGTTTEEIISKAKEADASAT

TrEMBL top hitse value%identityAlignment
A0A6J1D1Z1 uncharacterized protein LOC111016855 isoform X22.8e-28399.61Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQ

Query:  NLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTK
        NLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTK
Subjt:  NLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTK

Query:  VNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGIF
        VNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVY RPANMNSEMERGIF
Subjt:  VNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGIF

Query:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG
        VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG
Subjt:  VKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRG

Query:  TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAK
        TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAK
Subjt:  TLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAK

Query:  EADASAT
        EADASAT
Subjt:  EADASAT

A0A6J1D2Z1 uncharacterized protein LOC111016855 isoform X16.9e-28299.41Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ

Query:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
        QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
Subjt:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT

Query:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGI
        KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVY RPANMNSEMERGI
Subjt:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGI

Query:  FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGR
        FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGR
Subjt:  FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGR

Query:  GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKA
        GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKA
Subjt:  GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKA

Query:  KEADASAT
        KEADASAT
Subjt:  KEADASAT

A0A6J1D449 uncharacterized protein LOC111016855 isoform X49.4e-23999.31Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ

Query:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
        QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
Subjt:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT

Query:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGI
        KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVY RPANMNSEMERGI
Subjt:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGI

Query:  FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGR
        FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGR
Subjt:  FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGR

Query:  GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQ
        GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQ
Subjt:  GTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQ

A0A6J1D4B7 uncharacterized protein LOC111016855 isoform X31.1e-26093.71Show/hide
Query:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
        +QLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ
Subjt:  QQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHL-EDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQ

Query:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
        QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT
Subjt:  QNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDT

Query:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGI
        KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVY RPANMNSEMERGI
Subjt:  KVNTIIMRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGI

Query:  FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLS-FQSNLVSYQNLITRCVYDKQLDSG
        FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLV   G    P +   V  +     S+ +    L  FQSNLVSYQNLITRCVYDKQLDSG
Subjt:  FVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLS-FQSNLVSYQNLITRCVYDKQLDSG

Query:  RGTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISK
        RGTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISK
Subjt:  RGTLLHLCDEVIQQEVKEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISK

Query:  AKEADASAT
        AKEADASAT
Subjt:  AKEADASAT

A0A6J1F2F4 uncharacterized protein LOC1114415292.4e-22684.6Show/hide
Query:  FRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQNLSTEETDVLEQKFL
        FRTM+KKK EVIRLERESVIPILKPRLIS LS+ L D SDR+EF+K CQRVEYSIRAWYLLHFDDLLHLY+LFDPIHGA KLEQQNLS EETD LEQKFL
Subjt:  FRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQNLSTEETDVLEQKFL

Query:  GHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTKVNTIIMRIWTFFLKI
        G LFQVM+KSNF++TTD+EIAVALS QYRLNLPISVDESKLD KLLT YF ENPHDNLPYFADKYIIFRRGIGIDQM DHFY TKVN II RIW FFL I
Subjt:  GHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTKVNTIIMRIWTFFLKI

Query:  SGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEM-ERGIFVKHFKNIPMADLEI
         GL RL+   ASRSH+SQVF+KQIDISTDS+DDGLYVERIRVENM LG SML ++ITIQEPTFDRIIVVY RPA+ N E+ ERGIF+KHFKNIPMADLEI
Subjt:  SGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEM-ERGIFVKHFKNIPMADLEI

Query:  VLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEV
        VLPEKK+P LTPMDWV FLVSAAIGLVTVIGSLSVP AD++VIFAI+SAV  Y VKTYLSFQ NLVSYQ+LIT CVYDKQLDSGRGTLLHLCDEVIQQEV
Subjt:  VLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEV

Query:  KEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE
        KEVIISFYILMKQG+AT QELD+RCEELIQ QF QSCNF+VDDAVHKLEKLGI++RDADGAYSCVDLRSAN IIG TTEEI++KAKE
Subjt:  KEVIISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46915.1 Protein of unknown function (DUF3754)1.9e-1828.44Show/hide
Query:  ENMKLGISMLLSEITIQEPTFDRIIVVYSRPAN-----MNSEMERGIFVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSL-----
        + +K  IS+LLS  T+QEP F+ +I++Y++ A+        E    + ++ F+ IP+ DL ++ P KK      +D V+  +++ +GL     +      
Subjt:  ENMKLGISMLLSEITIQEPTFDRIIVVYSRPAN-----MNSEMERGIFVKHFKNIPMADLEIVLPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSL-----

Query:  -SVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGK---ATIQELDKRCEELI
         S P+A    + A V+A+ +Y+ +  L ++     YQ L+ + +Y+K L SG G++  L D   QQ+ KE I+++ I+++ GK    + + +  RCE  +
Subjt:  -SVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGK---ATIQELDKRCEELI

Query:  QGQFGQSCNFDVDDAVHKLEKLGIV
           F       V+ A+  L +LG+V
Subjt:  QGQFGQSCNFDVDDAVHKLEKLGIV

AT3G19340.1 Protein of unknown function (DUF3754)1.9e-17562.81Show/hide
Query:  MSKKKN-EVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQNLSTEETDVLEQKFLGH
        M+++ N EVIRLE ESVIPILKP+LI TL+  +E  +DR EF+KLC+R+EY++RAWYLL F+DL+ LY+LFDP+HGA K++QQNL+++E DVLEQ FL +
Subjt:  MSKKKN-EVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQNLSTEETDVLEQKFLGH

Query:  LFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTKVNTIIMRIWTFFLKISG
        LFQVM+KSNF+IT+++E+ VA S QY LNLPI VDESKLDKKLL +YF E+PH+N+P F+DKY+IFRRGIG+D+ TD+F+  K++ II R W+F ++I+ 
Subjt:  LFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTKVNTIIMRIWTFFLKISG

Query:  LNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGIFVKHFKNIPMADLEIVLP
        L +L +   S S   +   K  + + D+++D LYVERIR+EN KL     LS++TIQEPTFDR+IVVY R A+  + +ERGI+VKHFKNIPMAD+EIVLP
Subjt:  LNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGIFVKHFKNIPMADLEIVLP

Query:  EKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVKEV
        EK+NP LTPMDWVKFL+SA +GLV V+ S+ +P +D  VI AI+S V  Y  KTY +FQ N+ +YQNLIT+ +YDKQLDSGRGTLLHLCD+VIQQEVKEV
Subjt:  EKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVKEV

Query:  IISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE
        +I FYILM+QGKAT+++LD RCEELI+ +FG  CNFDV+DAV KLEKLGIV RD  G Y C+ L+ AN+IIGTTTEE++ KAK+
Subjt:  IISFYILMKQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE

AT5G13940.1 aminopeptidases5.4e-16260.83Show/hide
Query:  NEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQNLSTEETDVLEQKFLGHLFQVMK
        N V +++ + VI  +K         +++D  +R EF++ CQRVE +IRAWY LHF+DL+ LY+LF+P+ GA +L QQNLST E D LE +FL HLFQVM+
Subjt:  NEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGALKLEQQNLSTEETDVLEQKFLGHLFQVMK

Query:  KSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTKVNTIIMRIWTFFLKISGLNRLIQ
        KSNF++ T++EI VALSAQYRLNLPI V+E+KLD KLLT+YF++ P D+LP+FADKYIIFRRG GID M  +F+  K++TI++RIW F L I+ L RL+ 
Subjt:  KSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTKVNTIIMRIWTFFLKISGLNRLIQ

Query:  CGASRSHRSQV-FAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGIFVKHFKNIPMADLEIVLPEKKNP
               ++ V  ++QIDIS ++E D LY+ERIR+E +KL +S L+ +ITIQEPTF+RIIVVY R +    E ER I+VKHFK IPMAD+EIVLPEKKNP
Subjt:  CGASRSHRSQV-FAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGIFVKHFKNIPMADLEIVLPEKKNP

Query:  SLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFY
         LTP+DWVKFLVSAAIGLVTV+ S+S+  ADIRVI AI+S V  Y VKTY +FQ NLV YQ+LITR VYDKQLDSGRGTLLHLCDEVIQQEVKEVIISF+
Subjt:  SLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFY

Query:  ILMKQGKATI-QELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE
        +L+K+G  T  +ELD + E  I+ +F +SCNFDVDDA+ KLEKLG+V RD++  Y CV+++ AN+I+GTTTEE++ KA++
Subjt:  ILMKQGKATI-QELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCCATGGCCATGGATTCGTCTGTCACGAACTGGCAATCCAACAATTGATTTCTGTTTACGGTACTCTCTTTGATTTTGGAGGTTTCCGAACAATGTCCAAGAA
GAAGAATGAAGTCATACGCTTGGAGAGGGAGTCGGTTATTCCCATCCTCAAGCCCAGACTTATCAGCACCTTGTCCGCCCATCTCGAGGACGATTCGGACCGGAATGAGT
TTATAAAGCTTTGCCAGAGAGTTGAATACTCGATTCGAGCTTGGTATCTTCTGCATTTTGATGATCTTTTGCATTTATATGCGTTATTCGATCCTATACACGGGGCTCTG
AAATTGGAGCAGCAAAATCTCTCTACCGAGGAAACTGATGTTTTGGAACAAAAATTTCTGGGACACCTGTTTCAGGTGATGAAGAAGAGCAATTTTAGAATTACGACAGA
TGATGAAATCGCGGTTGCACTTTCTGCACAATATCGTTTAAATCTTCCTATCTCTGTGGATGAATCCAAGCTTGACAAGAAGCTTTTGACGAAATACTTCACGGAGAATC
CTCACGACAATCTTCCATATTTTGCTGATAAGTACATCATTTTCCGCCGTGGTATTGGGATTGACCAAATGACCGACCACTTCTACGACACAAAAGTAAATACCATAATT
ATGCGAATATGGACGTTCTTTCTCAAAATCTCAGGGTTAAATAGACTTATTCAATGTGGAGCGTCAAGAAGTCACCGAAGTCAGGTCTTTGCAAAACAAATTGACATCAG
TACAGATTCAGAGGATGATGGCTTGTATGTTGAGCGTATCCGCGTCGAGAACATGAAACTTGGGATCTCTATGCTATTGAGCGAGATTACGATCCAAGAACCCACGTTTG
ATAGAATTATCGTTGTTTACAGCAGGCCGGCAAATATGAATAGTGAAATGGAACGGGGTATCTTCGTGAAGCATTTCAAGAATATACCAATGGCAGATCTTGAGATTGTG
CTCCCTGAAAAGAAAAATCCAAGTTTAACTCCAATGGACTGGGTGAAATTCCTCGTGTCTGCTGCAATTGGGCTGGTTACTGTTATTGGCTCGCTTAGTGTCCCTACAGC
AGATATCAGAGTCATTTTTGCTATCGTCTCTGCAGTCAGTGTTTACTCTGTGAAAACATATCTCTCGTTTCAGAGTAATTTAGTGTCATATCAGAACCTAATCACAAGGT
GCGTGTATGACAAACAACTAGACAGTGGAAGGGGCACTCTTCTTCACTTGTGTGACGAAGTTATTCAGCAAGAAGTAAAGGAGGTGATTATTTCCTTCTATATATTGATG
AAACAGGGAAAGGCTACAATACAGGAGCTCGACAAGCGGTGCGAGGAGCTGATTCAAGGACAGTTTGGGCAGAGCTGTAATTTTGACGTGGATGATGCAGTTCACAAGTT
AGAGAAGTTAGGGATCGTTGTCCGGGATGCAGATGGAGCATATTCCTGTGTAGATTTGAGGAGTGCCAATAAGATCATAGGCACCACCACAGAGGAGATTATTTCCAAGG
CTAAAGAGGCTGATGCCTCCGCTACT
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGCCATGGCCATGGATTCGTCTGTCACGAACTGGCAATCCAACAATTGATTTCTGTTTACGGTACTCTCTTTGATTTTGGAGGTTTCCGAACAATGTCCAAGAA
GAAGAATGAAGTCATACGCTTGGAGAGGGAGTCGGTTATTCCCATCCTCAAGCCCAGACTTATCAGCACCTTGTCCGCCCATCTCGAGGACGATTCGGACCGGAATGAGT
TTATAAAGCTTTGCCAGAGAGTTGAATACTCGATTCGAGCTTGGTATCTTCTGCATTTTGATGATCTTTTGCATTTATATGCGTTATTCGATCCTATACACGGGGCTCTG
AAATTGGAGCAGCAAAATCTCTCTACCGAGGAAACTGATGTTTTGGAACAAAAATTTCTGGGACACCTGTTTCAGGTGATGAAGAAGAGCAATTTTAGAATTACGACAGA
TGATGAAATCGCGGTTGCACTTTCTGCACAATATCGTTTAAATCTTCCTATCTCTGTGGATGAATCCAAGCTTGACAAGAAGCTTTTGACGAAATACTTCACGGAGAATC
CTCACGACAATCTTCCATATTTTGCTGATAAGTACATCATTTTCCGCCGTGGTATTGGGATTGACCAAATGACCGACCACTTCTACGACACAAAAGTAAATACCATAATT
ATGCGAATATGGACGTTCTTTCTCAAAATCTCAGGGTTAAATAGACTTATTCAATGTGGAGCGTCAAGAAGTCACCGAAGTCAGGTCTTTGCAAAACAAATTGACATCAG
TACAGATTCAGAGGATGATGGCTTGTATGTTGAGCGTATCCGCGTCGAGAACATGAAACTTGGGATCTCTATGCTATTGAGCGAGATTACGATCCAAGAACCCACGTTTG
ATAGAATTATCGTTGTTTACAGCAGGCCGGCAAATATGAATAGTGAAATGGAACGGGGTATCTTCGTGAAGCATTTCAAGAATATACCAATGGCAGATCTTGAGATTGTG
CTCCCTGAAAAGAAAAATCCAAGTTTAACTCCAATGGACTGGGTGAAATTCCTCGTGTCTGCTGCAATTGGGCTGGTTACTGTTATTGGCTCGCTTAGTGTCCCTACAGC
AGATATCAGAGTCATTTTTGCTATCGTCTCTGCAGTCAGTGTTTACTCTGTGAAAACATATCTCTCGTTTCAGAGTAATTTAGTGTCATATCAGAACCTAATCACAAGGT
GCGTGTATGACAAACAACTAGACAGTGGAAGGGGCACTCTTCTTCACTTGTGTGACGAAGTTATTCAGCAAGAAGTAAAGGAGGTGATTATTTCCTTCTATATATTGATG
AAACAGGGAAAGGCTACAATACAGGAGCTCGACAAGCGGTGCGAGGAGCTGATTCAAGGACAGTTTGGGCAGAGCTGTAATTTTGACGTGGATGATGCAGTTCACAAGTT
AGAGAAGTTAGGGATCGTTGTCCGGGATGCAGATGGAGCATATTCCTGTGTAGATTTGAGGAGTGCCAATAAGATCATAGGCACCACCACAGAGGAGATTATTTCCAAGG
CTAAAGAGGCTGATGCCTCCGCTACT
Protein sequenceShow/hide protein sequence
MGSHGHGFVCHELAIQQLISVYGTLFDFGGFRTMSKKKNEVIRLERESVIPILKPRLISTLSAHLEDDSDRNEFIKLCQRVEYSIRAWYLLHFDDLLHLYALFDPIHGAL
KLEQQNLSTEETDVLEQKFLGHLFQVMKKSNFRITTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFTENPHDNLPYFADKYIIFRRGIGIDQMTDHFYDTKVNTII
MRIWTFFLKISGLNRLIQCGASRSHRSQVFAKQIDISTDSEDDGLYVERIRVENMKLGISMLLSEITIQEPTFDRIIVVYSRPANMNSEMERGIFVKHFKNIPMADLEIV
LPEKKNPSLTPMDWVKFLVSAAIGLVTVIGSLSVPTADIRVIFAIVSAVSVYSVKTYLSFQSNLVSYQNLITRCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILM
KQGKATIQELDKRCEELIQGQFGQSCNFDVDDAVHKLEKLGIVVRDADGAYSCVDLRSANKIIGTTTEEIISKAKEADASAT