; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023504 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023504
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationtig00000892:3868015..3874537
RNA-Seq ExpressionSgr023504
SyntenySgr023504
Gene Ontology termsNA
InterPro domainsIPR009057 - Homeobox-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605382.1 Telomeric repeat-binding factor 1, partial [Cucurbita argyrosperma subsp. sororia]4.7e-19265.03Show/hide
Query:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV
        MD+D+ RW++EFILRSSM+DHLLKR LAV+P PD D+RLKKTVLLRAIESE SEA +TEK+L IFEMIEQLDKTEG+A+++SMK+AYCAVAVECTVK+L 
Subjt:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV

Query:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN
        V G+   GKYFD V RIWRG+V EL RSG+SELVSREL+ WKDEVEAA  DK V KKL NMN+R +AL+L+ +Y+GE W +LGP FL+LSASL+D R  N
Subjt:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN

Query:  EMP----DSLIDKTAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQES
        EM     +  IDKTA+ASED+GGS GI++PSQ  N+AR E   SV VL+ A+T R ++L++++D GV++ S++ + V +NTE VQ L T+    T+ QES
Subjt:  EMP----DSLIDKTAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQES

Query:  VEG--VVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYD
        +E    VLQD SP+  +NLK S++PR KSLA+H+RVRGGAKI + ++LEN +SS K +CL +PE  +V+EALKTSSLELQA+VKDPLP ALR++E++  D
Subjt:  VEG--VVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYD

Query:  LAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYE
        LA+KNK  EHSL ++ND  A +P+ NK + PLQS    F NP HGH+ I P+PS+MERNSSA TYEWNDSIDGSPE +   RLHL SPKRKVISPLKKYE
Subjt:  LAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYE

Query:  ETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE
        E +   RRK KRWSLLEEDTLRTAVQRFGKGNWKLILNSYR+IFD+RTE
Subjt:  ETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE

KAG7035336.1 hypothetical protein SDJN02_02131, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-19164.85Show/hide
Query:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV
        MD+D+ RW++EFILRSSM+DHLLKR LAV+P PD D+RLKKTVLLRAIESE SEA +TEK+L IFEMIEQLDKTEG+A+++SMK+AYCAVAVECTVK+L 
Subjt:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV

Query:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN
        V G+   GKYFD V RIWRG++ EL RSG+SELVSREL+ WKDEVEAA  DK V KKL NMN+R +AL+L+ +Y+GE W +LGP FL+LSASL+D R  N
Subjt:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN

Query:  EMP----DSLIDKTAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQES
        EM     +  IDKTA+ASED+GGS GI++PSQ  N+AR E   SV VL+ A+T R ++L++++D GV++ S++ + V +NTE VQ L T+    T+ QES
Subjt:  EMP----DSLIDKTAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQES

Query:  VEG--VVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYD
        +E    VLQD SP+  +NLK S++PR KSLA+H+RVRGGAKI + ++LEN +SS K +CL +PE  +V+EALKTSSLELQA+VKDPLP ALR++E++  D
Subjt:  VEG--VVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYD

Query:  LAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYE
        LA+KNK  EHSL ++ND  A +P+ NK + PLQS    F NP HGH+ I P+PS+MERNSSA TYEWNDSIDGSPE +   RLHL SPKRKVISPLKKYE
Subjt:  LAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYE

Query:  ETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE
        E +   RRK KRWSLLEEDTLRTAVQRFGKGNWKLILNSYR+IFD+RTE
Subjt:  ETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE

XP_022149751.1 uncharacterized protein LOC111018108 [Momordica charantia]2.4e-20470.96Show/hide
Query:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV
        MDED+ RWV+EFILRSSMDD LLKR+LAVIPIPDKD+RLKKTVLLRAIES+T +AEITEK LEIFEMIEQLDKTEG+A++ESMKAAYCAVAVECTVK+LV
Subjt:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV

Query:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN
         GG+    +Y DAVRRIWRGKVTEL RSGKSELVSRELK WKDEVE A  DKNVRK+LANMN+RN+AL+LV  Y+GE WA+LGPPFLQLSASL+D+ MTN
Subjt:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN

Query:  EMPD-------SLIDKTAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKA
        EM            D+TAI  +D+GGS  I++P Q  N  R ER E+VQVLSEA T R+N+LNI++DLG++DGS Q + VAV+TEEVQELAT EAA  + 
Subjt:  EMPD-------SLIDKTAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKA

Query:  QESVEGVVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVY
            +G V    + S  EN K SV+PR KSLA+HRRVRGGAKISN D+LEN TS  K SCL +PEVN+V+EALK+S LELQA+VKDPLP ALR +EA+  
Subjt:  QESVEGVVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVY

Query:  DLAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQP-SLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKK
        DLA KN+NHEHSL  RND  AT+P+TNK DEPLQSG AD ENP H  +II  +P S+MERNS+ARTYEW+DSIDG+PEG    RLHL SPKRK ISPLKK
Subjt:  DLAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQP-SLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKK

Query:  YEETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE
        YEE KF RRRKSKRWSLLEEDTLR AV RFGKGNWKLIL++YRDIF+ERTE
Subjt:  YEETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE

XP_023007149.1 uncharacterized protein LOC111499729 isoform X1 [Cucurbita maxima]2.1e-19265.21Show/hide
Query:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV
        MD+D+ RW++EFILRSSM+D LLKR LA++P PD D+RLKKTVLLRAIESE SEA +TEK+L IFEMIEQLDKTEG+A+++SMK+AYCAVAVECTVK+L 
Subjt:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV

Query:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN
        V G+   GKYFD V RIWRG+V EL RSG+SELVSREL+ WKD+VEAA  DK V KKL NMNSR +AL+L+ +Y+GE W +LGP FL+LSASL+D R  N
Subjt:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN

Query:  EMP----DSLIDKTAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQES
        EM     +  I KTA+ SED+GGS GI++PSQ  N+A+ E   SV VL+ A+T R+++L+I++D GV+D S++ + V +NTE VQEL T+    T+ +ES
Subjt:  EMP----DSLIDKTAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQES

Query:  VEG--VVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYD
        VE    VLQDPSP+  +NLK SV+PR KSLA+H+RVRGGAKI + ++LEN +SS K +CL +PE ++V+EA KTSSLELQA+VKDPLP ALR++E++  D
Subjt:  VEG--VVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYD

Query:  LAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYE
        LA+KNK  EHSL ++ND  A +P+ NK + PLQS    F NP HGH+ I P+PS+MERNSSA TYEWNDSIDGSPE +   RLHL SPKRKVISPLKKYE
Subjt:  LAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYE

Query:  ETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE
        ET+   RRK KRWSLLEEDTLRTAVQRFGKGNWKLILNSYR+IFDERTE
Subjt:  ETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE

XP_023532585.1 uncharacterized protein LOC111794704 [Cucurbita pepo subsp. pepo]3.9e-19464.85Show/hide
Query:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV
        MD+D+ RW++EFILRSSM+DHLLKR LAV+P PD D+RLKKTVLLRAIESE SEA +TEK+L IFEMIEQLDKTEG+A+++SMK+AYCAVAVECTVK+L 
Subjt:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV

Query:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN
        V G+   GKYFD V RIWRG+V EL +SG+SELVSREL+ WKD+VE A  DK V KKL NMN+R +AL+L+ +Y+GE W +LGP FL+LSASL+D R  N
Subjt:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN

Query:  EMP----DSLIDKTAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQES
        EMP    +  IDKTA+ SED+GGS GI++PS+  N+AR +   SV VL+ A+T R+++L++++D GV+D S++ + V +NTE VQEL T+    T+ QES
Subjt:  EMP----DSLIDKTAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQES

Query:  VEG--VVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYD
        +E    VLQDPSP+  +NLK SV+PR KSLA+H+RVRGGAKI + ++LEN +SS K +CL +PE ++V+EALKTSSLELQA+VKDPLP ALR+++++  D
Subjt:  VEG--VVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYD

Query:  LAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYE
        LA+KNK  EHSL ++ND  A +P+ NK + PLQS    F NPCHGH+ I P+PS+MERNSSA TYEWNDSIDGSPE +   RLHL SPKRKVISPLKKYE
Subjt:  LAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYE

Query:  ETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE
        E +   RRK KRWSLLEEDTLRTAVQRFGKGNWKLILNSYR+IFD+RTE
Subjt:  ETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE

TrEMBL top hitse value%identityAlignment
A0A6J1D9B9 uncharacterized protein LOC1110181081.2e-20470.96Show/hide
Query:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV
        MDED+ RWV+EFILRSSMDD LLKR+LAVIPIPDKD+RLKKTVLLRAIES+T +AEITEK LEIFEMIEQLDKTEG+A++ESMKAAYCAVAVECTVK+LV
Subjt:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV

Query:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN
         GG+    +Y DAVRRIWRGKVTEL RSGKSELVSRELK WKDEVE A  DKNVRK+LANMN+RN+AL+LV  Y+GE WA+LGPPFLQLSASL+D+ MTN
Subjt:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN

Query:  EMPD-------SLIDKTAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKA
        EM            D+TAI  +D+GGS  I++P Q  N  R ER E+VQVLSEA T R+N+LNI++DLG++DGS Q + VAV+TEEVQELAT EAA  + 
Subjt:  EMPD-------SLIDKTAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKA

Query:  QESVEGVVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVY
            +G V    + S  EN K SV+PR KSLA+HRRVRGGAKISN D+LEN TS  K SCL +PEVN+V+EALK+S LELQA+VKDPLP ALR +EA+  
Subjt:  QESVEGVVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVY

Query:  DLAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQP-SLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKK
        DLA KN+NHEHSL  RND  AT+P+TNK DEPLQSG AD ENP H  +II  +P S+MERNS+ARTYEW+DSIDG+PEG    RLHL SPKRK ISPLKK
Subjt:  DLAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQP-SLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKK

Query:  YEETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE
        YEE KF RRRKSKRWSLLEEDTLR AV RFGKGNWKLIL++YRDIF+ERTE
Subjt:  YEETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE

A0A6J1FVK5 uncharacterized protein LOC111448860 isoform X13.2e-18664.79Show/hide
Query:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV
        M+EDI RW+ EFILRSSMDDHLLKR+LAVIP+ DKD+RLKKTVLLRAIESE SEA ITEKLLEIFEMIEQL+K EG+ I+ESMKAAYCAVAVECTVK+L+
Subjt:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV

Query:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN
        V G+ K G+YFDAVRRIWRG+VT      K+ELVS E K WKDEVEA+  D N+RKKL +MN+R DAL+L+ +Y+GE WA +GP FLQLSASL+D++M N
Subjt:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN

Query:  EMPDSLIDK----TAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQES
        EM    +++     AI S D+GGS GI++PSQR N  R+E   S +VLS+ ++ R+++L+  +DLG +DGS+Q A+ A+NTE VQELAT+ A   ++ E 
Subjt:  EMPDSLIDK----TAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQES

Query:  VEGVVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNL----ENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIV
         E  VLQ+ S S  E LK SV+PR KSLA HRRVRGG KIS+ ++L    E+ +SS++ +CL +PEVN+V+EALKTSSLELQA+VKDPLP ALR++E++ 
Subjt:  VEGVVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNL----ENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIV

Query:  YDLAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKK
         DLAEKNK  E+SL +RND G  +P+ NK   PLQ   A+ ++P HGHK + P+PS+MERNS+A TYEWNDSID  PEG    RLHLHSPKRK ISPLKK
Subjt:  YDLAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKK

Query:  YEETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE
        YEETK   RR+ K+WSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE
Subjt:  YEETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE

A0A6J1G5R7 uncharacterized protein LOC1114511331.5e-18364.66Show/hide
Query:  MDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLVVGGIGKRGKYFDAVRRI
        M+DHLLKR LAV+P PD D+RLKKTVLLRAIESE SEA +TEK+L IFEMIEQLDKTEG+A+++SMK+AYCAVAVECTVK+L V G+   GKYFD V RI
Subjt:  MDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLVVGGIGKRGKYFDAVRRI

Query:  WRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTNEMP----DSLIDKTAIA
        WRG+V EL RSG+SELVSRE + WKDEVEAA  DK V KKL NMN+R +AL+L+ +Y+GE W +LGP FL+LSASL+D R  NEM     +  IDKTA+ 
Subjt:  WRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTNEMP----DSLIDKTAIA

Query:  SEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQESVEG--VVLQDPSPSLSE
        SED+GGS GI+ PS+  N+AR E   SV VL+ A+T R+++L++++D GV+D S++ + V +NTE VQEL T+    T+ QES+E    VLQD SP+  +
Subjt:  SEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQESVEG--VVLQDPSPSLSE

Query:  NLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYDLAEKNKNHEHSLGNRND
        NLK S++PR KSLA+H+RVRGGAKI + + LEN +SS K +CL +PE ++V+EALKTSSLELQA+VKDPLP ALR++E++  DLA+KNK  EHSL ++ND
Subjt:  NLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYDLAEKNKNHEHSLGNRND

Query:  VGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYEETKFSRRRKSKRWSLLE
          A +P+ NK + PLQS    F NP HGH+ I P+PS+MERNSSA TYEWNDSIDGSPE +   RLHL SPKRKVISPLKKYEE +   RRK KRWSLLE
Subjt:  VGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYEETKFSRRRKSKRWSLLE

Query:  EDTLRTAVQRFGKGNWKLILNSYRDIFDERTE
        EDTLRTAVQRFGKGNWKLILNSYR+IFD+RTE
Subjt:  EDTLRTAVQRFGKGNWKLILNSYRDIFDERTE

A0A6J1L270 uncharacterized protein LOC111499729 isoform X22.3e-18465.04Show/hide
Query:  MDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLVVGGIGKRGKYFDAVRRI
        M+D LLKR LA++P PD D+RLKKTVLLRAIESE SEA +TEK+L IFEMIEQLDKTEG+A+++SMK+AYCAVAVECTVK+L V G+   GKYFD V RI
Subjt:  MDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLVVGGIGKRGKYFDAVRRI

Query:  WRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTNEMP----DSLIDKTAIA
        WRG+V EL RSG+SELVSREL+ WKD+VEAA  DK V KKL NMNSR +AL+L+ +Y+GE W +LGP FL+LSASL+D R  NEM     +  I KTA+ 
Subjt:  WRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTNEMP----DSLIDKTAIA

Query:  SEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQESVEG--VVLQDPSPSLSE
        SED+GGS GI++PSQ  N+A+ E   SV VL+ A+T R+++L+I++D GV+D S++ + V +NTE VQEL T+    T+ +ESVE    VLQDPSP+  +
Subjt:  SEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQESVEG--VVLQDPSPSLSE

Query:  NLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYDLAEKNKNHEHSLGNRND
        NLK SV+PR KSLA+H+RVRGGAKI + ++LEN +SS K +CL +PE ++V+EA KTSSLELQA+VKDPLP ALR++E++  DLA+KNK  EHSL ++ND
Subjt:  NLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYDLAEKNKNHEHSLGNRND

Query:  VGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYEETKFSRRRKSKRWSLLE
          A +P+ NK + PLQS    F NP HGH+ I P+PS+MERNSSA TYEWNDSIDGSPE +   RLHL SPKRKVISPLKKYEET+   RRK KRWSLLE
Subjt:  VGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYEETKFSRRRKSKRWSLLE

Query:  EDTLRTAVQRFGKGNWKLILNSYRDIFDERTE
        EDTLRTAVQRFGKGNWKLILNSYR+IFDERTE
Subjt:  EDTLRTAVQRFGKGNWKLILNSYRDIFDERTE

A0A6J1L461 uncharacterized protein LOC111499729 isoform X11.0e-19265.21Show/hide
Query:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV
        MD+D+ RW++EFILRSSM+D LLKR LA++P PD D+RLKKTVLLRAIESE SEA +TEK+L IFEMIEQLDKTEG+A+++SMK+AYCAVAVECTVK+L 
Subjt:  MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLV

Query:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN
        V G+   GKYFD V RIWRG+V EL RSG+SELVSREL+ WKD+VEAA  DK V KKL NMNSR +AL+L+ +Y+GE W +LGP FL+LSASL+D R  N
Subjt:  VGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTN

Query:  EMP----DSLIDKTAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQES
        EM     +  I KTA+ SED+GGS GI++PSQ  N+A+ E   SV VL+ A+T R+++L+I++D GV+D S++ + V +NTE VQEL T+    T+ +ES
Subjt:  EMP----DSLIDKTAIASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQES

Query:  VEG--VVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYD
        VE    VLQDPSP+  +NLK SV+PR KSLA+H+RVRGGAKI + ++LEN +SS K +CL +PE ++V+EA KTSSLELQA+VKDPLP ALR++E++  D
Subjt:  VEG--VVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYD

Query:  LAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYE
        LA+KNK  EHSL ++ND  A +P+ NK + PLQS    F NP HGH+ I P+PS+MERNSSA TYEWNDSIDGSPE +   RLHL SPKRKVISPLKKYE
Subjt:  LAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGHKIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYE

Query:  ETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE
        ET+   RRK KRWSLLEEDTLRTAVQRFGKGNWKLILNSYR+IFDERTE
Subjt:  ETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G06910.1 TRF-like 71.2e-1224.54Show/hide
Query:  KTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLVVGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKG
        K  +L  I  E  +  + EK LE  E + ++   EG  + +S+  AYC VAVECTVK L      K+  Y +A++ IW G++  L     S LV+ +L  
Subjt:  KTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLVVGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKG

Query:  WKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTNEMPDSLIDKTAIASEDLGGSCGIQVPSQRANYARSERHES
            +  A  D    K L + ++R+ AL  +R+ +           L L+ +L+   +  +  D +                            SE  E 
Subjt:  WKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTNEMPDSLIDKTAIASEDLGGSCGIQVPSQRANYARSERHES

Query:  VQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQESVEGVVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAK-ISNP
         + + EA     N                           Q     EA     QES+    L+ P+                         GG+K +  P
Subjt:  VQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQESVEGVVLQDPSPSLSENLKVSVMPRRKSLATHRRVRGGAK-ISNP

Query:  DNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYDLAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHG
             + S+  D  L           L+ S +EL   ++   P                N N+E      NDV A   +TN                   
Subjt:  DNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYDLAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHG

Query:  HKIIAPQPSLMERNSSARTYEWNDSIDGS--PEGHHVGRLHLHSPKRKVISPLKKYEETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLI
            AP+PSLME  S+A TYEWNDSID S    G  + R++    KR V+SPLK+   ++ +RR K   WS  E   +    +++G  NWK I
Subjt:  HKIIAPQPSLMERNSSARTYEWNDSIDGS--PEGHHVGRLHLHSPKRKVISPLKKYEETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLI

AT1G15720.1 TRF-like 52.5e-2132.29Show/hide
Query:  EDISRWVLEFILRSSMDD--HLLKRILAVIPIPDKD-YRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFL
        E + +WV EF LR  ++   +    + A+ P+   D  +LK T +LR I +   +  + E +L++ E++E+L   E   I+ S+K+AYC  AVECT++F+
Subjt:  EDISRWVLEFILRSSMDD--HLLKRILAVIPIPDKD-YRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFL

Query:  VVGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSA
                G + DA+ RIWR ++  L +  +S+LV+REL  W+ ++  AF +  + +K+   N R +A+  + + + E+WA+LG   L+  A
Subjt:  VVGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSA

AT5G58340.1 myb-like HTH transcriptional regulator family protein2.0e-1527.21Show/hide
Query:  EDISRWVLE-FILRSSMDDHLLKRILAVIPIPDKD--YRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFL
        E I +WV E F+LR          +++ + + D     +LK + +LR I +      I E +L++ E++E+L   +   +++S K+AYC  A ECT++F+
Subjt:  EDISRWVLE-FILRSSMDDHLLKRILAVIPIPDKD--YRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFL

Query:  VVGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSAS--LIDRR
                G + DA+ RIW  ++  L  SG S+LV+ +L  W+ +++ A  D  + +++   N R  A+  + + + E+WA+LG   L+  A    + R+
Subjt:  VVGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSAS--LIDRR

Query:  MTN---EMPDSLIDKTAI-ASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDD
          N   ++ D+  D++ +  S    GS  I +    AN AR ER +   +  +   +   M  +  D G+D+
Subjt:  MTN---EMPDSLIDKTAI-ASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDD

AT5G58340.2 myb-like HTH transcriptional regulator family protein2.0e-1527.21Show/hide
Query:  EDISRWVLE-FILRSSMDDHLLKRILAVIPIPDKD--YRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFL
        E I +WV E F+LR          +++ + + D     +LK + +LR I +      I E +L++ E++E+L   +   +++S K+AYC  A ECT++F+
Subjt:  EDISRWVLE-FILRSSMDDHLLKRILAVIPIPDKD--YRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFL

Query:  VVGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSAS--LIDRR
                G + DA+ RIW  ++  L  SG S+LV+ +L  W+ +++ A  D  + +++   N R  A+  + + + E+WA+LG   L+  A    + R+
Subjt:  VVGGIGKRGKYFDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSAS--LIDRR

Query:  MTN---EMPDSLIDKTAI-ASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDD
          N   ++ D+  D++ +  S    GS  I +    AN AR ER +   +  +   +   M  +  D G+D+
Subjt:  MTN---EMPDSLIDKTAI-ASEDLGGSCGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGAAGACATTTCTCGTTGGGTCCTCGAATTTATTCTCCGAAGTTCAATGGACGATCACTTGCTGAAGAGAATTCTTGCAGTTATTCCCATCCCGGACAAGGACTA
TCGCCTGAAGAAAACGGTGCTTTTACGAGCCATTGAGAGCGAAACATCTGAGGCTGAGATCACGGAGAAGTTGCTCGAAATTTTCGAGATGATTGAACAGTTGGATAAAA
CCGAAGGCGTTGCGATCATAGAATCAATGAAAGCTGCATATTGTGCTGTGGCAGTGGAGTGCACGGTGAAGTTCTTGGTGGTAGGAGGCATTGGCAAGCGCGGGAAATAC
TTCGACGCGGTGAGGAGGATCTGGAGAGGTAAGGTGACGGAATTAGTGAGGTCGGGAAAGAGTGAATTGGTTTCCCGTGAATTGAAGGGATGGAAGGACGAGGTGGAAGC
TGCGTTTCGGGATAAGAATGTTAGGAAGAAATTGGCTAACATGAATTCTAGAAATGATGCTCTAAGATTAGTAAGAGAGTACATGGGCGAGGAGTGGGCGATTCTTGGTC
CGCCATTCCTTCAATTGTCTGCTTCGTTGATTGATAGAAGGATGACGAATGAGATGCCTGATTCTTTGATTGACAAAACTGCCATTGCGAGTGAGGATTTAGGTGGCAGT
TGTGGGATTCAAGTGCCTTCTCAAAGGGCAAACTATGCGAGATCAGAGAGGCATGAAAGCGTGCAAGTCCTTAGCGAAGCTGATACGAACAGATCTAATATGCTAAATAT
TAGTCGGGATTTGGGCGTTGATGATGGTTCTAGACAATTGGCCATTGTTGCAGTGAACACTGAGGAAGTCCAGGAGTTGGCTACAAAAGAAGCAGCCCATACAAAAGCGC
AAGAATCAGTTGAGGGAGTAGTTTTGCAGGACCCAAGTCCAAGCCTGAGTGAAAATTTGAAGGTTTCTGTCATGCCTAGACGCAAATCCCTCGCAACTCATAGACGTGTT
AGAGGAGGGGCTAAAATTAGTAATCCCGACAACTTGGAGAATGTCACTTCATCTGATAAAGATAGTTGCCTACATAGTCCCGAAGTTAACAAGGTGCAAGAAGCGCTTAA
AACCAGCTCTTTAGAACTGCAAGCGATGGTGAAGGATCCTCTTCCACATGCATTACGTGTATCAGAAGCTATAGTATATGATCTTGCTGAAAAGAATAAAAATCATGAGC
ATTCTTTGGGTAACCGAAATGATGTAGGTGCCACTAGTCCATCCACTAACAAGTCTGATGAGCCTCTTCAATCTGGGAAAGCAGATTTTGAAAATCCATGCCATGGCCAT
AAGATAATTGCTCCTCAGCCTAGCTTAATGGAACGCAATAGTAGTGCTCGTACGTATGAGTGGAATGATTCAATAGATGGTTCACCTGAAGGCCATCATGTTGGTAGACT
TCATCTTCATAGTCCCAAGAGAAAGGTAATTTCTCCCTTGAAGAAGTATGAAGAAACCAAATTTAGCCGAAGAAGGAAGAGTAAGAGGTGGAGCCTGCTTGAAGAAGACA
CCTTAAGGACTGCCGTGCAGAGGTTTGGTAAAGGAAATTGGAAGCTCATCTTAAACAGTTATCGTGATATATTTGATGAGAGAACCGAGAGGGTGGCTAGACATATTAGA
TGGGGAATGGCTATATGGTGGCATTTGCATATTCTGGGGGCCTGCAGATCCCGGTTGCTGCAGCATTGGAATGAAAGACGACGGGGCAGACCTTTTGGCGCGCTTGGGGT
CACTGGTCCAGAGCTCAGCCAACTTATCAGGAGGCATAGCTTTCTTGGCTTCCATTATCTCCCCAAACATGCTGGACGACGTCGTCCCGTCCACCGAAACGCTATGCCGA
TGCCTTGGCTTCGAAGTCTTCTCACCCTCACTACCGCCGGCACCTTCACTTCCGCCATTGGCATTATGATCCGCGTAATTCCCTCCGCCATTACCTCCAAGCTTCTTGAC
ATCAATGTAGGTAGAGAAGAGATCATCTTCCGATCCGATCTCCTCCAGACTGGCGGTGGAGGAACCGCCGTTAAACGGATCCGACGCGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGACGAAGACATTTCTCGTTGGGTCCTCGAATTTATTCTCCGAAGTTCAATGGACGATCACTTGCTGAAGAGAATTCTTGCAGTTATTCCCATCCCGGACAAGGACTA
TCGCCTGAAGAAAACGGTGCTTTTACGAGCCATTGAGAGCGAAACATCTGAGGCTGAGATCACGGAGAAGTTGCTCGAAATTTTCGAGATGATTGAACAGTTGGATAAAA
CCGAAGGCGTTGCGATCATAGAATCAATGAAAGCTGCATATTGTGCTGTGGCAGTGGAGTGCACGGTGAAGTTCTTGGTGGTAGGAGGCATTGGCAAGCGCGGGAAATAC
TTCGACGCGGTGAGGAGGATCTGGAGAGGTAAGGTGACGGAATTAGTGAGGTCGGGAAAGAGTGAATTGGTTTCCCGTGAATTGAAGGGATGGAAGGACGAGGTGGAAGC
TGCGTTTCGGGATAAGAATGTTAGGAAGAAATTGGCTAACATGAATTCTAGAAATGATGCTCTAAGATTAGTAAGAGAGTACATGGGCGAGGAGTGGGCGATTCTTGGTC
CGCCATTCCTTCAATTGTCTGCTTCGTTGATTGATAGAAGGATGACGAATGAGATGCCTGATTCTTTGATTGACAAAACTGCCATTGCGAGTGAGGATTTAGGTGGCAGT
TGTGGGATTCAAGTGCCTTCTCAAAGGGCAAACTATGCGAGATCAGAGAGGCATGAAAGCGTGCAAGTCCTTAGCGAAGCTGATACGAACAGATCTAATATGCTAAATAT
TAGTCGGGATTTGGGCGTTGATGATGGTTCTAGACAATTGGCCATTGTTGCAGTGAACACTGAGGAAGTCCAGGAGTTGGCTACAAAAGAAGCAGCCCATACAAAAGCGC
AAGAATCAGTTGAGGGAGTAGTTTTGCAGGACCCAAGTCCAAGCCTGAGTGAAAATTTGAAGGTTTCTGTCATGCCTAGACGCAAATCCCTCGCAACTCATAGACGTGTT
AGAGGAGGGGCTAAAATTAGTAATCCCGACAACTTGGAGAATGTCACTTCATCTGATAAAGATAGTTGCCTACATAGTCCCGAAGTTAACAAGGTGCAAGAAGCGCTTAA
AACCAGCTCTTTAGAACTGCAAGCGATGGTGAAGGATCCTCTTCCACATGCATTACGTGTATCAGAAGCTATAGTATATGATCTTGCTGAAAAGAATAAAAATCATGAGC
ATTCTTTGGGTAACCGAAATGATGTAGGTGCCACTAGTCCATCCACTAACAAGTCTGATGAGCCTCTTCAATCTGGGAAAGCAGATTTTGAAAATCCATGCCATGGCCAT
AAGATAATTGCTCCTCAGCCTAGCTTAATGGAACGCAATAGTAGTGCTCGTACGTATGAGTGGAATGATTCAATAGATGGTTCACCTGAAGGCCATCATGTTGGTAGACT
TCATCTTCATAGTCCCAAGAGAAAGGTAATTTCTCCCTTGAAGAAGTATGAAGAAACCAAATTTAGCCGAAGAAGGAAGAGTAAGAGGTGGAGCCTGCTTGAAGAAGACA
CCTTAAGGACTGCCGTGCAGAGGTTTGGTAAAGGAAATTGGAAGCTCATCTTAAACAGTTATCGTGATATATTTGATGAGAGAACCGAGAGGGTGGCTAGACATATTAGA
TGGGGAATGGCTATATGGTGGCATTTGCATATTCTGGGGGCCTGCAGATCCCGGTTGCTGCAGCATTGGAATGAAAGACGACGGGGCAGACCTTTTGGCGCGCTTGGGGT
CACTGGTCCAGAGCTCAGCCAACTTATCAGGAGGCATAGCTTTCTTGGCTTCCATTATCTCCCCAAACATGCTGGACGACGTCGTCCCGTCCACCGAAACGCTATGCCGA
TGCCTTGGCTTCGAAGTCTTCTCACCCTCACTACCGCCGGCACCTTCACTTCCGCCATTGGCATTATGATCCGCGTAATTCCCTCCGCCATTACCTCCAAGCTTCTTGAC
ATCAATGTAGGTAGAGAAGAGATCATCTTCCGATCCGATCTCCTCCAGACTGGCGGTGGAGGAACCGCCGTTAAACGGATCCGACGCGGATAG
Protein sequenceShow/hide protein sequence
MDEDISRWVLEFILRSSMDDHLLKRILAVIPIPDKDYRLKKTVLLRAIESETSEAEITEKLLEIFEMIEQLDKTEGVAIIESMKAAYCAVAVECTVKFLVVGGIGKRGKY
FDAVRRIWRGKVTELVRSGKSELVSRELKGWKDEVEAAFRDKNVRKKLANMNSRNDALRLVREYMGEEWAILGPPFLQLSASLIDRRMTNEMPDSLIDKTAIASEDLGGS
CGIQVPSQRANYARSERHESVQVLSEADTNRSNMLNISRDLGVDDGSRQLAIVAVNTEEVQELATKEAAHTKAQESVEGVVLQDPSPSLSENLKVSVMPRRKSLATHRRV
RGGAKISNPDNLENVTSSDKDSCLHSPEVNKVQEALKTSSLELQAMVKDPLPHALRVSEAIVYDLAEKNKNHEHSLGNRNDVGATSPSTNKSDEPLQSGKADFENPCHGH
KIIAPQPSLMERNSSARTYEWNDSIDGSPEGHHVGRLHLHSPKRKVISPLKKYEETKFSRRRKSKRWSLLEEDTLRTAVQRFGKGNWKLILNSYRDIFDERTERVARHIR
WGMAIWWHLHILGACRSRLLQHWNERRRGRPFGALGVTGPELSQLIRRHSFLGFHYLPKHAGRRRPVHRNAMPMPWLRSLLTLTTAGTFTSAIGIMIRVIPSAITSKLLD
INVGREEIIFRSDLLQTGGGGTAVKRIRRG