; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012060 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012060
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00153204:181908..195089
RNA-Seq ExpressionSgr012060
SyntenySgr012060
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR001878 - Zinc finger, CCHC-type
IPR016177 - DNA-binding domain superfamily
IPR025724 - GAG-pre-integrase domain
IPR036875 - Zinc finger, CCHC-type superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8529702.1 hypothetical protein F0562_034198 [Nyssa sinensis]1.1e-11850.51Show/hide
Query:  GHIDGTTPAPTDATQLAQWKIKDAR------------------------------------------------------GSLSIQDYYSGFQNLWAEFSD
        GH+DG+ PAPTD  +L QWK+KDAR                                                      G+LS+Q+Y+ GFQNLWAEFSD
Subjt:  GHIDGTTPAPTDATQLAQWKIKDAR------------------------------------------------------GSLSIQDYYSGFQNLWAEFSD

Query:  IVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFS
        IV A VS ESL+ V A+HE SKRDQFLMKLR +FE+ RSNLM+R PSP+LDVCF  LLREEQ LLTQ++L QE +    +A+ A GK +G+DM  VQC+S
Subjt:  IVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFS

Query:  CKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGPPTMSAVNENAQST-VLTPEMVQQMIVSAFSALKLHGNGNSLSK
        CK+YGHIA +C +KFCNYCK+ GHIIK+C  RPQ R   A+ ATA T  T         +++V+ +  +T  LT EMVQQMIVSAFSAL L G G   S+
Subjt:  CKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGPPTMSAVNENAQST-VLTPEMVQQMIVSAFSALKLHGNGNSLSK

Query:  SWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQL---------------------------------------------LVLDQDSGTVIAKGPK
         WLVDSAASNHMTSS  +L NVR Y G  NIQVAN + L                                              V D  SG  IAKGPK
Subjt:  SWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQL---------------------------------------------LVLDQDSGTVIAKGPK

Query:  VGRLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKNNKF--SGLSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSD
        VGRLFPL+ SIPS +SLAC+ V N  E+WHKRLGHPN+A+LS+LL SG LGK +      LSFDCS CK+GKSKILPFP +GSRA  CFD++HSD
Subjt:  VGRLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKNNKF--SGLSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSD

KAG6501099.1 hypothetical protein ZIOFF_040967 [Zingiber officinale]9.2e-11046.14Show/hide
Query:  GHIDGTTPAPTDATQLAQWKIKDAR------------------------------------------------------GSLSIQDYYSGFQNLWAEFSD
        GHIDG+  AP +A  L QW+ KDAR                                                      G LSI+ YYSGF NLW E+S+
Subjt:  GHIDGTTPAPTDATQLAQWKIKDAR------------------------------------------------------GSLSIQDYYSGFQNLWAEFSD

Query:  IVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTL--EQEKMTTTQMAFLAHGKSKGKDMSKVQC
        I+ + V KE+L  + AIHE+SKRDQFLMKLRSDF+ AR+ L+NR+P P+LD+C  ELLREEQ L TQ  L    EK T   +A+ A G+++GKD  ++QC
Subjt:  IVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTL--EQEKMTTTQMAFLAHGKSKGKDMSKVQC

Query:  FSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGP-PTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSL
        +SCK +GHIA NC++KFCNYCK+HGHIIK+C  RP+NR+  AFQAT        + IGP  T++  N+    +VLTPEMVQQMI++AFS L L G G ++
Subjt:  FSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGP-PTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSL

Query:  SKSWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQL---------------------------------------------LVLDQDSGTVIAKG
        S SW+VDS ASNHMT S + L NVR Y+G +NIQ+ANG+ L                                             +V DQ SG VIAKG
Subjt:  SKSWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQL---------------------------------------------LVLDQDSGTVIAKG

Query:  PKVGRLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKN-NKFSG-LSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSDSFS
        PKVGRLFPL  S+P N+S +  V  N+ ++WHKRLGHPN+ ILS L+ +G LG N   F   L   C+TCKLGKSK+LPFP  G RA  CF+IIHSD + 
Subjt:  PKVGRLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKN-NKFSG-LSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSDSFS

Query:  DIAIL
           +L
Subjt:  DIAIL

KAG6536639.1 hypothetical protein ZIOFF_001697 [Zingiber officinale]5.9e-10948.22Show/hide
Query:  GHIDGTTPAPTDATQLAQWKIKDAR-----------------------------------GSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHE
        GHIDG+  AP +A  L QW+ KDAR                                   G LSI+ YYSGF NLW E+S+I+ + V KE+L  + AIHE
Subjt:  GHIDGTTPAPTDATQLAQWKIKDAR-----------------------------------GSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHE

Query:  ISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTL--EQEKMTTTQMAFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCN
        +SK DQFLMKLRSDF+ AR+ L+NR+  P+LD+C  ELLREEQ L TQ  L    EK T   +A+ A G+++GKD  ++QC+SCK +GHIA NC++KFCN
Subjt:  ISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTL--EQEKMTTTQMAFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCN

Query:  YCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKSWLVDSAASNHMTSSANL
        YCK+HGHIIK+C  RP+NR+  AFQAT        + IGP   S V    QS VLTPEMVQQMI++AFS L L G G ++S SW+VDS ASNH+T S +L
Subjt:  YCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKSWLVDSAASNHMTSSANL

Query:  LQNVRPYHGLENIQVANGNQL---------------------------------------------LVLDQDSGTVIAKGPKVGRLFPLHISIPSNVSLA
        L NVR Y+G +NIQ+AN + L                                             +V DQ SG VIAKGPKVGRLFPL  S+P N+S +
Subjt:  LQNVRPYHGLENIQVANGNQL---------------------------------------------LVLDQDSGTVIAKGPKVGRLFPLHISIPSNVSLA

Query:  CSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKN-NKFSG-LSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSD
          V  N+ ++WHKRLGHPN+ ILS L+ +G L  N   F   L   C+TCKLGKSK+LPFP  G RA  CF+IIHSD
Subjt:  CSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKN-NKFSG-LSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSD

TXG67369.1 hypothetical protein EZV62_008644 [Acer yangbiense]2.1e-11449.61Show/hide
Query:  GHIDGTTPAPTDATQLAQWKIKDAR------------------------------------------------------GSLSIQDYYSGFQNLWAEFSD
        GHIDG+ PAPT+  +LA WK+KDAR                                                      G+LSIQDY+S FQNLW EFSD
Subjt:  GHIDGTTPAPTDATQLAQWKIKDAR------------------------------------------------------GSLSIQDYYSGFQNLWAEFSD

Query:  IVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFS
        +V A V   SL+ V A+HE SKRDQFLMKLR +FE  RSNLMNR PSP+LDVCF ELLREEQ LLTQ   +Q+      +A+ A+GK KG+DM KVQCFS
Subjt:  IVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFS

Query:  CKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKS
        CK YGHIAANC +K CNYCKK GH IK+C  +PQNRQ TA+QA  +TS        P   S  +     + LTPEMVQQMI+SAFSAL L GN  +LSKS
Subjt:  CKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKS

Query:  WLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQLLVLDQDSGTVIAKGPKVGRLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLT
        WL+DSAASNHMT S++ L N                     DQ SG ++AKGPKVGRLFPLH SIPS +SLAC  V +QNE+WHKRLGHPNS +LS++L 
Subjt:  WLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQLLVLDQDSGTVIAKGPKVGRLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLT

Query:  SGLLG-KNNKFSGLSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSDSFSDIAILPSFDETSSSPERFKPGYVYEQRHSPPPLPTPDPSPDPAPTLLRR
        SGLLG K   +  LSFDC  CKLGKSK L FP  GSRA                                            P+ +P P P+P P   RR
Subjt:  SGLLG-KNNKFSGLSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSDSFSDIAILPSFDETSSSPERFKPGYVYEQRHSPPPLPTPDPSPDPAPTLLRR

Query:  STRVSRPP
        STRVSRPP
Subjt:  STRVSRPP

XP_021654098.1 uncharacterized protein LOC110645300 [Hevea brasiliensis]1.5e-11540.99Show/hide
Query:  GHIDGTTPAPTDATQLAQWKIKDARGSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCF
        GH DG+ PAPTD+ +L QW +KD RG+LSIQ+Y+SGFQNLWAEF+D+V A V  ESL+ + AIHE SKRDQFLMKLRSDFE  RSNLM+R PSP+LDVCF
Subjt:  GHIDGTTPAPTDATQLAQWKIKDARGSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCF

Query:  SELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSH
         ELLREEQ  LT++T +QE   T  +AF+A GK KG+DM+ + C+SCK YGHIAANC +KF NY K+ GHIIK+C  RPQNR+  A  A  ++S    +H
Subjt:  SELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSH

Query:  IGPPTMSAV-NENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKSWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQLLVLDQDSGTVIAKGP
           P +S   +  A+  VLTPEMVQQMIVSAFSAL L  N  + S+ WLVDSAASNHMT+S+++L+NV  YHG   IQ+AN + + +      T   K  
Subjt:  IGPPTMSAV-NENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKSWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQLLVLDQDSGTVIAKGP

Query:  KVG-RLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKNNKFSGLSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSDSFSDI
         +  +L    IS+   V   C V  +            N  ++   ++  ++ K  K S                                      S I
Subjt:  KVG-RLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKNNKFSGLSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSDSFSDI

Query:  AILPSFDETSSSPERFKPGYVYEQRH--------SPPPLPTPDPSPD------PAPTLLRRSTRVSRPPNWYGSYHTSFSAALSSFSVPSSYSQA-----
         ILP+F++  S P  FKPG+VYE+R          PP  PT +P+ +      P   +LRRSTRVSR PNWYG     FS  LS  SV S YSQA     
Subjt:  AILPSFDETSSSPERFKPGYVYEQRH--------SPPPLPTPDPSPD------PAPTLLRRSTRVSRPPNWYGSYHTSFSAALSSFSVPSSYSQA-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------DSSLVDTPLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLHLAAVRR
                                      DSSLVDTPLE+NVKY  +EG LL DP+L+RQLVG+LNYLTI RP+ISF  QQVSQFMH+PRHLHL AVR 
Subjt:  ------------------------------DSSLVDTPLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLHLAAVRR

Query:  IIKYL
        II+YL
Subjt:  IIKYL

TrEMBL top hitse value%identityAlignment
A0A2N9GB15 Uncharacterized protein3.9e-10632.41Show/hide
Query:  GHIDGTTPAPTDATQLAQWKIKD-----------------------------------ARGSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHE
        GHIDG++   +     A W  KD                                    +G LS+QDYYSGF  LW ++SD+V A VS E L  V  +H 
Subjt:  GHIDGTTPAPTDATQLAQWKIKD-----------------------------------ARGSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHE

Query:  ISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKM--TTTQMAFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCN
         S+RDQFLMKLR +FE+ R++L+NR P PTL+ CF ELLREEQ L TQ  +EQ ++   T  +A+ AHGK KG+DMS  QC+SCK YGHIA NC QKFCN
Subjt:  ISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKM--TTTQMAFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCN

Query:  YCKKHGHIIKDCFIRPQNRQNTAFQA--TASTSPTGQSHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKSWLVDSAASNHMTSSA
        YCK+ GHIIK+C IRP +R   A+ A  T  + P   +       SAV     +  LT EMVQ+MIVSAFSAL   G G+  S SW++DS ASNHMT+S 
Subjt:  YCKKHGHIIKDCFIRPQNRQNTAFQA--TASTSPTGQSHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKSWLVDSAASNHMTSSA

Query:  NLLQNVRPYHGLENIQVANGNQLLVLDQD----------------SGTVIAKGPKVGRLFPLHISIPSNVSLACSVVINQN----ELWHKRLGHPNSAIL
        + L NVR Y G  +IQ AN +   ++DQD                SG VIAKGPK GRLF L I  P N+    S++ N++    E+WHKRLGHPNS IL
Subjt:  NLLQNVRPYHGLENIQVANGNQLLVLDQD----------------SGTVIAKGPKVGRLFPLHISIPSNVSLACSVVINQN----ELWHKRLGHPNSAIL

Query:  SYLLTSGLLGKNNKFSGLSF-DCSTCKLGKSKILPFPLAGSRANKCFDIIHSD-----------------------------------------------
        SYLL SGLL     FS   F DC+TCKLGKSKILPFP  GSRA   F+IIHSD                                               
Subjt:  SYLLTSGLLGKNNKFSGLSF-DCSTCKLGKSKILPFPLAGSRANKCFDIIHSD-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------SFSDIAILPSFDETSSSP
                                                                                          S S +A LPSFD+T   P
Subjt:  ----------------------------------------------------------------------------------SFSDIAILPSFDETSSSP

Query:  ----ERFKPGYVYEQRHSPPPLPTPDPSPDPAPTLLRRSTRVSRPPNWY-----GSYHTSFSAALSSFSVPSSYSQA-----------------------
            ERF+PG VY++R   P LP   P   P P  LRRS+RVS PP+ Y     G+  ++ SA LSS +VP+SYSQA                       
Subjt:  ----ERFKPGYVYEQRHSPPPLPTPDPSPDPAPTLLRRSTRVSRPPNWY-----GSYHTSFSAALSSFSVPSSYSQA-----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------DSSLVDTPLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLHLAAVRRIIKY
                                  DS+ +DTP+E+N+K   +EG LLSDP  YR LVGSL YLTITRPDIS+AVQQVSQFM SPRHLH+AAVRRII+Y
Subjt:  --------------------------DSSLVDTPLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLHLAAVRRIIKY

Query:  LRAVGEDPSGMSGPS
        +   G    G+S P+
Subjt:  LRAVGEDPSGMSGPS

A0A5C7IEK2 Uncharacterized protein3.4e-10239.84Show/hide
Query:  GHIDGTTPAPTDATQLAQWKIKDAR------------------------------------------------------GSLSIQDYYSGFQNLWAEFSD
        GH+DG++ APTD  +L+ W+ KDA+                                                      G+LSI+ +YSGF NLW++++ 
Subjt:  GHIDGTTPAPTDATQLAQWKIKDAR------------------------------------------------------GSLSIQDYYSGFQNLWAEFSD

Query:  IVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFS
        +V + V KE+L  + A+H  S+RDQFLMKLR +FE+AR+ L+NR P P+LDVC  ELLREEQ L +Q  + Q+   T  +        KG+  S  QC+S
Subjt:  IVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFS

Query:  CKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKS
        CK  GHIA +C +KFCNYCKK GHIIKDC +R QNR   AF     +S    S   P T+   + N     +TPE VQQMIVSA  AL L G    LS  
Subjt:  CKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKS

Query:  WLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQL---------------------------------------------LVLDQDSGTVIAKGPKV
        WL+DSAASNHMT S+  LQ+VR Y G ++IQ+A+GN L                                              V DQ SG  IAKGPKV
Subjt:  WLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQL---------------------------------------------LVLDQDSGTVIAKGPKV

Query:  GRLFPLH-ISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKNNKFSGLSFDCSTCKLGKSKIL-------PFPLAGSRANKC-------
        GRLFPL   SIP ++S+  S + N +  WHK+LGHPNS IL++L+  G L   N FS LSFDC+ CKL +   L        F    + ANK        
Subjt:  GRLFPLH-ISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKNNKFSGLSFDCSTCKLGKSKIL-------PFPLAGSRANKC-------

Query:  ------FDIIHSDSFSDIAILPSFDETSSSPERFKPGYVYEQRHSPPPLPTPDP-----------SPDPAPTLL---RRSTRVSRPPNWYGSYHTSFSAA
              F    +++ S   +LP FD+ SS+P RF+PG VY QR  P PL + +P           S  P+  ++   RRSTRVSRPP+WYG   ++F A 
Subjt:  ------FDIIHSDSFSDIAILPSFDETSSSPERFKPGYVYEQRHSPPPLPTPDP-----------SPDPAPTLL---RRSTRVSRPPNWYGSYHTSFSAA

Query:  LSSFSVPSSYSQADS
        L +  VP SYSQA +
Subjt:  LSSFSVPSSYSQADS

A0A5J5AIJ4 Uncharacterized protein5.2e-11950.51Show/hide
Query:  GHIDGTTPAPTDATQLAQWKIKDAR------------------------------------------------------GSLSIQDYYSGFQNLWAEFSD
        GH+DG+ PAPTD  +L QWK+KDAR                                                      G+LS+Q+Y+ GFQNLWAEFSD
Subjt:  GHIDGTTPAPTDATQLAQWKIKDAR------------------------------------------------------GSLSIQDYYSGFQNLWAEFSD

Query:  IVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFS
        IV A VS ESL+ V A+HE SKRDQFLMKLR +FE+ RSNLM+R PSP+LDVCF  LLREEQ LLTQ++L QE +    +A+ A GK +G+DM  VQC+S
Subjt:  IVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFS

Query:  CKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGPPTMSAVNENAQST-VLTPEMVQQMIVSAFSALKLHGNGNSLSK
        CK+YGHIA +C +KFCNYCK+ GHIIK+C  RPQ R   A+ ATA T  T         +++V+ +  +T  LT EMVQQMIVSAFSAL L G G   S+
Subjt:  CKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGPPTMSAVNENAQST-VLTPEMVQQMIVSAFSALKLHGNGNSLSK

Query:  SWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQL---------------------------------------------LVLDQDSGTVIAKGPK
         WLVDSAASNHMTSS  +L NVR Y G  NIQVAN + L                                              V D  SG  IAKGPK
Subjt:  SWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQL---------------------------------------------LVLDQDSGTVIAKGPK

Query:  VGRLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKNNKF--SGLSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSD
        VGRLFPL+ SIPS +SLAC+ V N  E+WHKRLGHPN+A+LS+LL SG LGK +      LSFDCS CK+GKSKILPFP +GSRA  CFD++HSD
Subjt:  VGRLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKNNKF--SGLSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSD

A0A5J5AIJ4 Uncharacterized protein9.5e-0449.18Show/hide
Query:  IAILPSFDETSSSPERFKPGYVYEQRHSPPPLP----TPDPSPDPAPTLLRRSTRVSRPPN
        +++LP FD+    PERFK G+VYE+R    PLP     PDP PDP     RRS+R S PP+
Subjt:  IAILPSFDETSSSPERFKPGYVYEQRHSPPPLP----TPDPSPDPAPTLLRRSTRVSRPPN

A0A5J5AIJ4 Uncharacterized protein1.0e-11449.61Show/hide
Query:  GHIDGTTPAPTDATQLAQWKIKDAR------------------------------------------------------GSLSIQDYYSGFQNLWAEFSD
        GHIDG+ PAPT+  +LA WK+KDAR                                                      G+LSIQDY+S FQNLW EFSD
Subjt:  GHIDGTTPAPTDATQLAQWKIKDAR------------------------------------------------------GSLSIQDYYSGFQNLWAEFSD

Query:  IVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFS
        +V A V   SL+ V A+HE SKRDQFLMKLR +FE  RSNLMNR PSP+LDVCF ELLREEQ LLTQ   +Q+      +A+ A+GK KG+DM KVQCFS
Subjt:  IVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFS

Query:  CKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKS
        CK YGHIAANC +K CNYCKK GH IK+C  +PQNRQ TA+QA  +TS        P   S  +     + LTPEMVQQMI+SAFSAL L GN  +LSKS
Subjt:  CKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKS

Query:  WLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQLLVLDQDSGTVIAKGPKVGRLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLT
        WL+DSAASNHMT S++ L N                     DQ SG ++AKGPKVGRLFPLH SIPS +SLAC  V +QNE+WHKRLGHPNS +LS++L 
Subjt:  WLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQLLVLDQDSGTVIAKGPKVGRLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLT

Query:  SGLLG-KNNKFSGLSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSDSFSDIAILPSFDETSSSPERFKPGYVYEQRHSPPPLPTPDPSPDPAPTLLRR
        SGLLG K   +  LSFDC  CKLGKSK L FP  GSRA                                            P+ +P P P+P P   RR
Subjt:  SGLLG-KNNKFSGLSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSDSFSDIAILPSFDETSSSPERFKPGYVYEQRHSPPPLPTPDPSPDPAPTLLRR

Query:  STRVSRPP
        STRVSRPP
Subjt:  STRVSRPP

A5B7U3 Uncharacterized protein7.6e-10243.69Show/hide
Query:  GHIDGTTPAPTDATQLAQWKIKDAR------------------------------------------------------GSLSIQDYYSGFQNLWAEFSD
        GHIDGT+    D   L  W+ KDAR                                                      G+LSI+ YYSGF NLW E+S 
Subjt:  GHIDGTTPAPTDATQLAQWKIKDAR------------------------------------------------------GSLSIQDYYSGFQNLWAEFSD

Query:  IVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFS
        I+ A V KE L+ +  ++E S+ DQFLMKLR+++E  ++ L+ R+P PTLD+C  ELLREEQ L TQ  + QE++ +  +      + +G++  ++QC+S
Subjt:  IVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFS

Query:  CKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQ-SHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSK
        CK +GHIA +CT+ +CNYC+K GHIIK+C IRPQNRQ  AFQA    +P  Q S I  PT++ V  +    VLTPEMVQQM +SAFS   L GNG ++S 
Subjt:  CKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQ-SHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSK

Query:  SWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQLLV-----------------------------LDQDSGTVIAK----GPKVGRLFPLHISIP
         W VDS ASNHMT  +  L NV+ Y+G + IQ+ NG+ L +                             +D +     +     GPKVG+LFPL  SIP
Subjt:  SWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQLLV-----------------------------LDQDSGTVIAK----GPKVGRLFPLHISIP

Query:  SNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKNNKFS--GLSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSD
        S +SLACS V NQ+E+WHKRLGHPNS IL +L   G LG  ++FS  GLS DC++CKLGK+KILPFP+ GS A KCF++IHSD
Subjt:  SNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKNNKFS--GLSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSD

SwissProt top hitse value%identityAlignment
Q6J9Q2 Ethylene-responsive transcription factor ERF0863.0e-1554.32Show/hide
Query:  DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSDSSTFHSLLTALDVQALLPSDSPHSKQHSPLATKTPHFL
        DPTTKER WLGTFDTAH  ALAYDRA LSM+GT ARTNF+Y+ +   H++LT  ++ +L+   SP++   S L   +P F+
Subjt:  DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSDSSTFHSLLTALDVQALLPSDSPHSKQHSPLATKTPHFL

Q8H3Q1 Ethylene-responsive transcription factor FZP3.5e-1173.33Show/hide
Query:  DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSDSS
        DPTTKER WLGTFDTA   ALAYDRA LSMKG  ARTNF+Y+ ++
Subjt:  DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSDSS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.3e-1352.11Show/hide
Query:  VDTPLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLHLAAVRRIIKYL
        V TP+  + K     G  L+DP+ YR +VGSL YL  TRPDIS+AV ++SQFMH P   HL A++RI++YL
Subjt:  VDTPLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLHLAAVRRIIKYL

Q9M644 Ethylene-responsive transcription factor LEP5.4e-1258.57Show/hide
Query:  DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSD---SSTFHSLLTALDVQALLPSDSPHS
        DPTTKER WLGTFDTA   ALAYDRA  SM+GT ARTNF+YSD   SS+  S+++  D     P  +P S
Subjt:  DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSD---SSTFHSLLTALDVQALLPSDSPHS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.5e-1146.48Show/hide
Query:  VDTPLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLHLAAVRRIIKYL
        V TP+  + K     G  L DP+ YR +VGSL YL  TRPD+S+AV ++SQ+MH P   H  A++R+++YL
Subjt:  VDTPLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLHLAAVRRIIKYL

Arabidopsis top hitse value%identityAlignment
AT1G24590.1 DORNROSCHEN-like5.2e-1046.15Show/hide
Query:  DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIY---SDSSTFHSLLTALDVQALLPSDSPHSKQHSPLAT
        DP +KER+WLGTFDTA   A AYD A  +M+G  ARTNF+Y   S  S  H + ++  +   L  D  +S+  SPL T
Subjt:  DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIY---SDSSTFHSLLTALDVQALLPSDSPHSKQHSPLAT

AT1G28160.1 Integrase-type DNA-binding superfamily protein1.6e-1155.38Show/hide
Query:  ENISCIGIFDNTAWEMVAGD---PTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSD
        E I  +G+     W   A +   PTTKER WLGTFDTA   ALAYDRA  S++G  ARTNF+YSD
Subjt:  ENISCIGIFDNTAWEMVAGD---PTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSD

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-0943.48Show/hide
Query:  PLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLHLAAVRRIIKYLR
        P++ +V + +  G    D   YR+L+G L YL ITR DISFAV ++SQF  +PR  H  AV +I+ Y++
Subjt:  PLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLHLAAVRRIIKYLR

AT5G13910.1 Integrase-type DNA-binding superfamily protein3.8e-1358.57Show/hide
Query:  DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSD---SSTFHSLLTALDVQALLPSDSPHS
        DPTTKER WLGTFDTA   ALAYDRA  SM+GT ARTNF+YSD   SS+  S+++  D     P  +P S
Subjt:  DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSD---SSTFHSLLTALDVQALLPSDSPHS

AT5G18560.1 Integrase-type DNA-binding superfamily protein2.2e-1654.32Show/hide
Query:  DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSDSSTFHSLLTALDVQALLPSDSPHSKQHSPLATKTPHFL
        DPTTKER WLGTFDTAH  ALAYDRA LSM+GT ARTNF+Y+ +   H++LT  ++ +L+   SP++   S L   +P F+
Subjt:  DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSDSSTFHSLLTALDVQALLPSDSPHSKQHSPLATKTPHFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGACATATTGATGGTACTACTCCAGCACCAACAGATGCTACTCAGTTGGCTCAATGGAAGATCAAAGATGCTAGGGGGAGTCTTTCTATTCAGGATTACTATTC
TGGTTTTCAAAATTTATGGGCTGAATTTTCTGATATAGTGTGTGCTGCAGTATCTAAAGAATCTCTTACTGATGTTTTGGCTATTCATGAGATTAGCAAGCGTGATCAGT
TCTTGATGAAGCTACGATCAGATTTTGAAAATGCTCGTTCTAATTTGATGAATCGTCATCCTTCTCCTACCTTGGATGTATGTTTTAGTGAATTGCTTCGTGAGGAACAA
CACCTTCTTACACAAACCACTCTTGAACAGGAGAAAATGACTACAACACAAATGGCATTTTTGGCTCACGGGAAAAGCAAGGGTAAAGATATGAGTAAGGTTCAGTGCTT
TAGTTGCAAAAATTATGGGCATATTGCAGCAAATTGTACCCAGAAATTTTGCAATTATTGCAAGAAGCATGGGCACATCATCAAAGATTGTTTCATTCGCCCTCAAAATC
GTCAAAACACTGCTTTTCAAGCAACAGCTAGCACTTCTCCTACAGGTCAGTCTCACATTGGGCCCCCAACTATGTCAGCTGTTAATGAAAATGCTCAATCCACTGTTCTG
ACTCCTGAAATGGTGCAACAAATGATTGTTTCAGCCTTCTCAGCTCTGAAACTCCACGGTAATGGTAATTCTTTATCTAAGTCTTGGCTTGTTGATTCTGCTGCATCAAA
TCATATGACTAGTTCTGCCAATTTGTTACAAAATGTCCGACCCTATCATGGTTTGGAAAATATTCAAGTTGCTAATGGAAATCAATTACTGGTGTTGGATCAGGACTCGG
GGACGGTGATCGCGAAGGGGCCTAAAGTCGGACGTTTATTTCCTTTGCATATTTCCATTCCTAGTAATGTGTCTCTAGCATGTTCTGTGGTTATCAATCAAAATGAGTTG
TGGCATAAACGTTTGGGACACCCCAATTCTGCTATATTGTCTTACTTATTGACCTCTGGTTTATTAGGCAAAAATAATAAATTTTCAGGCCTGTCTTTTGATTGTTCAAC
TTGTAAATTGGGCAAAAGTAAAATTCTTCCTTTTCCCCTTGCTGGTAGTCGTGCAAATAAATGCTTTGATATTATTCATAGTGATTCATTCTCCGATATTGCTATTCTTC
CTAGCTTTGATGAAACGTCTTCTTCTCCTGAACGATTCAAGCCTGGATATGTGTATGAACAACGACATTCACCACCACCCCTTCCGACTCCAGATCCGTCACCTGATCCT
GCTCCGACTCTCTTGAGACGGTCCACTAGAGTCTCCCGTCCTCCTAATTGGTATGGCTCCTATCATACATCCTTTAGTGCTGCTTTATCCTCTTTTTCAGTTCCATCTTC
TTACTCACAGGCAGATTCTTCTTTGGTTGATACTCCTCTCGAAGTAAATGTCAAGTATCACTCCGATGAGGGAGCACTCCTTTCTGATCCATCTTTGTATCGTCAATTAG
TGGGTAGCTTAAACTACCTAACTATTACAAGACCTGACATTTCCTTTGCTGTTCAGCAAGTTAGTCAGTTTATGCACTCACCTCGCCATCTTCACTTGGCTGCAGTTCGT
CGTATTATAAAATATCTTCGGGCAGTTGGAGAAGATCCTTCGGGAATGTCAGGTCCTTCATCGGCAATTCAATGGGCGGTCTCAGAGGCGGCAGCAATCTCGCTTCCTGG
GTTGTCGCCGGAACCCTTGCCTACTTCCTCTGGGTCAAGCCCTCCCAAGACCTCAAACGCGAGCAGCAGGTTCGTCTCTCCCGGAAATTATTTCACTCAACTTTCGAAAC
GTTTCTGCTCCGTGCTGATTTCTGATCTTCTTCCCCTTCTTCTCTCCTGGATAGAAAGGGCTGCTCTTGCTGCTTCGGATCCTCATCGGTATATTGAGAAAAGGAAACCC
ATTCCTGATCCCCAGGCTCTCCTACTTCAAATGAAGGGAAATAAGCTTCCAGTTCATTTGATGGCTACCTCTGTTTATATTTGCATCTCTTATAATCTCTTCAATCCCTT
TGGCCATATTGTTTGGGATGTAATATATCATCTTTGGAATGAAGAAAATATTAGTTGCATTGGGATATTTGACAATACAGCGTGGGAAATGGTAGCAGGAGATCCAACAA
CCAAAGAGAGGAAATGGCTTGGCACTTTTGACACTGCCCATGTAACAGCTTTAGCTTATGACAGAGCTACCCTGTCAATGAAGGGCACCCTAGCAAGAACCAACTTCATT
TACTCTGACAGCTCAACTTTCCACTCTCTTCTCACTGCTCTTGATGTCCAAGCTTTGCTTCCTTCTGATTCTCCTCATTCCAAGCAACACTCCCCATTGGCAACCAAAAC
ACCCCATTTTCTCAAGTCAGCCTTCCACTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGGACATATTGATGGTACTACTCCAGCACCAACAGATGCTACTCAGTTGGCTCAATGGAAGATCAAAGATGCTAGGGGGAGTCTTTCTATTCAGGATTACTATTC
TGGTTTTCAAAATTTATGGGCTGAATTTTCTGATATAGTGTGTGCTGCAGTATCTAAAGAATCTCTTACTGATGTTTTGGCTATTCATGAGATTAGCAAGCGTGATCAGT
TCTTGATGAAGCTACGATCAGATTTTGAAAATGCTCGTTCTAATTTGATGAATCGTCATCCTTCTCCTACCTTGGATGTATGTTTTAGTGAATTGCTTCGTGAGGAACAA
CACCTTCTTACACAAACCACTCTTGAACAGGAGAAAATGACTACAACACAAATGGCATTTTTGGCTCACGGGAAAAGCAAGGGTAAAGATATGAGTAAGGTTCAGTGCTT
TAGTTGCAAAAATTATGGGCATATTGCAGCAAATTGTACCCAGAAATTTTGCAATTATTGCAAGAAGCATGGGCACATCATCAAAGATTGTTTCATTCGCCCTCAAAATC
GTCAAAACACTGCTTTTCAAGCAACAGCTAGCACTTCTCCTACAGGTCAGTCTCACATTGGGCCCCCAACTATGTCAGCTGTTAATGAAAATGCTCAATCCACTGTTCTG
ACTCCTGAAATGGTGCAACAAATGATTGTTTCAGCCTTCTCAGCTCTGAAACTCCACGGTAATGGTAATTCTTTATCTAAGTCTTGGCTTGTTGATTCTGCTGCATCAAA
TCATATGACTAGTTCTGCCAATTTGTTACAAAATGTCCGACCCTATCATGGTTTGGAAAATATTCAAGTTGCTAATGGAAATCAATTACTGGTGTTGGATCAGGACTCGG
GGACGGTGATCGCGAAGGGGCCTAAAGTCGGACGTTTATTTCCTTTGCATATTTCCATTCCTAGTAATGTGTCTCTAGCATGTTCTGTGGTTATCAATCAAAATGAGTTG
TGGCATAAACGTTTGGGACACCCCAATTCTGCTATATTGTCTTACTTATTGACCTCTGGTTTATTAGGCAAAAATAATAAATTTTCAGGCCTGTCTTTTGATTGTTCAAC
TTGTAAATTGGGCAAAAGTAAAATTCTTCCTTTTCCCCTTGCTGGTAGTCGTGCAAATAAATGCTTTGATATTATTCATAGTGATTCATTCTCCGATATTGCTATTCTTC
CTAGCTTTGATGAAACGTCTTCTTCTCCTGAACGATTCAAGCCTGGATATGTGTATGAACAACGACATTCACCACCACCCCTTCCGACTCCAGATCCGTCACCTGATCCT
GCTCCGACTCTCTTGAGACGGTCCACTAGAGTCTCCCGTCCTCCTAATTGGTATGGCTCCTATCATACATCCTTTAGTGCTGCTTTATCCTCTTTTTCAGTTCCATCTTC
TTACTCACAGGCAGATTCTTCTTTGGTTGATACTCCTCTCGAAGTAAATGTCAAGTATCACTCCGATGAGGGAGCACTCCTTTCTGATCCATCTTTGTATCGTCAATTAG
TGGGTAGCTTAAACTACCTAACTATTACAAGACCTGACATTTCCTTTGCTGTTCAGCAAGTTAGTCAGTTTATGCACTCACCTCGCCATCTTCACTTGGCTGCAGTTCGT
CGTATTATAAAATATCTTCGGGCAGTTGGAGAAGATCCTTCGGGAATGTCAGGTCCTTCATCGGCAATTCAATGGGCGGTCTCAGAGGCGGCAGCAATCTCGCTTCCTGG
GTTGTCGCCGGAACCCTTGCCTACTTCCTCTGGGTCAAGCCCTCCCAAGACCTCAAACGCGAGCAGCAGGTTCGTCTCTCCCGGAAATTATTTCACTCAACTTTCGAAAC
GTTTCTGCTCCGTGCTGATTTCTGATCTTCTTCCCCTTCTTCTCTCCTGGATAGAAAGGGCTGCTCTTGCTGCTTCGGATCCTCATCGGTATATTGAGAAAAGGAAACCC
ATTCCTGATCCCCAGGCTCTCCTACTTCAAATGAAGGGAAATAAGCTTCCAGTTCATTTGATGGCTACCTCTGTTTATATTTGCATCTCTTATAATCTCTTCAATCCCTT
TGGCCATATTGTTTGGGATGTAATATATCATCTTTGGAATGAAGAAAATATTAGTTGCATTGGGATATTTGACAATACAGCGTGGGAAATGGTAGCAGGAGATCCAACAA
CCAAAGAGAGGAAATGGCTTGGCACTTTTGACACTGCCCATGTAACAGCTTTAGCTTATGACAGAGCTACCCTGTCAATGAAGGGCACCCTAGCAAGAACCAACTTCATT
TACTCTGACAGCTCAACTTTCCACTCTCTTCTCACTGCTCTTGATGTCCAAGCTTTGCTTCCTTCTGATTCTCCTCATTCCAAGCAACACTCCCCATTGGCAACCAAAAC
ACCCCATTTTCTCAAGTCAGCCTTCCACTGCTGA
Protein sequenceShow/hide protein sequence
MAGHIDGTTPAPTDATQLAQWKIKDARGSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQ
HLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGPPTMSAVNENAQSTVL
TPEMVQQMIVSAFSALKLHGNGNSLSKSWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQLLVLDQDSGTVIAKGPKVGRLFPLHISIPSNVSLACSVVINQNEL
WHKRLGHPNSAILSYLLTSGLLGKNNKFSGLSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSDSFSDIAILPSFDETSSSPERFKPGYVYEQRHSPPPLPTPDPSPDP
APTLLRRSTRVSRPPNWYGSYHTSFSAALSSFSVPSSYSQADSSLVDTPLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLHLAAVR
RIIKYLRAVGEDPSGMSGPSSAIQWAVSEAAAISLPGLSPEPLPTSSGSSPPKTSNASSRFVSPGNYFTQLSKRFCSVLISDLLPLLLSWIERAALAASDPHRYIEKRKP
IPDPQALLLQMKGNKLPVHLMATSVYICISYNLFNPFGHIVWDVIYHLWNEENISCIGIFDNTAWEMVAGDPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFI
YSDSSTFHSLLTALDVQALLPSDSPHSKQHSPLATKTPHFLKSAFHC