; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0021163 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0021163
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionNon-LTR retroelement reverse transcriptase-like protein
Genome locationchr01:25244860..25248338
RNA-Seq ExpressionIVF0021163
SyntenyIVF0021163
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005840 - ribosome (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061317.1 hypothetical protein E6C27_scaffold6213G00130 [Cucumis melo var. makuwa]1.06e-6069.82Show/hide
Query:  MPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEG
        MPRTQFSR   K  P       +VLVISLSIF +R  RTSTHT GMLP  F+ GIGREASSLEGELW DPKR KRWP L TSGER SGVRNHWKCTCS  
Subjt:  MPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEG

Query:  NGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTF
        N LSSRTSG  P KTR  M RTQFSRP  KS PP VSSLCSVLVISLSIFG+  PRTS  T  +L + F
Subjt:  NGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTF

KAA0062888.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]2.98e-26254.62Show/hide
Query:  SLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRP
        S  + GS +   + H+    PD FSG IGREA  LE EL                              C EGNGL+SRTS ARPKKTRA MPRTQFS+P
Subjt:  SLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRP

Query:  GVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSG
        G KSG PTVSSLCSVLVISLSIFG+R PRTSTHTPGMLPDT                                                E N L+S TSG
Subjt:  GVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSG

Query:  ARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRN
        ARPKKTRA +P TQFSRP  KSGPPTVSSLCSVLV   +    R  +    T   L D              G+  C  + +   P   T+    +  +N
Subjt:  ARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRN

Query:  HWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRS
                  GL+S TS ARPKKTRA MPRTQF RPG K GPPT+SSLCSVLVISL IF +R  RTSTHTPGM+ DTF GGIG EAS LEGEL       
Subjt:  HWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRS

Query:  KRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGI
                               C E NGLSS+ S ARP KTRA MPRTQFSRPG KSGPPTVSSLCSVLVISLSIFG+R  RTSTHT GMLPDTFSG  
Subjt:  KRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGI

Query:  GREASSLEGELWCDPKRSKRWPKLPTSGERSSGKPLEMHLQRRKRAELEDFRCSSQENASGNAPDSVFSSWVKSGPPTVSSLCSVLVISLSIFGSRAPRT
                                 T+G           LQR KR EL+DFRCSSQENA+  APDS+FSS  K                      R+P  
Subjt:  GREASSLEGELWCDPKRSKRWPKLPTSGERSSGKPLEMHLQRRKRAELEDFRCSSQENASGNAPDSVFSSWVKSGPPTVSSLCSVLVISLSIFGSRAPRT

Query:  STHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSL
                 D F   + REASSLE ELWCD KRSKRWP  PT+GERSSGVRNHWKCTCSEGNGL+SRTSGA PKKTRA MPR+QFSR G KS  PTVSSL
Subjt:  STHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSL

Query:  CSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPR
        C VLVI+LSIF +R PRTSTHT GMLPD FS GIG+EASSLEGELWC+PKRSKRW  LPTS ERSSG+RNH KCT  EGNGLSSRT  A PKKTRA MPR
Subjt:  CSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPR

Query:  TQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGL
        TQFSR G KSGP TVSSLCSVLV      G RA              + G  G   + L        K  + + ++   G+RSSGVRNHWK TCSEGNGL
Subjt:  TQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGL

Query:  SSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLV
         SRT  A PKKTRA MPRTQFS  G KSGPPTVSSLCSVLV
Subjt:  SSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLV

KAA0065516.1 uncharacterized protein E6C27_scaffold638G00090 [Cucumis melo var. makuwa]8.25e-10633.6Show/hide
Query:  MPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEG
        MP+TQF   G+KSG  TVS  C +LVIS S F + A R+S + P ++P  FS  I +E S L GEL C+ +R KRW  L          +   K   S+G
Subjt:  MPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEG

Query:  NGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGEL---------WCD----
        N LSSRTSG  P+K +A M +TQF   G+KSGP TVS  C VLVIS S F +RAPR S +  G++   F   I RE SSL GEL         WC     
Subjt:  NGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGEL---------WCD----

Query:  ----------PKRSKR--WPK--------------------------------LPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFS
                  P R K+  W                                  LP+ G  +S +        SEGNGLS RTS    +K R  M  TQF 
Subjt:  ----------PKRSKR--WPK--------------------------------LPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFS

Query:  RPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRT
           VKS P TVS    VLVIS SIF +RAPRTS + PG      S  I REA+SL G+L                               S+GNGLSSRT
Subjt:  RPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRT

Query:  SGARPK--------------------------KTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVIS-LSIFGSRAPRTSTHTPGMLPDTFSGG---IGREA
        SG R +                          K R+G     + R  V+ G   +  + + L +S  S FG+RA RTS +  G++P  FS G   +GR  
Subjt:  SGARPK--------------------------KTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVIS-LSIFGSRAPRTSTHTPGMLPDTFSGG---IGREA

Query:  SSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGM----PRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAP
            G +       K    LPTSGERSS     +     E    S   S    +K+   +    PR   +R  +++G   +       + +LS+      
Subjt:  SSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGM----PRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAP

Query:  RTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGK----------PLEMHLQRRKRAELEDFRCSSQENASGNAPDSVFSSWV
          S   P    D        + +++   L    + SKRW        R S            P+E+ L       + D   +S E++  N+  +      
Subjt:  RTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGK----------PLEMHLQRRKRAELEDFRCSSQENASGNAPDSVFSSWV

Query:  KSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWP---------KLPTSGERSSGVRNHWKCTCS----
          G      +C + V+  +        + T+ PG++P TF  GIG++AS LEG+L      S R           ++P +     GVR+ +    S    
Subjt:  KSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWP---------KLPTSGERSSGVRNHWKCTCS----

Query:  -------------------------EGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGI
                                 EGNGLSSRTSG R +K RA MPRT F    VKS P TVS  C V VIS S FG+ APRTS + PG++P  FS GI
Subjt:  -------------------------EGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGI

Query:  GREASSLEGEL--W----CDPKRSKRWPKLPTSGERSSGVRNHWKCT---CSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVIS
        G E SSLEGEL  W    C PK S   P +      S G+    +      S+GNGLSSR S  RP+K RA M RTQF    +KSGP +VS    +LVI+
Subjt:  GREASSLEGEL--W----CDPKRSKRWPKLPTSGERSSGVRNHWKCT---CSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVIS

Query:  LSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPG
                                                      R+PK P             K   SEGN LSSRTSG RP+K RA MPRTQ     
Subjt:  LSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPG

Query:  VKSGPPTVSSLCSVLVISLSIF
        VKSG   VS  C VLVIS S F
Subjt:  VKSGPPTVSSLCSVLVISLSIF

TYK22521.1 uncharacterized protein E5676_scaffold387G00220 [Cucumis melo var. makuwa]8.68e-11649.91Show/hide
Query:  MPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSL-EGELWCDPKR-------SKRWPKLPTSGERSSGKPLE
        M RT FSR G KSGPPTVSSLCSVLVISLSIFG+R PRTSTH+   L   F     R  S L EG      K        S  W  L   GE    KPLE
Subjt:  MPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSL-EGELWCDPKR-------SKRWPKLPTSGERSSGKPLE

Query:  MHLQRRKRAELEDFRCSSQENASGNAPDSVFSSWVKSGPP--------TVSSLCSVLV----------------------------------------IS
         HL RRK AELEDF CSSQENAS NAPDS FSSW K   P        T   L S LV                                        + 
Subjt:  MHLQRRKRAELEDFRCSSQENASGNAPDSVFSSWVKSGPP--------TVSSLCSVLV----------------------------------------IS

Query:  LSIF--------GSRAPRTSTHTPGML--PDTFSG--GIGREASS--LEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKK
        L  F           AP ++  + G    PD F     IG ++ S  + G      K  +RWP LPTSGERSSGVRN WKCTC EGNGLSSRTSGARPKK
Subjt:  LSIF--------GSRAPRTSTHTPGML--PDTFSG--GIGREASS--LEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKK

Query:  TRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCT
        TRA MP+TQFSRPG KS PPTVS+LCSVLV SLSIFG+  PRTSTHTPGMLPDTFS            E  C                            
Subjt:  TRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCT

Query:  CSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPK
                               PR  F   G    P     L  +   S+ + GS  PRTSTHT GMLPDTFSGGIGREASSLEGEL CDPK SKRWP 
Subjt:  CSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPK

Query:  LPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKT
        LPT GERSSG RNHWKCTCSEGNGLSSRTSGARP+KT
Subjt:  LPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKT

TYK23424.1 gag/pol protein [Cucumis melo var. makuwa]1.06e-19451.01Show/hide
Query:  ERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGE
        +++ G +   K T +E   L S TSG R KKT    PRTQFSRP                        +R  R STHTPGML D F  GIGR+ASSL+ E
Subjt:  ERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGE

Query:  LWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGML
        L CDPKRSKRWP L   GERS GVRNHWK TC+E N LSSRT   R K T+A                                   R PRTSTHTPGML
Subjt:  LWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGML

Query:  PDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISL
        P+TFS GIGREAS LEGEL      SKR                         N LSSR SG RPKK  +      FS    K+   TVSSL  +LVISL
Subjt:  PDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISL

Query:  SIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGV
         IFG+  PRTSTHT  MLPD FS GIGREASSLEGEL CDPK  KRWP L TSGE SSGVRN+WK TCSE N L SRTS   PKKT A MP+TQF R   
Subjt:  SIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGV

Query:  KSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGKPLEMHLQRRKRAELEDFRCSSQ
                                 PRT THTPGMLPDTFS GIGREASSLEGEL C+ KRS RWP LPTSGE SSG                       
Subjt:  KSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGKPLEMHLQRRKRAELEDFRCSSQ

Query:  ENASGNAPDSVFSSWVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKC
                                                                     R+ASSLEGEL  DPK SKRWP LPTSGERSSGVRNHWKC
Subjt:  ENASGNAPDSVFSSWVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKC

Query:  TCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWP
        TCSEGN LSSRT G  PKKT A  P+ QFS  GVKSG PTVSSL  + +I L +FG+  P  STHTP ML  T S GIGREASSLE ELWCDPKRSK W 
Subjt:  TCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWP

Query:  KLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRT
         L TSG RSS VRNHWKCTC+E N L SRTSG RP+KT+A +P T
Subjt:  KLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRT

TrEMBL top hitse value%identityAlignment
A0A5A7V5J2 Non-LTR retroelement reverse transcriptase-like protein1.7e-22455.16Show/hide
Query:  PDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISL
        PD FSG IGREA  LE EL                                EGNGL+SRTS ARPKKTRA MPRTQFS+PG KSG PTVSSLCSVLVISL
Subjt:  PDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISL

Query:  SIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGV
        SIFG+R PRTSTHTPGMLPDT                                                E N L+S TSGARPKKTRA +P TQFSRP  
Subjt:  SIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGV

Query:  KSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGAR
        KSGPPTVSSLCSVLV       +R  +    T   L D              G+  C  + +   P   T+               ++ NGL+S TS AR
Subjt:  KSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGAR

Query:  PKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHW
        PKKTRA MPRTQF RPG K GPPT+SSLCSVLVISL IF +R  RTSTHTPGM+ DTF GGIG EAS LEGEL                           
Subjt:  PKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHW

Query:  KCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKR
             E NGLSS+ S ARP KTRA MPRTQFSRPG KSGPPTVSSLCSVLVISLSIFG+R  RTSTHT GMLPDTFSG                      
Subjt:  KCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKR

Query:  WPKLPTSGERSSGKPLEMHLQRRKRAELEDFRCSSQENASGNAPDSVFSSWVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREA
             T+G           LQR KR EL+DFRCSSQENA+  APDS+FSS  K                                   PD F   + REA
Subjt:  WPKLPTSGERSSGKPLEMHLQRRKRAELEDFRCSSQENASGNAPDSVFSSWVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREA

Query:  SSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTST
        SSLE ELWCD KRSKRWP  PT+GERSSGVRNHWKCTCSEGNGL+SRTSGA PKKTRA MPR+QFSR G KS  PTVSSLC VLVI+LSIF +R PRTST
Subjt:  SSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTST

Query:  HTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCS
        HT GMLPD FS GIG+EASSLEGELWC+PKRSKRW  LPTS ERSSG+RNH KCT  EGNGLSSRT  A PKKTRA MPRTQFSR G KSGP TVSSLCS
Subjt:  HTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCS

Query:  VLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQ
        VLV      G RA              + G  G   + L+     D +  +R       G+RSSGVRNHWK TCSEGNGL SRT  A PKKTRA MPRTQ
Subjt:  VLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQ

Query:  FSRPGVKSGPPTVSSLCSVLV
        FS  G KSGPPTVSSLCSVLV
Subjt:  FSRPGVKSGPPTVSSLCSVLV

A0A5A7VGQ5 Uncharacterized protein6.5e-9133.75Show/hide
Query:  MPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEG
        MP+TQF   G+KSG  TVS  C +LVIS S F + A R+S + P ++P  FS  I +E S L GEL C+ +R KRW  L          +   K   S+G
Subjt:  MPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEG

Query:  NGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGEL---------WC----D
        N LSSRTSG  P+K +A M +TQF   G+KSGP TVS  C VLVIS S F +RAPR S +  G++   F   I RE SSL GEL         WC     
Subjt:  NGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGEL---------WC----D

Query:  PKRSKRWPKLPTSGERS--SG----VRNHWKC----------------------------------TCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGV
          ++     LP+ G++   +G     R  +KC                                    SEGNGLS RTS    +K R  M  TQF    V
Subjt:  PKRSKRWPKLPTSGERS--SG----VRNHWKC----------------------------------TCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGV

Query:  KSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGAR
        KS P TVS    VLVIS SIF +RAPRTS + PG      S  I REA+SL G+L                               S+GNGLSSRTSG R
Subjt:  KSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGAR

Query:  PK--------------------------KTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVIS-LSIFGSRAPRTSTHTPGMLPDTFSGG---IGREASSLE
         +                          K R+G     + R  V+ G   +  + + L +S  S FG+RA RTS +  G++P  FS G   +GR      
Subjt:  PK--------------------------KTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVIS-LSIFGSRAPRTSTHTPGMLPDTFSGG---IGREASSLE

Query:  GELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGM----PRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTST
        G +       K    LPTSGERSS     +     E    S   S    +K+   +    PR   +R  +++G   +       + +LS+        S 
Subjt:  GELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGM----PRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTST

Query:  HTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGKPLEMHLQRRKRAELEDFRCSSQENASGNAPDSVFSSWVKSGPPTVSSLCSVL
          P    D        + +++   L    + SKRW        R S   ++         E+E     S  +    + +S  S+   +G      +C + 
Subjt:  HTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGKPLEMHLQRRKRAELEDFRCSSQENASGNAPDSVFSSWVKSGPPTVSSLCSVL

Query:  VISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRW---------PKLPTSGERSSGVRNHWKCTC-------------------
        V+  +        + T+ PG++P TF  GIG++AS LEG+L      S R           ++P +     GVR+ +                       
Subjt:  VISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRW---------PKLPTSGERSSGVRNHWKCTC-------------------

Query:  ----------SEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGEL--W
                  SEGNGLSSRTSG R +K RA MPRT F    VKS P TVS  C V VIS S FG+ APRTS + PG++P  FS GIG E SSLEGEL  W
Subjt:  ----------SEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGEL--W

Query:  ----CDPKRSKRWPKLPTSGERSSGVRNHWKC---TCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTH
            C PK S   P +      S G+    +      S+GNGLSSR S  RP+K RA M RTQF    +KSGP +VS    +LVI+              
Subjt:  ----CDPKRSKRWPKLPTSGERSSGVRNHWKC---TCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTH

Query:  TPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSV
                                        R+PK P             K   SEGN LSSRTSG RP+K RA MPRTQ     VKSG   VS  C V
Subjt:  TPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSV

Query:  LVISLSIF
        LVIS S F
Subjt:  LVISLSIF

A0A5D3BG85 Uncharacterized protein5.4e-5332.63Show/hide
Query:  THTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLC
        T+ PG++P TF  GIGR+AS LEG+L                               S+GNGLSSRTS   P+K R  MPRTQ   PGV+       SL 
Subjt:  THTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLC

Query:  SVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRT
                         S H P  +   +     R  + L GE      R+K +                     SEGNGLSSRTSG R +K RA MPRT
Subjt:  SVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRT

Query:  QFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGKPLEMHLQRRKRAELE
         F    VKS P TVS  C V VIS S F + APRTS + PG++P  FS GIG E SSLEGEL                                      
Subjt:  QFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGKPLEMHLQRRKRAELE

Query:  DFRCSSQENASGNAPDSVFSSWVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSG
                           S+W                     F + AP+TS + PG++P  FS GIGREASSLEGEL                      
Subjt:  DFRCSSQENASGNAPDSVFSSWVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSG

Query:  VRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDP
                 S+GNGLSSR S  RP+K RA MPRTQF    +KSGP +VS    +LVI+                                          
Subjt:  VRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDP

Query:  KRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFS
            R+PK P             K   SEGN LSSRTSG RP+K RA MPRTQ     VKSG   VS  C VLVIS S F   APRT  +   ++P  F 
Subjt:  KRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFS

Query:  GGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMP
                                                      +GNGLSSRTSG RP K     P
Subjt:  GGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMP

A0A5D3DFP2 Uncharacterized protein1.0e-9949.91Show/hide
Query:  MPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSL-EGELWCDPK-------RSKRWPKLPTSGERSSGKPLE
        M RT FSR G KSGPPTVSSLCSVLVISLSIFG+R PRTSTH+   L   F     R  S L EG      K        S  W  L   GE    KPLE
Subjt:  MPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSL-EGELWCDPK-------RSKRWPKLPTSGERSSGKPLE

Query:  MHLQRRKRAELEDFRCSSQENASGNAPDSVFSSWVKSGPP--------TVSSLCSVLV----------------------------------------IS
         HL RRK AELEDF CSSQENAS NAPDS FSSW K   P        T   L S LV                                        + 
Subjt:  MHLQRRKRAELEDFRCSSQENASGNAPDSVFSSWVKSGPP--------TVSSLCSVLV----------------------------------------IS

Query:  LSIF--------GSRAPRTSTHTPG--MLPDTFSG--GIGREASS--LEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKK
        L  F           AP ++  + G    PD F     IG ++ S  + G      K  +RWP LPTSGERSSGVRN WKCTC EGNGLSSRTSGARPKK
Subjt:  LSIF--------GSRAPRTSTHTPG--MLPDTFSG--GIGREASS--LEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKK

Query:  TRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCT
        TRA MP+TQFSRPG KS PPTVS+LCSVLV SLSIFG+  PRTSTHTPGMLPDTFS                               ER           
Subjt:  TRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCT

Query:  CSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPK
                               PR  F   G    P     L  +   S+ + GS  PRTSTHT GMLPDTFSGGIGREASSLEGEL CDPK SKRWP 
Subjt:  CSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPK

Query:  LPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKT
        LPT GERSSG RNHWKCTCSEGNGLSSRTSGARP+KT
Subjt:  LPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKT

A0A5D3DIF8 Gag/pol protein4.1e-16250.47Show/hide
Query:  ERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGE
        +++ G +   K T +E   L S TSG R KKT    PRTQFSRP                        +R  R STHTPGML D F  GIGR+ASSL+ E
Subjt:  ERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGE

Query:  LWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGML
        L CDPKRSKRWP L   GERS GVRNHWK TC+E N LSSRT   R K T+A                                   R PRTSTHTPGML
Subjt:  LWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGML

Query:  PDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISL
        P+TFS GIGREAS LEGEL                               S+ N LSSR SG RPKK  +            K+   TVSSL  +LVISL
Subjt:  PDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISL

Query:  SIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGV
         IFG+  PRTSTHT  MLPD FS GIGREASSLEGEL CDPK  KRWP L TSGE SSGVRN+WK TCSE N L SRTS   PKKT A MP+TQF R   
Subjt:  SIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGV

Query:  KSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGKPLEMHLQRRKRAELEDFRCSSQ
                                 PRT THTPGMLPDTFS GIGREASSLEGEL C+ KRS RWP LPTSGE SS                        
Subjt:  KSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGKPLEMHLQRRKRAELEDFRCSSQ

Query:  ENASGNAPDSVFSSWVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKC
                                                                    GR+ASSLEGEL  DPK SKRWP LPTSGERSSGVRNHWKC
Subjt:  ENASGNAPDSVFSSWVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKC

Query:  TCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWP
        TCSEGN LSSRT G  PKKT A  P+ QFS  GVKSG PTVSSL  + +I L +FG+  P  STHTP ML  T S GIGREASSLE ELWCDPKRSK W 
Subjt:  TCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWP

Query:  KLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRT
         L TSG RSS VRNHWKCTC+E N L SRTSG RP+KT+A +P T
Subjt:  KLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGTCCGGTGTCAGAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCGTGCTCGTCC
CAAGAAAACGCGAGCGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATAAGTCTTTCGA
TCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAGGTCGGGAAGCGAGCTCGTTGGAGGGAGAACTGTGG
TGCGACCCAAAACGGTCCAAAAGATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGTCCGGTGTCAGAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGGCTGAG
CTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACGCGAGCGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTTTGCT
CCGTATTGGTCATAAGTCTTTCGATCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAGGTCGGGAAGCG
AGCTCGTTGGAGGGAGAACTGTGGTGCGACCCAAAACGGTCCAAAAGATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGTCCGGTGTCAGAAACCACTGGAAATGCAC
TTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACGCGAGCGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGGTAAAAAGCGGTC
CCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATAAGTCTTTCGATCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACACGTTT
TCCGGAGGTATAGGTCGGGAAGCGAGCTCGTTGGAGGGAGAACTGTGGTGCGACCCAAAACGGTCCAAAAGATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGTCCGG
TGTCAGAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACGCGAGCGGGAATGCCCCGGACTCAGTTTT
CTCGTCCTGGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATAAGTCTTTCGATCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCACACT
CCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAGGTCGGGAAGCGAGCTCGTTGGAGGGAGAACTGTGGTGCGACCCAAAACGGTCCAAAAGATGGCCGAAGCTTCC
AACGAGTGGCGAACGCTCGTCCGGTGTCAGAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACGCGAG
CGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATAAGTCTTTCGATCTTCGGGTCTCGT
GCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAGGTCGGGAAGCGAGCTCGTTGGAGGGAGAACTGTGGTGCGACCCAAAACG
GTCCAAAAGATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGTCCGGTGTCAGAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCG
GTGCTCGTCCCAAGAAAACGCGAGCGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATA
AGTCTTTCGATCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAGGTCGGGAAGCGAGCTCGTTGGAGGG
AGAACTGTGGTGCGACCCAAAACGGTCCAAAAGATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGTCCGGTAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGG
CTGAGCTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACGCGAGCGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTT
TGCTCCGTATTGGTCATAAGTCTTTCGATCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAGGTCGGGA
AGCGAGCTCGTTGGAGGGAGAACTGTGGTGCGACCCAAAACGGTCCAAAAGATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGTCCGGTGTCAGAAACCACTGGAAAT
GCACTTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACGCGAGCGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGGTAAAAAGC
GGTCCCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATAAGTCTTTCGATCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACAC
GTTTTCCGGAGGTATAGGTCGGGAAGCGAGCTCGTTGGAGGGAGAACTGTGGTGCGACCCAAAACGGTCCAAAAGATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGT
CCGGTGTCAGAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACGCGAGCGGGAATGCCCCGGACTCAG
TTTTCTCGTCCTGGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATAAGTCTTTCGATCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCA
CACTCCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAGGTCGGGAAGCGAGCTCGTTGGAGGGAGAACTGTGGTGCGACCCAAAACGGTCCAAAAGATGGCCGAAGC
TTCCAACGAGTGGCGAACGCTCGTCCGGTGTCAGAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACG
CGAGCGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATAAGTCTTTCGATCTTCGGGTC
CGTGCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGTCCGGTGTCAGAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCGTGCTCGTCC
CAAGAAAACGCGAGCGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATAAGTCTTTCGA
TCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAGGTCGGGAAGCGAGCTCGTTGGAGGGAGAACTGTGG
TGCGACCCAAAACGGTCCAAAAGATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGTCCGGTGTCAGAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGGCTGAG
CTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACGCGAGCGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTTTGCT
CCGTATTGGTCATAAGTCTTTCGATCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAGGTCGGGAAGCG
AGCTCGTTGGAGGGAGAACTGTGGTGCGACCCAAAACGGTCCAAAAGATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGTCCGGTGTCAGAAACCACTGGAAATGCAC
TTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACGCGAGCGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGGTAAAAAGCGGTC
CCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATAAGTCTTTCGATCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACACGTTT
TCCGGAGGTATAGGTCGGGAAGCGAGCTCGTTGGAGGGAGAACTGTGGTGCGACCCAAAACGGTCCAAAAGATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGTCCGG
TGTCAGAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACGCGAGCGGGAATGCCCCGGACTCAGTTTT
CTCGTCCTGGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATAAGTCTTTCGATCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCACACT
CCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAGGTCGGGAAGCGAGCTCGTTGGAGGGAGAACTGTGGTGCGACCCAAAACGGTCCAAAAGATGGCCGAAGCTTCC
AACGAGTGGCGAACGCTCGTCCGGTGTCAGAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACGCGAG
CGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATAAGTCTTTCGATCTTCGGGTCTCGT
GCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAGGTCGGGAAGCGAGCTCGTTGGAGGGAGAACTGTGGTGCGACCCAAAACG
GTCCAAAAGATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGTCCGGTGTCAGAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCG
GTGCTCGTCCCAAGAAAACGCGAGCGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATA
AGTCTTTCGATCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAGGTCGGGAAGCGAGCTCGTTGGAGGG
AGAACTGTGGTGCGACCCAAAACGGTCCAAAAGATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGTCCGGTAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGG
CTGAGCTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACGCGAGCGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTT
TGCTCCGTATTGGTCATAAGTCTTTCGATCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAGGTCGGGA
AGCGAGCTCGTTGGAGGGAGAACTGTGGTGCGACCCAAAACGGTCCAAAAGATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGTCCGGTGTCAGAAACCACTGGAAAT
GCACTTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACGCGAGCGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGGTAAAAAGC
GGTCCCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATAAGTCTTTCGATCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACAC
GTTTTCCGGAGGTATAGGTCGGGAAGCGAGCTCGTTGGAGGGAGAACTGTGGTGCGACCCAAAACGGTCCAAAAGATGGCCGAAGCTTCCAACGAGTGGCGAACGCTCGT
CCGGTGTCAGAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACGCGAGCGGGAATGCCCCGGACTCAG
TTTTCTCGTCCTGGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATAAGTCTTTCGATCTTCGGGTCTCGTGCTCCGAGGACTTCAACCCA
CACTCCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAGGTCGGGAAGCGAGCTCGTTGGAGGGAGAACTGTGGTGCGACCCAAAACGGTCCAAAAGATGGCCGAAGC
TTCCAACGAGTGGCGAACGCTCGTCCGGTGTCAGAAACCACTGGAAATGCACTTGCAGCGAAGGAAACGGGCTGAGCTCGAGGACTTCCGGTGCTCGTCCCAAGAAAACG
CGAGCGGGAATGCCCCGGACTCAGTTTTCTCGTCCTGGGGTAAAAAGCGGTCCCCCGACGGTTTCTTCCCTTTGCTCCGTATTGGTCATAAGTCTTTCGATCTTCGGGTC
CGTGCTCCGAGGACTTCAACCCACACTCCTGGCATGCTCCCTGACACGTTTTCCGGAGGTATAG
Protein sequenceShow/hide protein sequence
MAEASNEWRTLVRCQKPLEMHLQRRKRAELEDFRARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELW
CDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREA
SSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTF
SGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHT
PGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSR
APRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKSGPPTVSSLCSVLVI
SLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGKPLEMHLQRRKRAELEDFRCSSQENASGNAPDSVFSSWVKSGPPTVSSL
CSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQFSRPGVKS
GPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKTRAGMPRTQ
FSRPGVKSGPPTVSSLCSVLVISLSIFGSRAPRTSTHTPGMLPDTFSGGIGREASSLEGELWCDPKRSKRWPKLPTSGERSSGVRNHWKCTCSEGNGLSSRTSGARPKKT
RAGMPRTQFSRPGVKSGPPTVSSLCSVLVISLSIFGSVLRGLQPTLLACSLTRFPEV