; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026371 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026371
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:35642553..35646809
RNA-Seq ExpressionLag0026371
SyntenyLag0026371
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4263648.1 unnamed protein product [Prunus armeniaca]3.6e-7631.74Show/hide
Query:  NHTNLVLIPKCRQP-------------------------RLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDMV
        N+T++ LIPK   P                         RLK++L  +I++ QS F+ GR ISDN I+  E+LHF++K+  GR G+ ALKLDMSK YD V
Subjt:  NHTNLVLIPKCRQP-------------------------RLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDMV

Query:  EWSYLNQIMEKLGFHERWI------------------------------RQEDPLLPYLFLLCTEGFSALL-DSTSRRNLAGISITRSCPKISHLFFVDD
        EW++L  +M  +GF  RW+                              RQ DPL PYLFLLC E FS L+  +     L G+S+ R  P +SHLFF DD
Subjt:  EWSYLNQIMEKLGFHERWI------------------------------RQEDPLLPYLFLLCTEGFSALL-DSTSRRNLAGISITRSCPKISHLFFVDD

Query:  SLVFLKAAAG------SILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMR------------------------------VSESLGLE----
        S +FLKA+         IL+ YE+VSGQ IN++KS V FS N   + +E  +  LG+R                              + E+ GLE    
Subjt:  SLVFLKAAAG------SILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMR------------------------------VSESLGLE----

Query:  -ESFFLSRRKGSVDQKHYTGDPYLC----DGVLLDFERDSIENFGTVCQILVWRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLL
          S++   ++G   + H+     LC    +G  L F      N   V + L WR+  NP   +AR L+ RYY +  +L +T   + S+ W+ L     +L
Subjt:  -ESFFLSRRKGSVDQKHYTGDPYLC----DGVLLDFERDSIENFGTVCQILVWRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLL

Query:  KQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITP-SLQWDCGKLNQHLNQDDVEVIKGLPIS-STAPDKLIWHYDRRGEYFVKSG
        ++G R  +G+G+++ +++D W+    +FKV S P    G+T V+  I P +LQW    L    + ++  +I  +P+S    PD LIWH++R G Y VKSG
Subjt:  KQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITP-SLQWDCGKLNQHLNQDDVEVIKGLPIS-STAPDKLIWHYDRRGEYFVKSG

Query:  YKLCL-------SRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALFRVDATLPATTVTSA
        Y++           + + + +  GI    W  +W++ +P KV+LF+ R+  N +P+ VNL    +   G C    E  ET  H   +           S 
Subjt:  YKLCL-------SRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALFRVDATLPATTVTSA

Query:  FPASSLTAAQHDFREWARGFGATLSSP
        + +     A  D   W +    TLS+P
Subjt:  FPASSLTAAQHDFREWARGFGATLSSP

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.0e-9830.73Show/hide
Query:  LTSPFTEEEVIAAVKDA-SVREWNHTNLVLIPKCRQP-------------------------RLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLN
        +  P T E  + A+ +   +++WN T + LIPK +QP                         RLK V+  +I + QS F+P R+ISDN+I+GHE LH +N
Subjt:  LTSPFTEEEVIAAVKDA-SVREWNHTNLVLIPKCRQP-------------------------RLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLN

Query:  KKKTGRTGFAALKLDMSKTYDMVEWSYLNQIMEKLGFHERW------------------------------IRQEDPLLPYLFLLCTEGFSALLD-STSR
          K+G  G AALKLD+SK +D VEW+YL  IM K+GF+E W                              IRQ DPL PYLFLLC EG SAL++   + 
Subjt:  KKKTGRTGFAALKLDMSKTYDMVEWSYLNQIMEKLGFHERW------------------------------IRQEDPLLPYLFLLCTEGFSALLD-STSR

Query:  RNLAGISITRSCPKISHLFFVDDSLVFLKA------AAGSILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMRV----SESLGLEESFFLSR
          L GI    +   I+HL F DDSL+FL++      A   +L  Y + SGQCIN +KS + FS N  P+ +++L  IL +++       LGL   F  +R
Subjt:  RNLAGISITRSCPKISHLFFVDDSLVFLKA------AAGSILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMRV----SESLGLEESFFLSR

Query:  RKGSVDQKHYTG-----DPYLCDGVLLDFERDSIENFG-TVCQILVWRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRK
        R+G   + H+        P  C G  L+F    +E F   +    VWR L +P L +++ L+ +Y+ ++ +L ++ +  SS+FWKG +WG DLL +G+R 
Subjt:  RKGSVDQKHYTG-----DPYLCDGVLLDFERDSIENFG-TVCQILVWRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRK

Query:  NLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKLNQHLNQDDVEVIKGLPISS-TAPDKLIWHYDRRGEYFVKSGYKLCLSR
         +GNG +I  F DPW+ R  TFK   F    + DT VA FIT    WD   ++     +D ++I  +PISS    D  +WHYD+RG Y V+SGYKL +  
Subjt:  NLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKLNQHLNQDDVEVIKGLPISS-TAPDKLIWHYDRRGEYFVKSGYKLCLSR

Query:  IREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALFRVDATLPATTVTSAFPASSLTAAQHDFR
            +++      + WN++WKL +P K+K+F+ RS H  IP+  NL                        L R    LPA T+                 
Subjt:  IREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALFRVDATLPATTVTSAFPASSLTAAQHDFR

Query:  EWARGFGATLSSPWISGKQGSDDSLRSSSAGLLRSATASPNVVVFVFRQVRCVRVFRSLAYWRFNRKIGVDLHTPNSKHRSVLKSGRYSGRAREVWDVFF
                                                                                     +  S++ +  +  RAR++W   F
Subjt:  EWARGFGATLSSPWISGKQGSDDSLRSSSAGLLRSATASPNVVVFVFRQVRCVRVFRSLAYWRFNRKIGVDLHTPNSKHRSVLKSGRYSGRAREVWDVFF

Query:  PSVVGNRMGD-MDIKDRWLSM-RRFKGSDLVCICVGAWAIWNDRNNFIHNRPIPIVESRCEWLNSYID
        P +      D +   + W S+  + +  DL    +  W IWNDRN+ IH + +  VE +CEWL  ++D
Subjt:  PSVVGNRMGD-MDIKDRWLSM-RRFKGSDLVCICVGAWAIWNDRNNFIHNRPIPIVESRCEWLNSYID

XP_030477990.1 uncharacterized protein LOC115695032 [Cannabis sativa]5.2e-7532.74Show/hide
Query:  NQMLTSPFTEEEVIAAVKDASVREWNHTNLVLIPKCRQPRLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDMV
        N+ L   FT  +V+ A+K       N  +  LI K    RLK VL  +I E QS F+P R I+DN+++  E++H+L  K  G+ GF+ALKLDMSK +D V
Subjt:  NQMLTSPFTEEEVIAAVKDASVREWNHTNLVLIPKCRQPRLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDMV

Query:  EWSYLNQIMEKLGFHERWI------------------------------RQEDPLLPYLFLLCTEGFSALLD-STSRRNLAGISITRSCPKISHLFFVDD
        EWS++  +M K+GF  RWI                              RQ DPL PYLFL+C+EG S LL    S   L G++++R  P +SHL F DD
Subjt:  EWSYLNQIMEKLGFHERWI------------------------------RQEDPLLPYLFLLCTEGFSALLD-STSRRNLAGISITRSCPKISHLFFVDD

Query:  SLVFLKA------AAGSILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMRVSES----LGLE------------------------------
        SL+F +A      A   +L  Y K SGQ +N +KS++ FS N    +K+   +ILGM + E     LGL                               
Subjt:  SLVFLKA------AAGSILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMRVSES----LGLE------------------------------

Query:  ----------------------------------ESFFLSRRKGSVDQK---HYTGDPYLCDGVL---LDFERDSIENFGTVCQILV----WRILCNPQL
                                          ES   +   GS   K   H+    +LC   L   L F     +NF    Q L+    WR+  NP+ 
Subjt:  ----------------------------------ESFFLSRRKGSVDQK---HYTGDPYLCDGVL---LDFERDSIENFGTVCQILV----WRILCNPQL

Query:  KIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKLNQH
         + R L+GRY+     L++     SS  W+G+ WG +LLK+GIR  +GNG  I    DPWI   +      F   P     VAD+ITP  +W+  KL   
Subjt:  KIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKLNQH

Query:  LNQDDVEVIKGLPISSTA-PDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHG
         +  DVE I  LP+S  A  D  +WH    G+Y VKSGY +      +  AS      SWW + W+L +P KVK+F  ++ HN +P    L         
Subjt:  LNQDDVEVIKGLPISSTA-PDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHG

Query:  MCPVSKEANETTHHALF
         C +   A E+  HA+F
Subjt:  MCPVSKEANETTHHALF

XP_030508858.1 uncharacterized protein LOC115723499 [Cannabis sativa]8.9e-7532.11Show/hide
Query:  LTSPFTEEEVIAAVKD-ASVREWNHTNLVLIPKCRQPRL--------------KLV-----------LNEIIDECQSTFIPGRSISDNMILGHEMLHFLN
        +  P   E V+  + + A+    N T + LIPK ++P+L              KLV           L  +I E QS F+  R I+DN+++  E+LH L 
Subjt:  LTSPFTEEEVIAAVKD-ASVREWNHTNLVLIPKCRQPRL--------------KLV-----------LNEIIDECQSTFIPGRSISDNMILGHEMLHFLN

Query:  KKKTGRTGFAALKLDMSKTYDMVEWSYLNQIMEKLGF------------------------------HERWIRQEDPLLPYLFLLCTEGFSALLDSTSR-
         +K G  G+AA+KLDMSK +D VEWSY+ QIM K+GF                                R IRQ DPL PYLFL+C EG S LL S    
Subjt:  KKKTGRTGFAALKLDMSKTYDMVEWSYLNQIMEKLGF------------------------------HERWIRQEDPLLPYLFLLCTEGFSALLDSTSR-

Query:  RNLAGISITRSCPKISHLFFVDDSLVFLKAAAGS------ILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMRV----SESLGLEESFFLSR
         +L G+ ++R+ P +SHLFF DDS++F +A   S      +L  Y + SGQ IN +K ++ FS N     + F   +L M +     E LGL       +
Subjt:  RNLAGISITRSCPKISHLFFVDDSLVFLKAAAGS------ILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMRV----SESLGLEESFFLSR

Query:  RK---GSVDQ-----KHYTGDPYLCDG--VLLDFERDSIENFGTVCQIL----------------------VWRILCNPQLKIARFLRGRYYPNSMVLNS
         K   G  D+       +    +   G  VLL     +I  +   C  L                       WR+L +P   ++R L  RY+ N  VL +
Subjt:  RK---GSVDQ-----KHYTGDPYLCDG--VLLDFERDSIENFGTVCQIL----------------------VWRILCNPQLKIARFLRGRYYPNSMVLNS

Query:  TASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKLNQHLNQDDVEVIKGLPIS-STA
              S  W+ +VWG +LL +G+R  +G+G  I    DPW+     F   SF +      +VAD I    QWD   ++ +  Q +++ I  +P+S   +
Subjt:  TASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKLNQHLNQDDVEVIKGLPIS-STA

Query:  PDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALF
         D LIW+    G Y VKSGY    S      A    +  +WW+  WKL +P+K+++FV + FHN IP    L   H+     CP+ K+  ET  HALF
Subjt:  PDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALF

XP_030509050.1 uncharacterized protein LOC115723712 [Cannabis sativa]8.9e-7531.38Show/hide
Query:  WNHTNLVLIPKCRQP-------------------------RLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDM
        +N T + LIPK ++P                         R++  +  +I E QS F+  R I+DN+++  E+LH L  +K GR G+AA+KLDMSK +D 
Subjt:  WNHTNLVLIPKCRQP-------------------------RLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDM

Query:  VEWSYLNQIMEKLGF------------------------------HERWIRQEDPLLPYLFLLCTEGFSALLD-STSRRNLAGISITRSCPKISHLFFVD
        VEW +L Q+M KLGF                               +R IRQ DPL PYLFL+C+EGFS LL    S   L G+ ++RS P I+HL F D
Subjt:  VEWSYLNQIMEKLGF------------------------------HERWIRQEDPLLPYLFLLCTEGFSALLD-STSRRNLAGISITRSCPKISHLFFVD

Query:  DSLVFLKAAAGS------ILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMRV----SESLGL---------------------------EES
        DS++F +A+  S       L  Y + SGQ +N  KS++ FS N     + F  H+L M +     + LGL                           E+ 
Subjt:  DSLVFLKAAAGS------ILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMRV----SESLGL---------------------------EES

Query:  FFLSRRK-------------------------GSVDQK---------------HYTGDPYLCDGVLLDFERDSIENFGTVCQILV----WRILCNPQLKI
        F +  ++                           ++Q                H+    +LC   +         NF    Q L+    WRIL +P   I
Subjt:  FFLSRRK-------------------------GSVDQK---------------HYTGDPYLCDGVLLDFERDSIENFGTVCQILV----WRILCNPQLKI

Query:  ARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKLNQHLN
        AR L+ RY+   + L +TA    S  WK +VWG +LL +G+R  +G+G  +     PWI   + FK   F  P   D  VADFIT S QWD  KL Q   
Subjt:  ARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKLNQHLN

Query:  QDDVEVIKGLPIS-STAPDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMC
          DV+ I  +P+S     D L+WHY   G Y VKSGYKL  S + +   +      +WW   W L +P+K+++F  R++H  +P+   L   H+     C
Subjt:  QDDVEVIKGLPIS-STAPDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMC

Query:  PVSKEANETTHHALF
        P+ +   ET +HA F
Subjt:  PVSKEANETTHHALF

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248749.6e-9930.73Show/hide
Query:  LTSPFTEEEVIAAVKDA-SVREWNHTNLVLIPKCRQP-------------------------RLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLN
        +  P T E  + A+ +   +++WN T + LIPK +QP                         RLK V+  +I + QS F+P R+ISDN+I+GHE LH +N
Subjt:  LTSPFTEEEVIAAVKDA-SVREWNHTNLVLIPKCRQP-------------------------RLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLN

Query:  KKKTGRTGFAALKLDMSKTYDMVEWSYLNQIMEKLGFHERW------------------------------IRQEDPLLPYLFLLCTEGFSALLD-STSR
          K+G  G AALKLD+SK +D VEW+YL  IM K+GF+E W                              IRQ DPL PYLFLLC EG SAL++   + 
Subjt:  KKKTGRTGFAALKLDMSKTYDMVEWSYLNQIMEKLGFHERW------------------------------IRQEDPLLPYLFLLCTEGFSALLD-STSR

Query:  RNLAGISITRSCPKISHLFFVDDSLVFLKA------AAGSILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMRV----SESLGLEESFFLSR
          L GI    +   I+HL F DDSL+FL++      A   +L  Y + SGQCIN +KS + FS N  P+ +++L  IL +++       LGL   F  +R
Subjt:  RNLAGISITRSCPKISHLFFVDDSLVFLKA------AAGSILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMRV----SESLGLEESFFLSR

Query:  RKGSVDQKHYTG-----DPYLCDGVLLDFERDSIENFG-TVCQILVWRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRK
        R+G   + H+        P  C G  L+F    +E F   +    VWR L +P L +++ L+ +Y+ ++ +L ++ +  SS+FWKG +WG DLL +G+R 
Subjt:  RKGSVDQKHYTG-----DPYLCDGVLLDFERDSIENFG-TVCQILVWRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRK

Query:  NLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKLNQHLNQDDVEVIKGLPISS-TAPDKLIWHYDRRGEYFVKSGYKLCLSR
         +GNG +I  F DPW+ R  TFK   F    + DT VA FIT    WD   ++     +D ++I  +PISS    D  +WHYD+RG Y V+SGYKL +  
Subjt:  NLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKLNQHLNQDDVEVIKGLPISS-TAPDKLIWHYDRRGEYFVKSGYKLCLSR

Query:  IREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALFRVDATLPATTVTSAFPASSLTAAQHDFR
            +++      + WN++WKL +P K+K+F+ RS H  IP+  NL                        L R    LPA T+                 
Subjt:  IREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALFRVDATLPATTVTSAFPASSLTAAQHDFR

Query:  EWARGFGATLSSPWISGKQGSDDSLRSSSAGLLRSATASPNVVVFVFRQVRCVRVFRSLAYWRFNRKIGVDLHTPNSKHRSVLKSGRYSGRAREVWDVFF
                                                                                     +  S++ +  +  RAR++W   F
Subjt:  EWARGFGATLSSPWISGKQGSDDSLRSSSAGLLRSATASPNVVVFVFRQVRCVRVFRSLAYWRFNRKIGVDLHTPNSKHRSVLKSGRYSGRAREVWDVFF

Query:  PSVVGNRMGD-MDIKDRWLSM-RRFKGSDLVCICVGAWAIWNDRNNFIHNRPIPIVESRCEWLNSYID
        P +      D +   + W S+  + +  DL    +  W IWNDRN+ IH + +  VE +CEWL  ++D
Subjt:  PSVVGNRMGD-MDIKDRWLSM-RRFKGSDLVCICVGAWAIWNDRNNFIHNRPIPIVESRCEWLNSYID

A0A6J5TM79 Reverse transcriptase domain-containing protein1.8e-7631.74Show/hide
Query:  NHTNLVLIPKCRQP-------------------------RLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDMV
        N+T++ LIPK   P                         RLK++L  +I++ QS F+ GR ISDN I+  E+LHF++K+  GR G+ ALKLDMSK YD V
Subjt:  NHTNLVLIPKCRQP-------------------------RLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDMV

Query:  EWSYLNQIMEKLGFHERWI------------------------------RQEDPLLPYLFLLCTEGFSALL-DSTSRRNLAGISITRSCPKISHLFFVDD
        EW++L  +M  +GF  RW+                              RQ DPL PYLFLLC E FS L+  +     L G+S+ R  P +SHLFF DD
Subjt:  EWSYLNQIMEKLGFHERWI------------------------------RQEDPLLPYLFLLCTEGFSALL-DSTSRRNLAGISITRSCPKISHLFFVDD

Query:  SLVFLKAAAG------SILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMR------------------------------VSESLGLE----
        S +FLKA+         IL+ YE+VSGQ IN++KS V FS N   + +E  +  LG+R                              + E+ GLE    
Subjt:  SLVFLKAAAG------SILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMR------------------------------VSESLGLE----

Query:  -ESFFLSRRKGSVDQKHYTGDPYLC----DGVLLDFERDSIENFGTVCQILVWRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLL
          S++   ++G   + H+     LC    +G  L F      N   V + L WR+  NP   +AR L+ RYY +  +L +T   + S+ W+ L     +L
Subjt:  -ESFFLSRRKGSVDQKHYTGDPYLC----DGVLLDFERDSIENFGTVCQILVWRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLL

Query:  KQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITP-SLQWDCGKLNQHLNQDDVEVIKGLPIS-STAPDKLIWHYDRRGEYFVKSG
        ++G R  +G+G+++ +++D W+    +FKV S P    G+T V+  I P +LQW    L    + ++  +I  +P+S    PD LIWH++R G Y VKSG
Subjt:  KQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITP-SLQWDCGKLNQHLNQDDVEVIKGLPIS-STAPDKLIWHYDRRGEYFVKSG

Query:  YKLCL-------SRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALFRVDATLPATTVTSA
        Y++           + + + +  GI    W  +W++ +P KV+LF+ R+  N +P+ VNL    +   G C    E  ET  H   +           S 
Subjt:  YKLCL-------SRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALFRVDATLPATTVTSA

Query:  FPASSLTAAQHDFREWARGFGATLSSP
        + +     A  D   W +    TLS+P
Subjt:  FPASSLTAAQHDFREWARGFGATLSSP

A0A803Q8I6 Uncharacterized protein5.1e-7634.44Show/hide
Query:  LIPKCRQPRLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDMVEWSYLNQIMEKLGFHERWI------------
        ++ K    R K VL  +I E QS F+  R I+DN+++  E++H L  K  G   ++ALKLDMSK +D VEW Y+ ++M K+GFH+RWI            
Subjt:  LIPKCRQPRLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDMVEWSYLNQIMEKLGFHERWI------------

Query:  ------------------RQEDPLLPYLFLLCTEGFSALLD-STSRRNLAGISITRSCPKISHLFFVDDSLVFLKA------AAGSILKDYEKVSGQCIN
                          RQ DPL PYLFL+ +EG S LL    S +NL G+ +TR  P +S+L F DDSL+F +A      A   +L  Y   SGQ +N
Subjt:  ------------------RQEDPLLPYLFLLCTEGFSALLD-STSRRNLAGISITRSCPKISHLFFVDDSLVFLKA------AAGSILKDYEKVSGQCIN

Query:  VNKSMVFFSENALPDTKEFLSHILGMRVSESLGLEESFFLSRRKGSVDQKHYTGDPYLCDGVLLDFERDSIENFGTVCQIL---VWRILCNPQLKIARFL
          KS++ FS N     K+F  H LGM +SE     E +         D+K    D        L    D + + G    +L    WRI   P   ++R L
Subjt:  VNKSMVFFSENALPDTKEFLSHILGMRVSESLGLEESFFLSRRKGSVDQKHYTGDPYLCDGVLLDFERDSIENFGTVCQIL---VWRILCNPQLKIARFL

Query:  RGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKLNQHLNQDDV
        + RY+ N+  L +    + S  W+G+ WG +LL +G R  +GNG ++F   D WI     FK  SF         V++FIT   +W+   LNQ+    DV
Subjt:  RGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKLNQHLNQDDV

Query:  EVIKGLPIS-STAPDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMCPVSK
        + I  +P+S   + D+LIWH+   G Y V SG+ L  +       +G    ++WW T W L +P KVK+F  R   N +P   +L    V     C +  
Subjt:  EVIKGLPIS-STAPDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMCPVSK

Query:  EANETTHHALF
         A E+  HALF
Subjt:  EANETTHHALF

A0A803Q8J4 Uncharacterized protein2.5e-7532.42Show/hide
Query:  KDASVREWNHTNLVLIPKCRQP-------------------------RLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLD
        K  S    N T + LIPK ++P                         R K +L  +I E QS F+P R I+DN+++  E++H L  K  GR GF+ALKLD
Subjt:  KDASVREWNHTNLVLIPKCRQP-------------------------RLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLD

Query:  MSKTYDMVEWSYLNQIMEKLGFHERWI------------------------------RQEDPLLPYLFLLCTEGFSALLD-STSRRNLAGISITRSCPKI
        MSK +D VEWS++  +M K+GF  RW+                              RQ DPL PYLFL+C+EG S LL    S   L G++I+R  P I
Subjt:  MSKTYDMVEWSYLNQIMEKLGFHERWI------------------------------RQEDPLLPYLFLLCTEGFSALLD-STSRRNLAGISITRSCPKI

Query:  SHLFFVDDSLVFLKA------AAGSILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMRVSES----LGL-------EESFFLSRRKGSVDQK
        SHL F DDSL+F +A      A   +L  Y K SGQ +N +KS++ FS N     K    +ILGM + +     LGL       +++ F   ++      
Subjt:  SHLFFVDDSLVFLKA------AAGSILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSHILGMRVSES----LGL-------EESFFLSRRKGSVDQK

Query:  HYTGDPYLCDG---VLLDFERDSIENFGTVCQIL-----------------------------VWRILCNPQLK--------------------------
        H   D     G   VLL     SI  +   C  L                              W+ LC+ +++                          
Subjt:  HYTGDPYLCDG---VLLDFERDSIENFGTVCQIL-----------------------------VWRILCNPQLK--------------------------

Query:  ----IARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKL
            + R L+GRY+P +  L++     SS  W+G+ WG +LLK+GIRK +GNG SI    DPWI     F    +   P  ++VVAD+ITP  +W+  KL
Subjt:  ----IARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKL

Query:  NQHLNQDDVEVIKGLPISSTA-PDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVP
        +   +  DV  I  LP+S+ A PD  IWH    GEY VKSGY    S   + + S     T+WW + W+L +P KVK+F  ++ HN +P    L      
Subjt:  NQHLNQDDVEVIKGLPISSTA-PDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVP

Query:  VHGMCPVSKEANETTHHALF
            C +   A E+  HALF
Subjt:  VHGMCPVSKEANETTHHALF

A0A803QH76 Uncharacterized protein3.9e-7632.96Show/hide
Query:  LIPKCRQPRLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDMVEWSYLNQIMEKLGFHERWI------------
        L+ K    R K VL  +I + QS F+P R I+DN++L  E++H L  KK GR G+AALKLDMSK +D VEW +L+++M K+GFH  W+            
Subjt:  LIPKCRQPRLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDMVEWSYLNQIMEKLGFHERWI------------

Query:  ------------------RQEDPLLPYLFLLCTEGFSALL-DSTSRRNLAGISITRSCPKISHLFFVDDSLVFLKA------AAGSILKDYEKVSGQCIN
                          RQ DPL PYLFL+C+EG SALL    +   L G+++ RS P ISHL F DDSL+F +A      A   +L  Y + SGQ +N
Subjt:  ------------------RQEDPLLPYLFLLCTEGFSALL-DSTSRRNLAGISITRSCPKISHLFFVDDSLVFLKA------AAGSILKDYEKVSGQCIN

Query:  VNKSMVFFSENALPDTKEFLSHILGMRV------------------SESLGLEESF----------FLSRRKGSVDQKHYTGDPYLCDGVL---LDFERD
          K+++ FS NA    ++    +L M +                       L + F          F      +  + H+    ++C       + F R 
Subjt:  VNKSMVFFSENALPDTKEFLSHILGMRV------------------SESLGLEESF----------FLSRRKGSVDQKHYTGDPYLCDGVL---LDFERD

Query:  SIENFGTVCQILVWRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMG
         I     +     WRIL  P   +AR L+ RY+ N   L +      S  W+ +  G +LL +G+R  +GNG+++   +DPW+    TF    +   P+ 
Subjt:  SIENFGTVCQILVWRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMG

Query:  DTVVADFITPSLQWDCGKLNQHLNQDDVEVIKGLPISS-TAPDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVL
           V  +I  + QWD   L QH    DV+ I  +P+S     DK++WH+   G Y V+SGY L +S    D  S       WWN LW L +P KVK+F  
Subjt:  DTVVADFITPSLQWDCGKLNQHLNQDDVEVIKGLPISS-TAPDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGIETSWWNTLWKLFMPNKVKLFVL

Query:  RSFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALFR
        R  ++ +P+ VNL +  +     C + K + E+  HALFR
Subjt:  RSFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALFR

SwissProt top hitse value%identityAlignment
P14381 Transposon TX1 uncharacterized 149 kDa protein1.7e-0725.34Show/hide
Query:  DMNQMLTSPFTEEEVIAAVKDA------------SVREWNHTNLV-----LIPKCRQPRLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKT
        D +++LT  F + E+  + + A             ++ W   +L+     ++ K    RLK VL E+I   QS  +PGR+I DN+ L  ++LHF   ++T
Subjt:  DMNQMLTSPFTEEEVIAAVKDA------------SVREWNHTNLV-----LIPKCRQPRLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKT

Query:  GRTGFAALKLDMSKTYDMVEWSYLNQIMEKLGF------------------------------HERWIRQEDPLLPYLFLLCTEGFSALLDSTSRRNLAG
        G    A L LD  K +D V+  YL   ++   F                                R +RQ  PL   L+ L  E F  LL    R+ L G
Subjt:  GRTGFAALKLDMSKTYDMVEWSYLNQIMEKLGF------------------------------HERWIRQEDPLLPYLFLLCTEGFSALLDSTSRRNLAG

Query:  ISITRSCPKISHLFFVDDSLV
        + +     ++    + DD ++
Subjt:  ISITRSCPKISHLFFVDDSLV

P92555 Uncharacterized mitochondrial protein AtMg012504.1e-0647.27Show/hide
Query:  RWIRQEDPLLPYLFLLCTEGFSALLDSTSRR-NLAGISITRSCPKISHLFFVDDS
        R +RQ DPL PYLF+LCTE  S L      +  L GI ++ + P+I+HL F DD+
Subjt:  RWIRQEDPLLPYLFLLCTEGFSALLDSTSRR-NLAGISITRSCPKISHLFFVDDS

P93295 Uncharacterized mitochondrial protein AtMg003101.8e-0627.91Show/hide
Query:  WRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPM
        +RI+  P   ++R LR RY+P+S ++  +     S+ W+ ++ G +LL +G+ + +G+G    ++ D WI       +   P PP+
Subjt:  WRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPM

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein2.4e-1726.36Show/hide
Query:  LRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSL--------QWDCGKL
        ++ RY+ +  +L++      S+ W  L+ G+ LLK+G R  +G+G++I +        L+    +  PRP   +    +    +L         WD  K+
Subjt:  LRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSL--------QWDCGKL

Query:  NQHLNQDDVEVIKGLPIS-STAPDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGI------ETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNL
        +Q ++Q D   I  + ++ S  PDK+IW+Y+  GEY V+SGY L    +  D ++ +               +W L +  K+K F+ R+    + +   L
Subjt:  NQHLNQDDVEVIKGLPIS-STAPDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGI------ETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNL

Query:  GNHHVPVHGMCPVSKEANETTHHALFRVDATLPATTVTSAFPASSLTAAQ---HDFRE
            + +   CP     NE+ +HALF    T P  T+      SSL   Q   +DF E
Subjt:  GNHHVPVHGMCPVSKEANETTHHALFRVDATLPATTVTSAFPASSLTAAQ---HDFRE

AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.1e-1240Show/hide
Query:  RLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDMVEWSYLNQIMEKLGFHERWI
        RLK ++  +I   Q++FIPGR  +DN++   E +H + +KK G  G+  LKLD+ K YD + W YL   +   GF E W+
Subjt:  RLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDMVEWSYLNQIMEKLGFHERWI

AT4G29090.1 Ribonuclease H-like superfamily protein2.2e-1825.52Show/hide
Query:  VWRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTV-------VA
        +WR+L  P+  +A+  + RY+  S  LN+      SF WK +    ++L+QG R  +GNG  I +++  W+            R P  +         V+
Subjt:  VWRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTV-------VA

Query:  DFITPS-LQWDCGKLNQHLNQDDVEVIKGL-PISSTAPDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGIETSW---WNTLWKLFMPNKVKLFVLR
        D I  S  +W    +     + + ++I  L P      D   W Y   G+Y VKSGY +    I + S+     E S    +  +WK     K++ F+ +
Subjt:  DFITPS-LQWDCGKLNQHLNQDDVEVIKGL-PISSTAPDKLIWHYDRRGEYFVKSGYKLCLSRIREDSASGVGIETSW---WNTLWKLFMPNKVKLFVLR

Query:  SFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALFR
           N +P    L   H+     C       ET +H LF+
Subjt:  SFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALFR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-0727.91Show/hide
Query:  WRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPM
        +RI+  P   ++R LR RY+P+S ++  +     S+ W+ ++ G +LL +G+ + +G+G    ++ D WI       +   P PP+
Subjt:  WRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQGIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPM

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.9e-0747.27Show/hide
Query:  RWIRQEDPLLPYLFLLCTEGFSALLDSTSRR-NLAGISITRSCPKISHLFFVDDS
        R +RQ DPL PYLF+LCTE  S L      +  L GI ++ + P+I+HL F DD+
Subjt:  RWIRQEDPLLPYLFLLCTEGFSALLDSTSRR-NLAGISITRSCPKISHLFFVDDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGATATGAACCAGATGCTTACCTCTCCTTTTACGGAGGAGGAAGTGATTGCAGCTGTCAAAGATGCATCGGTTCGCGAGTGGAATCATACGAATTTGGTGCTTAT
CCCAAAGTGTCGCCAACCGAGGCTTAAACTAGTGTTGAATGAGATCATTGATGAGTGTCAGTCGACTTTTATTCCTGGTCGGTCGATTTCCGATAATATGATATTGGGTC
ATGAAATGTTACACTTCTTAAATAAAAAGAAAACAGGGAGGACTGGTTTTGCAGCTTTAAAACTAGATATGAGTAAAACGTATGATATGGTGGAATGGTCATATCTGAAT
CAGATCATGGAAAAGCTAGGGTTCCATGAGCGATGGATTAGGCAAGAAGATCCCCTTTTGCCTTATTTATTTCTCTTATGTACAGAGGGATTTTCGGCCCTATTGGATTC
TACTAGTCGGAGGAATTTGGCAGGGATCTCAATTACGAGATCTTGTCCAAAAATCTCTCATTTATTCTTTGTCGATGATAGTTTGGTGTTTCTTAAAGCTGCGGCAGGAT
CCATTCTGAAGGATTATGAGAAGGTTTCGGGTCAGTGTATTAATGTTAACAAATCGATGGTGTTCTTCTCAGAGAATGCTCTCCCAGACACTAAGGAGTTTTTAAGCCAT
ATTTTAGGCATGAGGGTATCAGAGTCCTTGGGGTTGGAAGAGTCATTTTTTCTCTCAAGGAGGAAAGGAAGTGTTGATCAAAAGCATTATACTGGCGATCCCTACCTATG
CGATGGGGTGCTTTTGGATTTCGAAAGGGATTCTATCGAAAATTTCGGCACTGTGTGCCAAATTTTGGTATGGAGGATTTTGTGTAACCCTCAGCTTAAAATTGCAAGGT
TTCTTCGTGGGCGTTATTACCCAAATTCTATGGTTTTAAATTCGACAGCCAGTCTTACCTCATCCTTTTTTTGGAAAGGACTTGTATGGGGAATGGATCTTCTGAAACAA
GGTATCAGGAAAAATTTGGGTAATGGGAGATCAATATTTTTGTTCAAGGATCCTTGGATTGCGAGGCTCAATACTTTCAAGGTAACATCCTTTCCTAGGCCACCCATGGG
AGATACGGTCGTGGCAGATTTCATTACTCCATCTTTGCAATGGGATTGTGGCAAATTAAATCAGCATTTGAACCAAGATGATGTAGAGGTGATAAAGGGTCTACCGATAA
GTAGTACTGCTCCTGATAAATTGATATGGCATTATGATAGAAGAGGAGAGTATTTTGTGAAAAGTGGTTATAAGCTATGTCTAAGTAGGATTCGGGAGGATTCAGCCTCA
GGAGTTGGGATTGAGACTAGCTGGTGGAATACTCTGTGGAAATTGTTTATGCCTAATAAGGTGAAATTGTTTGTTTTGAGGTCTTTCCATAATGTGATTCCATCGATGGT
CAATCTTGGAAACCATCATGTCCCGGTCCATGGAATGTGTCCTGTGTCTAAAGAGGCTAATGAGACTACTCATCATGCTCTTTTCCGTGTCGACGCAACTCTTCCAGCAA
CGACGGTAACTTCGGCGTTTCCGGCGAGCAGTCTCACGGCGGCGCAACACGATTTTCGGGAGTGGGCTCGCGGGTTCGGTGCAACGTTGTCGTCTCCGTGGATTTCCGGC
AAGCAAGGCAGTGATGACAGCTTGCGTAGTTCGTCGGCGGGTCTTCTCCGATCAGCGACAGCAAGTCCAAACGTTGTGGTGTTCGTTTTCCGGCAAGTTCGTTGTGTTCG
CGTATTCCGGTCGTTAGCGTATTGGCGATTCAATAGGAAGATTGGAGTGGATTTGCATACCCCGAACAGCAAGCACCGTTCTGTTTTGAAGTCGGGTCGTTACAGTGGGA
GAGCAAGGGAGGTTTGGGATGTCTTTTTCCCCTCGGTGGTTGGGAATAGGATGGGGGATATGGATATTAAAGATAGATGGCTCAGTATGAGAAGATTCAAGGGGTCGGAT
TTGGTATGCATCTGCGTGGGTGCATGGGCGATATGGAATGATAGGAATAATTTTATTCATAACAGGCCAATCCCTATAGTAGAGTCGCGTTGTGAATGGTTAAATTCTTA
TATAGATGACTTTTGGAAGTTTGATCCGAAGGGTGGTCCATTGGTGCAATCACCGGAGGATATCATGAATATTATTTCAAAAGGGGAAGACTGGATTCTTCATACAGATG
CAGCTTTCATAAGTGGCGCAGGAAAGAGTGGAGTTGACCTCCTTCTGCGGAATAGATTGGGACATCTACAGGCAGCACAATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAGATATGAACCAGATGCTTACCTCTCCTTTTACGGAGGAGGAAGTGATTGCAGCTGTCAAAGATGCATCGGTTCGCGAGTGGAATCATACGAATTTGGTGCTTAT
CCCAAAGTGTCGCCAACCGAGGCTTAAACTAGTGTTGAATGAGATCATTGATGAGTGTCAGTCGACTTTTATTCCTGGTCGGTCGATTTCCGATAATATGATATTGGGTC
ATGAAATGTTACACTTCTTAAATAAAAAGAAAACAGGGAGGACTGGTTTTGCAGCTTTAAAACTAGATATGAGTAAAACGTATGATATGGTGGAATGGTCATATCTGAAT
CAGATCATGGAAAAGCTAGGGTTCCATGAGCGATGGATTAGGCAAGAAGATCCCCTTTTGCCTTATTTATTTCTCTTATGTACAGAGGGATTTTCGGCCCTATTGGATTC
TACTAGTCGGAGGAATTTGGCAGGGATCTCAATTACGAGATCTTGTCCAAAAATCTCTCATTTATTCTTTGTCGATGATAGTTTGGTGTTTCTTAAAGCTGCGGCAGGAT
CCATTCTGAAGGATTATGAGAAGGTTTCGGGTCAGTGTATTAATGTTAACAAATCGATGGTGTTCTTCTCAGAGAATGCTCTCCCAGACACTAAGGAGTTTTTAAGCCAT
ATTTTAGGCATGAGGGTATCAGAGTCCTTGGGGTTGGAAGAGTCATTTTTTCTCTCAAGGAGGAAAGGAAGTGTTGATCAAAAGCATTATACTGGCGATCCCTACCTATG
CGATGGGGTGCTTTTGGATTTCGAAAGGGATTCTATCGAAAATTTCGGCACTGTGTGCCAAATTTTGGTATGGAGGATTTTGTGTAACCCTCAGCTTAAAATTGCAAGGT
TTCTTCGTGGGCGTTATTACCCAAATTCTATGGTTTTAAATTCGACAGCCAGTCTTACCTCATCCTTTTTTTGGAAAGGACTTGTATGGGGAATGGATCTTCTGAAACAA
GGTATCAGGAAAAATTTGGGTAATGGGAGATCAATATTTTTGTTCAAGGATCCTTGGATTGCGAGGCTCAATACTTTCAAGGTAACATCCTTTCCTAGGCCACCCATGGG
AGATACGGTCGTGGCAGATTTCATTACTCCATCTTTGCAATGGGATTGTGGCAAATTAAATCAGCATTTGAACCAAGATGATGTAGAGGTGATAAAGGGTCTACCGATAA
GTAGTACTGCTCCTGATAAATTGATATGGCATTATGATAGAAGAGGAGAGTATTTTGTGAAAAGTGGTTATAAGCTATGTCTAAGTAGGATTCGGGAGGATTCAGCCTCA
GGAGTTGGGATTGAGACTAGCTGGTGGAATACTCTGTGGAAATTGTTTATGCCTAATAAGGTGAAATTGTTTGTTTTGAGGTCTTTCCATAATGTGATTCCATCGATGGT
CAATCTTGGAAACCATCATGTCCCGGTCCATGGAATGTGTCCTGTGTCTAAAGAGGCTAATGAGACTACTCATCATGCTCTTTTCCGTGTCGACGCAACTCTTCCAGCAA
CGACGGTAACTTCGGCGTTTCCGGCGAGCAGTCTCACGGCGGCGCAACACGATTTTCGGGAGTGGGCTCGCGGGTTCGGTGCAACGTTGTCGTCTCCGTGGATTTCCGGC
AAGCAAGGCAGTGATGACAGCTTGCGTAGTTCGTCGGCGGGTCTTCTCCGATCAGCGACAGCAAGTCCAAACGTTGTGGTGTTCGTTTTCCGGCAAGTTCGTTGTGTTCG
CGTATTCCGGTCGTTAGCGTATTGGCGATTCAATAGGAAGATTGGAGTGGATTTGCATACCCCGAACAGCAAGCACCGTTCTGTTTTGAAGTCGGGTCGTTACAGTGGGA
GAGCAAGGGAGGTTTGGGATGTCTTTTTCCCCTCGGTGGTTGGGAATAGGATGGGGGATATGGATATTAAAGATAGATGGCTCAGTATGAGAAGATTCAAGGGGTCGGAT
TTGGTATGCATCTGCGTGGGTGCATGGGCGATATGGAATGATAGGAATAATTTTATTCATAACAGGCCAATCCCTATAGTAGAGTCGCGTTGTGAATGGTTAAATTCTTA
TATAGATGACTTTTGGAAGTTTGATCCGAAGGGTGGTCCATTGGTGCAATCACCGGAGGATATCATGAATATTATTTCAAAAGGGGAAGACTGGATTCTTCATACAGATG
CAGCTTTCATAAGTGGCGCAGGAAAGAGTGGAGTTGACCTCCTTCTGCGGAATAGATTGGGACATCTACAGGCAGCACAATCTTAG
Protein sequenceShow/hide protein sequence
MTDMNQMLTSPFTEEEVIAAVKDASVREWNHTNLVLIPKCRQPRLKLVLNEIIDECQSTFIPGRSISDNMILGHEMLHFLNKKKTGRTGFAALKLDMSKTYDMVEWSYLN
QIMEKLGFHERWIRQEDPLLPYLFLLCTEGFSALLDSTSRRNLAGISITRSCPKISHLFFVDDSLVFLKAAAGSILKDYEKVSGQCINVNKSMVFFSENALPDTKEFLSH
ILGMRVSESLGLEESFFLSRRKGSVDQKHYTGDPYLCDGVLLDFERDSIENFGTVCQILVWRILCNPQLKIARFLRGRYYPNSMVLNSTASLTSSFFWKGLVWGMDLLKQ
GIRKNLGNGRSIFLFKDPWIARLNTFKVTSFPRPPMGDTVVADFITPSLQWDCGKLNQHLNQDDVEVIKGLPISSTAPDKLIWHYDRRGEYFVKSGYKLCLSRIREDSAS
GVGIETSWWNTLWKLFMPNKVKLFVLRSFHNVIPSMVNLGNHHVPVHGMCPVSKEANETTHHALFRVDATLPATTVTSAFPASSLTAAQHDFREWARGFGATLSSPWISG
KQGSDDSLRSSSAGLLRSATASPNVVVFVFRQVRCVRVFRSLAYWRFNRKIGVDLHTPNSKHRSVLKSGRYSGRAREVWDVFFPSVVGNRMGDMDIKDRWLSMRRFKGSD
LVCICVGAWAIWNDRNNFIHNRPIPIVESRCEWLNSYIDDFWKFDPKGGPLVQSPEDIMNIISKGEDWILHTDAAFISGAGKSGVDLLLRNRLGHLQAAQS