; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028938 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028938
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:33200835..33202876
RNA-Seq ExpressionLag0028938
SyntenyLag0028938
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023914298.1 uncharacterized protein LOC112025844 [Quercus suber]2.5e-11036.89Show/hide
Query:  DLLSFSNNHIDGWITWDAYH--WRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVG
        +++S S NHID  I  +A H  WR +G YG      + +TW+L+  L      PWL  GDFN +L+ HEK G   +  S + AF+ V+D CGL+DLGFVG
Subjt:  DLLSFSNNHIDGWITWDAYH--WRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVG

Query:  NRFTWCNRRPEGTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVL---SPQP----------------------------------------
        ++FTW  +R  G + ERLDR  +S AW  ++P   V HL+ H SDH+ I + L   +P+P                                        
Subjt:  NRFTWCNRRPEGTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVL---SPQP----------------------------------------

Query:  ---------------GCWRRS----------------------------KAQLEDVLQEEELYWKQRSR-------------------------------
                       GC R+S                            K++L  +L +E L W+QR+R                               
Subjt:  ---------------GCWRRS----------------------------KAQLEDVLQEEELYWKQRSR-------------------------------

Query:  ---------------------EQLFSTSEPSDQDFDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQS
                               LF+TS+PS  +  V L  ++ SV  EMN  LL PF +EE+  AL Q     APGPDG+   FY   W+++G  V  +
Subjt:  ---------------------EQLFSTSEPSDQDFDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQS

Query:  CLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKW
         L  LN+   P  IN T I LIPK+K+P  +SD+RPISLCN  YKL+SKV+ NR K +LP++I  NQSAF  GR + DN+++ +E +H ++    GKS +
Subjt:  CLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKW

Query:  AALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVA
         ALKLDMSKAYDR+EW F+  +M ++ F  +W  LIL C+S+VS+S  +NG     + PSRGLRQGDPLSPYLFL+C+EGL  L++       I G  + 
Subjt:  AALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVA

Query:  RSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICYE
        +  P ++HLFFA DSL+F RA++ E   IQ LL+ YE
Subjt:  RSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICYE

XP_028068804.1 uncharacterized protein LOC114271378 [Camellia sinensis]3.0e-11941.81Show/hide
Query:  SSDLLSFSNNHIDGWITWDA--YHWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGF
        S  + ++S+ H+D  +  D    +WR TGFYG P    +  +W LL +L G  D PWL+  DFN +L   EK G  D+  +++ AFQ+ +  C L DLGF
Subjt:  SSDLLSFSNNHIDGWITWDA--YHWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGF

Query:  VGNRFTWCNRRPEGTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVLSPQPGCWRRSK-----------------------AQLEDVLQEEE
         G  FTWCN RP G ++ERLDR  ++ AW  I+P   V HL    SDH PI + L      W +                         ++++++L+ EE
Subjt:  VGNRFTWCNRRPEGTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVLSPQPGCWRRSK-----------------------AQLEDVLQEEE

Query:  LYWKQRSREQLFSTSEPSDQDFD-----------------------VSLRDLQR---------------------------SVDSEMNMDLLKPFTEEEI
          W QR+R       + +   F                         +L DL+R                            V  E N++L +P+T EE+
Subjt:  LYWKQRSREQLFSTSEPSDQDFD-----------------------VSLRDLQR---------------------------SVDSEMNMDLLKPFTEEEI

Query:  LRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLI
          AL Q HP KAPGPDG    FY+  W IVG  V ++ L VLN G +  ++N+T IVLIPK+K+P+R+S FRPISLCN  YKL+SKV+ NRM+ ILP++I
Subjt:  LRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLI

Query:  LSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGL
          NQSAF+ GR + DN++  FE  H L+ +  GK    ALKLDMSKAYDR+EW FLR VM+RM F Q + D I+ C+SSVS+S  +NG  +    P+RGL
Subjt:  LSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGL

Query:  RQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICYE
        RQGDPLSPYLF+LCAEGLS+L++  E    ++G  V R SP +SHL FA DSLLF  AN+ E V +QD+L  YE
Subjt:  RQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICYE

XP_030942013.1 uncharacterized protein LOC115967068 [Quercus lobata]6.1e-11238.14Show/hide
Query:  ETKLSSSDLLSFSNNHIDGWITWDAYHWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLD
        + K  S +  S   NH +G     A  WR TGFYG P A MR  +W L+  L+   D PW+I GDFN +++  EK G  D+   ++  F++ +  CGL+D
Subjt:  ETKLSSSDLLSFSNNHIDGWITWDAYHWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLD

Query:  LGFVGNRFTWCNRR-PEGTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVLSPQ----PGCWRRS---------------------------
        LGFVG R+TWCN R  E     RLDR  ++  W +++    V H     SDH  + L L  Q      C++ S                           
Subjt:  LGFVGNRFTWCNRR-PEGTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVLSPQ----PGCWRRS---------------------------

Query:  ---------------KAQLEDVLQEEELYWKQRSR----------------------------------------------------EQLFSTSEPSDQD
                       K ++ +V   EE+ W QRSR                                                    +++FSTS P   +
Subjt:  ---------------KAQLEDVLQEEELYWKQRSR----------------------------------------------------EQLFSTSEPSDQD

Query:  FDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDF
        F  SL  ++R V  +MN DLL+ F EEE+ RALKQ HP K+PGP+ +S  F++++W +VGP V+   L  L  G  P  +N+T I LIPK+  P+++S+F
Subjt:  FDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDF

Query:  RPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTD
        RPISLCN  YK++SKV+ NR+K +LP +I   QSAF+PGR + DNV++ FE +H + ++  GK    A+KLDMSKAYDR+EW +L A+M R+ F ++W  
Subjt:  RPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTD

Query:  LILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLI
        L++ CV+SVS+S  LNGE  G + P+RGLRQGDP+SPYLFLLCAEGLS++LR  E + +  G  V R +P +SHL FA D ++F  A+  E   +  +L 
Subjt:  LILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLI

Query:  CYE
         YE
Subjt:  CYE

XP_039834390.1 uncharacterized protein LOC120695147 [Panicum virgatum]3.0e-11143.94Show/hide
Query:  LLSFSNNHIDGWI-TWDAYHWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNR
        +LS+S  HID  I   D   W++TG YG    + +++TW LL  L+  S  PWL  GDFN +L+  EKEGG  +       F+  ++ C L DLGFVG+ 
Subjt:  LLSFSNNHIDGWI-TWDAYHWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNR

Query:  FTWCNRRPEGTIY--ERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVLSPQPGCWRRS--KAQLEDVLQEEELYWKQRSREQLFSTSEPSD--QD
        FTW N       Y  ERLDR  ++ AW   +P   V + D   SDHRPI +    +PG   +S  +  LE + + E  + ++    +    +  +D    
Subjt:  FTWCNRRPEGTIY--ERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVLSPQPGCWRRS--KAQLEDVLQEEELYWKQRSREQLFSTSEPSD--QD

Query:  FDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDF
         +  L  LQ+ +  +MN +L  PF+ +E+  ALK     KAPG DG+   FYK  WS+VG  V +  L VLN    P   N+T+IVL+PK K+P ++ D 
Subjt:  FDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDF

Query:  RPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTD
        RPISLCN  YKLISKV+ NR+K +LP +I  +QSAF+PGR + DNV+L +E  H L +R  GK+  AA+KLDMSKAYDR+EW FL  +M ++ FA QW +
Subjt:  RPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTD

Query:  LILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLI
         +++CVS+VS+   +NG+    + P RGLRQG+PLSPYLF+LCAEGLS+LL+  E +  I G RV R +P I+HLFFA DSL+  RAN  +A  ++ +L 
Subjt:  LILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLI

Query:  CYE
         YE
Subjt:  CYE

XP_042962369.1 uncharacterized protein LOC122296631 [Carya illinoinensis]2.2e-11440.13Show/hide
Query:  KLSSSDLLSFSNNHIDGWIT--WDAY--HWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGL
        K  S  + SFS  HI   +T   DA    W LTGFYG P    R ++WSLL+ L+   D  W+I GDFN ++   EK GGR KP  +L AF++V++ C +
Subjt:  KLSSSDLLSFSNNHIDGWIT--WDAY--HWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGL

Query:  LDLGFVGNRFTWCNRRPEG-TIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVLSPQ--------------------------PGCWR-----
         D+GF G+ FTWCN R EG TI ERLDRCFS++ WH  YP  VV H     SDH+PI L LS +                           G W+     
Subjt:  LDLGFVGNRFTWCNRRPEG-TIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVLSPQ--------------------------PGCWR-----

Query:  --------------------------------------------------------RSKAQLEDVLQEEELYWKQRSREQLFSTSEPSDQ----------
                                                                +++ +++  LQ  E+ WKQRS+   +  +E  D+          
Subjt:  --------------------------------------------------------RSKAQLEDVLQEEELYWKQRSREQLFSTSEPSDQ----------

Query:  ---------DFDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPK
                 DF   L+DL   +D++M   L  PFT +E+ RAL + HP KA GPDG++  FY+  WSIVG  V  + L  LN G  P  IN T I LIPK
Subjt:  ---------DFDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPK

Query:  IKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMD
         K P  VSD+R ISLCN  YKLISKV+ +R+K + P +I  +Q+AF+PGR + DNV++ +E +H LRR+  GK  + +LKLDMSKAYDRIEW +L  VM 
Subjt:  IKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMD

Query:  RMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVV
        +M F  +W  L++ CV++VSFS  +NGE    + P+RGLRQGDP+SPYLFLLC EGL +LL     +  I GF+V R +P ++HL FA DS+LF RAN+ 
Subjt:  RMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVV

Query:  EAVAIQDLLICYE
          + +   L  YE
Subjt:  EAVAIQDLLICYE

TrEMBL top hitse value%identityAlignment
A0A2N9EX83 Reverse transcriptase domain-containing protein4.8e-11540Show/hide
Query:  SSDLLSFSNNHIDGWITWDAY-HWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFV
        S  + S+S++HID  + +D    WR TGFYG P A  +   W LL  LR     PW  GGDFN LL   EK G   +P  ++  F+ V+D CG +DLGFV
Subjt:  SSDLLSFSNNHIDGWITWDAY-HWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFV

Query:  GNRFTWCNRRP-EGTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPI--------------------ELVLSPQPGC-----------------WR
        G+ +TW N++     + ERLDRC ++  W   +PN  V HL    SDH+P+                    E + +   GC                 ++
Subjt:  GNRFTWCNRRP-EGTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPI--------------------ELVLSPQPGC-----------------WR

Query:  RSKAQLEDVLQEEELYWKQRSR----------------------------------------------------EQLFSTSEPSDQDFDVSLRDLQRSVD
        +   +L D+  +EE  WKQRSR                                                    + LF+TS+P   +FD  L  + R + 
Subjt:  RSKAQLEDVLQEEELYWKQRSR----------------------------------------------------EQLFSTSEPSDQDFDVSLRDLQRSVD

Query:  SEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLI
         +MN  L   FT  E+  AL Q  P KAPGPDG++  FY+ +W+IVG  V  S L  L  G     IN T I LIPK++ P  V DFRPISLCN  YK+I
Subjt:  SEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLI

Query:  SKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSF
        +KV+ NR+K ILP++I  +QSAF+PGR + DN+++ FE +H ++   G K  + ALKLDMSKAYDR+EW FL  +M  M F++ W  +I+ CV +VS+S 
Subjt:  SKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSF

Query:  NLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICYE
         +NGE  G   P+RGLRQGDP+SPYLFLLCAEGL++LL        I G  ++R  P +SHLFFA DS+LF RA++ E  AIQD+L  YE
Subjt:  NLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICYE

A0A2N9GDB5 Reverse transcriptase domain-containing protein5.7e-11639.23Show/hide
Query:  SFSNNHIDGWITWDAYH-WRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFT
        S+S++HID  +  +    WR TGFYG P    RD++W+LL +L      PW   GDFN L+   EK+G  ++   ++  F++V+D CGL+DLGF G RFT
Subjt:  SFSNNHIDGWITWDAYH-WRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFT

Query:  WCNRRPEGTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPI-----------------ELVLSPQPGC-------WRRS-----------------
        W N RP    +ERLDR  ++  W  ++P+  V HL+   SDH+PI                 E V +   GC       W+ +                 
Subjt:  WCNRRPEGTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPI-----------------ELVLSPQPGC-------WRRS-----------------

Query:  -----------KAQLEDVLQEEELYWKQRSR----------------------------------------------------EQLFSTSEPSDQDFDVS
                   K +L ++L +EE  W+QRSR                                                      LF T  P   +  V 
Subjt:  -----------KAQLEDVLQEEELYWKQRSR----------------------------------------------------EQLFSTSEPSDQDFDVS

Query:  LRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPIS
          D+QR V +EMN  L+K FT  E+  ALKQ  P KAPGPDGL   FY+ +W ++G  V  + L  LN G    +IN T I LIPK++ P  V +FRPIS
Subjt:  LRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPIS

Query:  LCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILR
        LCN  YK+ISK++ NR+K ILP+++  +QSAFIPGR + DN+++ FE +H ++ +  GK+   ALKLDMSKAYDR+EW FL+ VM++M F ++W  +++ 
Subjt:  LCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILR

Query:  CVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICYE
        C+S+VS+S  +NGE  G + PSRGLRQGDPLSPYLFLLCAEGL SL++  +    + G  ++R  P I+HLFFA DSLLF +A   +   IQ +L  YE
Subjt:  CVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICYE

A0A2N9GJ35 Uncharacterized protein1.4e-11437.79Show/hide
Query:  DLLSFSNNHIDGWITW-DAYHWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGN
        ++ S+S +HIDG +   D   WRLTGFYG+P A +R ++WSLL  LR  SD PW+I GDFN +    EK G  D+  +++AAF+  +  C L D+GF G 
Subjt:  DLLSFSNNHIDGWITW-DAYHWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGN

Query:  RFTWCNRRPEGTIYE-RLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVL---SPQP----------------------GC----------------
         FTW N R  G +   RLDR  +  AW  ++P+  +NHL    SDH  + L+L   + QP                      GC                
Subjt:  RFTWCNRRPEGTIYE-RLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVL---SPQP----------------------GC----------------

Query:  -----------------WRRS-----------------------------------KAQLEDVLQEEELYWKQRSR------------------------
                         W +S                                   K  L  + ++ E+ W+QRSR                        
Subjt:  -----------------WRRS-----------------------------------KAQLEDVLQEEELYWKQRSR------------------------

Query:  ----------------------------EQLFSTSEPSDQDFDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIV
                                      LF++S P  +  D  L +++  V   MN  L++PFT+EEI RAL Q HP K+PGPDG+S  F++ +W IV
Subjt:  ----------------------------EQLFSTSEPSDQDFDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIV

Query:  GPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRR
           V  + L  L +G   GSIN T +VLIPK+ AP  ++ FRPISLCN  YK++SKV+VNRMK ILP +I  +QSAF+PGR + DNVI+ FE IH L+  
Subjt:  GPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRR

Query:  SGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRAL
          G +   A+KLDMSKAYDR+EW +L+A+M ++ F  QW  L++ CV + ++S  +NGE  G +TP RGLRQGDPLSPYLFLLC EGLS++LR  ER +L
Subjt:  SGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRAL

Query:  ISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICY
        + G  + R  P +SHLFFA DS++F RA   + V +Q+LL  Y
Subjt:  ISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICY

A0A2N9HTH6 Reverse transcriptase domain-containing protein2.4e-11440.99Show/hide
Query:  SFSNNHIDGWITW--DAYHWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRF
        ++S NHID  +        +R TGFYG P    R ++W+LL  LR     PWL  GDFN LL Q+EK G   +P  ++  F+  ++ C L DLGFVGN+F
Subjt:  SFSNNHIDGWITW--DAYHWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRF

Query:  TWC-NRRPEGTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVLSPQPGCWRRSK--------------------------------------
        TW   RR   T  ERLDR  +SV+W   Y   VV HL    SDH P+ L +       RR K                                      
Subjt:  TWC-NRRPEGTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVLSPQPGCWRRSK--------------------------------------

Query:  ---------------------------AQLEDVLQEEELYWKQRSREQ----LFSTSEPSDQDFDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSH
                                     L+ ++QE+     Q   E+    +F++++P ++  +  L  +   V   MN +L+  FT EE+ +ALKQ +
Subjt:  ---------------------------AQLEDVLQEEELYWKQRSREQ----LFSTSEPSDQDFDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSH

Query:  PHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFI
        P KAPGPDG+S  FY+++W IVGP V Q+ L +L+ G     IN T IVLIPKIK P +++D+RPI+LCN  YK++SK++ NR+K +LP++I + QSAF+
Subjt:  PHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFI

Query:  PGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSP
        PGR + DNV++ FE +H +  ++ GK    ALKLDMSKAYDR+EW F+ AVM R+ F ++W  LI+ C+S+VS+S  LNG + GN T SRG+RQGDPLSP
Subjt:  PGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSP

Query:  YLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICYE
        Y+FLLCAEGLSSLL+  ER   I+G   +R  P ++HLFFA DS+LF +A+     A+ ++L  YE
Subjt:  YLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICYE

A0A803QAN3 Uncharacterized protein1.5e-11642.6Show/hide
Query:  LLSFSNNHIDGWITW-DAYHWRLTGFYGFPAADMRDQTWSLLSKLRGGSD-TPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGN
        L SF+ N  D ++ + +      T FYG P    R  TW+LL +L+  +   PW++ GDFN +LY H K+GG  +  S++  F+ V+D C L +L F G+
Subjt:  LLSFSNNHIDGWITW-DAYHWRLTGFYGFPAADMRDQTWSLLSKLRGGSD-TPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGN

Query:  RFTWCNRRPE-GTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIE----LVLSPQPGCWRRSKAQLEDVLQEEELYWKQRSR-------------
         FTW   R +  TI+ERLD CF++ +W+  +   V +HLDY+ SDHR I     L+ S      + ++A L+D+L +EE YW QRSR             
Subjt:  RFTWCNRRPE-GTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIE----LVLSPQPGCWRRSKAQLEDVLQEEELYWKQRSR-------------

Query:  ---------------------------------------EQLFSTSEPSDQDFDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLS
                                                 LF+T         + L  +  ++ S+MN+ L  PFT EE++ ALK   P K+PG DG+S
Subjt:  ---------------------------------------EQLFSTSEPSDQDFDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLS

Query:  GSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVIL
          FY+N+W IVG +V Q  L VLN G     +N+++I LIPK+  P  +SD+RPISLCN  YKLISKV+V R + +LP +I   QSAF+  R + DN+++
Subjt:  GSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVIL

Query:  GFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLS
         FE IH LR ++ G+  ++ALKLDMSKA+DR+EW +L AVM +M FA +W  LI+ C+ + SFSF+LNGE +G+V PSRGLRQGDPLSPYLFL+C+EGLS
Subjt:  GFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLS

Query:  SLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICY
         LL+  E    + G R+ R SP +SHL FA DSLLF RAN   A AIQ  L  Y
Subjt:  SLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICY

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.7e-2727.76Show/hide
Query:  LLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKI-KAPRRVSDFRPISLCNFSYKLISKVVV
        L +P T  EI+  +      K+PGPDG +  FY+ +   + P +++    +   G  P S  E  I+LIPK  +   +  +FRPISL N   K+++K++ 
Subjt:  LLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKI-KAPRRVSDFRPISLCNFSYKLISKVVV

Query:  NRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGE
        NR++  +  LI  +Q  FIPG     N+      I  + R          + +D  KA+D+I+ PF+   ++++     +  +I       + +  LNG+
Subjt:  NRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGE

Query:  RLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLL
        +L       G RQG PLSP LF +  E L+   R + +   I G ++ +    +S   FA D +++    +V A  +  L+
Subjt:  RLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLL

P08548 LINE-1 reverse transcriptase homolog4.9e-2428.53Show/hide
Query:  AQLEDVLQEEELYWKQRSREQLFSTSEPSDQDFDVSLR--DLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQS
        ++++ +L E   Y+K     +L+S    + ++ D  L    L R    E+ M L +P +  EI   ++     K+PGPDG +  FY+     + P ++  
Subjt:  AQLEDVLQEEELYWKQRSREQLFSTSEPSDQDFDVSLR--DLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQS

Query:  CLVVLNHGCSPGSINETMIVLIPKI-KAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSK
           +   G  P +  E  I LIPK  K P R  ++RPISL N   K+++K++ NR++  +  +I  +Q  FIPG     N+      I  + +       
Subjt:  CLVVLNHGCSPGSINETMIVLIPKI-KAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSK

Query:  WAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRV
           L +D  KA+D I+ PF+   + ++     +  LI    S  + +  LNG +L +     G RQG PLSP LF +  E L+  +R  E +A I G  +
Subjt:  WAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRV

Query:  ARSSPPISHLFFAYDSLLF
           S  I    FA D +++
Subjt:  ARSSPPISHLFFAYDSLLF

P11369 LINE-1 retrotransposable element ORF2 protein2.5e-2830.03Show/hide
Query:  EQLFSTSEPSDQDFDVSLRDLQRSVDSEMNMD----LLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINET
        ++L+ST   +  + D   + L R    ++N D    L  P + +EI   +      K+PGPDG S  FY+     + P + +    +   G  P S  E 
Subjt:  EQLFSTSEPSDQDFDVSLRDLQRSVDSEMNMD----LLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINET

Query:  MIVLIPK-IKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEW
         I LIPK  K P ++ +FRPISL N   K+++K++ NR++  +  +I  +Q  FIPG     N+      IH + +          + LD  KA+D+I+ 
Subjt:  MIVLIPK-IKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEW

Query:  PFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHL
        PF+  V++R      + ++I    S    +  +NGE+L  +    G RQG PLSPYLF +  E L+   R + ++  I G ++ +    IS L
Subjt:  PFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHL

P14381 Transposon TX1 uncharacterized 149 kDa protein4.8e-2730.74Show/hide
Query:  KQRSREQLFSTSEPSDQDFDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINE
        + RS  Q   + +P   D    L D    V       L  P T +E+ +AL+    +K+PG DGL+  F++  W  +GP   +        G  P S   
Subjt:  KQRSREQLFSTSEPSDQDFDVSLRDLQRSVDSEMNMDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINE

Query:  TMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEW
         ++ L+PK    R + ++RP+SL +  YK+++K +  R+K +L  +I  +QS  +PGR + DNV L  + +H   RR+G     A L LD  KA+DR++ 
Subjt:  TMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEW

Query:  PFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLR
         +L   +   SF  Q+   +    +S      +N      +   RG+RQG PLS  L+ L  E    LLR
Subjt:  PFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLR

P92555 Uncharacterized mitochondrial protein AtMg012505.5e-1557.97Show/hide
Query:  FNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDS
        F +NG   G VTPSRGLRQGDPLSPYLF+LC E LS L R  + +  + G RV+ +SP I+HL FA D+
Subjt:  FNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.0e-0834.44Show/hide
Query:  TEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLIS
        +++EI  A+     +KAPGPD  +  F+   W +V  S I +       G      N T I LIPK+    ++S FRP+S C   YK+I+
Subjt:  TEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLIS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases8.7e-1633.78Show/hide
Query:  VVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLN
        +V R+K ++ NLI   Q++FIPGR   DN++   E +H +RR+ G K  W  LKLD+ KAYDRI W +L   +    F + W   I R     +F     
Subjt:  VVNRMKHILPNLILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLN

Query:  GERLGNVTPSR---------GLRQGDPLSPYL--FLLCAEGLSSLLRG
           +G    S+         G R  D  +P+    + CAE L  + RG
Subjt:  GERLGNVTPSR---------GLRQGDPLSPYL--FLLCAEGLSSLLRG

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.9e-1657.97Show/hide
Query:  FNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDS
        F +NG   G VTPSRGLRQGDPLSPYLF+LC E LS L R  + +  + G RV+ +SP I+HL FA D+
Subjt:  FNLNGERLGNVTPSRGLRQGDPLSPYLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGAAACAAAGTTGTCGTCAAGCGACCTCTTGTCATTTTCGAATAACCACATTGATGGGTGGATCACGTGGGACGCTTATCATTGGCGTCTCACGGGTTTCTATGG
TTTCCCTGCCGCAGATATGCGGGATCAAACGTGGTCCCTTCTCTCTAAGTTAAGGGGGGGTTCTGATACTCCTTGGCTTATAGGAGGGGACTTTAATGCCCTGTTGTATC
AGCATGAGAAGGAGGGTGGCAGAGATAAACCCCTCTCAGAGCTAGCGGCCTTTCAGAATGTGATTGACTCATGTGGGCTTCTTGATTTGGGCTTTGTGGGGAATAGGTTC
ACATGGTGCAACAGGCGGCCGGAAGGAACGATCTATGAGCGCTTGGATAGGTGTTTTAGCTCAGTTGCTTGGCACGATATCTACCCCAACTATGTAGTTAACCATCTTGA
TTACCATCAGTCCGATCACCGACCGATTGAGTTGGTTCTCTCTCCGCAGCCTGGTTGTTGGAGACGCTCGAAGGCCCAGTTGGAAGATGTTTTACAAGAGGAAGAACTTT
ACTGGAAGCAAAGATCCAGAGAGCAGCTTTTCTCGACATCAGAGCCGAGTGATCAGGATTTCGATGTATCTCTCAGGGACCTTCAGCGATCTGTGGATAGTGAGATGAAT
ATGGATCTATTGAAACCTTTTACTGAGGAGGAGATTCTTCGGGCTTTGAAGCAGTCTCATCCTCATAAGGCCCCGGGTCCAGATGGGTTATCTGGCAGTTTCTATAAGAA
TCACTGGTCGATAGTGGGGCCTTCAGTGATCCAGAGTTGTTTGGTCGTTTTGAATCATGGATGCTCCCCAGGTTCGATTAATGAGACTATGATTGTTCTTATTCCGAAGA
TCAAGGCCCCTCGACGAGTTTCTGATTTTCGTCCCATTTCCTTATGCAATTTTAGCTATAAGCTGATTTCGAAGGTCGTGGTTAATAGGATGAAACATATCCTTCCAAAT
CTTATATTATCCAACCAGAGTGCCTTTATCCCTGGGAGGTGTGTGGTGGATAATGTCATATTGGGGTTTGAATGCATCCATGAGTTAAGGAGACGGTCTGGGGGAAAATC
TAAATGGGCTGCTCTAAAACTTGACATGAGCAAAGCGTACGACAGGATAGAGTGGCCGTTTCTGCGGGCAGTTATGGATAGAATGAGTTTCGCTCAACAGTGGACTGATT
TGATTCTCCGGTGTGTTAGCTCGGTTTCTTTTTCGTTTAACCTGAATGGGGAGAGGTTGGGGAATGTGACTCCTTCCCGTGGGCTCAGACAGGGAGATCCGTTGTCTCCG
TATTTGTTTTTGCTCTGTGCGGAGGGTTTGTCTAGTCTGTTGCGAGGAGTAGAACGTCGAGCTTTGATATCTGGGTTTCGAGTTGCGCGGAGTAGTCCTCCGATTTCTCA
TCTATTTTTTGCATATGATAGCCTCCTTTTCTTCAGAGCAAACGTGGTGGAAGCAGTGGCTATCCAGGATTTGTTGATCTGTTATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGAAACAAAGTTGTCGTCAAGCGACCTCTTGTCATTTTCGAATAACCACATTGATGGGTGGATCACGTGGGACGCTTATCATTGGCGTCTCACGGGTTTCTATGG
TTTCCCTGCCGCAGATATGCGGGATCAAACGTGGTCCCTTCTCTCTAAGTTAAGGGGGGGTTCTGATACTCCTTGGCTTATAGGAGGGGACTTTAATGCCCTGTTGTATC
AGCATGAGAAGGAGGGTGGCAGAGATAAACCCCTCTCAGAGCTAGCGGCCTTTCAGAATGTGATTGACTCATGTGGGCTTCTTGATTTGGGCTTTGTGGGGAATAGGTTC
ACATGGTGCAACAGGCGGCCGGAAGGAACGATCTATGAGCGCTTGGATAGGTGTTTTAGCTCAGTTGCTTGGCACGATATCTACCCCAACTATGTAGTTAACCATCTTGA
TTACCATCAGTCCGATCACCGACCGATTGAGTTGGTTCTCTCTCCGCAGCCTGGTTGTTGGAGACGCTCGAAGGCCCAGTTGGAAGATGTTTTACAAGAGGAAGAACTTT
ACTGGAAGCAAAGATCCAGAGAGCAGCTTTTCTCGACATCAGAGCCGAGTGATCAGGATTTCGATGTATCTCTCAGGGACCTTCAGCGATCTGTGGATAGTGAGATGAAT
ATGGATCTATTGAAACCTTTTACTGAGGAGGAGATTCTTCGGGCTTTGAAGCAGTCTCATCCTCATAAGGCCCCGGGTCCAGATGGGTTATCTGGCAGTTTCTATAAGAA
TCACTGGTCGATAGTGGGGCCTTCAGTGATCCAGAGTTGTTTGGTCGTTTTGAATCATGGATGCTCCCCAGGTTCGATTAATGAGACTATGATTGTTCTTATTCCGAAGA
TCAAGGCCCCTCGACGAGTTTCTGATTTTCGTCCCATTTCCTTATGCAATTTTAGCTATAAGCTGATTTCGAAGGTCGTGGTTAATAGGATGAAACATATCCTTCCAAAT
CTTATATTATCCAACCAGAGTGCCTTTATCCCTGGGAGGTGTGTGGTGGATAATGTCATATTGGGGTTTGAATGCATCCATGAGTTAAGGAGACGGTCTGGGGGAAAATC
TAAATGGGCTGCTCTAAAACTTGACATGAGCAAAGCGTACGACAGGATAGAGTGGCCGTTTCTGCGGGCAGTTATGGATAGAATGAGTTTCGCTCAACAGTGGACTGATT
TGATTCTCCGGTGTGTTAGCTCGGTTTCTTTTTCGTTTAACCTGAATGGGGAGAGGTTGGGGAATGTGACTCCTTCCCGTGGGCTCAGACAGGGAGATCCGTTGTCTCCG
TATTTGTTTTTGCTCTGTGCGGAGGGTTTGTCTAGTCTGTTGCGAGGAGTAGAACGTCGAGCTTTGATATCTGGGTTTCGAGTTGCGCGGAGTAGTCCTCCGATTTCTCA
TCTATTTTTTGCATATGATAGCCTCCTTTTCTTCAGAGCAAACGTGGTGGAAGCAGTGGCTATCCAGGATTTGTTGATCTGTTATGAATGA
Protein sequenceShow/hide protein sequence
MSETKLSSSDLLSFSNNHIDGWITWDAYHWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRF
TWCNRRPEGTIYERLDRCFSSVAWHDIYPNYVVNHLDYHQSDHRPIELVLSPQPGCWRRSKAQLEDVLQEEELYWKQRSREQLFSTSEPSDQDFDVSLRDLQRSVDSEMN
MDLLKPFTEEEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLVVLNHGCSPGSINETMIVLIPKIKAPRRVSDFRPISLCNFSYKLISKVVVNRMKHILPN
LILSNQSAFIPGRCVVDNVILGFECIHELRRRSGGKSKWAALKLDMSKAYDRIEWPFLRAVMDRMSFAQQWTDLILRCVSSVSFSFNLNGERLGNVTPSRGLRQGDPLSP
YLFLLCAEGLSSLLRGVERRALISGFRVARSSPPISHLFFAYDSLLFFRANVVEAVAIQDLLICYE