; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028149 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028149
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr8:14622546..14624879
RNA-Seq ExpressionLag0028149
SyntenyLag0028149
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]4.4e-8233.48Show/hide
Query:  KPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKKDDLLTSPMAMFASS--PHTQ
        K  +S+D+Y+++IK+ ++ LA+VS+L+ DED+LIY LNGLP EYN F TS+RT+++ ++ EE++ +LK EE  IE + K+++    P AM A++  P+  
Subjt:  KPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKKDDLLTSPMAMFASS--PHTQ

Query:  SRSTSSNSPFQGRGWSRNNGRGRFPSSSGQGKS--RF-FPN-------TPT---------DNNSSRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPTQFF
        S    S S F GRG     GRGRF +  G+  S  RF  PN        PT          NNS  + CQIC   GHS LDCY+ M++ YQG+    Q  
Subjt:  SRSTSSNSPFQGRGWSRNNGRGRFPSSSGQGKS--RF-FPN-------TPT---------DNNSSRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPTQFF

Query:  AMVANQNS-----------------------------------------------------SYINCQTH-------------------------------
        AM A  N+                                                     S I+   H                               
Subjt:  AMVANQNS-----------------------------------------------------SYINCQTH-------------------------------

Query:  ----------------------------------------------------------HQPL-----------LLLGWLIQVVM------HTLPQTLAIS
                                                                  H PL             LG  +  V+      H    TL   
Subjt:  ----------------------------------------------------------HQPL-----------LLLGWLIQVVM------HTLPQTLAIS

Query:  LLHQNIMVKRMFLLAVDKCFL-SLTQVPFPKSYSTSMSPLELLHSDVWGPAPEISINGFKPLVE--NMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILH
        L   +I   R        C +  +T++PFP S + S +PL+L+HSD+WGPAP  S + F   V   + FS      R+DGGG++     T  L QSGI H
Subjt:  LLHQNIMVKRMFLLAVDKCFL-SLTQVPFPKSYSTSMSPLELLHSDVWGPAPEISINGFKPLVE--NMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILH

Query:  QKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKT
        ++SC +TPQQNGIAERKHRHI+E  ++L++R+ LP ++W   F+   +LINR+P+  L + SP+E LF+  PDY  LK FG ACYP L PY+ +K+QPKT
Subjt:  QKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKT

Query:  TQCVFLGYPLEYKGYFCYNMS-NKLLVSQHVFFDESTFPFAE-SPFPTPTFSSTPSPSYYHPPYSLLLSSSPCAENINHLSSTNT-----NVPTDPSNT
        TQC FLGY L YKG++C + S NK+ V++HV FDE+T+PF   +  PT T++     ++  PP SLL S+   A + +H  ST T     +VP  PS+T
Subjt:  TQCVFLGYPLEYKGYFCYNMS-NKLLVSQHVFFDESTFPFAE-SPFPTPTFSSTPSPSYYHPPYSLLLSSSPCAENINHLSSTNT-----NVPTDPSNT

PKU81762.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]5.7e-6631.55Show/hide
Query:  KPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKKDDLLTSPMAMFASSPHTQSR
        K D S+ AY+  IKE  + +A     ++ ED++++TLNGLPT YN+F T++RT  QP+  ++L+ LL SEE  I + A K+            S H    
Subjt:  KPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKKDDLLTSPMAMFASSPHTQSR

Query:  STSSNSPFQGRGWSRNNGRGRFPSSSG-QGKSRFFPNTPTDNNSSR-IPCQICQHTGHSTLDCYNWMNYHY-----------------------------
        +T+  +       SR  GRGRF ++ G +G+     N   D N  R + CQIC  TGHS   C++  + +Y                             
Subjt:  STSSNSPFQGRGWSRNNGRGRFPSSSG-QGKSRFFPNTPTDNNSSR-IPCQICQHTGHSTLDCYNWMNYHY-----------------------------

Query:  ------------------------QGRHHPTQFFA----------MVANQ-------------------NSSYINCQTHH---------QPLLLLG----
                                 GR  P Q             +V NQ                   +++ I C T H           +LL G    
Subjt:  ------------------------QGRHHPTQFFA----------MVANQ-------------------NSSYINCQTHH---------QPLLLLG----

Query:  ------------WLIQVVMHTLPQTLAISLLHQ--NIMVKRMFLLAVDKCFLSL------------TQVPFPKSYSTSMSPLELLHSDVWGPAPEISING
                     L  + ++T+P    + L H   + +       +++ C +               Q+PF  S ST+ SP EL+HSDVWGP P IS  G
Subjt:  ------------WLIQVVMHTLPQTLAISLLHQ--NIMVKRMFLLAVDKCFLSL------------TQVPFPKSYSTSMSPLELLHSDVWGPAPEISING

Query:  FK----------------PLVEN----------------MFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISL
        F+                PL++                  FST +KT+RTDGGG+++N SF SF    GI HQ +C YTP QNG+AERK+RHI+E   SL
Subjt:  FK----------------PLVEN----------------MFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISL

Query:  MTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYNMS-NKLLVS
        +     P++ W       I++INRLP+STLQN +PFE L+ K+P + H KIFGC C+P+L PY+  KL P +  CVF+GY  + KGY C + S  ++  S
Subjt:  MTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYNMS-NKLLVS

Query:  QHVFFDESTFPFAESPFPTPTFSSTPSPSYYHPPYSLLLSSSPCAENINHLSS
        +HV F E+ FPF  S   T + +S  +P+   PP  LL+  S  + N NH +S
Subjt:  QHVFFDESTFPFAESPFPTPTFSSTPSPSYYHPPYSLLLSSSPCAENINHLSS

PKU84173.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]4.8e-6530.92Show/hide
Query:  YVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKKDDLLTSPMAMFASSPHTQSRSTSSNSPF
        Y+  +K I +K+A     V+ ED+++Y LNGLP  Y  F TS+RT   P++ ++L+ LL SEE  I     +     + MA+FAS               
Subjt:  YVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKKDDLLTSPMAMFASSPHTQSRSTSSNSPF

Query:  QGRGWSRNNGRGRFPSSSGQGKSRFFPNTPTDNNSSRIP--CQICQHTGHSTLDCYNWMNYHYQGRHHPTQFFAMVANQNSSYIN------CQTH-----
        +GRG      R RF SS+         N  T   S++IP  CQIC   GH+  DC++ +N  Y  +   T   A+ AN + ++ +        +H     
Subjt:  QGRGWSRNNGRGRFPSSSGQGKSRFFPNTPTDNNSSRIP--CQICQHTGHSTLDCYNWMNYHYQGRHHPTQFFAMVANQNSSYIN------CQTH-----

Query:  ----------------------------------------------HQPLLLLGWLIQVVMHTLPQTLAISL----------------------------
                                                      H P L    L+ +   T    LAI+                             
Subjt:  ----------------------------------------------HQPLLLLGWLIQVVMHTLPQTLAISL----------------------------

Query:  -----------LHQNI-----------MVKRMFLLAVD-------------KCFLSLT----QVPFPKSYSTSMSPLELLHSDVWGPAPEISING-----
                   LH  +              +  L ++               C    T    ++PFP S S  ++ L+L+HSDVWGPAP  S+ G     
Subjt:  -----------LHQNI-----------MVKRMFLLAVD-------------KCFLSLT----QVPFPKSYSTSMSPLELLHSDVWGPAPEISING-----

Query:  ---------------------------FKPLVENMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSC
                                   FK  +EN+ S +IK++RTDGGG+++++ FT FL  +GI HQ SC +TP+QNG+AERK+RHIIE + +++ R+ 
Subjt:  ---------------------------FKPLVENMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSC

Query:  LPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYNM-SNKLLVSQHVFF
        LP +FWP      ++LINR+P+ST  N SPFE L N++PDY+HL+IFGCACYP +     HKLQ     CV LGY   YKGY C N+ +NK ++S+HV F
Subjt:  LPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYNM-SNKLLVSQHVFF

Query:  DESTFPFAESPFPTPTFSSTPS-----PSYYHPPYSLLLSSSPCAENINHLSSTNTNVPTDPS
        DE  FP+  +    P   ST S     PS  H   S++ S    A   N +SST T   T PS
Subjt:  DESTFPFAESPFPTPTFSSTPS-----PSYYHPPYSLLLSSSPCAENINHLSSTNTNVPTDPS

PRQ17908.1 putative RNA-directed DNA polymerase [Rosa chinensis]5.2e-6730.51Show/hide
Query:  KKPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKKDDLLTSPMAMFASSPHTQS
        +K  +SVD Y+ R+K  +++L+++ + + DED++I  L GLP+E++T    ++ +  P+S  EL  LL + ES +E    K   +TS  AM A S    S
Subjt:  KKPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKKDDLLTSPMAMFASSPHTQS

Query:  RSTSS--NSPFQ-----------------------------GRGWSRNN-------GRGRFPSSSGQGKSRFFPNTPTDNNSSR----------------
        + +     +P Q                             G G  RNN       G G F ++ G G +  + N   + N  +                
Subjt:  RSTSS--NSPFQ-----------------------------GRGWSRNN-------GRGRFPSSSGQGKSRFFPNTPTDNNSSR----------------

Query:  -----IPCQICQHTGHSTLDCYN-------------------WMNYHYQGRHHPTQFFAMVANQNSSYINC--QTHHQPL--------------------
             I CQIC   GHS   C++                     N H+    +    FA +  Q+++      QTH  P                     
Subjt:  -----IPCQICQHTGHSTLDCYN-------------------WMNYHYQGRHHPTQFFAMVANQNSSYINC--QTHHQPL--------------------

Query:  --------------------LLLGWLIQVVMHTLPQTLAISLLH---------------QNIMVKRM--------------------FLLAVDKCFLSL-
                            +L   L +  ++ +P + + +L +               Q++  KR+                    F+ +   C   L 
Subjt:  --------------------LLLGWLIQVVMHTLPQTLAISLLH---------------QNIMVKRM--------------------FLLAVDKCFLSL-

Query:  ---TQVPFPKSYSTSMSPLELLHSDVWGPAPEISINGFK----------------PL----------------VENMFSTRIKTLRTDGGGKFINHSFTS
           T++PFP S + S +P   +HSDVWGP+P +SI GFK                PL                V   FS+ IK L+T+GGG++ +H+F S
Subjt:  ---TQVPFPKSYSTSMSPLELLHSDVWGPAPEISINGFK----------------PL----------------VENMFSTRIKTLRTDGGGKFINHSFTS

Query:  FLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPY
        FL ++GI+HQKSC YTPQQNG+AERK+RHI+E +++L+ +S LP +FW    AI +FLINR+P+ TL   SPFE L +K P    LK+FGCAC+P + PY
Subjt:  FLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPY

Query:  NSHKLQPKTTQCVFLGYPLEYKGYFCYNMSN-KLLVSQHVFFDESTFPF---AESPFPTPTFSSTPSPSYY---HPPYSLLLSSSPCAENINHLSSTNTN
        NS+KLQ KT +C+FLGY   YKG+ C+N SN K ++S+HV FDES+FP+   + SP PT   SST  P      HP  S   +SS    +  HL++    
Subjt:  NSHKLQPKTTQCVFLGYPLEYKGYFCYNMSN-KLLVSQHVFFDESTFPF---AESPFPTPTFSSTPSPSYY---HPPYSLLLSSSPCAENINHLSSTNTN

Query:  VPTDPSNT
         P  P++T
Subjt:  VPTDPSNT

TQE01264.1 hypothetical protein C1H46_013171 [Malus baccata]7.5e-6629.28Show/hide
Query:  KKPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKKDDLLTSPMAM---------
        KK ++SV  Y++RIK++++ L+   ++  D+D++I  L GLP+EYNTF T +R R   +S +E    L +EE+ +E  +  +  +T+ +A          
Subjt:  KKPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKKDDLLTSPMAM---------

Query:  ----FASSPHTQSR-STSSNSP------------------FQGRGWSRNNGRGRF-----PSSSGQG--------------------------------K
             +SS  +QS+  T  ++P                  F+GRG  RNN          PS+S  G                                 
Subjt:  ----FASSPHTQSR-STSSNSP------------------FQGRGWSRNNGRGRF-----PSSSGQG--------------------------------K

Query:  SRFFPNTPTDNNSSRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPTQFFAM---------------------------VANQN-----------------
          F  ++     SS++ CQIC   GH  + CY+  N+ YQGR  P+   AM                           ++N N                 
Subjt:  SRFFPNTPTDNNSSRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPTQFFAM---------------------------VANQN-----------------

Query:  ---SSYINCQTHHQP-------------------------------------------------LLLLG----------------WLIQVVMHT------
            S+I   T H P                                                 ++L G                 LIQ   H       
Subjt:  ---SSYINCQTHHQP-------------------------------------------------LLLLG----------------WLIQVVMHT------

Query:  ---LPQTLAISLLHQNI----------MVKRMFL-LAVDK----CFLSL----TQVPFPKSYSTSMSPLELLHSDVWGPAPEISINGFK-----------
           L   + I+L HQ +          M+K+  + +++D     C   L     ++PF    + ++ PLE++HSDVWGP+  +SI G+K           
Subjt:  ---LPQTLAISLLHQNI----------MVKRMFL-LAVDK----CFLSL----TQVPFPKSYSTSMSPLELLHSDVWGPAPEISINGFK-----------

Query:  -----PLVE----------------NMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPF
             PL+                   FS  +K  ++DGGG++ +H F  +L Q GILHQKSC YTPQQNG+AERKHRHI+E +I+L+  + LP + W  
Subjt:  -----PLVE----------------NMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPF

Query:  VFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYN-MSNKLLVSQHVFFDESTFPFA
          AI ++LINR+   TLQ  SPF+CLF   P  +HLK+FGCAC+P L   NS KLQPKT+QC+F+GY  +YKGY C N ++NK+ VS+HV FDE+TFP++
Subjt:  VFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYN-MSNKLLVSQHVFFDESTFPFA

Query:  ESPFPTPTFSSTPSPSYYHPPYSL
                 S   SPS +  P  L
Subjt:  ESPFPTPTFSSTPSPSYYHPPYSL

TrEMBL top hitse value%identityAlignment
A0A2N9EFT0 Uncharacterized protein6.6e-8434.21Show/hide
Query:  KPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKL--AKKDDLLTSPMAMFASSPHTQ
        K ++ VD +++R+KE ++KL  V + ++DE++L   L GLPTE+++  +++RTR  P+SF+EL VLL +EES+++    A KD  L + ++     P   
Subjt:  KPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKL--AKKDDLLTSPMAMFASSPHTQ

Query:  SRSTSSNSPFQGRGWSRNNGRGRFPSSSGQG---KSRFFPNT--PTDNNSSRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPTQFFAMVA---------N
        +    +NS  +GRG   NNGRGR   ++G+G      F  NT   + N S R  CQIC   GH  LDCY+ M+Y YQGRH P +  A+ +         N
Subjt:  SRSTSSNSPFQGRGWSRNNGRGRFPSSSGQG---KSRFFPNT--PTDNNSSRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPTQFFAMVA---------N

Query:  QNSSYINCQTHH-------------------QPL------LLLGWLIQVVMHTL---------------PQTLAISLLHQNIMVKRMFLLAVDK------
        QN+S       +                   QPL          W+       L               P +  +S+  Q+I   R       K      
Subjt:  QNSSYINCQTHH-------------------QPL------LLLGWLIQVVMHTL---------------PQTLAISLLHQNIMVKRMFLLAVDK------

Query:  ------------------------------------------CFL----------------------------SLTQVPFPKSYSTSMSPLELLHSDVWG
                                                  C L                             L Q PFP S  T+ +PLEL+HSDVWG
Subjt:  ------------------------------------------CFL----------------------------SLTQVPFPKSYSTSMSPLELLHSDVWG

Query:  PAPEISING--------------------------------FKPLVENMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHR
        PAP  SING                                F   +EN+ +TRIK LRTD GG++ N +F SF +  GILHQ SC +TPQQNG+AERKHR
Subjt:  PAPEISING--------------------------------FKPLVENMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHR

Query:  HIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYN
        HI+E +++L++ S LP ++WP+ F+  I+LINR+P+  L+  SP++ LF+  PDY+ LK FGC C+P L PYN HKL+P+++ CVFLGY L  KGY C N
Subjt:  HIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYN

Query:  M-SNKLLVSQHVFFDESTFPFAESPFP----TPTFSSTPSPSYYHP
        + ++KLL+S+HV F E++FPF     P    TP+ +   S  Y+HP
Subjt:  M-SNKLLVSQHVFFDESTFPFAESPFP----TPTFSSTPSPSYYHP

A0A2N9F9F8 Uncharacterized protein1.0e-8136.38Show/hide
Query:  KKPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEE-SAIEKLAKKDDLLTSPMAMFASSPHTQ
        KK  ES+++Y++++K  ++KL  V  L+++E+LL   L GLP EY  F +++RTR +PV+FEE+ VLL++EE SA E      D  + PMAMFAS+P+ +
Subjt:  KKPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEE-SAIEKLAKKDDLLTSPMAMFASSPHTQ

Query:  SRSTSS-----NSPFQGRGWSRNNGR----GRFPSS-------SGQGKSRFFPNTPTDNNSSRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPTQFFAMV
        + ++ S     N+ F+GRG  RNN +    GRF +S       S QG S+F    P     SR  CQIC   GH  LDCY+ M++ YQGRH P +  AM 
Subjt:  SRSTSS-----NSPFQGRGWSRNNGR----GRFPSS-------SGQGKSRFFPNTPTDNNSSRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPTQFFAMV

Query:  ANQNSS-------------------YINCQT------HHQPL----LLLGWLIQ---VVMHTLPQTLAIS------------------------LLHQNI
        +  N S                     N QT        Q L    +L   L +     +HTLP + ++S                        L H + 
Subjt:  ANQNSS-------------------YINCQT------HHQPL----LLLGWLIQ---VVMHTLPQTLAIS------------------------LLHQNI

Query:  MVKRMFLLAVDKCFL----------------SLTQVPFPKSYSTSMSPLELLHSDVWGPAPEISING--------------------------------F
         V    + A+  C                   + ++PF  S   S  PLEL+HSDVWGPAP  S NG                                F
Subjt:  MVKRMFLLAVDKCFL----------------SLTQVPFPKSYSTSMSPLELLHSDVWGPAPEISING--------------------------------F

Query:  KPLVENMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNK
        K  VEN  S  IK LRTD GG++ +++FT F +  GI HQ SC +TPQQNG  ERKHRHIIE +++L++ + LP   W +     I LINRLP+  L +K
Subjt:  KPLVENMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNK

Query:  SPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYN-MSNKLLVSQHVFFDESTFPFAESPFPTPTFSST----PSPS
        SP+E LF+K PD  HL+ FGC C+P+L PYN HKLQP+TT C+FLGYP   KGY   N  + +  +S+HV F+E+ F       P  T SS     P P 
Subjt:  SPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYN-MSNKLLVSQHVFFDESTFPFAESPFPTPTFSST----PSPS

Query:  YYH----PPYSLLLSSSPCAENINHLSSTNTNVPT
          H    P   + L + P A       S + +VP+
Subjt:  YYH----PPYSLLLSSSPCAENINHLSSTNTNVPT

A0A2N9FT93 Uncharacterized protein2.6e-8034.83Show/hide
Query:  KKPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKKDDLLTSPMAMFASSPHTQS
        KK  ES+ ++++++K  ++KL  V  L+++E+LL   L GLP EY  F +++RTR +PV+FEE+ VLL++EE +I + +     L S MAMFAS+  + +
Subjt:  KKPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKKDDLLTSPMAMFASSPHTQS

Query:  RSTSSNSPF-----QGRGWSRNN---GR-GRFPS----SSGQGKSRFFPNTPTDNNS--------SRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPTQF
        RS++S S F     Q RG  RNN   GR GRF S    +S Q   ++ P T   NNS        SR  CQIC   GH  LDCY+ M++ YQGRH P + 
Subjt:  RSTSSNSPF-----QGRGWSRNN---GR-GRFPS----SSGQGKSRFFPNTPTDNNS--------SRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPTQF

Query:  FAMVANQNSSYINCQTHHQ---------PLLLLG-------------WLIQVV-----------------MHTLPQTLAIS-------------------
         AM +  N S    +   Q         P+  +G             +LIQ +                 +HT   + +++                   
Subjt:  FAMVANQNSSYINCQTHHQ---------PLLLLG-------------WLIQVV-----------------MHTLPQTLAIS-------------------

Query:  --LLHQNIMVKRMFLLAVDKCFL----------------SLTQVPFPKSYSTSMSPLELLHSDVWGPAPEISING-------------------------
          L H +  V    L ++  C                   + ++PF +S   S  PLEL+HSDVWGPAP  S NG                         
Subjt:  --LLHQNIMVKRMFLLAVDKCFL----------------SLTQVPFPKSYSTSMSPLELLHSDVWGPAPEISING-------------------------

Query:  -------FKPLVENMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRL
               FK  VEN  ST+IK LRTD GG++ +++FT F +  GI HQ SC +TPQQNGI ERKHRHI+E ++++++ + LP  +W +  +  + LINRL
Subjt:  -------FKPLVENMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRL

Query:  PSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYN-MSNKLLVSQHVFFDESTF-------PFAESPFP
        P+  L + SP+E LF+K PD  HLK FGC C+P+L PYN+HKLQP++T C+FLGYP   KGY C + +S+++ +S+H  F+ES F         A +   
Subjt:  PSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYN-MSNKLLVSQHVFFDESTF-------PFAESPFP

Query:  TPTFSSTP------------------SPSYYHPPYSLLLSSSPCAENINHLSSTNTNVPTDPSNTI
        T TF   P                  SPS + P    L  S P    I   S +   +PT PS +I
Subjt:  TPTFSSTP------------------SPSYYHPPYSLLLSSSPCAENINHLSSTNTNVPTDPSNTI

A0A2N9GJ13 Uncharacterized protein1.5e-8337.74Show/hide
Query:  KKPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEE-SAIEKLAKKDDLLTSPMAMFASSPHTQ
        KK  ES+ +Y++++K  ++KL  V  L+++E+LL   L GL  EY  F +++RTR +PV+FEE+ VLL++EE SA E      D  + PMAMFAS+P+ +
Subjt:  KKPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEE-SAIEKLAKKDDLLTSPMAMFASSPHTQ

Query:  SRSTSS-----NSPFQGRGWSRNN---GRG--------------RFPSSSGQGKSRFFPNTPTDNNSSRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPT
        + ++ S     N+ F+GRG  RNN   GRG              +FP SS QG S+F    P     SR  CQIC   GH  LDCY+ M++ YQGRH P 
Subjt:  SRSTSS-----NSPFQGRGWSRNN---GRG--------------RFPSSSGQGKSRFFPNTPTDNNSSRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPT

Query:  Q-----------FFAMVANQNSSYINCQTHHQPLLLLGWLIQVV--MHTLPQTLAIS----------------------LLH-------QNIMVKRMFLL
        +            +AMV    S+ +        +L  G     +  +HTLP + ++S                      L H         ++V  M  L
Subjt:  Q-----------FFAMVANQNSSYINCQTHHQPLLLLGWLIQVV--MHTLPQTLAIS----------------------LLH-------QNIMVKRMFLL

Query:  A-------------VDKCFLS-LTQVPFPKSYSTSMSPLELLHSDVWGPAPEISING--------------------------------FKPLVENMFST
        +                C +S + ++PF  S   S  PLEL+HSDVWGPAP  S NG                                FK  +EN  S 
Subjt:  A-------------VDKCFLS-LTQVPFPKSYSTSMSPLELLHSDVWGPAPEISING--------------------------------FKPLVENMFST

Query:  RIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKR
         IK LRTD GG++ +++FT F +  GI HQ SC +TPQQNG  ERKHRHIIE +++L++ + LP   W +   I I LINRLP+  L +KSP+E LF+K 
Subjt:  RIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKR

Query:  PDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYN-MSNKLLVSQHVFFDESTFPFAESPFPTPTFSSTP
        PD  HL+ FGC C+P++ PYN HKLQP+TT C+FLGYP   KGY C N  + +  +S+HV F+E+ F       P  T SS P
Subjt:  PDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYN-MSNKLLVSQHVFFDESTFPFAESPFPTPTFSSTP

A0A5J5A1U7 Integrase catalytic domain-containing protein2.1e-8233.48Show/hide
Query:  KPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKKDDLLTSPMAMFASS--PHTQ
        K  +S+D+Y+++IK+ ++ LA+VS+L+ DED+LIY LNGLP EYN F TS+RT+++ ++ EE++ +LK EE  IE + K+++    P AM A++  P+  
Subjt:  KPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKKDDLLTSPMAMFASS--PHTQ

Query:  SRSTSSNSPFQGRGWSRNNGRGRFPSSSGQGKS--RF-FPN-------TPT---------DNNSSRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPTQFF
        S    S S F GRG     GRGRF +  G+  S  RF  PN        PT          NNS  + CQIC   GHS LDCY+ M++ YQG+    Q  
Subjt:  SRSTSSNSPFQGRGWSRNNGRGRFPSSSGQGKS--RF-FPN-------TPT---------DNNSSRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPTQFF

Query:  AMVANQNS-----------------------------------------------------SYINCQTH-------------------------------
        AM A  N+                                                     S I+   H                               
Subjt:  AMVANQNS-----------------------------------------------------SYINCQTH-------------------------------

Query:  ----------------------------------------------------------HQPL-----------LLLGWLIQVVM------HTLPQTLAIS
                                                                  H PL             LG  +  V+      H    TL   
Subjt:  ----------------------------------------------------------HQPL-----------LLLGWLIQVVM------HTLPQTLAIS

Query:  LLHQNIMVKRMFLLAVDKCFL-SLTQVPFPKSYSTSMSPLELLHSDVWGPAPEISINGFKPLVE--NMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILH
        L   +I   R        C +  +T++PFP S + S +PL+L+HSD+WGPAP  S + F   V   + FS      R+DGGG++     T  L QSGI H
Subjt:  LLHQNIMVKRMFLLAVDKCFL-SLTQVPFPKSYSTSMSPLELLHSDVWGPAPEISINGFKPLVE--NMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILH

Query:  QKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKT
        ++SC +TPQQNGIAERKHRHI+E  ++L++R+ LP ++W   F+   +LINR+P+  L + SP+E LF+  PDY  LK FG ACYP L PY+ +K+QPKT
Subjt:  QKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKT

Query:  TQCVFLGYPLEYKGYFCYNMS-NKLLVSQHVFFDESTFPFAE-SPFPTPTFSSTPSPSYYHPPYSLLLSSSPCAENINHLSSTNT-----NVPTDPSNT
        TQC FLGY L YKG++C + S NK+ V++HV FDE+T+PF   +  PT T++     ++  PP SLL S+   A + +H  ST T     +VP  PS+T
Subjt:  TQCVFLGYPLEYKGYFCYNMS-NKLLVSQHVFFDESTFPFAE-SPFPTPTFSSTPSPSYYHPPYSLLLSSSPCAENINHLSSTNT-----NVPTDPSNT

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.1e-1726.52Show/hide
Query:  PLELLHSDVWGPAPEISIN--------------------------------GFKPLVENMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTP
        PL ++HSDV GP   ++++                                 F    E  F+ ++  L  D G +++++    F  + GI +  +  +TP
Subjt:  PLELLHSDVWGPAPEISIN--------------------------------GFKPLVENMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTP

Query:  QQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTL--QNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFL
        Q NG++ER  R I E + ++++ + L   FW        +LINR+PS  L   +K+P+E   NK+P   HL++FG   Y  +      K   K+ + +F+
Subjt:  QQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTL--QNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFL

Query:  GYPLEYKGYFCYNMSN-KLLVSQHVFFDES
        GY  E  G+  ++  N K +V++ V  DE+
Subjt:  GYPLEYKGYFCYNMSN-KLLVSQHVFFDES

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-2731.54Show/hide
Query:  QVPFPKSYSTSMSPLELLHSDVWGPAPEISING--------------------------------FKPLVENMFSTRIKTLRTDGGGKFINHSFTSFLNQ
        +V F  S    ++ L+L++SDV GP    S+ G                                F  LVE     ++K LR+D GG++ +  F  + + 
Subjt:  QVPFPKSYSTSMSPLELLHSDVWGPAPEISING--------------------------------FKPLVENMFSTRIKTLRTDGGGKFINHSFTSFLNQ

Query:  SGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHK
         GI H+K+   TPQ NG+AER +R I+E   S++  + LP  FW        +LINR PS  L  + P     NK   Y+HLK+FGC  +  +      K
Subjt:  SGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHK

Query:  LQPKTTQCVFLGYPLEYKGYFCYN-MSNKLLVSQHVFFDES
        L  K+  C+F+GY  E  GY  ++ +  K++ S+ V F ES
Subjt:  LQPKTTQCVFLGYPLEYKGYFCYN-MSNKLLVSQHVFFDES

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.0e-4942.86Show/hide
Query:  FKPLVENMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQN
        FK L+EN F TRI T  +D GG+F+  +   + +Q GI H  S  +TP+ NG++ERKHRHI+E  ++L++ + +P  +WP+ FA+ ++LINRLP+  LQ 
Subjt:  FKPLVENMFSTRIKTLRTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQN

Query:  KSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYNM-SNKLLVSQHVFFDESTFPFAE------------------
        +SPF+ LF   P+Y+ L++FGCACYP+L PYN HKL  K+ QCVFLGY L    Y C ++ +++L +S+HV FDE+ FPF+                   
Subjt:  KSPFECLFNKRPDYNHLKIFGCACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYNM-SNKLLVSQHVFFDESTFPFAE------------------

Query:  -SPFPT-PTFSST-PSPSYYHPPYSLLLSSSPCAENIN-HLSSTN
         SP  T PT +   P+PS   P ++    SSP A   N  +SS+N
Subjt:  -SPFPT-PTFSST-PSPSYYHPPYSLLLSSSPCAENIN-HLSSTN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-5439.24Show/hide
Query:  LLAVDKCFLSLT-QVPFPKSYSTSMSPLELLHSDVWGPAPEISING--------------------------------FKPLVENMFSTRIKTLRTDGGG
        LL+   CF++ + +VPF  S  TS  PLE ++SDVW  +P +SI+                                 FK LVEN F TRI TL +D GG
Subjt:  LLAVDKCFLSLT-QVPFPKSYSTSMSPLELLHSDVWGPAPEISING--------------------------------FKPLVENMFSTRIKTLRTDGGG

Query:  KFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGC
        +F+      +L+Q GI H  S  +TP+ NG++ERKHRHI+E+ ++L++ + +P  +WP+ F++ ++LINRLP+  LQ +SPF+ LF + P+Y  LK+FGC
Subjt:  KFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGC

Query:  ACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYNM-SNKLLVSQHVFFDESTFPFAESPFPTPTFSSTPSPSYYH-------PPYSLLLSSSPCAEN
        ACYP+L PYN HKL+ K+ QC F+GY L    Y C ++ + +L  S+HV FDE  FPF+ + F   T     S S  +       P   L+L + PC   
Subjt:  ACYPFLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYNM-SNKLLVSQHVFFDESTFPFAESPFPTPTFSSTPSPSYYH-------PPYSLLLSSSPCAEN

Query:  INHLSSTNTNVPTDPS
          HL  T+   P+ PS
Subjt:  INHLSSTNTNVPTDPS

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.8e-0526.97Show/hide
Query:  DESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKK-----DDLLTSPMAMFASSPHT
        D  V  Y +++K++ + L NV + V D +L++Y LNGL  +++     ++ R    SF++   +L+ EE  +++  K      D   +S +   + +P  
Subjt:  DESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESAIEKLAKK-----DDLLTSPMAMFASSPHT

Query:  QS--RSTSSNSPFQGRGWSRNNGRGRFPSSSGQGKSRFFPNTPTDNNSSRIP
         +  RS  +   ++GRG   N  RGR       G    + N PT N+ +R P
Subjt:  QS--RSTSSNSPFQGRGWSRNNGRGRFPSSSGQGKSRFFPNTPTDNNSSRIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGCACAGTGTTCTAGAGTCTATTTTTACTACATTTTTACCCACTCTGGAAACCCAACACCTATATTGTTGGCAGCAAAACATCGAAACAGGAATCCTGTTGCTAA
GAAACCTGATGAATCCGTGGATGCTTATGTGAAACGAATTAAAGAAATCAAAGAAAAATTAGCAAATGTCTCAATCTTGGTTAATGACGAAGATCTCCTCATTTATACTC
TTAATGGCTTACCAACAGAATATAATACATTCTGGACTTCGATGCGTACTCGTGCTCAGCCTGTTTCCTTCGAGGAACTTCATGTCCTTTTGAAGTCTGAAGAATCTGCC
ATAGAAAAATTGGCCAAGAAAGATGATCTTTTAACATCACCAATGGCTATGTTTGCTTCTTCGCCACACACTCAATCTCGCTCTACATCTTCTAATTCTCCCTTCCAAGG
ACGAGGTTGGAGTAGAAACAATGGTCGTGGCAGATTCCCTTCTAGTTCTGGTCAAGGTAAAAGTCGTTTTTTTCCTAATACACCTACTGATAATAATTCTTCGCGCATAC
CTTGTCAGATCTGCCAGCACACTGGGCATTCTACCTTAGACTGTTACAACTGGATGAACTACCATTATCAAGGCCGTCATCATCCTACTCAATTTTTTGCAATGGTGGCT
AACCAAAATTCATCCTACATCAATTGTCAAACTCATCATCAGCCTCTACTACTCCTTGGTTGGTTGATTCAGGTTGTAATGCACACATTACCTCAGACCTTGGCAATCTC
TTTGTTGCATCAGAATATAATGGTGAAGAGGATGTTTCTGTTGGCAGTGGACAAATGCTTCTTATCTCTCACACAGGTGCCATTTCCAAAATCTTATTCTACTTCCATGT
CTCCATTAGAGCTTTTGCACAGTGATGTATGGGGTCCTGCTCCTGAAATATCAATCAATGGTTTTAAACCTTTAGTTGAGAACATGTTTTCTACTCGTATAAAAACCCTT
CGAACTGATGGTGGTGGAAAATTTATAAATCATTCTTTTACCTCATTTTTAAACCAGTCTGGAATCCTTCACCAAAAGTCTTGTGCCTACACACCACAACAAAATGGTAT
AGCTGAGCGAAAGCATCGTCATATTATTGAAGTTTCTATTTCTTTAATGACTCGTTCTTGTCTTCCAACTCGTTTTTGGCCCTTTGTGTTTGCTATAGTCATCTTTCTTA
TTAACCGTTTACCATCTTCCACTCTTCAAAATAAGTCACCATTTGAATGTTTGTTCAATAAACGTCCTGACTATAATCACCTCAAAATATTCGGTTGTGCATGTTATCCC
TTTCTCAATCCTTATAATTCTCATAAATTACAACCTAAAACCACCCAATGTGTATTCCTTGGTTATCCTCTCGAGTACAAAGGGTATTTTTGTTACAACATGAGCAACAA
ACTTCTTGTTTCTCAACATGTCTTTTTCGATGAATCTACTTTTCCTTTTGCTGAATCACCCTTTCCTACTCCTACTTTCTCGTCCACTCCATCTCCTTCGTATTATCATC
CTCCATACTCTTTGCTGCTTTCTTCATCTCCTTGTGCTGAGAATATAAATCATTTGTCTTCTACTAATACAAATGTTCCTACTGATCCTTCTAATACTATTAATGTTGAT
GTTATTGCCAATGTTGATGTGGTTGCTGATGCTACGGCCAATGTTGTTTCTGATACTAATGTGGTTAATACTACTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTGCACAGTGTTCTAGAGTCTATTTTTACTACATTTTTACCCACTCTGGAAACCCAACACCTATATTGTTGGCAGCAAAACATCGAAACAGGAATCCTGTTGCTAA
GAAACCTGATGAATCCGTGGATGCTTATGTGAAACGAATTAAAGAAATCAAAGAAAAATTAGCAAATGTCTCAATCTTGGTTAATGACGAAGATCTCCTCATTTATACTC
TTAATGGCTTACCAACAGAATATAATACATTCTGGACTTCGATGCGTACTCGTGCTCAGCCTGTTTCCTTCGAGGAACTTCATGTCCTTTTGAAGTCTGAAGAATCTGCC
ATAGAAAAATTGGCCAAGAAAGATGATCTTTTAACATCACCAATGGCTATGTTTGCTTCTTCGCCACACACTCAATCTCGCTCTACATCTTCTAATTCTCCCTTCCAAGG
ACGAGGTTGGAGTAGAAACAATGGTCGTGGCAGATTCCCTTCTAGTTCTGGTCAAGGTAAAAGTCGTTTTTTTCCTAATACACCTACTGATAATAATTCTTCGCGCATAC
CTTGTCAGATCTGCCAGCACACTGGGCATTCTACCTTAGACTGTTACAACTGGATGAACTACCATTATCAAGGCCGTCATCATCCTACTCAATTTTTTGCAATGGTGGCT
AACCAAAATTCATCCTACATCAATTGTCAAACTCATCATCAGCCTCTACTACTCCTTGGTTGGTTGATTCAGGTTGTAATGCACACATTACCTCAGACCTTGGCAATCTC
TTTGTTGCATCAGAATATAATGGTGAAGAGGATGTTTCTGTTGGCAGTGGACAAATGCTTCTTATCTCTCACACAGGTGCCATTTCCAAAATCTTATTCTACTTCCATGT
CTCCATTAGAGCTTTTGCACAGTGATGTATGGGGTCCTGCTCCTGAAATATCAATCAATGGTTTTAAACCTTTAGTTGAGAACATGTTTTCTACTCGTATAAAAACCCTT
CGAACTGATGGTGGTGGAAAATTTATAAATCATTCTTTTACCTCATTTTTAAACCAGTCTGGAATCCTTCACCAAAAGTCTTGTGCCTACACACCACAACAAAATGGTAT
AGCTGAGCGAAAGCATCGTCATATTATTGAAGTTTCTATTTCTTTAATGACTCGTTCTTGTCTTCCAACTCGTTTTTGGCCCTTTGTGTTTGCTATAGTCATCTTTCTTA
TTAACCGTTTACCATCTTCCACTCTTCAAAATAAGTCACCATTTGAATGTTTGTTCAATAAACGTCCTGACTATAATCACCTCAAAATATTCGGTTGTGCATGTTATCCC
TTTCTCAATCCTTATAATTCTCATAAATTACAACCTAAAACCACCCAATGTGTATTCCTTGGTTATCCTCTCGAGTACAAAGGGTATTTTTGTTACAACATGAGCAACAA
ACTTCTTGTTTCTCAACATGTCTTTTTCGATGAATCTACTTTTCCTTTTGCTGAATCACCCTTTCCTACTCCTACTTTCTCGTCCACTCCATCTCCTTCGTATTATCATC
CTCCATACTCTTTGCTGCTTTCTTCATCTCCTTGTGCTGAGAATATAAATCATTTGTCTTCTACTAATACAAATGTTCCTACTGATCCTTCTAATACTATTAATGTTGAT
GTTATTGCCAATGTTGATGTGGTTGCTGATGCTACGGCCAATGTTGTTTCTGATACTAATGTGGTTAATACTACTTTGTAA
Protein sequenceShow/hide protein sequence
MLAQCSRVYFYYIFTHSGNPTPILLAAKHRNRNPVAKKPDESVDAYVKRIKEIKEKLANVSILVNDEDLLIYTLNGLPTEYNTFWTSMRTRAQPVSFEELHVLLKSEESA
IEKLAKKDDLLTSPMAMFASSPHTQSRSTSSNSPFQGRGWSRNNGRGRFPSSSGQGKSRFFPNTPTDNNSSRIPCQICQHTGHSTLDCYNWMNYHYQGRHHPTQFFAMVA
NQNSSYINCQTHHQPLLLLGWLIQVVMHTLPQTLAISLLHQNIMVKRMFLLAVDKCFLSLTQVPFPKSYSTSMSPLELLHSDVWGPAPEISINGFKPLVENMFSTRIKTL
RTDGGGKFINHSFTSFLNQSGILHQKSCAYTPQQNGIAERKHRHIIEVSISLMTRSCLPTRFWPFVFAIVIFLINRLPSSTLQNKSPFECLFNKRPDYNHLKIFGCACYP
FLNPYNSHKLQPKTTQCVFLGYPLEYKGYFCYNMSNKLLVSQHVFFDESTFPFAESPFPTPTFSSTPSPSYYHPPYSLLLSSSPCAENINHLSSTNTNVPTDPSNTINVD
VIANVDVVADATANVVSDTNVVNTTL