; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035828 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035828
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr3:31523142..31529870
RNA-Seq ExpressionLag0035828
SyntenyLag0035828
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016646912.1 PREDICTED: uncharacterized protein LOC103318979 [Prunus mume]1.3e-20843.74Show/hide
Query:  PADPQNGMARPQIQAADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSNSFVIQGVPRDAIRLTFFLYSLRDGAKAWLNSFAPGSIRTWDE----
        P    + + RP I A +FE+KP M  MLQ    F GL +EDP++HL  FL + ++    GV  DAIRL  F +SL+D AK WL S    SIRTWD+    
Subjt:  PADPQNGMARPQIQAADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSNSFVIQGVPRDAIRLTFFLYSLRDGAKAWLNSFAPGSIRTWDE----

Query:  ----------------------QLEDETFSEAWERFKELLRKCPHHGLPHCIQMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWS
                              Q + E   EAWERFK+LLRKCPHH LP  IQ++ FYNGL   ++ +VDA+AGGAL+AKT  EA+E+LE +++N+ QW 
Subjt:  ----------------------QLEDETFSEAWERFKELLRKCPHHGLPHCIQMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWS

Query:  DVRGTNKKVRSVLEVDGVSTIRPDLAMIANALKNVTVISHQQP------------PAMEPTAG-------------------------------------
          R  N K   VLEVD ++ +   ++ +   + +++V S                P+ E T G                                     
Subjt:  DVRGTNKKVRSVLEVDGVSTIRPDLAMIANALKNVTVISHQQP------------PAMEPTAG-------------------------------------

Query:  ------LAQRP------------------QLRMGRTRKQCASTTKDATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTE-HPRME-----------
                QRP                  QL M  T+    + T+    Q+ QAS++ LE+QVGQLAN +  R QG  PS  E +P+ +           
Subjt:  ------LAQRP------------------QLRMGRTRKQCASTTKDATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTE-HPRME-----------

Query:  --------------GKEQGAGGSKKDVGASGSIP------------NVEPPYVSPPPYVPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQM
                       KE+ A     ++G + S P            ++  P +   PYVP +PFPQR +    DGQF KFLE+ ++L INIP  E  EQM
Subjt:  --------------GKEQGAGGSKKDVGASGSIP------------NVEPPYVSPPPYVPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQM

Query:  LNYAKFLKDILTKKKRLGEFETVSRTEECSAILKNGLPPKAKDLGSFTITVPIGGKELGKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRA--
         +YAKF+KDIL+KK++ GE E +  TEECSAIL+  LPPK KD GSF I   IG     +ALCDLG+SINL+PLSV +K+GIGE +PTTV+LQ+ADR+  
Subjt:  LNYAKFLKDILTKKKRLGEFETVSRTEECSAILKNGLPPKAKDLGSFTITVPIGGKELGKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRA--

Query:  -----------------------------------------------LIDVQKRELTMRVCNEEVKFNVFKAMKYPGEMEDCSFIRII------------
                                                       LIDV++  LT+RV NE+  F VF+A+K+P E EDC  I +I            
Subjt:  -----------------------------------------------LIDVQKRELTMRVCNEEVKFNVFKAMKYPGEMEDCSFIRII------------

Query:  KSTVIETTIQDSASKHSE-----EHGESLDLDQ------------RKATPIKS--SLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALI
         S  +E+T+  +A+   +     E+   LD  Q              A P K+  S+I APTL LKPLP HL+Y YLG  ETLP+I+A++L    EE ++
Subjt:  KSTVIETTIQDSASKHSE-----EHGESLDLDQ------------RKATPIKS--SLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALI

Query:  KLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPT
        ++L++++ AIGWT+A I+GISPS CMH+I +EE    S+E QRRLNP MK+VV+ EV+KLLDAGIIYPI+D++WVSP Q VPKKGG+TVV N++NEL+PT
Subjt:  KLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPT

Query:  RTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYM
        RTVTGWRVC+DYR+LN ATRKDHFPLPFIDQML+RLAG AYYCFLDGYSGYNQI IAPEDQEKTTFTCP+GTFA+RRMPFGLCNAP TFQRCM +IFS M
Subjt:  RTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYM

Query:  IESTVE
        +E  +E
Subjt:  IESTVE

XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]1.6e-20642.5Show/hide
Query:  LDPEIERTFRIRRREQRRNKMENVSRLPQGPEGPADPQ----------------NGMARPQIQAADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLG
        LD EIE+T +  RR+ + ++  + +    G +    P+                + +ARP I A +F +     Q ++   +F+GLS+EDP+ HL++FL 
Subjt:  LDPEIERTFRIRRREQRRNKMENVSRLPQGPEGPADPQ----------------NGMARPQIQAADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLG

Query:  VSNSFVIQGVPRDAIRLTFFLYSLRDGAKAWLNSFAPGSIRTWD--------------------------EQLEDETFSEAWERFKELLRKCPHHGLPHC
        + ++F + GVP + IRL  F  SL   A+ WL+S    SI TW+                          +Q + E+  EA+ERFK+LLRKCPHHGL   
Subjt:  VSNSFVIQGVPRDAIRLTFFLYSLRDGAKAWLNSFAPGSIRTWD--------------------------EQLEDETFSEAWERFKELLRKCPHHGLPHC

Query:  IQMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRSVLEVD---------------------GVSTIRPDLAMIAN
         ++  FYNGL       VD +A G+L  +  ++A+EILE I++N+C+  D R ++KKV  V EVD                      +S  R      A+
Subjt:  IQMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRSVLEVD---------------------GVSTIRPDLAMIAN

Query:  ALKNVTVI--------------------------------SHQQ----------------------------PPAMEPTAGLAQRPQLRMGRTRKQCAS-
         ++N+ V                                 S+QQ                            P       G  Q P+ +    +      
Subjt:  ALKNVTVI--------------------------------SHQQ----------------------------PPAMEPTAGLAQRPQLRMGRTRKQCAS-

Query:  -----TTKDATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTE-HPRMEGKE-----------QGAGGSKK--DVGASGSIPNVEPPYVSPPP----
                DA IQS  ASMRALEMQVGQLA+ +  RP G LPS+TE +P+ + +E           +  G +KK  D G    + N EP  +S P     
Subjt:  -----TTKDATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTE-HPRMEGKE-----------QGAGGSKK--DVGASGSIPNVEPPYVSPPP----

Query:  -----YVPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEFETVSRTEECSAILKNGLPPKAKDLGSFTITVP
               PP PFPQR + + QD QF+KF+++ K+L INIP  E  EQM +Y KF+KDIL++K+RL EFETV+ TEECSAIL+  LPPK KD GSFTI   
Subjt:  -----YVPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEFETVSRTEECSAILKNGLPPKAKDLGSFTITVP

Query:  IGGKELGKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRA-------------------------------------------------LIDVQ
        IG +  GKALCDLGAS+NLMPLS++ KLG+GE +PT+V LQLADR+                                                 LIDVQ
Subjt:  IGGKELGKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRA-------------------------------------------------LIDVQ

Query:  KRELTMRVCNEEVKFNVFKAMKYPGEMEDCSFIRII-------------KSTVIETTIQDSASKHSEEHGE------SLDLDQR------------KATP
        K ELTMRV +E+V FNVF AMK+  + E C  +                 +  +E +++++  + +EE  E      +L   +R            K+  
Subjt:  KRELTMRVCNEEVKFNVFKAMKYPGEMEDCSFIRII-------------KSTVIETTIQDSASKHSEEHGE------SLDLDQR------------KATP

Query:  IKSSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDV
         K S+ E P L+LK LP HLKY +LGE  TLP+I++S L +EHEE L+++L++Y++AIGW +A I+GISPSFCMHKI++E+    +IE QRRLNP MK+V
Subjt:  IKSSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDV

Query:  VKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYN
        VKKE+IK LDAGIIYPI+D++WVSP+QCVPKKGG+TVV+N+ NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG+ +YCFLDGYSGY+
Subjt:  VKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYN

Query:  QITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE
        QI IAPEDQEKTTFTCP+GTFAFR++ FGLCNAP+TFQRCM AIFS MIE  VE
Subjt:  QITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE

XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]2.1e-20350.32Show/hide
Query:  MEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGT-NKKVRSVLEVDGVSTIRPDLAMIANALKNVTVISHQQPPAMEPTAGL
        ME FYNGL   T+ +VDASA GA+L+KT++EAYEILERI++N+CQW+DVR    +K R VLEVD +S+I   LA + N L+N+ +       A   TA +
Subjt:  MEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGT-NKKVRSVLEVDGVSTIRPDLAMIANALKNVTVISHQQPPAMEPTAGL

Query:  AQRPQLRMGRTRKQCASTTKDATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTEHPRMEGKEQ-------------GAGGSKKDVGASGSIPNVE-
          +         K+  +   DA IQS QAS+R LE+QVGQLANEL+ RP  KLP+DTE P+ EG EQ               G   K+ G S S    + 
Subjt:  AQRPQLRMGRTRKQCASTTKDATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTEHPRMEGKEQ-------------GAGGSKKDVGASGSIPNVE-

Query:  ------------------------------PPYVSPPP-------------YVPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKF
                                         V PP              Y P  PFPQR K + ++  F+KF++I K++HINIPLVE  +QM NY KF
Subjt:  ------------------------------PPYVSPPP-------------YVPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKF

Query:  LKDILTKKKRLGEFETVSRTEECSAILKNGLPPKAKDLGSFTITVPIGGKELGKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRA--------
        LKD+LT +++  EF+ V   EECSAILKN +P K KD GSFTI + IGGK+LG+ALCDLG+SINLMPLS+Y+KLGIGEARPTTVTLQLADR+        
Subjt:  LKDILTKKKRLGEFETVSRTEECSAILKNGLPPKAKDLGSFTITVPIGGKELGKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRA--------

Query:  -----------------------------------------LIDVQKRELTMRVCNEEVKFNVFKAMKYPGEMEDCSFIRIIKSTVIETTIQDSASKHSE
                                                 L+DV K  +T+R+ +++V+FN+  +MKYP   E+CS +  +          D  S   E
Subjt:  -----------------------------------------LIDVQKRELTMRVCNEEVKFNVFKAMKYPGEMEDCSFIRIIKSTVIETTIQDSASKHSE

Query:  EHG------------------ESLDLDQRKATPIKSSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQG
        +                    ESL+ + RK++P++ S+ EAP LDLKPLP +LKY YLG+ +TLPII+++ L S  E+ L++ L++++ AIGWTLA I+G
Subjt:  EHG------------------ESLDLDQRKATPIKSSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQG

Query:  ISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKAT
        ISPS CMHKI LEEG  +SIEQQRRLNP MK+VV+KE++K LDAGIIYPIA+++ VSP+QCVPKKGG+TV++N++NELI TR V GWR+CMDYRRLNKAT
Subjt:  ISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKAT

Query:  RKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE
        RKDHFPLPFIDQMLDRLAG+++YCFLDGYSGYNQITI+PEDQEKTTFTCPYG FAFRRMPFGLCNAP TFQRCM AIF+ M+E+ +E
Subjt:  RKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]9.0e-20242.25Show/hide
Query:  QNGMARPQIQAADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSNSFVIQGVPRDAIRLTFFLYSLRDGAKAWLNSFAPGSIRTWD---------
        Q  + RP + A +FE+KP + QM+Q   QF G  SEDPH HL +FL + ++  + GV  DAIRL  F +SL+D AKAWLNS AP S  TW+         
Subjt:  QNGMARPQIQAADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSNSFVIQGVPRDAIRLTFFLYSLRDGAKAWLNSFAPGSIRTWD---------

Query:  -----------------EQLEDETFSEAWERFKELLRKCPHHGLPHCIQMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRG
                          Q + E+  EAWERFK+L RKCPHHGLP  + ++ FYNGL    +  +DA+AGG L++K+ +EAYE+LE +++N+ QWS+ R 
Subjt:  -----------------EQLEDETFSEAWERFKELLRKCPHHGLPHCIQMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRG

Query:  TNKKVRSVLEVDGVSTIR---PDLAMIANALKNVTVIS------------------------------HQQ-----------------------------
          KKV  + +VDG++ +      L  + + L NV  +S                               QQ                             
Subjt:  TNKKVRSVLEVDGVSTIR---PDLAMIANALKNVTVIS------------------------------HQQ-----------------------------

Query:  -----PPAMEPTAGLAQRPQLRMGRTRKQCASTTK-----DATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTE-HPR--------MEGKEQGAGG
             PP  +P     +  Q       K   ++++     +A +    +S R +E+Q+GQLAN + +R QG LPS TE +P+          GK+ G   
Subjt:  -----PPAMEPTAGLAQRPQLRMGRTRKQCASTTK-----DATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTE-HPR--------MEGKEQGAGG

Query:  SKKDVGASGSIPNV------------EPPYVSPP--PYVPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEF
         +  VG       V            + P   PP  PYVPP+PFPQR K    D QF+KFL++ +QLHINIP  +   Q+  Y KFLK+I++KK++L +F
Subjt:  SKKDVGASGSIPNV------------EPPYVSPP--PYVPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEF

Query:  ETVSRTEECSAILKNGLPPKAKDLGSFTITVPIGGKELGKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADR-----------------------
        ET++ TEECSAI++N LPPK +D GSF+I   IG  +  +ALCDLGAS++LMPLSV RKLG+ E +PTT++LQLADR                       
Subjt:  ETVSRTEECSAILKNGLPPKAKDLGSFTITVPIGGKELGKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADR-----------------------

Query:  --------------------------ALIDVQKRELTMRVCNEEVKFNVFKAMKYPGEMEDCSFIRIIKSTVIE-----------TTIQDSASKHSEEHG
                                  A+IDV+   LT++V  EEV+FN+F+A KYP   +    + ++  +  E            T   SA    +++ 
Subjt:  --------------------------ALIDVQKRELTMRVCNEEVKFNVFKAMKYPGEMEDCSFIRIIKSTVIE-----------TTIQDSASKHSEEHG

Query:  ESL---------------------DLDQRKATPIKSSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQG
        E                       D+ + K  P  S+ ++AP L+LKPLP HL Y +LGE  TLP+IV+  L +E  + LI++L+  +KAIGWT++ ++G
Subjt:  ESL---------------------DLDQRKATPIKSSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQG

Query:  ISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKAT
        ISPS CMH+I +E+     +E QRRLNP MK+VV+ EV+K LDAGIIYPI+D+ W+SPVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYR+LN  T
Subjt:  ISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKAT

Query:  RKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE
        RKDHFPLPF+DQ+L+RLAG AYYCFLDGYSGYNQI+I+PEDQEKTTFTCPYGTFAFRRMPFGLCNAP TFQRCM AIFS  +E  +E
Subjt:  RKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE

XP_038976409.1 uncharacterized protein LOC113461320 [Phoenix dactylifera]1.8e-20242.45Show/hide
Query:  QNGMARPQIQAADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSNSFVIQGVPRDAIRLTFFLYSLRDGAKAWLNSFAPGSIRTWD---------
        Q  + RP + A +FE+KP + QM+Q   QF G  SEDPH HL +FL + ++  + GV  DAIRL  F +SL+D AKAWLNS AP S  TW+         
Subjt:  QNGMARPQIQAADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSNSFVIQGVPRDAIRLTFFLYSLRDGAKAWLNSFAPGSIRTWD---------

Query:  -----------------EQLEDETFSEAWERFKELLRKCPHHGLPHCIQMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRG
                          Q + E+  EAWERFK+L RKCPHHGLP  + ++ FYNGL    +  +DA+AGG L++K+ +EAYE+LE +++N+ QWS+ R 
Subjt:  -----------------EQLEDETFSEAWERFKELLRKCPHHGLPHCIQMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRG

Query:  TNKKVRSVLEVDGVSTIR---PDLAMIANALKNVTVIS------------------------------HQQ-----------------------------
          KKV  + +VDG++ +      L  +   L NV  +S                               QQ                             
Subjt:  TNKKVRSVLEVDGVSTIR---PDLAMIANALKNVTVIS------------------------------HQQ-----------------------------

Query:  -----PPAMEPTAGLAQRPQLRMGRTRKQCASTTK-----DATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTE-HPR--------MEGKEQGAGG
             PP  +P     +  Q       K   ++++     +A +    +S R +EMQ+GQLAN + +R QG LPS TE +P+          GK+ G   
Subjt:  -----PPAMEPTAGLAQRPQLRMGRTRKQCASTTK-----DATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTE-HPR--------MEGKEQGAGG

Query:  SKKDVGASGSIPNV------------EPPYVSPP--PYVPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEF
        S+  VG       V            + P   PP  PYVPP+PFPQR K    D QF+KFL++ +QLHINIP  +   Q+  Y KFLK+I++KK++L +F
Subjt:  SKKDVGASGSIPNV------------EPPYVSPP--PYVPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEF

Query:  ETVSRTEECSAILKNGLPPKAKDLGSFTITVPIGGKELGKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADR-----------------------
        ET++ TEECSAI++N LPPK +D GSF+I   IG  +  +ALCDLGAS++LMPLSV RKLG+ E +PTT++LQLADR                       
Subjt:  ETVSRTEECSAILKNGLPPKAKDLGSFTITVPIGGKELGKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADR-----------------------

Query:  --------------------------ALIDVQKRELTMRVCNEEVKFNVFKAMKYPGEMEDCSFIRIIKSTVIE-----------TTIQDSASKHSEEHG
                                  A+IDV+   LT++V  EEV+FN+F+A KYP   +    + ++  +  E            T   SA    +++ 
Subjt:  --------------------------ALIDVQKRELTMRVCNEEVKFNVFKAMKYPGEMEDCSFIRIIKSTVIE-----------TTIQDSASKHSEEHG

Query:  ESL---------------------DLDQRKATPIKSSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQG
        E                       D+ + K  P  S+ ++AP L+LKPLP HL Y +LGE  TLP+IV+  L  E  + LI++L+  +KAIGWT++ ++G
Subjt:  ESL---------------------DLDQRKATPIKSSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQG

Query:  ISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKAT
        ISPS CMH+I +E+     +E QRRLNP MK+VV+ EV+K LDAGIIYPI+D+ W+SPVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYR+LN  T
Subjt:  ISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKAT

Query:  RKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE
        RKDHFPLPF+DQ+L+RLAG AYYCFLDGYSGYNQI+I+PEDQEKTTFTCPYGTFAFRRMPFGLCNAP TFQRCM AIFS  +E  +E
Subjt:  RKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE

TrEMBL top hitse value%identityAlignment
A0A2G9HWF8 Reverse transcriptase3.1e-19241.87Show/hide
Query:  DPEIERTFRIRRRE-----QRRNKMENVSRLPQGPEGPADPQNGMARPQIQ-----------AADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGV
        DPEIERTFR RRR+     ++  K+E    +         P   +A P+ +            A  +++  M QM+Q   QF GLS E+P+ H+ +FL +
Subjt:  DPEIERTFRIRRRE-----QRRNKMENVSRLPQGPEGPADPQNGMARPQIQ-----------AADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGV

Query:  SNSFVIQGVPRDAIRLTFFLYSLRDGAKAWLNSFAPGSIRTWDEQLE--------------------------DETFSEAWERFKELLRKCPHHGLPHCI
         ++   +GV +DA+RL  F +SL   A  W  S    SI TW +  E                           ET  EAW RF+++LR CP+H +P  I
Subjt:  SNSFVIQGVPRDAIRLTFFLYSLRDGAKAWLNSFAPGSIRTWDEQLE--------------------------DETFSEAWERFKELLRKCPHHGLPHCI

Query:  QMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRSVLEVDGVSTIRPDLAMIANALKN------------------
        Q+  FY+GL    +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD V+ +   +  +  ++KN                  
Subjt:  QMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRSVLEVDGVSTIRPDLAMIANALKN------------------

Query:  VTVISHQQPPAMEP-----TAGLAQRPQLRMGRTRKQCAS---------TTKDATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTEHPRMEGKEQG
        +  +S+ + P   P       G  Q P       + Q ++          TK    Q  +A  +A+ ++ G+   E+   P           + + KE  
Subjt:  VTVISHQQPPAMEP-----TAGLAQRPQLRMGRTRKQCAS---------TTKDATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTEHPRMEGKEQG

Query:  AGGSKKDVGASGSIPNVEPPYVSPPPYVPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEFETVSRTEECSA
        +   +K+V A        P  VS P  + P PFPQR + +    QF KFLE+ K+LHINIP  E  EQM +Y KF+KDIL+KK+RLG++ETV+ TEECSA
Subjt:  AGGSKKDVGASGSIPNVEPPYVSPPPYVPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEFETVSRTEECSA

Query:  ILKNGLPPKAKDLGSFTITVPIGGKELGKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRA---------------------------------
        I++N LPPK KD G              +ALCDLGASINLMP S+YR LG+ EA+PT++TLQLADR+                                 
Subjt:  ILKNGLPPKAKDLGSFTITVPIGGKELGKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRA---------------------------------

Query:  ----------------LIDVQKRELTMRVCNEEVKFNVFKAMKYPGEMEDCSFIRIIKSTV------------IETTIQDSASKHSEEHGESL-DLDQRK
                        LIDVQK ELTMRV ++++ FNVFKAMK+P E ++C  + +  +              +E  + D   + +EE  E +  LD  K
Subjt:  ----------------LIDVQKRELTMRVCNEEVKFNVFKAMKYPGEMEDCSFIRIIKSTV------------IETTIQDSASKHSEEHGESL-DLDQRK

Query:  ---------------ATPIKSSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEG
                       +  +K S+ E PTL+LKPLP HL Y YLGE +TLP+I++S L     E L+++L+ ++ AIGWT+A I+GISPSFCMHKI LE+ 
Subjt:  ---------------ATPIKSSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEG

Query:  SFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLD
           S+E QRRLNP MK+VVKKE+IK LDAGIIYPI+D +W+SPVQCVPKKGG+TVV N  NE IPT+TVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLD
Subjt:  SFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLD

Query:  RLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE
        RLAG+ +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRR+PF LCNAP TFQRCM AIF+ M+E+ +E
Subjt:  RLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE

A0A2G9HYA0 Reverse transcriptase3.4e-19145.68Show/hide
Query:  ETFSEAWERFKELLRKCPHHGLPHCIQMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRSVLEVDGVSTIRPDLA
        ET  EAW RF+++LR CP+H +P  IQ+  FY+GL    +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD V+ +   + 
Subjt:  ETFSEAWERFKELLRKCPHHGLPHCIQMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRSVLEVDGVSTIRPDLA

Query:  MIANALKN-------------------------------VTVISHQQPPAMEP-------------------TAGLAQRPQLRMG---------RTRKQC
         +  ++KN                               +  +S+ + P   P                     G    P+ + G         + +K  
Subjt:  MIANALKN-------------------------------VTVISHQQPPAMEP-------------------TAGLAQRPQLRMG---------RTRKQC

Query:  ASTTKDATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTE-HPRMEGKEQ------------------GAGGSKKDVGASGSIPNVEPPY-VSPPPY
           T    + S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR +GK Q                       +K+V +      VE P  VS P  
Subjt:  ASTTKDATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTE-HPRMEGKEQ------------------GAGGSKKDVGASGSIPNVEPPY-VSPPPY

Query:  VPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEFETVSRTEECSAILKNGLPPKAKDLGSFTITVPIGGKEL
        + P PFPQR + +  + QF KFLE+ K+LHINIP  E  EQM +Y KF+KDIL+KK+RLG++ETV+ TEECSAI++N LPPK KD GSFTI   IG    
Subjt:  VPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEFETVSRTEECSAILKNGLPPKAKDLGSFTITVPIGGKEL

Query:  GKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRA-------------------------------------------------LIDVQKRELTM
        G+ALCDLGASINLMP S+YR LG+GEA+PT++TLQLADR+                                                 LIDVQK ELTM
Subjt:  GKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRA-------------------------------------------------LIDVQKRELTM

Query:  RVCNEEVKFNVFKAMKYPGEMEDCSFIRIIKSTV------------IETTIQDSASKHSEEHGE---SLDLDQ----------RKATP---IKSSLIEAP
        RV ++++ FNVFKAMK+P E ++C  + +                 +E  + D   + +EE  E   +LD  +           + TP   +K S+ + P
Subjt:  RVCNEEVKFNVFKAMKYPGEMEDCSFIRIIKSTV------------IETTIQDSASKHSEEHGE---SLDLDQ----------RKATP---IKSSLIEAP

Query:  TLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLL
        TL+LKPLP HL Y YLGE +TLP+I++S L     E L+++L+ ++ AIGWT+A I+GISPSFCMHKI LE+    S+E QRRLNP MK+VVKKE+IK L
Subjt:  TLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLL

Query:  DAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQ
        DAGIIYPI+D++WVSPVQCVPKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQI IAPEDQ
Subjt:  DAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQ

Query:  EKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE
        EKTTFTCPYGTFAFRRMPFGLCNAP TFQRCM AIF+ M+E+ +E
Subjt:  EKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE

A0A2G9HYD8 Reverse transcriptase1.0e-19046.08Show/hide
Query:  ETFSEAWERFKELLRKCPHHGLPHCIQMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRSVLEVDGVSTIRPDLA
        ET  EAW RF+++LR CP+H +P  IQ+  FY+GL    +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD V+ +   + 
Subjt:  ETFSEAWERFKELLRKCPHHGLPHCIQMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRSVLEVDGVSTIRPDLA

Query:  MIANALKN---------------VTVISHQQPPAMEP-------------------TAGLAQRPQLRMG---------RTRKQCASTTKDATIQSNQASM
         +  ++KN               +  +S+ + P   P                     G    P+ + G         + +K     T    + S  A+ 
Subjt:  MIANALKN---------------VTVISHQQPPAMEP-------------------TAGLAQRPQLRMG---------RTRKQCASTTKDATIQSNQASM

Query:  RALEMQVGQLANELKARPQGKLPSDTE-HPRMEGKEQ------------------GAGGSKKDVGASGSIPNVEPPY-VSPPPYVPPLPFPQRQKPRNQD
        + +E Q+GQLAN + +RPQG LPS+TE +PR +GK Q                       +K+V +      VE P  VS P  + P PFPQ+ + +  +
Subjt:  RALEMQVGQLANELKARPQGKLPSDTE-HPRMEGKEQ------------------GAGGSKKDVGASGSIPNVEPPY-VSPPPYVPPLPFPQRQKPRNQD

Query:  GQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEFETVSRTEECSAILKNGLPPKAKDLGSFTITVPIGGKELGKALCDLGASINLMPL
         QF KFLE+ K+LHINIP  E  EQM +Y KF+KDIL+KK+RLG++ET + TEEC+AI++N LPPK KD GSFTI   IG    G+ALCDLGASINLMP 
Subjt:  GQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEFETVSRTEECSAILKNGLPPKAKDLGSFTITVPIGGKELGKALCDLGASINLMPL

Query:  SVYRKLGIGEARPTTVTLQLADRA-------------------------------------------------LIDVQKRELTMRVCNEEVKFNVFKAMK
        S+YR LG+GEA+PT++TLQLADR+                                                 LIDVQK ELTMRV ++++ FNVFKAMK
Subjt:  SVYRKLGIGEARPTTVTLQLADRA-------------------------------------------------LIDVQKRELTMRVCNEEVKFNVFKAMK

Query:  YPGEMEDCSFIRIIKSTVIETTIQDSASKHSE-------EHGESLDLD------------------QRKATP---IKSSLIEAPTLDLKPLPDHLKYVYL
        +P E ++C  + +  +     +I +      E       E G   DL+                    + TP   +K S+ + PTL+LKPLP+HL YVYL
Subjt:  YPGEMEDCSFIRIIKSTVIETTIQDSASKHSE-------EHGESLDLD------------------QRKATP---IKSSLIEAPTLDLKPLPDHLKYVYL

Query:  GEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSP
        GE +TLP+I++S L     E L+++L+ ++ AIGWT+A I+GISPSFCMHKI LE+    S+E QRRLN  MK+VVKKE+IK LDAGIIYPI+D++WVSP
Subjt:  GEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSP

Query:  VQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRR
        VQCVPKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRR
Subjt:  VQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRR

Query:  MPFGLCNAPTTFQRCMFAIFSYMIESTVE
        MPFGLCNAP TFQRCM AIF+ M+E+ +E
Subjt:  MPFGLCNAPTTFQRCMFAIFSYMIESTVE

A0A6P8DD93 uncharacterized protein LOC1162064533.7e-19340.7Show/hide
Query:  LDPEIERTF-RIRRREQRRNKM-------ENVSRLPQGPEG-------PADPQNGMARPQIQAADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGV
        LDPEIERT  R+RR  +RR ++       ++++R  QG          P    + + RP I A +FE+KP + QM+Q+  QF G  +E P  H+  FL  
Subjt:  LDPEIERTF-RIRRREQRRNKM-------ENVSRLPQGPEG-------PADPQNGMARPQIQAADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGV

Query:  SNSFVIQGVPRDAIRLTFFLYSLRDGAKAWLNSFAPGSIRTWDE--------------------------QLEDETFSEAWERFKELLRKCPHHGLPHCI
         N+  +  V  D IRL  F +SLRD A+AW NS    SI TW +                          +   E+  EAWERFKE +RKCPHHGLP  +
Subjt:  SNSFVIQGVPRDAIRLTFFLYSLRDGAKAWLNSFAPGSIRTWDE--------------------------QLEDETFSEAWERFKELLRKCPHHGLPHCI

Query:  QMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRSVLEVDGVSTIRPDLAMIANALKNVTVI--------------
         +E+FY  L    + +VDA+AGGAL+ K +DEA  ++E +++++  W + R +  +V SV ++D ++ +   ++ +   +  +T                
Subjt:  QMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRSVLEVDGVSTIRPDLAMIANALKNVTVI--------------

Query:  -------------------------------SHQQP---------------------PAMEPTAGLAQR-------PQLRMGRTRKQCASTTK--DATIQ
                                       S+Q P                      A++P  G  ++       PQ    R  +   S  +  D  +Q
Subjt:  -------------------------------SHQQP---------------------PAMEPTAGLAQR-------PQLRMGRTRKQCASTTK--DATIQ

Query:  SNQASMRALEMQVGQLANELKARPQGKLPSDT-EHPR-------MEGKEQGAGGSK---------KDVGASGSIPNVEPPYVSPPPYVPPLPFPQRQKPR
        + QA++R LE Q+ Q++ +L  RP G LPS+T E+P+         GKE      K         KD G    +       +   PYVPP+PFP+R K +
Subjt:  SNQASMRALEMQVGQLANELKARPQGKLPSDT-EHPR-------MEGKEQGAGGSK---------KDVGASGSIPNVEPPYVSPPPYVPPLPFPQRQKPR

Query:  NQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEFETVSRTEECSAILKN---GLPPKAKDLGSFTITVPIGGKELGKALCDLGAS
          D QF KFL++ K+L INIP  E  +QM +YA+F+KD+LTKK++    E V  T ECS IL+     LP K +D GSFT+   IG       L D GAS
Subjt:  NQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEFETVSRTEECSAILKN---GLPPKAKDLGSFTITVPIGGKELGKALCDLGAS

Query:  INLMPLSVYRKLGIGEARPTTVTLQLADR-------------------------------------------------ALIDVQKRELTMRVCNEEVKFN
        INLMPLS++RKLG+GE + T VTLQLADR                                                 ALIDV++ +LT+RV NE++ FN
Subjt:  INLMPLSVYRKLGIGEARPTTVTLQLADR-------------------------------------------------ALIDVQKRELTMRVCNEEVKFN

Query:  VFKAMKYPGEMEDCSFIRIIKSTV------------IETTIQD-----SASKHSEEHGESL-DLDQR-------KATPIKSSLIEAPTLDLKPLPDHLKY
        V+ A+K   + + C  I II   +            +E+ ++D        +H EE  E + ++  R        AT   SSL ++P L+LKPLP HLKY
Subjt:  VFKAMKYPGEMEDCSFIRIIKSTV------------IETTIQD-----SASKHSEEHGESL-DLDQR-------KATPIKSSLIEAPTLDLKPLPDHLKY

Query:  VYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNW
         YLG  +TLPII++S L  + E+ L+ +L+++++AIGWT+A I+GISP  C H+I LE      ++ QRRLNP +K+VVKKEV+KLLDAGIIYPI+D+ W
Subjt:  VYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNW

Query:  VSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFA
        VSPVQ VPKKGG+TVV N+ N+LIPTRTVTGWRVC+DYR+LN ATRKDH PLPFIDQML++LAG  YYCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFA
Subjt:  VSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFA

Query:  FRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE
        FRRMPFGLCNAP TFQRCM +IFS M+E+ +E
Subjt:  FRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE

A0A6P8DKJ2 uncharacterized protein LOC1162042311.7e-19340.87Show/hide
Query:  LDPEIERTF-RIRRREQRRNKM-------ENVSRLPQGPEG-------PADPQNGMARPQIQAADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGV
        LDPEIERT  R+RR  +RR ++       ++++R  QG          P    + + RP I A +FE+KP + QM+Q+  QF G  +E P  H+  FL  
Subjt:  LDPEIERTF-RIRRREQRRNKM-------ENVSRLPQGPEG-------PADPQNGMARPQIQAADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGV

Query:  SNSFVIQGVPRDAIRLTFFLYSLRDGAKAWLNSFAPGSIRTWDE--------------------------QLEDETFSEAWERFKELLRKCPHHGLPHCI
         N+  +  V  D IRL  F +SLRD A+AW NS    SI TW +                          +   E+  EAWERFKE +RKCPHHGLP  +
Subjt:  SNSFVIQGVPRDAIRLTFFLYSLRDGAKAWLNSFAPGSIRTWDE--------------------------QLEDETFSEAWERFKELLRKCPHHGLPHCI

Query:  QMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRSVLEVDGVSTIRPDLAMIANALKNVTVI--------------
         +E+FY  L    + +VDA+AGGAL+ K +DEA  ++E +++++  W + R +  +V SV ++D ++ +   ++ +   +  +T                
Subjt:  QMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRSVLEVDGVSTIRPDLAMIANALKNVTVI--------------

Query:  -------------------------------SHQQP---------------------PAMEPTAGLAQR-------PQLRMGRTRKQCASTTK--DATIQ
                                       S+Q P                      A++P  G  ++       PQ    R  +   S  +  D  +Q
Subjt:  -------------------------------SHQQP---------------------PAMEPTAGLAQR-------PQLRMGRTRKQCASTTK--DATIQ

Query:  SNQASMRALEMQVGQLANELKARPQGKLPSDT-EHPR-------MEGKEQGAGGSK---------KDVGASGSIPNVEPP---YVSPPPYVPPLPFPQRQ
        + QA++R LE Q+ Q++ +L  RP G LPS+T E+P+         GKE      K         KD G       VE P    +   PYVPP+PFP R 
Subjt:  SNQASMRALEMQVGQLANELKARPQGKLPSDT-EHPR-------MEGKEQGAGGSK---------KDVGASGSIPNVEPP---YVSPPPYVPPLPFPQRQ

Query:  KPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEFETVSRTEECSAILKN---GLPPKAKDLGSFTITVPIGGKELGKALCDL
        K +  D QF KFL++ K+L INIP  E  +QM +YA+F+KD+LTKK++    E V  T ECS IL+     LP K +D GSFT+   IG       L D 
Subjt:  KPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEFETVSRTEECSAILKN---GLPPKAKDLGSFTITVPIGGKELGKALCDL

Query:  GASINLMPLSVYRKLGIGEARPTTVTLQLADR-------------------------------------------------ALIDVQKRELTMRVCNEEV
        GASINLMPLS++RKLG+GE + T +TLQLADR                                                 ALIDV++ +LT+RV NE++
Subjt:  GASINLMPLSVYRKLGIGEARPTTVTLQLADR-------------------------------------------------ALIDVQKRELTMRVCNEEV

Query:  KFNVFKAMKYPGEMEDCSFIRIIKSTV------------IETTIQD-----SASKHSEEHGESL-DLDQR-------KATPIKSSLIEAPTLDLKPLPDH
         FNV+ A+K   + + C  I II   +            +E+ ++D        +H EE  E + ++  R        AT   SSL ++P L+LKPLP H
Subjt:  KFNVFKAMKYPGEMEDCSFIRIIKSTV------------IETTIQD-----SASKHSEEHGESL-DLDQR-------KATPIKSSLIEAPTLDLKPLPDH

Query:  LKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIAD
        LKY YLG  +TLPII++S L  + E+ L+ +L+++++AIGWT+A I+GISP  C H+I LE      ++ QRRLNP +K+VVKKEV+KLLDAGIIYPI+D
Subjt:  LKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIAD

Query:  NNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYG
        + WVSPVQ VPKKGG+TVV N+ N+LIPTRTVTGWRVC+DYR+LN ATRKDHFPLPFIDQML++LAG  YYCFLDGYSGYNQI IAPEDQEKTTFTCPYG
Subjt:  NNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYG

Query:  TFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE
        TFAFRRMPFGLCNAP TFQRCM +IFS M+E+ +E
Subjt:  TFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.0e-1931.86Show/hide
Query:  LMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFC-MHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTV
        L +E ++ L  LLQ+Y       + Y +G   +F    K T+       +  +     A +  V+ ++  +L+ GII   +++ + SP+  VPKK     
Subjt:  LMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFC-MHKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTV

Query:  VSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTF
         S K            +R+ +DYR+LN+ T  D  P+P +D++L +L    Y+  +D   G++QI + PE   KT F+  +G + + RMPFGL NAP TF
Subjt:  VSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTF

Query:  QRCM
        QRCM
Subjt:  QRCM

P10394 Retrovirus-related Pol polyprotein from transposon 4122.0e-1834.07Show/hide
Query:  VKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYN
        ++ +V KL+   I+ P + + + SP+  VPKK              P      WR+ +DYR++NK    D FPLP ID +LD+L    Y+  LD  SG++
Subjt:  VKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYN

Query:  QITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFS---------YMIESTVEG-----LAKRFLEVVGSLR
        QI +    ++ T+F+   G++ F R+PFGL  AP +FQR M   FS         YM +  V G     + K   EV G  R
Subjt:  QITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFS---------YMIESTVEG-----LAKRFLEVVGSLR

P20825 Retrovirus-related Pol polyprotein from transposon 2971.7e-1736.88Show/hide
Query:  VKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYN
        V+ +V ++L+ G+I   +++ + SP   VPKK   +  +NK            +RV +DYR+LN+ T  D +P+P +D++L +L    Y+  +D   G++
Subjt:  VKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYN

Query:  QITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCM
        QI +  E   KT F+   G + + RMPFGL NAP TFQRCM
Subjt:  QITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCM

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.9e-2338.6Show/hide
Query:  HKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPL
        H I ++ G+     Q   +    +  + K V KLLD   I P + +   SPV  VPKK G                   +R+C+DYR LNKAT  D FPL
Subjt:  HKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPL

Query:  PFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIF
        P ID +L R+     +  LD +SGY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F
Subjt:  PFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIF

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.9e-2338.6Show/hide
Query:  HKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPL
        H I ++ G+     Q   +    +  + K V KLLD   I P + +   SPV  VPKK G                   +R+C+DYR LNKAT  D FPL
Subjt:  HKITLEEGSFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPL

Query:  PFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIF
        P ID +L R+     +  LD +SGY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F
Subjt:  PFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIF

Arabidopsis top hitse value%identityAlignment
AT1G60030.1 nucleobase-ascorbate transporter 72.8e-0747.22Show/hide
Query:  LGTTVLIPSSLVPQMGGG--------------------------------FGGSYTFVPTTVSIILAGRFSD
        LGTTVLIP+ LVPQMGGG                                 GGSYT+VPTT+SIILAGR+SD
Subjt:  LGTTVLIPSSLVPQMGGG--------------------------------FGGSYTFVPTTVSIILAGRFSD

AT5G62890.1 Xanthine/uracil permease family protein7.7e-1048.72Show/hide
Query:  LGTTVLIPSSLVPQMGGGF--------------------------------GGSYTFVPTTVSIILAGRFSDTTNLID
        LGTTVLIP++LVPQMGGG+                                G SYTFVPTT+SIIL+GRFSDT+N ID
Subjt:  LGTTVLIPSSLVPQMGGGF--------------------------------GGSYTFVPTTVSIILAGRFSDTTNLID

AT5G62890.2 Xanthine/uracil permease family protein7.7e-1048.72Show/hide
Query:  LGTTVLIPSSLVPQMGGGF--------------------------------GGSYTFVPTTVSIILAGRFSDTTNLID
        LGTTVLIP++LVPQMGGG+                                G SYTFVPTT+SIIL+GRFSDT+N ID
Subjt:  LGTTVLIPSSLVPQMGGGF--------------------------------GGSYTFVPTTVSIILAGRFSDTTNLID

AT5G62890.3 Xanthine/uracil permease family protein7.7e-1048.72Show/hide
Query:  LGTTVLIPSSLVPQMGGGF--------------------------------GGSYTFVPTTVSIILAGRFSDTTNLID
        LGTTVLIP++LVPQMGGG+                                G SYTFVPTT+SIIL+GRFSDT+N ID
Subjt:  LGTTVLIPSSLVPQMGGGF--------------------------------GGSYTFVPTTVSIILAGRFSDTTNLID

AT5G62890.4 Xanthine/uracil permease family protein7.7e-1048.72Show/hide
Query:  LGTTVLIPSSLVPQMGGGF--------------------------------GGSYTFVPTTVSIILAGRFSDTTNLID
        LGTTVLIP++LVPQMGGG+                                G SYTFVPTT+SIIL+GRFSDT+N ID
Subjt:  LGTTVLIPSSLVPQMGGGF--------------------------------GGSYTFVPTTVSIILAGRFSDTTNLID


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCGCCTGGGGTAAGGTTCGAGCTAGATCCAGAAATTGAGAGGACATTCAGGATTAGAAGGAGAGAGCAGCGCAGAAACAAGATGGAGAACGTGTCGCGTCT
TCCACAGGGTCCTGAAGGTCCAGCAGACCCCCAAAATGGGATGGCACGTCCCCAAATTCAAGCGGCGGATTTTGAAATGAAACCGGTAATGTTTCAAATGTTGCAAACCG
TGGGGCAATTCCATGGTTTGTCATCTGAAGATCCTCATTTACATCTTAAGTCTTTTCTAGGAGTTAGCAATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTATTAGA
TTAACTTTTTTCTTGTATTCTCTTAGAGATGGAGCAAAGGCGTGGTTAAATTCTTTTGCTCCAGGATCAATTAGGACATGGGATGAGCAACTTGAGGATGAGACTTTTAG
TGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTGTATTCAAATGGAAATATTTTACAATGGTTTAATTGGAGTAACCCAAG
GTATGGTTGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAGACTTTTGATGAAGCCTATGAAATTTTAGAAAGAATATCTACTAATAGTTGTCAGTGGTCGGATGTTAGA
GGCACAAATAAAAAGGTTAGGAGTGTATTAGAGGTTGATGGTGTGTCCACCATTAGGCCTGATCTTGCTATGATTGCTAATGCTCTTAAGAATGTGACAGTGATTAGTCA
TCAACAGCCACCAGCTATGGAGCCTACTGCAGGGTTGGCGCAACGACCCCAACTTCGCATGGGGAGGACAAGGAAGCAGTGCGCAAGCACAACAAAAGATGCCACAATTC
AAAGTAATCAAGCTTCGATGAGAGCCCTAGAAATGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCTTCAGATACTGAACACCCTCGA
ATGGAAGGTAAGGAGCAGGGTGCTGGAGGCAGCAAGAAAGATGTTGGAGCATCTGGTTCTATTCCAAATGTGGAACCACCTTATGTGTCGCCCCCACCTTATGTACCACC
TCTACCTTTTCCACAAAGGCAAAAGCCTAGGAATCAGGACGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCATATAAATATCCCTTTAGTAGAAGATACTG
AGCAAATGCTTAATTATGCTAAATTTCTTAAGGATATTTTAACTAAAAAGAAGAGGTTAGGTGAGTTTGAAACTGTATCTCGTACTGAGGAGTGTAGTGCTATTCTTAAG
AATGGGCTACCACCCAAGGCTAAGGATCTAGGGTCATTTACTATAACTGTGCCTATAGGTGGAAAAGAGTTAGGTAAAGCACTCTGTGATTTAGGTGCAAGCATTAACCT
TATGCCTCTTTCGGTCTATCGAAAGCTTGGTATTGGTGAAGCTAGGCCCACCACAGTCACACTCCAACTAGCTGATAGGGCATTAATAGATGTTCAGAAAAGGGAGTTAA
CGATGAGAGTTTGTAATGAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAGTATCCAGGCGAAATGGAGGATTGCTCCTTCATTAGGATTATAAAGAGCACAGTTATT
GAGACAACAATACAGGATTCAGCTAGTAAGCATTCAGAAGAGCATGGAGAGTCTTTAGATTTAGATCAAAGGAAGGCTACTCCTATTAAGTCATCCCTGATTGAGGCACC
TACTTTAGATTTGAAGCCCTTGCCGGATCATCTAAAGTATGTGTATCTTGGGGAAGGTGAGACGTTGCCCATTATTGTTGCATCAGATTTAATGTCGGAGCATGAGGAGG
CCTTAATAAAATTGCTGCAGCAATACCGCAAGGCTATAGGTTGGACATTGGCTTACATTCAGGGAATTAGCCCGTCTTTTTGCATGCACAAAATCACTCTAGAGGAGGGA
TCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGATGTTGTTAAAAAGGAGGTGATTAAATTGTTGGATGCTGGGATCATTTATCCAATTGCAGA
CAACAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATAAAGACAATGAGTTAATCCCAACCAGGACAGTAACTGGCTGGAGGG
TTTGCATGGACTACAGGAGGCTTAATAAAGCTACCCGTAAGGACCATTTCCCTCTACCATTTATTGACCAGATGTTGGATAGATTGGCTGGTCAGGCCTACTACTGTTTC
TTAGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTGAGGATCAGGAGAAAACCACTTTCACCTGCCCTTATGGGACGTTTGCTTTTAGGCGAATGCCTTTTGG
CCTTTGCAATGCTCCAACAACATTTCAGCGGTGTATGTTTGCAATTTTTTCTTATATGATTGAGTCCACTGTTGAGGGATTGGCCAAAAGGTTCCTTGAGGTTGTTGGAT
CTTTGAGACATGGCTATGTTTATTTTGTATGTGGTAATCTAAGGAGTGTAATGGAAATGACAAGACTTATTGATATTATTGATATTAAAGTAGTTTTAAAAGGTGGGCAA
AACTTGTATGTTCGAATTCTTGCTTCTGAGTTAGGCACGACAGTTCTGATTCCATCTTCTCTTGTTCCTCAAATGGGTGGTGGATTTGGAGGGTCGTATACCTTTGTACC
AACCACTGTTTCAATCATCCTTGCTGGTCGATTTTCTGATACGACAAATCTTATTGATGTAACGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCGCCTGGGGTAAGGTTCGAGCTAGATCCAGAAATTGAGAGGACATTCAGGATTAGAAGGAGAGAGCAGCGCAGAAACAAGATGGAGAACGTGTCGCGTCT
TCCACAGGGTCCTGAAGGTCCAGCAGACCCCCAAAATGGGATGGCACGTCCCCAAATTCAAGCGGCGGATTTTGAAATGAAACCGGTAATGTTTCAAATGTTGCAAACCG
TGGGGCAATTCCATGGTTTGTCATCTGAAGATCCTCATTTACATCTTAAGTCTTTTCTAGGAGTTAGCAATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTATTAGA
TTAACTTTTTTCTTGTATTCTCTTAGAGATGGAGCAAAGGCGTGGTTAAATTCTTTTGCTCCAGGATCAATTAGGACATGGGATGAGCAACTTGAGGATGAGACTTTTAG
TGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTGTATTCAAATGGAAATATTTTACAATGGTTTAATTGGAGTAACCCAAG
GTATGGTTGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAGACTTTTGATGAAGCCTATGAAATTTTAGAAAGAATATCTACTAATAGTTGTCAGTGGTCGGATGTTAGA
GGCACAAATAAAAAGGTTAGGAGTGTATTAGAGGTTGATGGTGTGTCCACCATTAGGCCTGATCTTGCTATGATTGCTAATGCTCTTAAGAATGTGACAGTGATTAGTCA
TCAACAGCCACCAGCTATGGAGCCTACTGCAGGGTTGGCGCAACGACCCCAACTTCGCATGGGGAGGACAAGGAAGCAGTGCGCAAGCACAACAAAAGATGCCACAATTC
AAAGTAATCAAGCTTCGATGAGAGCCCTAGAAATGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCTTCAGATACTGAACACCCTCGA
ATGGAAGGTAAGGAGCAGGGTGCTGGAGGCAGCAAGAAAGATGTTGGAGCATCTGGTTCTATTCCAAATGTGGAACCACCTTATGTGTCGCCCCCACCTTATGTACCACC
TCTACCTTTTCCACAAAGGCAAAAGCCTAGGAATCAGGACGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCATATAAATATCCCTTTAGTAGAAGATACTG
AGCAAATGCTTAATTATGCTAAATTTCTTAAGGATATTTTAACTAAAAAGAAGAGGTTAGGTGAGTTTGAAACTGTATCTCGTACTGAGGAGTGTAGTGCTATTCTTAAG
AATGGGCTACCACCCAAGGCTAAGGATCTAGGGTCATTTACTATAACTGTGCCTATAGGTGGAAAAGAGTTAGGTAAAGCACTCTGTGATTTAGGTGCAAGCATTAACCT
TATGCCTCTTTCGGTCTATCGAAAGCTTGGTATTGGTGAAGCTAGGCCCACCACAGTCACACTCCAACTAGCTGATAGGGCATTAATAGATGTTCAGAAAAGGGAGTTAA
CGATGAGAGTTTGTAATGAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAGTATCCAGGCGAAATGGAGGATTGCTCCTTCATTAGGATTATAAAGAGCACAGTTATT
GAGACAACAATACAGGATTCAGCTAGTAAGCATTCAGAAGAGCATGGAGAGTCTTTAGATTTAGATCAAAGGAAGGCTACTCCTATTAAGTCATCCCTGATTGAGGCACC
TACTTTAGATTTGAAGCCCTTGCCGGATCATCTAAAGTATGTGTATCTTGGGGAAGGTGAGACGTTGCCCATTATTGTTGCATCAGATTTAATGTCGGAGCATGAGGAGG
CCTTAATAAAATTGCTGCAGCAATACCGCAAGGCTATAGGTTGGACATTGGCTTACATTCAGGGAATTAGCCCGTCTTTTTGCATGCACAAAATCACTCTAGAGGAGGGA
TCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGATGTTGTTAAAAAGGAGGTGATTAAATTGTTGGATGCTGGGATCATTTATCCAATTGCAGA
CAACAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATAAAGACAATGAGTTAATCCCAACCAGGACAGTAACTGGCTGGAGGG
TTTGCATGGACTACAGGAGGCTTAATAAAGCTACCCGTAAGGACCATTTCCCTCTACCATTTATTGACCAGATGTTGGATAGATTGGCTGGTCAGGCCTACTACTGTTTC
TTAGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTGAGGATCAGGAGAAAACCACTTTCACCTGCCCTTATGGGACGTTTGCTTTTAGGCGAATGCCTTTTGG
CCTTTGCAATGCTCCAACAACATTTCAGCGGTGTATGTTTGCAATTTTTTCTTATATGATTGAGTCCACTGTTGAGGGATTGGCCAAAAGGTTCCTTGAGGTTGTTGGAT
CTTTGAGACATGGCTATGTTTATTTTGTATGTGGTAATCTAAGGAGTGTAATGGAAATGACAAGACTTATTGATATTATTGATATTAAAGTAGTTTTAAAAGGTGGGCAA
AACTTGTATGTTCGAATTCTTGCTTCTGAGTTAGGCACGACAGTTCTGATTCCATCTTCTCTTGTTCCTCAAATGGGTGGTGGATTTGGAGGGTCGTATACCTTTGTACC
AACCACTGTTTCAATCATCCTTGCTGGTCGATTTTCTGATACGACAAATCTTATTGATGTAACGTAA
Protein sequenceShow/hide protein sequence
MSDPPGVRFELDPEIERTFRIRRREQRRNKMENVSRLPQGPEGPADPQNGMARPQIQAADFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSNSFVIQGVPRDAIR
LTFFLYSLRDGAKAWLNSFAPGSIRTWDEQLEDETFSEAWERFKELLRKCPHHGLPHCIQMEIFYNGLIGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVR
GTNKKVRSVLEVDGVSTIRPDLAMIANALKNVTVISHQQPPAMEPTAGLAQRPQLRMGRTRKQCASTTKDATIQSNQASMRALEMQVGQLANELKARPQGKLPSDTEHPR
MEGKEQGAGGSKKDVGASGSIPNVEPPYVSPPPYVPPLPFPQRQKPRNQDGQFKKFLEILKQLHINIPLVEDTEQMLNYAKFLKDILTKKKRLGEFETVSRTEECSAILK
NGLPPKAKDLGSFTITVPIGGKELGKALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRALIDVQKRELTMRVCNEEVKFNVFKAMKYPGEMEDCSFIRIIKSTVI
ETTIQDSASKHSEEHGESLDLDQRKATPIKSSLIEAPTLDLKPLPDHLKYVYLGEGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLAYIQGISPSFCMHKITLEEG
SFRSIEQQRRLNPAMKDVVKKEVIKLLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCF
LDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMFAIFSYMIESTVEGLAKRFLEVVGSLRHGYVYFVCGNLRSVMEMTRLIDIIDIKVVLKGGQ
NLYVRILASELGTTVLIPSSLVPQMGGGFGGSYTFVPTTVSIILAGRFSDTTNLIDVT