; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001143 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001143
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr4:25264267..25268861
RNA-Seq ExpressionLag0001143
SyntenyLag0001143
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN05661.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]6.0e-14447.13Show/hide
Query:  VNQV--AVESCVYCGEEHNYEFCHNNLASVFFC------------------WRNHPNFAWG---GQGSNVQAQQKVNQSGFAKLQILPQHNKQALRQ--Q
        VNQV     +C  CGE H  + C +++ S+ F                   WR HPNF+W    GQGS  + Q               Q  +Q ++Q  Q
Subjt:  VNQV--AVESCVYCGEEHNYEFCHNNLASVFFC------------------WRNHPNFAWG---GQGSNVQAQQKVNQSGFAKLQILPQHNKQALRQ--Q

Query:  NSESTLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTE-HPRREGKEQIQAVTLRSGKPLEERKEASKTQDVEKNSDKNVVVE
          + +LE  + ++M+       S  A+ + +E Q+G LAN + +RPQG LPS+TE +PR++GK Q QAVTLR+G+ L+E       ++  K+ +K V+ E
Subjt:  NSESTLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTE-HPRREGKEQIQAVTLRSGKPLEERKEASKTQDVEKNSDKNVVVE

Query:  KEFKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTV
        ++ K                   +VE P     P  L  PFPQR + +  + QF KFLE+ K+LHINIP  EA++QMP+Y KF+KDIL+KK+RLG+++TV
Subjt:  KEFKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTV

Query:  SLTEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFI
        +LTEEC AI++N LP K KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF+
Subjt:  SLTEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFI

Query:  ILYYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRILENTI------------VETAMQDLANKH-------
        +L  E D +VPI LGR FLATGRTLIDVQKGELTMRV  +++ FNVFK MK+ +E ++C  + + +N              +E A+ DL ++        
Subjt:  ILYYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRILENTI------------VETAMQDLANKH-------

Query:  LKDLD-------------ERKAP--LIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCM
        +K LD             ER AP  ++K S+ E PTL+LKPL  HL Y YLGE +TLP+I++S L     E L+++L+ ++ AIGWT+ DI+GI+ SFCM
Subjt:  LKDLD-------------ERKAP--LIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCM

Query:  HKITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIAD
        HKI LE+    S+E Q+RLNP+MKEVVKKE+IKWLD GIIYPI+D
Subjt:  HKITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIAD

PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]3.8e-14647.53Show/hide
Query:  VNQV--AVESCVYCGEEHNYEFCHNNLASVFFC------------------WRNHPNFAWG---GQGSNVQAQQKVNQSGFAKLQILPQHNKQALRQQNS
        VNQV     +C  CGE H  + C +++ S+ F                   WR HPNF+W    GQGS  + Q    Q G  ++Q   Q  K +L     
Subjt:  VNQV--AVESCVYCGEEHNYEFCHNNLASVFFC------------------WRNHPNFAWG---GQGSNVQAQQKVNQSGFAKLQILPQHNKQALRQQNS

Query:  ESTLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTE-HPRREGKEQIQAVTLRSGKPLEERKEASKTQDVEKNSDKNVVVEKE
        E TL   M            S  A+ + +E Q+G LAN + +RPQG LPS+TE +PR++GK Q QAVTLR+G+ L+E       ++  K+ +K V+ E++
Subjt:  ESTLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTE-HPRREGKEQIQAVTLRSGKPLEERKEASKTQDVEKNSDKNVVVEKE

Query:  FKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTVSL
         K                   +VE P     P  L  PFPQR + +  + QF KFLE+ K+LHINIP  EA++QMP+Y KF+KDIL+KK+RLG+++TV+L
Subjt:  FKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTVSL

Query:  TEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL
        TEEC AI++N LP K KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF++L
Subjt:  TEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL

Query:  YYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRILENTI------------VETAMQDLANKH-------LK
          E D +VPI LGR FLATGRTLIDVQKGELTMRV  +++ FNVFK MK+ +E ++C  + + +               +E A+ DL ++        +K
Subjt:  YYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRILENTI------------VETAMQDLANKH-------LK

Query:  DLD-------------ERKAP--LIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCMHK
         LD             ER  P  ++K S+ + PTL+LKPL  HL Y YLGE +TLP+I++S L     E L+++L+ ++ AIGWT+ADI+GI+ SFCMHK
Subjt:  DLD-------------ERKAP--LIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCMHK

Query:  ITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVN
        I LE+    S+E Q+RLNP+MKEVVKKE+IKWLD GIIYPI+DS+WV+
Subjt:  ITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVN

XP_017227899.1 PREDICTED: uncharacterized protein LOC108203467 [Daucus carota subsp. sativus]8.6e-15155.54Show/hide
Query:  MMKEYM-------SRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTEHPRREGKEQIQAVTLRSGKPLEERKEASKTQD-VEKNSDKNVVVEK
        M+KEY+       S+T A + S  AS+R LE QVG LAN L+ RP G L SDTE P+  G E  +A+TL+SGK L      +K  D VE + ++ +  +K
Subjt:  MMKEYM-------SRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTEHPRREGKEQIQAVTLRSGKPLEERKEASKTQD-VEKNSDKNVVVEK

Query:  EFKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTVS
        E ++ K           S   + +E  ++ P P     PFPQR + + QD QF+KFL++LKQLHINIPLVEA++QMPNY KF+KDILTKK+RLGEF+TV+
Subjt:  EFKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTVS

Query:  LTEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFII
        LT+EC + L++ LPTK KDPGSFTIP +IG+   G ALCDLGASINLMP+SV+RKLGIGE RPTTVTLQLADRS+ +PEGKIEDVLVKVDKFIFP DFI+
Subjt:  LTEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFII

Query:  LYYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRI-----------------LENTIVETAMQDLANKHL--
        L YEAD++VPI LGR FLATGRTLIDVQ GELTMRV  E+V FNVFK MKY D+VEDCS I I                 LE  I+  + ++  N  L  
Subjt:  LYYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRI-----------------LENTIVETAMQDLANKHL--

Query:  -----------------KDLDERKAPLIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFC
                          DL  R+    KSS+ E P L+LK L  HLKY YLG   TLP+I++++L   +EE L++LL+ +RKAIGW++ADI+GI+ S C
Subjt:  -----------------KDLDERKAPLIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFC

Query:  MHKITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVN
        MHKI LE+G+  SIE Q+RLNP+MKEVVKKEVIKWLD GIIYPI+DS WV+
Subjt:  MHKITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVN

XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]9.9e-14752.55Show/hide
Query:  PQHNKQALRQQNSESTLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTEHPRREGKEQIQAVTLRSGKPLEERKE--------
        P H    + Q  +ES   +++KEYM++ +AAIQS QAS+R LE+QVG LAN L+ RP  KLP+DTE P+REG EQ QA+ LRSGK +  R E        
Subjt:  PQHNKQALRQQNSESTLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTEHPRREGKEQIQAVTLRSGKPLEERKE--------

Query:  -ASKTQDVEKNSDKNVVVEKEFKSGKGAGGNNIDAGASGSISDVEPPYVSPSP--------YVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEA
         + +T D ++  +K VV E+  K+   A      +     I  V+PP    S         Y    PFPQR + K ++  F KF++I K++HINIPLVEA
Subjt:  -ASKTQDVEKNSDKNVVVEKEFKSGKGAGGNNIDAGASGSISDVEPPYVSPSP--------YVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEA

Query:  IKQMPNYAKFLKDILTKKKRLGEFDTVSLTEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLAD
        +KQMPNY KFLKD+LT +++  EF  V L EEC AILKN +P K KDPGSFTIP+SIG K+LGRALCDLG+SINLMPLS+Y+KLGIGEARPTTVTLQLAD
Subjt:  IKQMPNYAKFLKDILTKKKRLGEFDTVSLTEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLAD

Query:  RSITYPEGKIEDVLVKVDKFIFPVDFIILYYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRIL--------
        RS TYPEGKIED+L++VDKFIFP DFIIL YEAD DVPI LGR FL TGRTL+DV KG +T+R+  ++V+FN+   MKY    E+CS +  L        
Subjt:  RSITYPEGKIEDVLVKVDKFIFPVDFIILYYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRIL--------

Query:  --------------ENTIVETAMQDLANKHLKDL--DERKAPLIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKA
                       + I + A+    ++  + L  + RK+  ++ S+ EAP LDLKPL  +LKY YLG+ +TLPII+++ L    E+ L++ L++++ A
Subjt:  --------------ENTIVETAMQDLANKHLKDL--DERKAPLIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKA

Query:  IGWTLADIQGINSSFCMHKITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVNLI
        IGWTLADI+GI+ S CMHKI LEEG  +SIEQQ+RLNP+MKEVV+KE++KWLD GIIYPIA+S+ V+ I
Subjt:  IGWTLADIQGINSSFCMHKITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVNLI

XP_024028757.1 uncharacterized protein LOC112093792 [Morus notabilis]3.3e-15049.14Show/hide
Query:  SCVYCGEEHNYEFCHNNLASVFFC-----------------WRNHPNFAWGGQGSNVQ--AQQKVNQSGFAKLQILPQ----HNKQALRQQNSESTLEAM
        +CVYCG EH++E C +N  SV +                  W+ HPNF+W  Q +N      +     GF + Q   Q     + Q    Q S + +EA+
Subjt:  SCVYCGEEHNYEFCHNNLASVFFC-----------------WRNHPNFAWGGQGSNVQ--AQQKVNQSGFAKLQILPQ----HNKQALRQQNSESTLEAM

Query:  MKEYMSRTN-------AAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTEHPRREG----KEQIQAVTLRSGKPLEERKEASKTQDVEKNSDKNVVV
        +KEYM+R +       A +QS  AS+R LE QVG LAN L  RPQG LPSDT++PRR+G    KE  +A+TL++G+ +E+      T+        ++  
Subjt:  MKEYMSRTN-------AAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTEHPRREG----KEQIQAVTLRSGKPLEERKEASKTQDVEKNSDKNVVV

Query:  EKEFKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDT
        ++  +    +  + +D  A+  +   +P    P       PFPQR + + QD QFR+FL++LKQLHINIPLVEA++QMP+Y KF+KDILTKK+RLGEF+T
Subjt:  EKEFKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDT

Query:  VSLTEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDF
        V+LTEEC AILKN LP K KDPGSFTIP SIG++ +G+ALCDLGASINLMP+S++RKLGIGE  PTTVTLQLADRS  +PEGKIEDVLV+VDKFIFP DF
Subjt:  VSLTEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDF

Query:  IILYYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRIL--------ENTIVETAM--QDL------------
        I+L YEADK+VPI LGR FLATG+TLIDVQKGELTMRV+ ++V FNVFK M++ DEVE+CS + +L        E T  E  M  +DL            
Subjt:  IILYYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRIL--------ENTIVETAM--QDL------------

Query:  -------------ANKHLKDLDERKAPL--IKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINS
                     + +H + LD    PL   K S+ E P L+L+PL  HL+Y YLG+ +TLP+I+AS L    E  L+++L+++++AIGWT+ADI+GI+ 
Subjt:  -------------ANKHLKDLDERKAPL--IKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINS

Query:  SFCMHKITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWL
        S CMHKI L+E    S+EQQ+RLNP+MKEVVKKEVI ++
Subjt:  SFCMHKITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWL

TrEMBL top hitse value%identityAlignment
A0A2G9GK35 Reverse transcriptase2.9e-14447.13Show/hide
Query:  VNQV--AVESCVYCGEEHNYEFCHNNLASVFFC------------------WRNHPNFAWG---GQGSNVQAQQKVNQSGFAKLQILPQHNKQALRQ--Q
        VNQV     +C  CGE H  + C +++ S+ F                   WR HPNF+W    GQGS  + Q               Q  +Q ++Q  Q
Subjt:  VNQV--AVESCVYCGEEHNYEFCHNNLASVFFC------------------WRNHPNFAWG---GQGSNVQAQQKVNQSGFAKLQILPQHNKQALRQ--Q

Query:  NSESTLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTE-HPRREGKEQIQAVTLRSGKPLEERKEASKTQDVEKNSDKNVVVE
          + +LE  + ++M+       S  A+ + +E Q+G LAN + +RPQG LPS+TE +PR++GK Q QAVTLR+G+ L+E       ++  K+ +K V+ E
Subjt:  NSESTLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTE-HPRREGKEQIQAVTLRSGKPLEERKEASKTQDVEKNSDKNVVVE

Query:  KEFKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTV
        ++ K                   +VE P     P  L  PFPQR + +  + QF KFLE+ K+LHINIP  EA++QMP+Y KF+KDIL+KK+RLG+++TV
Subjt:  KEFKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTV

Query:  SLTEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFI
        +LTEEC AI++N LP K KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF+
Subjt:  SLTEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFI

Query:  ILYYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRILENTI------------VETAMQDLANKH-------
        +L  E D +VPI LGR FLATGRTLIDVQKGELTMRV  +++ FNVFK MK+ +E ++C  + + +N              +E A+ DL ++        
Subjt:  ILYYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRILENTI------------VETAMQDLANKH-------

Query:  LKDLD-------------ERKAP--LIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCM
        +K LD             ER AP  ++K S+ E PTL+LKPL  HL Y YLGE +TLP+I++S L     E L+++L+ ++ AIGWT+ DI+GI+ SFCM
Subjt:  LKDLD-------------ERKAP--LIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCM

Query:  HKITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIAD
        HKI LE+    S+E Q+RLNP+MKEVVKKE+IKWLD GIIYPI+D
Subjt:  HKITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIAD

A0A2G9H2I8 DNA-directed DNA polymerase2.9e-13645.22Show/hide
Query:  VNQV--AVESCVYCGEEHNYEFCHNNLASVFF------------------CWRNHPNFAWGGQGSNVQAQQKVNQSGFAKLQILPQHNKQALRQ--QNSE
        VNQV     +C  CGE H  + C +++ S+ F                   WR +PNF+W             N  G        Q  +Q ++Q  Q  +
Subjt:  VNQV--AVESCVYCGEEHNYEFCHNNLASVFF------------------CWRNHPNFAWGGQGSNVQAQQKVNQSGFAKLQILPQHNKQALRQ--QNSE

Query:  STLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTE-HPRREGKEQIQAVTLRSGKPLEE-RKEASKTQDVEKNSDKNVVVEKE
         +LE  + ++M+       S  A+ + +E Q+G LAN + ++PQG LPS+TE +PR+ GK Q QAVTLR+G+ L+E  KE +K+++ E            
Subjt:  STLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTE-HPRREGKEQIQAVTLRSGKPLEE-RKEASKTQDVEKNSDKNVVVEKE

Query:  FKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTVSL
                              VE P     P  L   FPQ  + +  + QF KFLE+ K+LHINIP  EA++QMP+Y KF+KDIL+KK+RLG+++TV+L
Subjt:  FKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTVSL

Query:  TEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL
        TEEC AI++N LP K KDPGSF IP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF++L
Subjt:  TEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL

Query:  YYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRILENTI------------VETAMQDL-------------
          E D +VPI LGR FLATGRTLIDVQKGELTMRV  +++ FNVF+ MK+ +E ++C  + + +N              +E A+ DL             
Subjt:  YYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRILENTI------------VETAMQDL-------------

Query:  ---ANKHLKDLD----ERKAP--LIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCMHK
           A+K+ K       ER AP  ++K S+ E PTL+LKPL +HL Y YLGE +TLP+I++  L     E L+++L+ ++ AIGWT+ADI+GI+ SFC+HK
Subjt:  ---ANKHLKDLD----ERKAP--LIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCMHK

Query:  ITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVN
        I LE+    S+E Q+RLNP+M EVVKKE+IKWLD GII+PI  S+WV+
Subjt:  ITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVN

A0A2G9HYA0 Reverse transcriptase1.8e-14647.53Show/hide
Query:  VNQV--AVESCVYCGEEHNYEFCHNNLASVFFC------------------WRNHPNFAWG---GQGSNVQAQQKVNQSGFAKLQILPQHNKQALRQQNS
        VNQV     +C  CGE H  + C +++ S+ F                   WR HPNF+W    GQGS  + Q    Q G  ++Q   Q  K +L     
Subjt:  VNQV--AVESCVYCGEEHNYEFCHNNLASVFFC------------------WRNHPNFAWG---GQGSNVQAQQKVNQSGFAKLQILPQHNKQALRQQNS

Query:  ESTLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTE-HPRREGKEQIQAVTLRSGKPLEERKEASKTQDVEKNSDKNVVVEKE
        E TL   M            S  A+ + +E Q+G LAN + +RPQG LPS+TE +PR++GK Q QAVTLR+G+ L+E       ++  K+ +K V+ E++
Subjt:  ESTLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTE-HPRREGKEQIQAVTLRSGKPLEERKEASKTQDVEKNSDKNVVVEKE

Query:  FKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTVSL
         K                   +VE P     P  L  PFPQR + +  + QF KFLE+ K+LHINIP  EA++QMP+Y KF+KDIL+KK+RLG+++TV+L
Subjt:  FKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTVSL

Query:  TEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL
        TEEC AI++N LP K KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF++L
Subjt:  TEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIIL

Query:  YYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRILENTI------------VETAMQDLANKH-------LK
          E D +VPI LGR FLATGRTLIDVQKGELTMRV  +++ FNVFK MK+ +E ++C  + + +               +E A+ DL ++        +K
Subjt:  YYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRILENTI------------VETAMQDLANKH-------LK

Query:  DLD-------------ERKAP--LIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCMHK
         LD             ER  P  ++K S+ + PTL+LKPL  HL Y YLGE +TLP+I++S L     E L+++L+ ++ AIGWT+ADI+GI+ SFCMHK
Subjt:  DLD-------------ERKAP--LIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCMHK

Query:  ITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVN
        I LE+    S+E Q+RLNP+MKEVVKKE+IKWLD GIIYPI+DS+WV+
Subjt:  ITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVN

A0A2G9HYD8 Reverse transcriptase2.3e-14148.41Show/hide
Query:  WRNHPNFAWG---GQGSNVQAQQKVNQSGFAKLQILPQHNKQALRQ--QNSESTLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLP
        WR HPNF+W    GQGS  + Q               Q  +Q ++Q  Q  + +LE  + ++M+       S  A+ + +E Q+G LAN + +RPQG LP
Subjt:  WRNHPNFAWG---GQGSNVQAQQKVNQSGFAKLQILPQHNKQALRQ--QNSESTLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLP

Query:  SDTE-HPRREGKEQIQAVTLRSGKPLEERKEASKTQDVEKNSDKNVVVEKEFKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQD
        S+TE +PR++GK Q QAVTLR+G+ L+E       +   K+ +K V+ +++ K                   +VE P     P  L  PFPQ+ + +  +
Subjt:  SDTE-HPRREGKEQIQAVTLRSGKPLEERKEASKTQDVEKNSDKNVVVEKEFKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQD

Query:  GQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTVSLTEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPL
         QF KFLE+ K+LHINIP  EA++QMP+Y KF+KDIL+KK+RLG+++T +LTEEC AI++N LP K KDPGSFTIP +IG    GRALCDLGASINLMP 
Subjt:  GQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTVSLTEEC-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPL

Query:  SVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILYYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMK
        S+YR LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF++L  E D +VPI LGR FLATGRTLIDVQKGELTMRV  +++ FNVFK MK
Subjt:  SVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILYYEADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMK

Query:  YLDEVEDCSFIRILENTI------------VETAMQDL----------------ANKHLKDLD----ERKAP--LIKSSLIEAPTLDLKPLSDHLKYEYL
        + +E ++C  + + +N              +E A+ DL                A+K  K       ER  P  ++K S+ + PTL+LKPL +HL Y YL
Subjt:  YLDEVEDCSFIRILENTI------------VETAMQDL----------------ANKHLKDLD----ERKAP--LIKSSLIEAPTLDLKPLSDHLKYEYL

Query:  GEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCMHKITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVN
        GE +TLP+I++S L     E L+++L+ ++ AIGWT+ADI+GI+ SFCMHKI LE+    S+E Q+RLN +MKEVVKKE+IKWLD GIIYPI+DS+WV+
Subjt:  GEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCMHKITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVN

A0A2G9IA86 DNA-directed DNA polymerase6.3e-13946.8Show/hide
Query:  VNQV--AVESCVYCGEEHNYEFCHNNLASVFFC------------------WRNHPNFAWGGQGSNVQAQQKVNQSGFAKLQILPQHNKQALRQQNSEST
        VNQV     +C  CGE H    C N++ S+ F                   WR HPNF+W        A  +  QSG  ++Q   Q  K +L     E T
Subjt:  VNQV--AVESCVYCGEEHNYEFCHNNLASVFFC------------------WRNHPNFAWGGQGSNVQAQQKVNQSGFAKLQILPQHNKQALRQQNSEST

Query:  LEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTE-HPRREGKEQIQAVTLRSGKPLEERKEASKTQDVEKNSDKNVVVEKEFKS
        L   M            S   +++ +E Q+G LAN + +RPQG L S+TE +PR++GK Q QAVTLR+G+ L+E       ++  K+  K V+ EKE K 
Subjt:  LEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTE-HPRREGKEQIQAVTLRSGKPLEERKEASKTQDVEKNSDKNVVVEKEFKS

Query:  GKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTVSLTEE
                          +VE           PL   Q+Q+ K    QF KFLE+ K+LHIN P  EA++QMP+Y KF+K IL+KK+RLG+++TV+LTEE
Subjt:  GKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTVSLTEE

Query:  C-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILYYE
        C AI++N LP K KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLA+RS+TYP+G IED+LVKVDKFIFP DF++L  E
Subjt:  C-AILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILYYE

Query:  ADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRILENTI--------VETAMQDLANKH-------LKDLD----
         D +VPI LGR FLATGRTLIDVQKG+LTMRV  +++ FNVFK MK+ +E ++C  + + +N          +E A+ DL ++        +K LD    
Subjt:  ADKDVPISLGRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRILENTI--------VETAMQDLANKH-------LKDLD----

Query:  ---------ERKAP--LIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCMHKITLEEGS
                 ER AP  ++K S+ E+PTL+LKPL  HL Y YLGE +TLP+I++S L     E L+++ + ++ AIGWT+ADI+GI+ SFCMHKI LE+  
Subjt:  ---------ERKAP--LIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLPIIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCMHKITLEEGS

Query:  FRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVN
          S+E Q+RLNP+MKEVVKKE+IKW+D GIIYPI+DS+WV+
Subjt:  FRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGAGAGTGGCCAATTCGCCGACTCAATAAGCCTACCATTTCAAGGGCAAGACCGAATGGGGGCTGGGAACATAACAGTACAAGATGGAATTCACTTCTTCCCTTA
TTGGGTGGTTAGTCATCAGCAGCCGCCAGCTGTGGAGCATGCTGCAGTGGTGAACCAAGTTGCAGTGGAATCATGTGTCTATTGTGGTGAAGAGCACAACTACGAGTTTT
GCCACAACAATCTAGCTTCTGTGTTTTTTTGTTGGCGCAACCACCCGAACTTCGCGTGGGGAGGACAAGGAAGTAATGTGCAAGCACAGCAAAAGGTGAACCAATCGGGA
TTTGCTAAATTGCAGATATTACCCCAACACAATAAACAGGCTTTGCGACAACAAAATTCAGAGAGTACTCTTGAGGCAATGATGAAAGAATATATGTCTCGTACAAATGC
TGCAATTCAAAGTAATCAAGCTTCAATGAGAGCTCTTGAATTGCAAGTGGGTCATCTAGCTAATGGGTTGAAGGCAAGGCCTCAAGGGAAACTTCCTTCAGATACTGAGC
ACCCTAGAAGGGAAGGTAAGGAGCAGATACAGGCAGTGACTCTAAGGAGTGGTAAGCCACTAGAAGAAAGAAAAGAGGCTAGTAAAACCCAGGATGTAGAGAAGAATAGT
GATAAAAATGTTGTTGTTGAAAAAGAGTTTAAGTCTGGTAAAGGTGCTGGAGGCAACAATATTGATGCTGGAGCATCTGGTTCTATTTCAGATGTGGAACCACCTTATGT
GTCGCCCTCACCTTATGTACTACCTCTACCTTTTCCACAAAGGCAAAGGCCTAAGAATCAAGATGGTCAATTTAGGAAGTTTTTAGAGATTCTTAAGCAATTGCATATAA
ATATCCCTTTAGTAGAAGCTATAAAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTAACTAAAAAGAAGAGGTTAGGTGAGTTTGACACTGTATCTCTTACT
GAGGAATGTGCTATTCTTAAGAATGGGCTACCAACCAAGGCTAAGGATCCAGGATCATTTACTATACCTGTGTCAATAGGTGAAAAAGAGTTAGGTAGGGCACTTTGTGA
TTTAGGCGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTAGGTATTGGTGAAGCTAGGCCTACCACAGTCACACTCCAGTTAGCCGATAGGTCTATCACAT
ATCCTGAAGGTAAAATTGAGGATGTCTTAGTGAAAGTAGATAAATTCATATTTCCTGTTGATTTTATTATCCTATATTATGAGGCAGATAAAGATGTCCCAATTAGTCTT
GGTCGTCTGTTTTTGGCTACTGGTAGAACATTGATAGATGTTCAAAAAGGGGAATTAACAATGAGGGTTTATAAAGAGGAAGTGAAGTTTAATGTTTTTAAGGTCATGAA
GTATCTAGACGAAGTGGAAGATTGTTCTTTCATTAGGATTTTGGAGAACACAATTGTTGAGACAGCAATGCAGGATTTGGCAAACAAGCATTTGAAAGATTTGGATGAAA
GGAAAGCTCCTCTTATTAAGTCATCTCTGATTGAGGCACCCACTTTAGATTTGAAGCCCTTGTCGGATCATCTAAAATATGAGTATCTTGGGGAGGGTGAGACGTTGCCC
ATTATTGTTGCATCAGATTTAATGCCAGAGGATGAAGAAGCCTTAATAAAGTTGCTGCAACAATATCGTAAGGCAATAGGTTGGACATTGGCTGACATTCAGGGAATCAA
CTCATCTTTCTGTATGCATAAAATCACGCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAGAAAAGGCTTAACCCTGTAATGAAAGAGGTTGTTAAGAAGGAGGTTA
TTAAATGGTTGGATGTTGGGATTATTTATCCAATTGCAGATAGCAATTGGGTAAACCTGATAGCGCGCATAGCGAGCCATGTTTCCCGGTACAATTTGGTTAGTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGAGAGTGGCCAATTCGCCGACTCAATAAGCCTACCATTTCAAGGGCAAGACCGAATGGGGGCTGGGAACATAACAGTACAAGATGGAATTCACTTCTTCCCTTA
TTGGGTGGTTAGTCATCAGCAGCCGCCAGCTGTGGAGCATGCTGCAGTGGTGAACCAAGTTGCAGTGGAATCATGTGTCTATTGTGGTGAAGAGCACAACTACGAGTTTT
GCCACAACAATCTAGCTTCTGTGTTTTTTTGTTGGCGCAACCACCCGAACTTCGCGTGGGGAGGACAAGGAAGTAATGTGCAAGCACAGCAAAAGGTGAACCAATCGGGA
TTTGCTAAATTGCAGATATTACCCCAACACAATAAACAGGCTTTGCGACAACAAAATTCAGAGAGTACTCTTGAGGCAATGATGAAAGAATATATGTCTCGTACAAATGC
TGCAATTCAAAGTAATCAAGCTTCAATGAGAGCTCTTGAATTGCAAGTGGGTCATCTAGCTAATGGGTTGAAGGCAAGGCCTCAAGGGAAACTTCCTTCAGATACTGAGC
ACCCTAGAAGGGAAGGTAAGGAGCAGATACAGGCAGTGACTCTAAGGAGTGGTAAGCCACTAGAAGAAAGAAAAGAGGCTAGTAAAACCCAGGATGTAGAGAAGAATAGT
GATAAAAATGTTGTTGTTGAAAAAGAGTTTAAGTCTGGTAAAGGTGCTGGAGGCAACAATATTGATGCTGGAGCATCTGGTTCTATTTCAGATGTGGAACCACCTTATGT
GTCGCCCTCACCTTATGTACTACCTCTACCTTTTCCACAAAGGCAAAGGCCTAAGAATCAAGATGGTCAATTTAGGAAGTTTTTAGAGATTCTTAAGCAATTGCATATAA
ATATCCCTTTAGTAGAAGCTATAAAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTAACTAAAAAGAAGAGGTTAGGTGAGTTTGACACTGTATCTCTTACT
GAGGAATGTGCTATTCTTAAGAATGGGCTACCAACCAAGGCTAAGGATCCAGGATCATTTACTATACCTGTGTCAATAGGTGAAAAAGAGTTAGGTAGGGCACTTTGTGA
TTTAGGCGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTAGGTATTGGTGAAGCTAGGCCTACCACAGTCACACTCCAGTTAGCCGATAGGTCTATCACAT
ATCCTGAAGGTAAAATTGAGGATGTCTTAGTGAAAGTAGATAAATTCATATTTCCTGTTGATTTTATTATCCTATATTATGAGGCAGATAAAGATGTCCCAATTAGTCTT
GGTCGTCTGTTTTTGGCTACTGGTAGAACATTGATAGATGTTCAAAAAGGGGAATTAACAATGAGGGTTTATAAAGAGGAAGTGAAGTTTAATGTTTTTAAGGTCATGAA
GTATCTAGACGAAGTGGAAGATTGTTCTTTCATTAGGATTTTGGAGAACACAATTGTTGAGACAGCAATGCAGGATTTGGCAAACAAGCATTTGAAAGATTTGGATGAAA
GGAAAGCTCCTCTTATTAAGTCATCTCTGATTGAGGCACCCACTTTAGATTTGAAGCCCTTGTCGGATCATCTAAAATATGAGTATCTTGGGGAGGGTGAGACGTTGCCC
ATTATTGTTGCATCAGATTTAATGCCAGAGGATGAAGAAGCCTTAATAAAGTTGCTGCAACAATATCGTAAGGCAATAGGTTGGACATTGGCTGACATTCAGGGAATCAA
CTCATCTTTCTGTATGCATAAAATCACGCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAGAAAAGGCTTAACCCTGTAATGAAAGAGGTTGTTAAGAAGGAGGTTA
TTAAATGGTTGGATGTTGGGATTATTTATCCAATTGCAGATAGCAATTGGGTAAACCTGATAGCGCGCATAGCGAGCCATGTTTCCCGGTACAATTTGGTTAGTCTTTGA
Protein sequenceShow/hide protein sequence
MGESGQFADSISLPFQGQDRMGAGNITVQDGIHFFPYWVVSHQQPPAVEHAAVVNQVAVESCVYCGEEHNYEFCHNNLASVFFCWRNHPNFAWGGQGSNVQAQQKVNQSG
FAKLQILPQHNKQALRQQNSESTLEAMMKEYMSRTNAAIQSNQASMRALELQVGHLANGLKARPQGKLPSDTEHPRREGKEQIQAVTLRSGKPLEERKEASKTQDVEKNS
DKNVVVEKEFKSGKGAGGNNIDAGASGSISDVEPPYVSPSPYVLPLPFPQRQRPKNQDGQFRKFLEILKQLHINIPLVEAIKQMPNYAKFLKDILTKKKRLGEFDTVSLT
EECAILKNGLPTKAKDPGSFTIPVSIGEKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILYYEADKDVPISL
GRLFLATGRTLIDVQKGELTMRVYKEEVKFNVFKVMKYLDEVEDCSFIRILENTIVETAMQDLANKHLKDLDERKAPLIKSSLIEAPTLDLKPLSDHLKYEYLGEGETLP
IIVASDLMPEDEEALIKLLQQYRKAIGWTLADIQGINSSFCMHKITLEEGSFRSIEQQKRLNPVMKEVVKKEVIKWLDVGIIYPIADSNWVNLIARIASHVSRYNLVSL