; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038380 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038380
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:16344987..16347337
RNA-Seq ExpressionLag0038380
SyntenyLag0038380
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]7.7e-6124.62Show/hide
Query:  MSKAYDRVEWFYIEKFLIALGLEGHLVRLIMRCVTT-------------------------------------GYQECIPGWKQKTESAGLR--------
        M+KAYDRVEW ++ + ++ LG     V  +M C++T                                     G+   + G +++ +  G++        
Subjt:  MSKAYDRVEWFYIEKFLIALGLEGHLVRLIMRCVTT-------------------------------------GYQECIPGWKQKTESAGLR--------

Query:  SHVLALRYPI---------CSLQMTVF-----------CFSGPGGGGNPNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYI
        +H+L     I         C    T+F            +S      +PN        I   L V +V  H++YLGLPT+    +    + +KD++W +I
Subjt:  SHVLALRYPI---------CSLQMTVF-----------CFSGPGGGGNPNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYI

Query:  HKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILARFWWGGEEEGKKVHWASWKKLC--------------------VPKGEW-----
          WK    S  GKE+L+K VLQA+PTY MSCF++P  L KE N I+ARFWW   ++ + +HW  W+ LC                    + K  W     
Subjt:  HKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILARFWWGGEEEGKKVHWASWKKLC--------------------VPKGEW-----

Query:  ----------------------------IFCMAEFDVGEIVAKRGLAVENRDGNLVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLLCPDNSWNVGLVQ
                                     F       G+ +  +GL     +G  + V  DKW+P  S  +I+   ++ L   V  L      WNV L++
Subjt:  ----------------------------IFCMAEFDVGEIVAKRGLAVENRDGNLVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLLCPDNSWNVGLVQ

Query:  SVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACFLRGRASSSNEDSVGR------------------------------------KRG
         +F   + +A L++P       D L+WH+E++G+Y+V+SGYRLAC  + + S      V                                       R 
Subjt:  SVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACFLRGRASSSNEDSVGR------------------------------------KRG

Query:  MVVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFYPLPDPQSIRNAADLLWWCSLNMIANSFEE--FAVMCWWLWNRRNTKVVGGGN-----------
        +  + +CP C    E+  HA+W C+  K  W  S +  + +   + +  +L  W +L + ++  E+  FA +CW LWNRRN+ +  G +           
Subjt:  MVVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFYPLPDPQSIRNAADLLWWCSLNMIANSFEE--FAVMCWWLWNRRNTKVVGGGN-----------

Query:  -LGGRDENGWIWVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRD
         L     N     +  LSH  T +GR+ S +        W PP +  YK+N D A+        +G V+R++ G+ M   ++ I         E MA  +
Subjt:  -LGGRDENGWIWVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRD

Query:  SLLVAKEAGLLRLEVETDS
         L  A + G     +E D+
Subjt:  SLLVAKEAGLLRLEVETDS

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.2e-6927.61Show/hide
Query:  MSKAYDRVEWFYIEKFLIALGLEGHLVRLIMRCVTTGYQECIPGWKQKTESAGLRSHVLALRYPICSLQMTVFCFSGPGGG------------------G
        MSKAYDRVEW ++E  ++ +G +         C     +  +  +          S V AL     SL M V  F    GG                   
Subjt:  MSKAYDRVEWFYIEKFLIALGLEGHLVRLIMRCVTTGYQECIPGWKQKTESAGLRSHVLALRYPICSLQMTVFCFSGPGGG------------------G

Query:  NPNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILA
         P   ++    I + L V +V    +YLGLPT  P  + M   ++KDR+W ++  WK   FS GGKEVLIK V QA+P Y MSCF+LP  L++E + I A
Subjt:  NPNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILA

Query:  RFWWGGEEEGKKVHWASWKKLCVPKGEWIFCMAEFDV-----------------------------------------------------GEIVAKRGLA
        RFWWG  +E KK+HW +W  L +PK E      + ++                                                     G  + K+GL 
Subjt:  RFWWGGEEEGKKVHWASWKKLCVPKGEWIFCMAEFDV-----------------------------------------------------GEIVAKRGLA

Query:  VENRDGNLVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLL-CPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACF
            +G+ V +  D W+P   T++I+ +  + L  RVS L+   +  W   +V+  F   +A+ IL +P  R    D+L+W++EK G+Y+VRSGY++A  
Subjt:  VENRDGNLVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLL-CPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACF

Query:  ----LRGRASSSNED-----------------------------SVG---RKRGMVVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFYPLPDPQSIR
            ++  +SSS+E+                               G    KRG+ ++  C  C    E + H  W CK+ +  W  S F  L     +R
Subjt:  ----LRGRASSSNED-----------------------------SVG---RKRGMVVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFYPLPDPQSIR

Query:  NAADLLWWCSLNMIANSFEEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWI--WVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICK
         + +       ++    FEE  V+ W LWN+RN +               +  W ++Y   FR       +G +     + W PP+   YK+NTDA+   
Subjt:  NAADLLWWCSLNMIANSFEEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWI--WVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICK

Query:  ETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGL
          +   LG +I + +G++M    K + ++Q VD  EA+A  + L +A E G+
Subjt:  ETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGL

XP_024042628.1 uncharacterized protein LOC112099434 [Citrus clementina]2.6e-6130.69Show/hide
Query:  NVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILARF
        NV       I +   +K+V+ H+RYLGLP++   +K      +K ++ + I  W+H  FS+GGKEVLIK V QAVP Y MS F+LPV    +  +++AR+
Subjt:  NVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILARF

Query:  WWGGEEEGKKVHWASWKKLCVPK---GEWIFCMAEFDVGEIVAKRG-LAVENRDGNLVNVIRDKWIPRSS----------------------TMRIIETN
        WWG +E+ + +HWASW KL   K   G     ++ F+   +VAK+G   ++N +  +  V++ K+    +                      T + I   
Subjt:  WWGGEEEGKKVHWASWKKLCVPK---GEWIFCMAEFDVGEIVAKRG-LAVENRDGNLVNVIRDKWIPRSS----------------------TMRIIETN

Query:  EIDLDMRVSRLLCPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACFLRGRASSSNEDSVGRKRGMVVSIMCPRCRV
         +  D  VS L+  DN W   L+   F   DA+AI+ +P  R  + D+++WH+++ G Y+V+SGY++A  L+ +   +   SV                 
Subjt:  EIDLDMRVSRLLCPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACFLRGRASSSNEDSVGRKRGMVVSIMCPRCRV

Query:  AEETTFHALWECKWVKRQWHFS----PFYPLPDPQSIRNAADLLWWCSLNMIANSFEEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWIWVSEYLSHFRT
          E  FHAL ECK  ++ W  +     F  +  P  +    +L+      M    FE  AV CW +W  RN  +      G +     + V+   +    
Subjt:  AEETTFHALWECKWVKRQWHFS----PFYPLPDPQSIRNAADLLWWCSLNMIANSFEEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWIWVSEYLSHFRT

Query:  FNGRRGSGEIV-----RREGVRWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLLRLEVET
        +   + SG I      ++    W PP    +K+N DAAI KE     LGAVIRD+ G I+   +K      +V   EA A+R  L VA +AGL  L +ET
Subjt:  FNGRRGSGEIV-----RREGVRWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLLRLEVET

Query:  DSARVAAMVRSKQNDYSE
        DS  VA ++ +++   +E
Subjt:  DSARVAAMVRSKQNDYSE

XP_024190234.1 uncharacterized protein LOC112194221 [Rosa chinensis]2.4e-6229.91Show/hide
Query:  PNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILAR
        P VDE +K  I   LGV +V FH+RYLGLPTV    K    K + +R+  ++ +W   F S  GK VL+KVV QA+PT+ M+ F+LP  + K     +A 
Subjt:  PNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILAR

Query:  FWWGGEEEGKKVHWASWKKLCVPKGE--------------------WIFCM----------------------AEFDVG------EIVAKRGLAVENR--
        FWWG  +  K +HW  W +LC  K +                    W   M                      A  +VG       ++  R L +     
Subjt:  FWWGGEEEGKKVHWASWKKLCVPKGE--------------------WIFCM----------------------AEFDVG------EIVAKRGLAVENR--

Query:  ---DGNLVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLLCPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACFLR
            G  + V  DKW+P   T R +  +  +L+++VS L+     WN  L++S F+  + + IL +P       D +VWH+ K G YTV+SG  LA  L+
Subjt:  ---DGNLVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLLCPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACFLR

Query:  -------GRASSSNEDSVG--------------------------------RKRGMVVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFYPLPDPQSI
               G ++SSN++                                    +R +  S +C RC   EETT H +W C W K+ W FS    +      
Subjt:  -------GRASSSNEDSVG--------------------------------RKRGMVVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFYPLPDPQSI

Query:  RNAADLLWWCSLNMIANSFEEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWIW-VSEYLSHFRTFNGRRGSGEIV-----RREGVRWSPPNSQNYKLNTD
         +  DL             E F+V+CW LW  RN     G      +    +W  +E+L++F+  + +R +  +      +R  V+W PP +   KLNTD
Subjt:  RNAADLLWWCSLNMIANSFEEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWIW-VSEYLSHFRTFNGRRGSGEIV-----RREGVRWSPPNSQNYKLNTD

Query:  AAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLLRLEVETDSARVAAMVRSKQNDYS-EGG
        AAI  + K  +LG V+RD +GK+     K++     + A+EA+A+   LL+ +EAG   L VE+DS  V   +   + D S EGG
Subjt:  AAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLLRLEVETDSARVAAMVRSKQNDYS-EGG

XP_030487384.1 uncharacterized protein LOC115704310 [Cannabis sativa]3.1e-6229.72Show/hide
Query:  RYPICSLQMTVF-----CFSGPGGGGNPNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVL
        +Y I S Q+  F     CF         NV   +K  +A+ +GVK+V  + +YLGLP+     K    +F+K+R+W  +  WK SFFSA  KEVLIK ++
Subjt:  RYPICSLQMTVF-----CFSGPGGGGNPNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVL

Query:  QAVPTYVMSCFQLPVSLVKECNRILARFWWGGEEEGKKVHWASWKKLCVPKGEW------------IFCMAEFDVGEIVAKRGLAVENRDGNLVNVIRDK
        QA+PTY MSCF+LP   +   + + ARFWWG  E+  K+HW   K L      +             F       G+ + ++G      + N V V+ D 
Subjt:  QAVPTYVMSCFQLPVSLVKECNRILARFWWGGEEEGKKVHWASWKKLCVPKGEW------------IFCMAEFDVGEIVAKRGLAVENRDGNLVNVIRDK

Query:  WIPRSSTMRIIETNEIDLDMRVSRLLCPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACFLRGRASSSNEDSVGR-
        W+PR  T +I +   +   + V  L  P  +W+   V++VF   DA+ IL MP   +   DK++WH+ K G Y+VRSGYR+A  L+ R   S+ ++  + 
Subjt:  WIPRSSTMRIIETNEIDLDMRVSRLLCPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACFLRGRASSSNEDSVGR-

Query:  --------------------------------KRGMVVSIMCPRCRVAE-ETTFHALWECKWVKRQWHFSPFYPLPDPQSIRNAADLLWWCSLNMIANSF
                                         R + V   C RC   E ET FH LW C+  +  W  + FY         +    L   S        
Subjt:  --------------------------------KRGMVVSIMCPRCRVAE-ETTFHALWECKWVKRQWHFSPFYPLPDPQSIRNAADLLWWCSLNMIANSF

Query:  EEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWI--WVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKGKI
        E F V+ W LW  RN+   G    G + +   I  W S+YLS FR  N  +  G+  R+E   W PP ++ +K+N D  +       +   V+RD  G +
Subjt:  EEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWI--WVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKGKI

Query:  MLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLL
              ++  +Q++  L+A     ++   K+ G+L
Subjt:  MLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLL

TrEMBL top hitse value%identityAlignment
A0A2N9ESC9 Uncharacterized protein1.7e-6628.22Show/hide
Query:  MSKAYDRVEWFYIEKFLIALGLEGHLVRLIMRCVTTG-----YQECIPGWKQK-TESAGLRSHVLALRYP-ICSL----QMTVFC---------------
        MSKAYDRVEW ++++ ++ +G     + LIM C++T        E + G  +K T    +    L  R P I +L       +FC               
Subjt:  MSKAYDRVEWFYIEKFLIALGLEGHLVRLIMRCVTTG-----YQECIPGWKQK-TESAGLRSHVLALRYP-ICSL----QMTVFC---------------

Query:  -FSGPGGGG----------NPNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYV
         +    G            + N  +  ++ I   LGV  +  +++YLGLP++    K+     +K+R+W+ +  WK    S  G+EVLIK V+QA+PTY 
Subjt:  -FSGPGGGG----------NPNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYV

Query:  MSCFQLPVSLVKECNRILARFWWGGEEEGKKVHWASWKKLCVPKGE--------------------W------------IFCMAEFDVGEIVA-------
        M+CF+LPV+L KE   I+ RFWWG   E +K+HW  W+KLC  KGE                    W            +F    F  G I+A       
Subjt:  MSCFQLPVSLVKECNRILARFWWGGEEEGKKVHWASWKKLCVPKGE--------------------W------------IFCMAEFDVGEIVA-------

Query:  --------------KRGLAVENRDGNLVNVIRDKWIPRSSTMRIIE-TNEIDLDMRVSRLL-CPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLV
                      + GL+    DG  + +    W+      R++     +  + RV+ L+     +WNV  VQ++F   DAEAIL++P       DK  
Subjt:  --------------KRGLAVENRDGNLVNVIRDKWIPRSSTMRIIE-TNEIDLDMRVSRLL-CPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLV

Query:  WHHEKHGLYTVRSGYRL----------------------------------ACFLRGRASSSNEDSVG-RKRGMVVSIMCPRCRVAEETTFHALWECKWV
        W   + G Y+VRSGYRL                                    FL      S    +G  KR +  + +C  CR   E   HALW+C  V
Subjt:  WHHEKHGLYTVRSGYRL----------------------------------ACFLRGRASSSNEDSVG-RKRGMVVSIMCPRCRVAEETTFHALWECKWV

Query:  KRQWHFSPFYPLPDPQSIRNAADLLWWCSLNMIANS--FEEFAVMCWWLWNRRNTKVVGGGNLGGRDENG-WIWVSEYLSHFRTFNGRRGSGEIVRREGV
         + W  +P +      S  + ++L   C +  + +    E+FAV CW LW++RN   +    L   D +  W      L  +          E  +   V
Subjt:  KRQWHFSPFYPLPDPQSIRNAADLLWWCSLNMIANS--FEEFAVMCWWLWNRRNTKVVGGGNLGGRDENG-WIWVSEYLSHFRTFNGRRGSGEIVRREGV

Query:  RWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLLRLEVETDSARVAAMVRSK
        RW P  +  YK+N D AI KE+  G +G VIRD  G ++ T  + I     V+ +EA+A R +++ AKE G+  +E E DS  V   + S+
Subjt:  RWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLLRLEVETDSARVAAMVRSK

A0A2N9F7C1 Uncharacterized protein1.5e-6524.93Show/hide
Query:  MSKAYDRVEWFYIEKFLIALGLEGHLVRLIMRCVTT-------------------GYQE-----------CIPGWKQKTESAGLRSHVLAL---------
        MSKAYDRVEW ++++ ++ +G     + LIM C++T                   G ++           C  G       A L+  +  +         
Subjt:  MSKAYDRVEWFYIEKFLIALGLEGHLVRLIMRCVTT-------------------GYQE-----------CIPGWKQKTESAGLRSHVLAL---------

Query:  ---------------RYPICSLQMTVFCFSGPGGGG-----------NPNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYI
                         P C     +        G            + N  + M++ +   LGV  +  +++YLGLP++    K+     +K+R+W+ +
Subjt:  ---------------RYPICSLQMTVFCFSGPGGGG-----------NPNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYI

Query:  HKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILARFWWGGEEEGKKVHWASWKKLCVPKGE---WIFCMAEFDVGEIVAKRGLAVEN
          WK    S  G+E+LIK V+QA+PTY M+CF+LPV+L KE   I+ RFWWG   E +K+HW  W+KLC PKG        + +F++  +  +    + N
Subjt:  HKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILARFWWGGEEEGKKVHWASWKKLCVPKGE---WIFCMAEFDVGEIVAKRGLAVEN

Query:  RDGNLVNVIRDKWIPRSSTMRIIETN---------------------------------------------------EIDLDMRVSRLL-CPDNSWNVGL
         +  L  V   K+ P  + M   E N                                                   ++  D RV  L+      WN+  
Subjt:  RDGNLVNVIRDKWIPRSSTMRIIETN---------------------------------------------------EIDLDMRVSRLL-CPDNSWNVGL

Query:  VQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRL----------------------------------ACFLRGRASSSNEDSVGR-KR
        VQS+F   DAEAIL++P       DK+ W   + G Y+VRSGY+L                                    FL      S   + G  +R
Subjt:  VQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRL----------------------------------ACFLRGRASSSNEDSVGR-KR

Query:  GMVVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFYPLPDPQSIRNAADLLWWCSLNMIANSFEEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWIWV
         ++ + +C  C+   E + HALW C  V + W+ +P + +   +  R+ +DL+         N  E+ AV CW +WN+RN       +    ++   +W 
Subjt:  GMVVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFYPLPDPQSIRNAADLLWWCSLNMIANSFEEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWIWV

Query:  -SEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLLR
         ++ + H           +  +    RW  P +  YK+N D AI K++ SG +G VIRD+ G+++ T  + +     V+ +EA+A R +++ A+E G+  
Subjt:  -SEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLLR

Query:  LEVETDSARV
        +EVE D+  +
Subjt:  LEVETDSARV

A0A2N9HS90 RNase H domain-containing protein8.5e-6627.53Show/hide
Query:  MSKAYDRVEWFYIEKFLIALGLEGHLVRLIMRCVT------------TGYQECIPGWKQKTESAGLRSHVLALRYPICSLQMTVFCFSGPGGGG------
        MSKAYDRVEW ++EK +  +G     + LI+ C++            TG+     G +Q       R+ +L  +     +Q  +  +    G        
Subjt:  MSKAYDRVEWFYIEKFLIALGLEGHLVRLIMRCVT------------TGYQECIPGWKQKTESAGLRSHVLALRYPICSLQMTVFCFSGPGGGG------

Query:  ----NPNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECN
            + N  + +++ I   LGV  +  +++YLGLP++    K+     +K+R+W+ +  WK    S  G+EVLIK V+QA+PTY M+CF+LPV+L KE  
Subjt:  ----NPNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECN

Query:  RILARFWWGGEEEGKKVHWASWKKLCVPKGE--------------------W------------IFCMAEFDVGEIV-----------------AK----
         I+ RFWWG   + +K+HW  W+K+C  KGE                    W            +F    F  G I+                 AK    
Subjt:  RILARFWWGGEEEGKKVHWASWKKLCVPKGE--------------------W------------IFCMAEFDVGEIV-----------------AK----

Query:  RGLAVENRDGNLVNVIRDKWIPRSSTMRIIE-TNEIDLDMRVSRLLCPDNS-WNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGY
         GL     DG  + +    W+      RI+     + +D RV  L+      WN+  +Q++F   D +AIL++P       D+L W   ++G Y+VRSGY
Subjt:  RGLAVENRDGNLVNVIRDKWIPRSSTMRIIE-TNEIDLDMRVSRLLCPDNS-WNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGY

Query:  RLAC----------------------FLRGRASS--------SNEDSVGRKRGM-----VVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFYPLPDP
        +L C                        R R  +        ++ DS+  K G+       + +C  CR   E + HALW C  V + W  +P +     
Subjt:  RLAC----------------------FLRGRASS--------SNEDSVGRKRGM-----VVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFYPLPDP

Query:  QSIRNAADLLWWCSLNMIANSFEEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWI------WVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKL
         +  + +DL+     +     FE+FA   W LW++RN   +   +    D +  I       +SEYL+   T N      + ++   VRW PP+S  +K+
Subjt:  QSIRNAADLLWWCSLNMIANSFEEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWI------WVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKL

Query:  NTDAAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLLRLEVETDSARV
        N D AI +E  +G LG VIRD+ G ++ T  + +      + +EA+A R ++  A E G+  +E+E D+  V
Subjt:  NTDAAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLLRLEVETDSARV

A0A6J1DAR4 uncharacterized protein LOC1110189545.7e-7027.61Show/hide
Query:  MSKAYDRVEWFYIEKFLIALGLEGHLVRLIMRCVTTGYQECIPGWKQKTESAGLRSHVLALRYPICSLQMTVFCFSGPGGG------------------G
        MSKAYDRVEW ++E  ++ +G +         C     +  +  +          S V AL     SL M V  F    GG                   
Subjt:  MSKAYDRVEWFYIEKFLIALGLEGHLVRLIMRCVTTGYQECIPGWKQKTESAGLRSHVLALRYPICSLQMTVFCFSGPGGG------------------G

Query:  NPNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILA
         P   ++    I + L V +V    +YLGLPT  P  + M   ++KDR+W ++  WK   FS GGKEVLIK V QA+P Y MSCF+LP  L++E + I A
Subjt:  NPNVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILA

Query:  RFWWGGEEEGKKVHWASWKKLCVPKGEWIFCMAEFDV-----------------------------------------------------GEIVAKRGLA
        RFWWG  +E KK+HW +W  L +PK E      + ++                                                     G  + K+GL 
Subjt:  RFWWGGEEEGKKVHWASWKKLCVPKGEWIFCMAEFDV-----------------------------------------------------GEIVAKRGLA

Query:  VENRDGNLVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLL-CPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACF
            +G+ V +  D W+P   T++I+ +  + L  RVS L+   +  W   +V+  F   +A+ IL +P  R    D+L+W++EK G+Y+VRSGY++A  
Subjt:  VENRDGNLVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLL-CPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACF

Query:  ----LRGRASSSNED-----------------------------SVG---RKRGMVVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFYPLPDPQSIR
            ++  +SSS+E+                               G    KRG+ ++  C  C    E + H  W CK+ +  W  S F  L     +R
Subjt:  ----LRGRASSSNED-----------------------------SVG---RKRGMVVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFYPLPDPQSIR

Query:  NAADLLWWCSLNMIANSFEEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWI--WVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICK
         + +       ++    FEE  V+ W LWN+RN +               +  W ++Y   FR       +G +     + W PP+   YK+NTDA+   
Subjt:  NAADLLWWCSLNMIANSFEEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWI--WVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICK

Query:  ETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGL
          +   LG +I + +G++M    K + ++Q VD  EA+A  + L +A E G+
Subjt:  ETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGL

A0A803QQT2 Uncharacterized protein1.4e-6828.98Show/hide
Query:  NVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILARF
        NV  +++  +A  LGV+ V  H +YLGLP+     K   L  +K+++WA +  WK S FS  GKEVLIKV++QA+PTY MSCF+LP   +   +R+ +RF
Subjt:  NVDEQMKQAIASTLGVKLVAFHDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILARF

Query:  WWGGEEEGKKVHWASWKKLCVPKGE--------------------W---------------------------------IFCMAEFDVGEIVAKRGLAVE
        WWG  ++ KK+HW  W+ LC PK +                    W                                  F       G+ +  +G    
Subjt:  WWGGEEEGKKVHWASWKKLCVPKGE--------------------W---------------------------------IFCMAEFDVGEIVAKRGLAVE

Query:  NRDGNLVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLLCPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACFLRG
          +G  V V+ D W+PR  T ++ +   +  ++ V+ L   D  W+ G ++S+F   D + IL +P   +   DK++WH+ K+G Y+V+SGYR+A     
Subjt:  NRDGNLVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLLCPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACFLRG

Query:  RASSSNEDSVGR---------------------------------KRGMVVSIMCPRCRV-AEETTFHALWECKWVKRQWHFSPFYPLPDPQSIRNAADL
            SNE S+ +                                 KRG+  S++C RC    +E+  HALWECK  K  W  S  Y         +   +
Subjt:  RASSSNEDSVGR---------------------------------KRGMVVSIMCPRCRV-AEETTFHALWECKWVKRQWHFSPFYPLPDPQSIRNAADL

Query:  LWWCSLNMIANSFEEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWIWVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICKETKSGNL
        L   +        E F ++ W +WN RNT V GG +   + E    W   +L+ FR   GR  S      E  RW PP      +N DA + +      L
Subjt:  LWWCSLNMIANSFEEFAVMCWWLWNRRNTKVVGGGNLGGRDENGWIWVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICKETKSGNL

Query:  GAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLLRLEVETDSARVAAMVRSKQN
        G V+RD+ G ++     ++        LE MAI+  + V  +  L R  VETD  +   ++++K+N
Subjt:  GAVIRDSKGKIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLLRLEVETDSARVAAMVRSKQN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein6.8e-0721.37Show/hide
Query:  RKRGMVVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFY---PLPDPQSIRNAADLLWWCSLNMIANSFEEFAV--MCWWLWNRRNTKVVG-------
        R R +    +C RC + EET  H ++ C + +  W  +          P S  +  + L   S     NS + F    + W LW  RN  +         
Subjt:  RKRGMVVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFY---PLPDPQSIRNAADLLWWCSLNMIANSFEEFAV--MCWWLWNRRNTKVVG-------

Query:  -GGNLGGRDENGWIWVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMA
             G +D   W+  +E   +    +      +  RR+  +W+PP     K N D+   + +     G  IR+  G I+L     +         EA+ 
Subjt:  -GGNLGGRDENGWIWVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDALEAMA

Query:  IRDSLLVAKEAGLLRLEVETDSARVAAMVRSKQN
           +L V    GL  +  E+DS  +  ++ + ++
Subjt:  IRDSLLVAKEAGLLRLEVETDSARVAAMVRSKQN

AT2G22440.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT4G29090.1)1.4e-0434.38Show/hide
Query:  LVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLLCPD------NSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLA
        L  V +D WIP   T+       I L++R S L   D      N W +  +Q++   VD   IL +   R    D   W H K G YTV+SGY +A
Subjt:  LVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLLCPD------NSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGLYTVRSGYRLA

AT3G09510.1 Ribonuclease H-like superfamily protein2.5e-0923.31Show/hide
Query:  GEIVAKRGLAVENRDGNLVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLLCPDNS---WNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGL
        G  + K+G      DG  + +  D  I  S   R + T E   +M ++ L     S   W+   +       D   I  +   +   PDK++W++   G 
Subjt:  GEIVAKRGLAVENRDGNLVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLLCPDNS---WNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHGL

Query:  YTVRSGY----------------------------------RLACFLRGRASSSNEDSVGR--KRGMVVSIMCPRCRVAEETTFHALWECKWVKRQWHFS
        YTVRSGY                                  +L  FL  RA S    +  R   RGM +   CPRC    E+  HAL+ C +    W  S
Subjt:  YTVRSGY----------------------------------RLACFLRGRASSSNEDSVGR--KRGMVVSIMCPRCRVAEETTFHALWECKWVKRQWHFS

Query:  PFYPLPDPQSIRNAADLLWWCSLNMIANSFEEF---------------------AVMCWWLWNRRNTKVVGGGN-------LGGRDENGWIWVSEYLSHF
              D   IRN           +++N FEE                        + W +W  RN  V            L  + E    W++   SH 
Subjt:  PFYPLPDPQSIRNAADLLWWCSLNMIANSFEEF---------------------AVMCWWLWNRRNTKVVGGGN-------LGGRDENGWIWVSEYLSHF

Query:  RTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKG-KIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLLRLEVETD
        +T +  R   E      + W  P +   K N DA    +      G +IR+  G  I    MKL H    ++A E  A+  +L      G  ++ +E D
Subjt:  RTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKG-KIMLTFMKLIHHVQDVDALEAMAIRDSLLVAKEAGLLRLEVETD

AT4G29090.1 Ribonuclease H-like superfamily protein5.4e-1222.45Show/hide
Query:  AVPTYVMSCFQLPVSLVKECNRILARFWWGGEEEGKKVHWASWKKLCVPKGE--------------------W---------------------------
        A+PTY M+CF LP ++ K+   +LA FWW  ++E K +HW +W  L   K E                    W                           
Subjt:  AVPTYVMSCFQLPVSLVKECNRILARFWWGGEEEGKKVHWASWKKLCVPKGE--------------------W---------------------------

Query:  ------IFCMAEFDVGEIVAKRGLAVENRDGNLVNVIRDKWI---PRSSTMRI-----IETNEIDLDMRVSRLLCPD-NSWNVGLVQSVFQGVDAEAILE
               F        + + ++G      +G  + + R KW+   P S+ +R+      E   +   ++VS L+      W   +++ +F  V+ + I E
Subjt:  ------IFCMAEFDVGEIVAKRGLAVENRDGNLVNVIRDKWI---PRSSTMRI-----IETNEIDLDMRVSRLLCPD-NSWNVGLVQSVFQGVDAEAILE

Query:  MPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACFLRGRASSSNEDS
        +        D   W +   G YTV+SGY +   +  + SS  E S
Subjt:  MPRRRFPTPDKLVWHHEKHGLYTVRSGYRLACFLRGRASSSNEDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAAGGCCTACGATAGAGTGGAGTGGTTCTATATAGAGAAGTTTCTGATTGCCTTGGGGTTGGAAGGCCATCTGGTTAGACTGATTATGAGATGTGTGACC
ACGGGTTATCAAGAATGTATTCCTGGCTGGAAACAAAAAACAGAGTCTGCAGGCTTAAGATCTCACGTTTTAGCCCTTCGATATCCCATTTGTTCTTTGCAGATG
ACTGTTTTTTGTTTTTCAGGGCCAGGGGGCGGAGGCAATCCAAATGTGGATGAGCAAATGAAACAAGCTATCGCCTCGACGTTGGGGGTGAAGCTTGTTGCTTTT
CATGATCGGTACCTAGGTCTCCCAACGGTTTTTCCAGGCCGAAAGGTGATGTCACTGAAGTTTGTGAAGGACCGAATGTGGGCATATATTCATAAGTGGAAGCAT
TCTTTTTTCTCAGCAGGGGGAAAGGAAGTTCTTATAAAAGTCGTACTACAGGCAGTTCCTACTTATGTTATGTCTTGCTTTCAGTTGCCAGTAAGTTTAGTTAAA
GAATGTAATAGAATTCTTGCTAGATTTTGGTGGGGAGGTGAGGAGGAAGGGAAAAAGGTGCACTGGGCATCGTGGAAGAAGTTGTGTGTGCCTAAAGGGGAGTGG
ATCTTTTGTATGGCGGAGTTTGATGTGGGGGAGATCGTTGCTAAAAGAGGGCTTGCGGTGGAGAATAGGGATGGGAATCTAGTTAACGTCATTCGAGATAAGTGG
ATTCCTAGGAGCTCAACTATGAGGATTATTGAGACCAATGAGATTGATCTTGATATGAGGGTCAGTAGGCTCCTATGCCCAGATAACTCATGGAATGTGGGCTTA
GTCCAGTCGGTTTTTCAAGGGGTGGATGCTGAAGCCATTCTTGAAATGCCAAGGCGACGGTTTCCAACCCCTGATAAGCTGGTTTGGCATCATGAGAAGCATGGA
TTGTATACGGTTCGAAGCGGCTATAGGTTGGCATGTTTTTTGAGGGGTAGAGCGAGCAGTTCCAATGAGGACTCGGTTGGGAGGAAGAGGGGGATGGTGGTGTCT
ATTATGTGCCCTAGATGTAGGGTGGCGGAGGAGACGACTTTCCATGCTCTTTGGGAGTGCAAGTGGGTGAAGAGGCAGTGGCATTTTTCCCCTTTTTACCCGCTA
CCCGACCCTCAATCTATTAGAAATGCTGCTGATTTGCTATGGTGGTGTAGCTTGAATATGATTGCGAATTCTTTTGAGGAATTTGCTGTTATGTGTTGGTGGTTG
TGGAATAGGAGGAATACAAAGGTTGTTGGTGGAGGGAATTTGGGAGGAAGGGATGAGAATGGTTGGATTTGGGTCTCTGAGTATCTTAGCCACTTTAGGACCTTT
AATGGAAGAAGGGGGTCGGGGGAAATAGTCCGAAGGGAAGGGGTTCGGTGGTCACCACCTAATTCACAAAACTATAAGCTCAACACAGATGCAGCTATATGTAAA
GAAACAAAATCCGGCAATCTAGGGGCAGTTATCCGAGATTCTAAAGGGAAGATAATGCTTACCTTTATGAAATTGATCCATCATGTGCAAGATGTGGATGCTCTA
GAGGCTATGGCCATCCGTGATAGCTTGCTTGTTGCTAAGGAAGCTGGTCTGCTGAGGTTGGAGGTGGAAACGGACTCTGCTCGTGTGGCGGCCATGGTTCGGTCG
AAGCAGAACGATTACTCTGAGGGAGGTGAATCGGGTGGCACACATGGCGGCGAGGCAAGTGATGGAGCTGGGCATTCAGGGTGTGTGGTTGGAGGAGACGCCGGC
GTCTTTGGAGGAGGTGTATCGCAGCGAAATTTTGGACAGTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTAAGGCCTACGATAGAGTGGAGTGGTTCTATATAGAGAAGTTTCTGATTGCCTTGGGGTTGGAAGGCCATCTGGTTAGACTGATTATGAGATGTGTGACC
ACGGGTTATCAAGAATGTATTCCTGGCTGGAAACAAAAAACAGAGTCTGCAGGCTTAAGATCTCACGTTTTAGCCCTTCGATATCCCATTTGTTCTTTGCAGATG
ACTGTTTTTTGTTTTTCAGGGCCAGGGGGCGGAGGCAATCCAAATGTGGATGAGCAAATGAAACAAGCTATCGCCTCGACGTTGGGGGTGAAGCTTGTTGCTTTT
CATGATCGGTACCTAGGTCTCCCAACGGTTTTTCCAGGCCGAAAGGTGATGTCACTGAAGTTTGTGAAGGACCGAATGTGGGCATATATTCATAAGTGGAAGCAT
TCTTTTTTCTCAGCAGGGGGAAAGGAAGTTCTTATAAAAGTCGTACTACAGGCAGTTCCTACTTATGTTATGTCTTGCTTTCAGTTGCCAGTAAGTTTAGTTAAA
GAATGTAATAGAATTCTTGCTAGATTTTGGTGGGGAGGTGAGGAGGAAGGGAAAAAGGTGCACTGGGCATCGTGGAAGAAGTTGTGTGTGCCTAAAGGGGAGTGG
ATCTTTTGTATGGCGGAGTTTGATGTGGGGGAGATCGTTGCTAAAAGAGGGCTTGCGGTGGAGAATAGGGATGGGAATCTAGTTAACGTCATTCGAGATAAGTGG
ATTCCTAGGAGCTCAACTATGAGGATTATTGAGACCAATGAGATTGATCTTGATATGAGGGTCAGTAGGCTCCTATGCCCAGATAACTCATGGAATGTGGGCTTA
GTCCAGTCGGTTTTTCAAGGGGTGGATGCTGAAGCCATTCTTGAAATGCCAAGGCGACGGTTTCCAACCCCTGATAAGCTGGTTTGGCATCATGAGAAGCATGGA
TTGTATACGGTTCGAAGCGGCTATAGGTTGGCATGTTTTTTGAGGGGTAGAGCGAGCAGTTCCAATGAGGACTCGGTTGGGAGGAAGAGGGGGATGGTGGTGTCT
ATTATGTGCCCTAGATGTAGGGTGGCGGAGGAGACGACTTTCCATGCTCTTTGGGAGTGCAAGTGGGTGAAGAGGCAGTGGCATTTTTCCCCTTTTTACCCGCTA
CCCGACCCTCAATCTATTAGAAATGCTGCTGATTTGCTATGGTGGTGTAGCTTGAATATGATTGCGAATTCTTTTGAGGAATTTGCTGTTATGTGTTGGTGGTTG
TGGAATAGGAGGAATACAAAGGTTGTTGGTGGAGGGAATTTGGGAGGAAGGGATGAGAATGGTTGGATTTGGGTCTCTGAGTATCTTAGCCACTTTAGGACCTTT
AATGGAAGAAGGGGGTCGGGGGAAATAGTCCGAAGGGAAGGGGTTCGGTGGTCACCACCTAATTCACAAAACTATAAGCTCAACACAGATGCAGCTATATGTAAA
GAAACAAAATCCGGCAATCTAGGGGCAGTTATCCGAGATTCTAAAGGGAAGATAATGCTTACCTTTATGAAATTGATCCATCATGTGCAAGATGTGGATGCTCTA
GAGGCTATGGCCATCCGTGATAGCTTGCTTGTTGCTAAGGAAGCTGGTCTGCTGAGGTTGGAGGTGGAAACGGACTCTGCTCGTGTGGCGGCCATGGTTCGGTCG
AAGCAGAACGATTACTCTGAGGGAGGTGAATCGGGTGGCACACATGGCGGCGAGGCAAGTGATGGAGCTGGGCATTCAGGGTGTGTGGTTGGAGGAGACGCCGGC
GTCTTTGGAGGAGGTGTATCGCAGCGAAATTTTGGACAGTTTTAG
Protein sequenceShow/hide protein sequence
MSKAYDRVEWFYIEKFLIALGLEGHLVRLIMRCVTTGYQECIPGWKQKTESAGLRSHVLALRYPICSLQMTVFCFSGPGGGGNPNVDEQMKQAIASTLGVKLVAF
HDRYLGLPTVFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQAVPTYVMSCFQLPVSLVKECNRILARFWWGGEEEGKKVHWASWKKLCVPKGEW
IFCMAEFDVGEIVAKRGLAVENRDGNLVNVIRDKWIPRSSTMRIIETNEIDLDMRVSRLLCPDNSWNVGLVQSVFQGVDAEAILEMPRRRFPTPDKLVWHHEKHG
LYTVRSGYRLACFLRGRASSSNEDSVGRKRGMVVSIMCPRCRVAEETTFHALWECKWVKRQWHFSPFYPLPDPQSIRNAADLLWWCSLNMIANSFEEFAVMCWWL
WNRRNTKVVGGGNLGGRDENGWIWVSEYLSHFRTFNGRRGSGEIVRREGVRWSPPNSQNYKLNTDAAICKETKSGNLGAVIRDSKGKIMLTFMKLIHHVQDVDAL
EAMAIRDSLLVAKEAGLLRLEVETDSARVAAMVRSKQNDYSEGGESGGTHGGEASDGAGHSGCVVGGDAGVFGGGVSQRNFGQF