; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039114 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039114
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:36324760..36328155
RNA-Seq ExpressionLag0039114
SyntenyLag0039114
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]4.5e-4727.99Show/hide
Query:  ENFSQKMEKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWNFD
        ++ S+K EKL + + +   +  I+    E  ++ +  + + K +T K IN E FKS +  IW  + EV  + +G N+F+  FQN   R++++EGGPW FD
Subjt:  ENFSQKMEKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWNFD

Query:  RGLLIFEEIRGHERYTSINFR-------------------------------------RTGKMCGGETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQV
        + LL+  E  G E+ T + FR                                      +G+ C G+ +R+RV I+V NPLKR +++ +G   +   + +
Subjt:  RGLLIFEEIRGHERYTSINFR-------------------------------------RTGKMCGGETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQV

Query:  KYEKLPDYCYGCGRIGHQLRDCEETGK--NSGEELQFGPWLKQDSPIKSRGKERPEQEKGLNRQGRGRG---SIRGRGRNCDSTNRDNSEEEEDSEKDDD
         YE+LP++CY CG+IGH +RDC    K   S    +FGPW++  S  +S+G    +     +R+G       ++R +G    +  +D+S    D E+ D 
Subjt:  KYEKLPDYCYGCGRIGHQLRDCEETGK--NSGEELQFGPWLKQDSPIKSRGKERPEQEKGLNRQGRGRG---SIRGRGRNCDSTNRDNSEEEEDSEKDDD

Query:  RNTGPAQLGSGGEPAAGKGNDESSWELCPRTVEKFQTATCIERT--VTEQD--------KKEKVRESAEGNGQGIPSNI-AKEPVKSKQEAESVSTNHSN
              +L SG                   T+E   T T + RT  V++Q+         KEK+ E +  N + + + +    PV        +  N SN
Subjt:  RNTGPAQLGSGGEPAAGKGNDESSWELCPRTVEKFQTATCIERT--VTEQD--------KKEKVRESAEGNGQGIPSNI-AKEPVKSKQEAESVSTNHSN

Query:  QKREDDSKEKTAKTQSEKEGRGSSKETKTQLLRVSRRDIGGGCWTAPSDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRT
        Q+    S++   K  ++K  +           R++R   G            +++   + LG       +                 +Y++R  +KI   
Subjt:  QKREDDSKEKTAKTQSEKEGRGSSKETKTQLLRVSRRDIGGGCWTAPSDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRT

Query:  LNFENGFCVPNKGKGGGLLLLWKKEMDVSISTYSEGHIDAIIKKAHGF-WRFSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGG
            + F V   G+GGGL LLWK +++VSI ++++GHIDA+IK +    WRF+GFYG P    R  SW+L+ +L    +L W++ GDFNEI+   EKKGG
Subjt:  LNFENGFCVPNKGKGGGLLLLWKKEMDVSISTYSEGHIDAIIKKAHGF-WRFSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGG

Query:  AKRNPRQANSW
          R+    +S+
Subjt:  AKRNPRQANSW

TXG72251.1 hypothetical protein EZV62_000830 [Acer yangbiense]7.1e-4529.66Show/hide
Query:  EKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWNFDRGLLIFEEIRGHERY
        E++++ GI+D D            V  +L+ KK+N E FK +M +IW+    V  + VG N+F   F NK+ R +V + GPW+F   L+  E++ G    
Subjt:  EKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWNFDRGLLIFEEIRGHERY

Query:  TSINFRRTGKMCGGETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQVKYEKLPDYCYGCGRIGHQLRD-CEETGKNS---GEELQFGPWLKQDSPIKSR
             +   ++ G                   +K++ GT  E   + +KYE+LPD+CY CGRIGH +++  +E  + +   G   +FG WLK  +  K +
Subjt:  TSINFRRTGKMCGGETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQVKYEKLPDYCYGCGRIGHQLRD-CEETGKNS---GEELQFGPWLKQDSPIKSR

Query:  GKERPEQEKGLNRQGRGRGSIRGRGRNCD-STNRDNSEEEEDSE-------------------KDDDRNTGPAQLGSGGEPAAGKGNDESSWELCPRTVE
         +         N QG G  S R R  +   +T  D S                          K+    T  +  GSG   A     DES       + E
Subjt:  GKERPEQEKGLNRQGRGRGSIRGRGRNCD-STNRDNSEEEEDSE-------------------KDDDRNTGPAQLGSGGEPAAGKGNDESSWELCPRTVE

Query:  KFQTATCIERTVTEQDKKEKVRESAEGNGQGIPSNIAKEPVKSKQEAESVSTNHSNQKREDDSKEKTAKTQSEKEGRGSSKETKTQLLRVSRRDIGGGCW
          Q +  +   V+   +       +  + +   +   K   +  Q  +      S   R  D K+   KT         S+     LL V    +     
Subjt:  KFQTATCIERTVTEQDKKEKVRESAEGNGQGIPSNIAKEPVKSKQEAESVSTNHSNQKREDDSKEKTAKTQSEKEGRGSSKETKTQLLRVSRRDIGGGCW

Query:  TAPSDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLLLLWKKEMDVSISTYSEGHIDAIIKK
          PS TM  LSWN R LGNPRA  AL  L++ +   +VFL ETK N    E+I+ +  F    CV + G  GGLLLLWK  + VS+ ++   HIDA + +
Subjt:  TAPSDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLLLLWKKEMDVSISTYSEGHIDAIIKK

Query:  AHGF-WRFSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEK
          GF WRF+ FYG P   KR  SW L+  L + D+L W+ GGDFNE ++  +K
Subjt:  AHGF-WRFSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEK

XP_018816058.1 uncharacterized protein LOC108987582 [Juglans regia]1.7e-4628.18Show/hide
Query:  INTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWNFDRGLLIFEEIRGHERYTSINFRR------------------TGKM----
        +N + FKS M KIW  EG +  K VG N     FQ  + + KV+ G PW+FDR L+  +E+ G      I F R                   G M    
Subjt:  INTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWNFDRGLLIFEEIRGHERYTSINFRR------------------TGKM----

Query:  -------------CG-GETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQVKYEKLPDYCYGCGRIGHQLRDCEETGKNSGE-ELQFGPWLKQDSPIKSR
                     CG G  LR +V++++  PL R   + V     ++WI  KYE+LP +C+ CG I H  +    T  N  E   Q+GPWL+  +P+K  
Subjt:  -------------CG-GETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQVKYEKLPDYCYGCGRIGHQLRDCEETGKNSGE-ELQFGPWLKQDSPIKSR

Query:  GKERPEQEKGLNRQGRGRGSIRGRGRNCDSTNRDNSEEEEDSEKDDDRNTGPAQLGSGGEPAAGK-----GNDESSWELCPRTVEKFQTATCIERTVTEQ
        G +      G  +  + + S        +S  R   E +E      D  T   +  S  E   G       ND       P  +E  + +   +  V+ +
Subjt:  GKERPEQEKGLNRQGRGRGSIRGRGRNCDSTNRDNSEEEEDSEKDDDRNTGPAQLGSGGEPAAGK-----GNDESSWELCPRTVEKFQTATCIERTVTEQ

Query:  DKKEKVRESAEGN-----------------GQGIPSNIAKEPVKSKQEAESVS--TNHSNQKREDDSKEKTAKTQSEKEGR---GSSKETKTQLLRVSRR
        D+     ES +                   G  +  +I+       Q+ + +S      + K     K       S   G+   G+ ++ + +LL VS  
Subjt:  DKKEKVRESAEGN-----------------GQGIPSNIAKEPVKSKQEAESVS--TNHSNQKREDDSKEKTAKTQSEKEGR---GSSKETKTQLLRVSRR

Query:  D------------------IGGGCWTAPSDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLL
        +                  +    W  P  TM  +SWN RGLGNPR +  L  L++   P ++F +ETK ++   E++ + L+F++   V ++G  GGL 
Subjt:  D------------------IGGGCWTAPSDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLL

Query:  LLWKKEMDVSISTYSEGHIDAIIKKA--HGFWRFSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQANSWTTTTLET
        L+WK+ +D+SI  YS  HI A +K+      W  +GFYG+PET KR  SW+L++ +   + +AWL  GDFNEI   +EK GG  R  RQ   +    +E 
Subjt:  LLWKKEMDVSISTYSEGHIDAIIKKA--HGFWRFSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQANSWTTTTLET

Query:  SSHGGGEETKGTKF
        S       T+G+KF
Subjt:  SSHGGGEETKGTKF

XP_042939567.1 uncharacterized protein LOC122274609 [Carya illinoinensis]3.5e-4429.1Show/hide
Query:  WTVQGICVCLIGDLWRSAKGVTIVMEENFSQKMEKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVG
        +T  G+ +C       +  G  ++MEE  ++  E+LK+SE+E   V G++   +E+       + V KI +E+KI  E  +S M KI  +E  +  +++G
Subjt:  WTVQGICVCLIGDLWRSAKGVTIVMEENFSQKMEKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVG

Query:  TNLFECTFQNKKGRRKVMEGGPWNFDRGLLIFEEIRGHERYTSINFRRTGKMCGGETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQVKYEKLPDYCYG
         N F  TF +KK + KVM G PW FD  L + ++  G  +   I+F            +V   I + N     + I +G    +    V+   + +    
Subjt:  TNLFECTFQNKKGRRKVMEGGPWNFDRGLLIFEEIRGHERYTSINFRRTGKMCGGETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQVKYEKLPDYCYG

Query:  CGRIGHQLRDCEETGK--NSGEELQFGPWLKQDSPIKSRGKERPEQEKGLNRQGRGRGSIRGRGRNCDSTNRDNSEEEEDSEKDDDRNTGPAQLGSGGEP
         GR+       E+ G+  + G +LQ+G W++    I    +    + K ++ +    GS    G    S +   S + ++ EK+D+           G  
Subjt:  CGRIGHQLRDCEETGK--NSGEELQFGPWLKQDSPIKSRGKERPEQEKGLNRQGRGRGSIRGRGRNCDSTNRDNSEEEEDSEKDDDRNTGPAQLGSGGEP

Query:  AAGKGNDESSWELCPRTVEKFQTATCIERTVTEQDKKEKVRESAEGNGQGIPSNIA----KEPVKSKQEAESVSTNHSNQKREDDSKEKTAKTQSEKEGR
          GKG + S        +   +    +ER V +  + E      E + Q I  N+A     E ++ +   E+V  +  N                  EG+
Subjt:  AAGKGNDESSWELCPRTVEKFQTATCIERTVTEQDKKEKVRESAEGNGQGIPSNIA----KEPVKSKQEAESVSTNHSNQKREDDSKEKTAKTQSEKEGR

Query:  GSSKETKTQLLRVSRRDIGGGCWTAPSDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLLLL
         +S + + +    S +  GGGC    S  MK +SWN RG+GNPR ++ L  L   NK  +VFLIET       E++KR L  E  F V + GK GGL L 
Subjt:  GSSKETKTQLLRVSRRDIGGGCWTAPSDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLLLL

Query:  WKKEMDVSISTYSEGHIDA-IIKKAHGF-WRFSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQANSW
        W+   +V I +YS  HI+A + ++  G  W F+GFY +PET KR FSW+L+ +L  R+   W + GDFNEI+ ++EK GG +R   Q N +
Subjt:  WKKEMDVSISTYSEGHIDA-IIKKAHGF-WRFSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQANSW

XP_042954615.1 uncharacterized protein LOC122291031 [Carya illinoinensis]1.9e-5029.65Show/hide
Query:  MEENFSQKMEKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCK-----ILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVME
        MEE  S++ EKL ++E+EK      E   L  H+   G           I+ EK IN E FKS M KIW     +   +VG N F   F + +  ++V+E
Subjt:  MEENFSQKMEKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCK-----ILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVME

Query:  GGPWNFDRGLLIFEEIRGHERYTSINFRR------------------TGKMCG------------------GETLRVRVKIEVQNPLKRAVKIRVGTMAE
        G PW FDR LL  +   G +    I F +                  TG + G                  G+ LR+RV++++   L R   + V    +
Subjt:  GGPWNFDRGLLIFEEIRGHERYTSINFRR------------------TGKMCG------------------GETLRVRVKIEVQNPLKRAVKIRVGTMAE

Query:  EEWIQVKYEKLPDYCYGCGRIGHQLRDCEETGKNSGEELQFGPWLK------QDSPIKSRGK----ERPEQEKGLNRQGRGRGSIRG---------RGRN
        + W+  KYE+LP  C+ CG I H    C       G + Q+G WL+       D  +K  G+    E    ++   R+  GR  ++G         + RN
Subjt:  EEWIQVKYEKLPDYCYGCGRIGHQLRDCEETGKNSGEELQFGPWLK------QDSPIKSRGK----ERPEQEKGLNRQGRGRGSIRG---------RGRN

Query:  CDSTNRD----NSEEEEDSEKDDDRNTGPAQLGSGGEPAAGKGNDESSWELCPRTVEKFQTATCIERTVTEQDKKEKVRE----SAEGNGQGIP-SNIAK
          S   D    +  + + +E+   +   P Q+G+  +  AGKG    +    P   EK      +ER + EQ +   +      + E    GI  + I+ 
Subjt:  CDSTNRD----NSEEEEDSEKDDDRNTGPAQLGSGGEPAAGKGNDESSWELCPRTVEKFQTATCIERTVTEQDKKEKVRE----SAEGNGQGIP-SNIAK

Query:  EPVKSKQEAESVSTNH---SNQKREDDS---KEKTAKTQSEKEGRG--------SSKETKTQLLRVSRRDIGGGCWTAPSDTMKTLSWNARGLGNPRAIR
        + V      ES  ++     NQ ++D +   +E+  KT++   G G        S K  K+ +L+ S+R        AP  +++ +SWN+RGLGNP  +R
Subjt:  EPVKSKQEAESVSTNH---SNQKREDDS---KEKTAKTQSEKEGRG--------SSKETKTQLLRVSRRDIGGGCWTAPSDTMKTLSWNARGLGNPRAIR

Query:  ALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLLLLWKKEMDVSISTYSEGHIDAIIK---KAHGFWRFSGFYGNPETEKRQF
         LR L     P I+FL ET+ +    EKIK  L F N   V N   GGG+ LLWK  +D+ I+ YS+ HI A IK   +    W  +G YG+PE  +R  
Subjt:  ALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLLLLWKKEMDVSISTYSEGHIDAIIK---KAHGFWRFSGFYGNPETEKRQF

Query:  SWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQANSWTTTTLE
        +W+L++ L      AWL+ GDFNEI+   EK+GG  R   Q   ++   +E
Subjt:  SWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQANSWTTTTLE

TrEMBL top hitse value%identityAlignment
A0A2N9G0G1 Uncharacterized protein3.3e-4827.61Show/hide
Query:  ENFSQKMEKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWNFD
        E+     EK  +SE+E  +V      DL             K LT + +N E+       +W  +     + +  N     F+++  R +VM G PW +D
Subjt:  ENFSQKMEKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWNFD

Query:  RGLLIFEEIRGHERYTSINFRRT--------------------------GKMC----------GGETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQVK
        + L+I + I   E    + F  T                          GK+           GG+ +R+RV I++  PL R  K  +     E WI  K
Subjt:  RGLLIFEEIRGHERYTSINFRRT--------------------------GKMC----------GGETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQVK

Query:  YEKLPDYCYGCGRIGHQLRDCEETGKNSG----EELQFGPWLKQDSPIKSRGKERPEQEKGLNRQGRGRGSIRGRGRNCDSTNRDNSEEEEDSEKDDDRN
        YE+LP++CY CG + H  +DC    +N      E+ QFGPWL+  +       ERP ++  +  +                 +   ++++  +  D + N
Subjt:  YEKLPDYCYGCGRIGHQLRDCEETGKNSG----EELQFGPWLKQDSPIKSRGKERPEQEKGLNRQGRGRGSIRGRGRNCDSTNRDNSEEEEDSEKDDDRN

Query:  TGPAQLGSGGEPAAGKGNDESSWELCPRTVEKFQTATCIERTVTEQDKKEKVRESAEGNGQGIPSNIAKEPVKSKQEAESVSTNHSNQKREDDSKEKTAK
         G                                T+   ER   EQ  ++++R+         P N+   P        S+S +  N +        TA 
Subjt:  TGPAQLGSGGEPAAGKGNDESSWELCPRTVEKFQTATCIERTVTEQDKKEKVRESAEGNGQGIPSNIAKEPVKSKQEAESVSTNHSNQKREDDSKEKTAK

Query:  TQSEKEGRGSSKETKTQLLRVSRRDIGGGCWTAPSDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKG
         Q +++   S K++  +L R             PS  M  L+WN RGLGN R ++ +  LV +  P +VFLIET  +E   E+++  L FEN F   ++ 
Subjt:  TQSEKEGRGSSKETKTQLLRVSRRDIGGGCWTAPSDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKG

Query:  KGGGLLLLWKKEMDVSISTYSEGHIDAIIKKAH-GFWRFSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQ
        KGGGL LLWKKE+++ + ++S  HIDA++ +     WRF+GFYG PET  R+ SWNL+ +L+ +  L W   GDFNE+V   EK+G   R+ RQ
Subjt:  KGGGLLLLWKKEMDVSISTYSEGHIDAIIKKAH-GFWRFSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQ

A0A2N9G7B6 Uncharacterized protein2.7e-5027.77Show/hide
Query:  EKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWNFDRGLLIFE
        EK  +SE+E  +V      DL             K LT + +N E+       +W  +     + +  N     F+++  R +VM G PW +D+ L+I +
Subjt:  EKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWNFDRGLLIFE

Query:  EIRGHERYTSINFRRT--------------------------GKMC----------GGETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQVKYEKLPDY
         I   E    + F  T                          GK+            G+ +R+RV I++  PL R  K  +     E WI  KYE+LP++
Subjt:  EIRGHERYTSINFRRT--------------------------GKMC----------GGETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQVKYEKLPDY

Query:  CYGCGRIGHQLRDCEETGKNSG----EELQFGPWLKQDSPIKSRGKERPEQEKGLNRQGRGRGSIRGRGRNCDSTNRDNSEEEEDSEKDDDRNTGPAQLG
        CY CG + H  +DC    +N      E+ QFGPWL+  +       ERP ++  +  +G                 + +  ++         NT   Q  
Subjt:  CYGCGRIGHQLRDCEETGKNSG----EELQFGPWLKQDSPIKSRGKERPEQEKGLNRQGRGRGSIRGRGRNCDSTNRDNSEEEEDSEKDDDRNTGPAQLG

Query:  SGGEPAAGKGNDESSWELCPRTVEKFQTATCIERTVTEQDKKEKVRESAEGNGQGIPSNIAKEPVKSKQEAESVSTNHSNQKREDDSKEKTAKTQSEKEG
        S  +P                      T+  I   +  Q                 P   + + + S   + +V   H    ++D     T     E  G
Subjt:  SGGEPAAGKGNDESSWELCPRTVEKFQTATCIERTVTEQDKKEKVRESAEGNGQGIPSNIAKEPVKSKQEAESVSTNHSNQKREDDSKEKTAKTQSEKEG

Query:  RGSSKETKTQLLRVSRRDIGGGCWTAPSDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLLL
                      S+  IGGGC   P   M  L+WN RGLGNPR ++ +  LV +  P +VFLIET  +E   E+++  L FEN F   ++ KGGGL L
Subjt:  RGSSKETKTQLLRVSRRDIGGGCWTAPSDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLLL

Query:  LWKKEMDVSISTYSEGHIDAIIKKAH-GFWRFSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQ
        LWKKE+++ + ++S  HIDA++ +     WRF+GFYG PET  R+ SWNL+ +L+ +  L W   GDFNE+V   EK+G   R+ RQ
Subjt:  LWKKEMDVSISTYSEGHIDAIIKKAH-GFWRFSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQ

A0A5C7H9Y2 CCHC-type domain-containing protein2.2e-4727.99Show/hide
Query:  ENFSQKMEKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWNFD
        ++ S+K EKL + + +   +  I+    E  ++ +  + + K +T K IN E FKS +  IW  + EV  + +G N+F+  FQN   R++++EGGPW FD
Subjt:  ENFSQKMEKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWNFD

Query:  RGLLIFEEIRGHERYTSINFR-------------------------------------RTGKMCGGETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQV
        + LL+  E  G E+ T + FR                                      +G+ C G+ +R+RV I+V NPLKR +++ +G   +   + +
Subjt:  RGLLIFEEIRGHERYTSINFR-------------------------------------RTGKMCGGETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQV

Query:  KYEKLPDYCYGCGRIGHQLRDCEETGK--NSGEELQFGPWLKQDSPIKSRGKERPEQEKGLNRQGRGRG---SIRGRGRNCDSTNRDNSEEEEDSEKDDD
         YE+LP++CY CG+IGH +RDC    K   S    +FGPW++  S  +S+G    +     +R+G       ++R +G    +  +D+S    D E+ D 
Subjt:  KYEKLPDYCYGCGRIGHQLRDCEETGK--NSGEELQFGPWLKQDSPIKSRGKERPEQEKGLNRQGRGRG---SIRGRGRNCDSTNRDNSEEEEDSEKDDD

Query:  RNTGPAQLGSGGEPAAGKGNDESSWELCPRTVEKFQTATCIERT--VTEQD--------KKEKVRESAEGNGQGIPSNI-AKEPVKSKQEAESVSTNHSN
              +L SG                   T+E   T T + RT  V++Q+         KEK+ E +  N + + + +    PV        +  N SN
Subjt:  RNTGPAQLGSGGEPAAGKGNDESSWELCPRTVEKFQTATCIERT--VTEQD--------KKEKVRESAEGNGQGIPSNI-AKEPVKSKQEAESVSTNHSN

Query:  QKREDDSKEKTAKTQSEKEGRGSSKETKTQLLRVSRRDIGGGCWTAPSDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRT
        Q+    S++   K  ++K  +           R++R   G            +++   + LG       +                 +Y++R  +KI   
Subjt:  QKREDDSKEKTAKTQSEKEGRGSSKETKTQLLRVSRRDIGGGCWTAPSDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRT

Query:  LNFENGFCVPNKGKGGGLLLLWKKEMDVSISTYSEGHIDAIIKKAHGF-WRFSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGG
            + F V   G+GGGL LLWK +++VSI ++++GHIDA+IK +    WRF+GFYG P    R  SW+L+ +L    +L W++ GDFNEI+   EKKGG
Subjt:  LNFENGFCVPNKGKGGGLLLLWKKEMDVSISTYSEGHIDAIIKKAHGF-WRFSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGG

Query:  AKRNPRQANSW
          R+    +S+
Subjt:  AKRNPRQANSW

A0A7N2LIH6 Uncharacterized protein1.3e-4728.48Show/hide
Query:  VTIVMEENFSQKMEKLKISEKEKAQVIGIEDEDLE------EHDKGIGET-AVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKG
        V I+M E   +  +KL ++E         EDED++         K +G+   V KILT++ +  E  K  M  +W     +   ++G +LF   F + + 
Subjt:  VTIVMEENFSQKMEKLKISEKEKAQVIGIEDEDLE------EHDKGIGET-AVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKG

Query:  RRKVMEGGPWNFDRGLLIFEEIRGHERYTSINFRRT--------------------------GKMC----------GGETLRVRVKIEVQNPLKRAVKIR
        ++KVME  PW++++ L++ +E  G      I  + T                          GK+            G+ LRVR++ +    L R  K+ 
Subjt:  RRKVMEGGPWNFDRGLLIFEEIRGHERYTSINFRRT--------------------------GKMC----------GGETLRVRVKIEVQNPLKRAVKIR

Query:  VGTMAEEEWIQVKYEKLPDYCYGCGRIGHQLRDCEE--TGKNSGEE--LQFGPWLKQDSPIKSRGK-----------ERPEQEKGLNRQGRGRGSIRGRG
        +    E  W+  KYE+LP++CY CGR+ H  +DC E   G+N G+E   Q+G WL+ + P +S G+           ER +  +    + + R  ++ R 
Subjt:  VGTMAEEEWIQVKYEKLPDYCYGCGRIGHQLRDCEE--TGKNSGEE--LQFGPWLKQDSPIKSRGK-----------ERPEQEKGLNRQGRGRGSIRGRG

Query:  RNCDSTNRDNSEEEEDSEKDDDRNTGPAQLGSGGEPAAGKGNDESSWELCPRT------VEKFQTATCIE------RTVTEQDKKEKVR-ESAEGNGQGI
        +   +  R +  E+   +KD  R +     G G      KG      EL           EKF+    I         V  +D ++K++ E    +   +
Subjt:  RNCDSTNRDNSEEEEDSEKDDDRNTGPAQLGSGGEPAAGKGNDESSWELCPRT------VEKFQTATCIE------RTVTEQDKKEKVR-ESAEGNGQGI

Query:  PSNIAKEPVKSKQEAESVS-TNHSNQKREDDSKEKTAKTQSEKEGRGS----------------SKETKTQLLRVS----RRD---IGGGCWTAPSDTMK
         S   +E  K KQ  +     N  NQ   ++     A T  +++G  S                +K++      VS    R+D    GGGC  AP  +M 
Subjt:  PSNIAKEPVKSKQEAESVS-TNHSNQKREDDSKEKTAKTQSEKEGRGS----------------SKETKTQLLRVS----RRD---IGGGCWTAPSDTMK

Query:  TLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLLLLWKKEMDVSISTYSEGHIDAIIKKA--HGFWR
         L+WN RGLG   A+R L   V+   P +VFL+ETK +    +  +  L F  G  VP+ G+ GGL LLWK+  D+   + S  HID ++  A   G WR
Subjt:  TLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLLLLWKKEMDVSISTYSEGHIDAIIKKA--HGFWR

Query:  FSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQANSW
         +GFYG+P+T KR  SW L+E L+ + ++ WL+ GDFNEIV   EK G   R+  Q +++
Subjt:  FSGFYGNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQANSW

A0A7N2R0C3 Reverse transcriptase domain-containing protein3.0e-4927.5Show/hide
Query:  MEENFSQKMEKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWN
        M E      ++LK++E+E+  ++ + DE +    +   +    K+++ K +  E  +  +  +W     +    +G  LF   F++++ +R+VM+  PW+
Subjt:  MEENFSQKMEKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNLEGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWN

Query:  FDRGLLIFEEIRGHERYTSI------------------NFRRTGKMCG------------------GETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQ
        +++ L++F+E  G E    I                    + TGK  G                  G  LRVRV+I+V   L R  KI +    E  W+ 
Subjt:  FDRGLLIFEEIRGHERYTSI------------------NFRRTGKMCG------------------GETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQ

Query:  VKYEKLPDYCYGCGRIGHQLRDC-EETGKN-SGEE--LQFGPWLKQDSPIKSRGKERPEQEKGL-----NRQGRGRGSIRGRGRNCDSTNRDNSEEEEDS
         KYE+LP++CY CG + H L+DC EE GK+ +GEE  LQ+G WL+ + PI+  G +    +K +     N++       +GR    +   R+  E E  S
Subjt:  VKYEKLPDYCYGCGRIGHQLRDC-EETGKN-SGEE--LQFGPWLKQDSPIKSRGKERPEQEKGL-----NRQGRGRGSIRGRGRNCDSTNRDNSEEEEDS

Query:  EKDDDRNTGPAQLGSGGEPAAGKGNDESSWELCPRTVEKFQTATCIERTVTEQDKKEKVRESAEGNG------QGIPSNIAKE-----------------
           D R      L  GGE    K  +    E    ++ +  +   +E         E+ RE   GNG      +G   N+ K                  
Subjt:  EKDDDRNTGPAQLGSGGEPAAGKGNDESSWELCPRTVEKFQTATCIERTVTEQDKKEKVRESAEGNG------QGIPSNIAKE-----------------

Query:  -------PVKSKQ-------------EAESVSTNHSNQKR------EDDSKEKTAKTQSEKEGRGSSKETKTQLLRVSRRDIGGGCWTAPSDTMKTLSWN
               P K+K               A  +  +  + KR      + + KE  +  Q +++G  + +E    +    RR         P   M  L+WN
Subjt:  -------PVKSKQ-------------EAESVSTNHSNQKR------EDDSKEKTAKTQSEKEGRGSSKETKTQLLRVSRRDIGGGCWTAPSDTMKTLSWN

Query:  ARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLLLLWKKEMDVSISTYSEGHIDAIIKKAHGF--WRFSGFY
         RG+G+  A+RAL   V++  P +VFL ETK ++R  + ++R L    G  VP+ G+ GGL +LW++ +DVS+ + S  HID ++  ++G   WR +GFY
Subjt:  ARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLLLLWKKEMDVSISTYSEGHIDAIIKKAHGF--WRFSGFY

Query:  GNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQ
        G+P+   R  SW L+E LS + ++ W++ GDFNEI++  EK G  +R+ RQ
Subjt:  GNPETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACCAGGAAACCAACCCGGAAGAAGGTCAGACCAAAGGGTTGGTCGAGGCCGACCATTCAGCCTGCTTGCGAGGGCTGAAATCGTTCGCCTCGACTCAGTCCTTGC
TGCCTCTGGCCGCCCCGGGTCCGCCTGGTCCGTCCCAAAACGCCTTCGAATTCCTAAAAGCCCTAACGAAAGACACAGCATCGAAGTGCAGGTGATCTACACCACATCGG
TGTGCAGTGGTTCTTACTGGTCTTGCAGGTCACGTTTTCTCCGTCTTCAACAAATTCATTGCTGGTGTCACATGAAGGTTTTTAACTTGCCACACACAACTGAAATCGTG
GATAATGAAGAAGAAACACAAGGTACGAGGATAGCATACCAGAGGGTCGTGCGAAAGGAGTGGACAGTGCAGGGCATATGTGTATGCTTAATCGGAGATCTGTGGAGATC
TGCAAAGGGAGTTACGATAGTTATGGAAGAAAACTTCAGTCAGAAGATGGAGAAACTAAAAATATCAGAAAAAGAAAAGGCGCAAGTAATAGGTATAGAAGATGAAGATC
TGGAAGAGCATGACAAGGGGATTGGCGAGACAGCGGTGTGCAAGATCCTAACAGAGAAGAAAATAAACACCGAAAACTTCAAATCAATGATGCCAAAAATCTGGAATCTA
GAAGGAGAAGTGAACAACAAGAAAGTGGGAACAAATCTATTTGAGTGCACATTTCAAAACAAAAAGGGAAGGAGGAAAGTCATGGAAGGGGGACCTTGGAACTTCGACCG
AGGCCTCCTTATATTTGAAGAAATCAGAGGGCATGAGAGATACACATCGATCAATTTCAGACGAACAGGGAAAATGTGTGGAGGAGAAACACTCAGAGTCAGAGTAAAAA
TAGAAGTTCAAAATCCGTTAAAGAGAGCTGTCAAAATTAGAGTGGGAACTATGGCAGAGGAAGAGTGGATTCAAGTCAAATATGAAAAGCTACCAGATTATTGCTATGGT
TGCGGGCGGATTGGACACCAATTGAGAGATTGTGAAGAAACAGGAAAAAACAGTGGTGAAGAATTACAATTTGGGCCATGGCTAAAACAAGACTCTCCAATAAAGAGTAG
AGGTAAAGAAAGGCCCGAGCAAGAAAAAGGATTGAACAGGCAGGGAAGGGGTCGCGGTAGCATCAGAGGAAGAGGAAGAAATTGCGACTCGACCAACAGGGACAACAGCG
AAGAAGAAGAAGACTCAGAGAAAGATGATGACCGGAACACTGGCCCAGCCCAGCTGGGAAGCGGCGGAGAACCGGCAGCCGGGAAGGGAAATGATGAAAGCTCGTGGGAA
CTGTGCCCGAGAACAGTAGAAAAGTTCCAAACGGCTACCTGCATTGAGAGGACAGTGACTGAACAAGACAAAAAGGAAAAAGTCAGAGAGTCTGCTGAAGGGAATGGGCA
GGGGATTCCATCGAATATTGCAAAAGAACCTGTAAAGTCAAAGCAGGAGGCTGAATCAGTCAGTACCAATCACTCTAATCAGAAGAGAGAAGATGATAGCAAAGAAAAAA
CAGCAAAAACTCAGAGCGAAAAAGAAGGCAGAGGAAGTTCTAAAGAAACCAAAACTCAGTTACTTCGAGTCAGTCGAAGGGATATCGGCGGAGGCTGTTGGACAGCCCCG
TCGGACACCATGAAAACCTTAAGTTGGAATGCTCGAGGTCTGGGGAATCCTCGAGCGATCCGAGCTCTTCGCTTCCTAGTGGAGAGTAATAAACCCCAAATTGTTTTCCT
CATAGAGACCAAGTATAATGAGAGAAGTTGTGAGAAGATTAAGAGGACTCTGAACTTCGAAAATGGGTTTTGTGTGCCCAACAAGGGCAAAGGGGGTGGATTATTGCTGT
TATGGAAAAAAGAGATGGATGTTAGTATTTCTACTTACTCTGAAGGTCATATAGATGCTATCATAAAAAAAGCTCATGGCTTTTGGAGATTTTCAGGCTTCTATGGCAAC
CCAGAAACAGAAAAACGACAATTCTCGTGGAACCTTATGGAGAAGTTGAGCGAGAGGGATGACTTGGCTTGGTTAATAGGCGGTGATTTCAATGAGATTGTTTCGGAGTC
TGAGAAGAAGGGTGGAGCTAAAAGGAACCCGAGGCAAGCAAACTCATGGACGACGACTACACTGGAAACAAGTTCACATGGCGGAGGGGAAGAAACAAAAGGAACCAAAT
TTGCGAAAGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGACCAGGAAACCAACCCGGAAGAAGGTCAGACCAAAGGGTTGGTCGAGGCCGACCATTCAGCCTGCTTGCGAGGGCTGAAATCGTTCGCCTCGACTCAGTCCTTGC
TGCCTCTGGCCGCCCCGGGTCCGCCTGGTCCGTCCCAAAACGCCTTCGAATTCCTAAAAGCCCTAACGAAAGACACAGCATCGAAGTGCAGGTGATCTACACCACATCGG
TGTGCAGTGGTTCTTACTGGTCTTGCAGGTCACGTTTTCTCCGTCTTCAACAAATTCATTGCTGGTGTCACATGAAGGTTTTTAACTTGCCACACACAACTGAAATCGTG
GATAATGAAGAAGAAACACAAGGTACGAGGATAGCATACCAGAGGGTCGTGCGAAAGGAGTGGACAGTGCAGGGCATATGTGTATGCTTAATCGGAGATCTGTGGAGATC
TGCAAAGGGAGTTACGATAGTTATGGAAGAAAACTTCAGTCAGAAGATGGAGAAACTAAAAATATCAGAAAAAGAAAAGGCGCAAGTAATAGGTATAGAAGATGAAGATC
TGGAAGAGCATGACAAGGGGATTGGCGAGACAGCGGTGTGCAAGATCCTAACAGAGAAGAAAATAAACACCGAAAACTTCAAATCAATGATGCCAAAAATCTGGAATCTA
GAAGGAGAAGTGAACAACAAGAAAGTGGGAACAAATCTATTTGAGTGCACATTTCAAAACAAAAAGGGAAGGAGGAAAGTCATGGAAGGGGGACCTTGGAACTTCGACCG
AGGCCTCCTTATATTTGAAGAAATCAGAGGGCATGAGAGATACACATCGATCAATTTCAGACGAACAGGGAAAATGTGTGGAGGAGAAACACTCAGAGTCAGAGTAAAAA
TAGAAGTTCAAAATCCGTTAAAGAGAGCTGTCAAAATTAGAGTGGGAACTATGGCAGAGGAAGAGTGGATTCAAGTCAAATATGAAAAGCTACCAGATTATTGCTATGGT
TGCGGGCGGATTGGACACCAATTGAGAGATTGTGAAGAAACAGGAAAAAACAGTGGTGAAGAATTACAATTTGGGCCATGGCTAAAACAAGACTCTCCAATAAAGAGTAG
AGGTAAAGAAAGGCCCGAGCAAGAAAAAGGATTGAACAGGCAGGGAAGGGGTCGCGGTAGCATCAGAGGAAGAGGAAGAAATTGCGACTCGACCAACAGGGACAACAGCG
AAGAAGAAGAAGACTCAGAGAAAGATGATGACCGGAACACTGGCCCAGCCCAGCTGGGAAGCGGCGGAGAACCGGCAGCCGGGAAGGGAAATGATGAAAGCTCGTGGGAA
CTGTGCCCGAGAACAGTAGAAAAGTTCCAAACGGCTACCTGCATTGAGAGGACAGTGACTGAACAAGACAAAAAGGAAAAAGTCAGAGAGTCTGCTGAAGGGAATGGGCA
GGGGATTCCATCGAATATTGCAAAAGAACCTGTAAAGTCAAAGCAGGAGGCTGAATCAGTCAGTACCAATCACTCTAATCAGAAGAGAGAAGATGATAGCAAAGAAAAAA
CAGCAAAAACTCAGAGCGAAAAAGAAGGCAGAGGAAGTTCTAAAGAAACCAAAACTCAGTTACTTCGAGTCAGTCGAAGGGATATCGGCGGAGGCTGTTGGACAGCCCCG
TCGGACACCATGAAAACCTTAAGTTGGAATGCTCGAGGTCTGGGGAATCCTCGAGCGATCCGAGCTCTTCGCTTCCTAGTGGAGAGTAATAAACCCCAAATTGTTTTCCT
CATAGAGACCAAGTATAATGAGAGAAGTTGTGAGAAGATTAAGAGGACTCTGAACTTCGAAAATGGGTTTTGTGTGCCCAACAAGGGCAAAGGGGGTGGATTATTGCTGT
TATGGAAAAAAGAGATGGATGTTAGTATTTCTACTTACTCTGAAGGTCATATAGATGCTATCATAAAAAAAGCTCATGGCTTTTGGAGATTTTCAGGCTTCTATGGCAAC
CCAGAAACAGAAAAACGACAATTCTCGTGGAACCTTATGGAGAAGTTGAGCGAGAGGGATGACTTGGCTTGGTTAATAGGCGGTGATTTCAATGAGATTGTTTCGGAGTC
TGAGAAGAAGGGTGGAGCTAAAAGGAACCCGAGGCAAGCAAACTCATGGACGACGACTACACTGGAAACAAGTTCACATGGCGGAGGGGAAGAAACAAAAGGAACCAAAT
TTGCGAAAGACTAG
Protein sequenceShow/hide protein sequence
MGPGNQPGRRSDQRVGRGRPFSLLARAEIVRLDSVLAASGRPGSAWSVPKRLRIPKSPNERHSIEVQVIYTTSVCSGSYWSCRSRFLRLQQIHCWCHMKVFNLPHTTEIV
DNEEETQGTRIAYQRVVRKEWTVQGICVCLIGDLWRSAKGVTIVMEENFSQKMEKLKISEKEKAQVIGIEDEDLEEHDKGIGETAVCKILTEKKINTENFKSMMPKIWNL
EGEVNNKKVGTNLFECTFQNKKGRRKVMEGGPWNFDRGLLIFEEIRGHERYTSINFRRTGKMCGGETLRVRVKIEVQNPLKRAVKIRVGTMAEEEWIQVKYEKLPDYCYG
CGRIGHQLRDCEETGKNSGEELQFGPWLKQDSPIKSRGKERPEQEKGLNRQGRGRGSIRGRGRNCDSTNRDNSEEEEDSEKDDDRNTGPAQLGSGGEPAAGKGNDESSWE
LCPRTVEKFQTATCIERTVTEQDKKEKVRESAEGNGQGIPSNIAKEPVKSKQEAESVSTNHSNQKREDDSKEKTAKTQSEKEGRGSSKETKTQLLRVSRRDIGGGCWTAP
SDTMKTLSWNARGLGNPRAIRALRFLVESNKPQIVFLIETKYNERSCEKIKRTLNFENGFCVPNKGKGGGLLLLWKKEMDVSISTYSEGHIDAIIKKAHGFWRFSGFYGN
PETEKRQFSWNLMEKLSERDDLAWLIGGDFNEIVSESEKKGGAKRNPRQANSWTTTTLETSSHGGGEETKGTKFAKD