; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039162 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039162
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold10:41206276..41218748
RNA-Seq ExpressionSpg039162
SyntenySpg039162
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR019775 - WD40 repeat, conserved site
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69274.1 TatD related DNase [Prunus dulcis]3.5e-5329.26Show/hide
Query:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAISMLKIKRQ
        + NGKPRGK  A+RGL QGD                ++  A D  +V G   G D ++VSHLQFA +T+FF   KE Y+ NL  +L  F  +S +KI + 
Subjt:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAISMLKIKRQ

Query:  KSSIIGIN--------------CE------------------------------HSKLASW----------ASLV-------------------------
        KS I+GIN              CE                                +L  W           +L+                         
Subjt:  KSSIIGIN--------------CE------------------------------HSKLASW----------ASLV-------------------------

Query:  ----------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKYGPHPFDWVSGHRLKGSGKNLWALVASGFP
                  GVEEG   HLV+W  V+K  E GGLGI +LR RN+ L AKWLWRF  E  SLWHR+I SKYG     W +    K S +N W  ++ G+ 
Subjt:  ----------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKYGPHPFDWVSGHRLKGSGKNLWALVASGFP

Query:  LFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQLRNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTDRETTDLASLLSILSDRTIHLGRRDSRIW--
         F    + S+G+  K  FWEDFW+ E  L  LFP+L +LS +    +A    + +   +      R L++ E  ++  LL IL +  +   R D R W  
Subjt:  LFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQLRNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTDRETTDLASLLSILSDRTIHLGRRDSRIW--

Query:  ------TPKPSKGFFL---PLLFPNL------------------------------------MCFSSP-----GNKEETLNHLLGDCLFASSLWNRFFRT
              + K  + F +    ++FP                                      MC S           E ++HL   C ++  LW R    
Subjt:  ------TPKPSKGFFL---PLLFPNL------------------------------------MCFSSP-----GNKEETLNHLLGDCLFASSLWNRFFRT

Query:  FGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLE-EEIWESIRFNTFLWAFVGWPF
         G      + C  ++   L  S    K  IL      A  WNIW+ERNR IF+G   +  EE+W+ I+F   LWA V   F
Subjt:  FGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLE-EEIWESIRFNTFLWAFVGWPF

CAN69913.1 hypothetical protein VITISV_042568 [Vitis vinifera]2.2e-5531.12Show/hide
Query:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCS--EKENYFRNLNGLLSFFEAISMLKIK
        L NG  +G + A+RGL QGD                +++ A ++ ++EGF++G +  +VSHLQFA +T+FF +  E+E   + L  LL  F  IS LK+ 
Subjt:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCS--EKENYFRNLNGLLSFFEAISMLKIK

Query:  RQKSSIIGINCEHSKLASWASLV----------------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKY
          KS I GIN + + L+    ++                GV EG   HLV+W+VV KP  +GGLG  N+  RN  LL KWLWR+  E  +LWH+VI+S Y
Subjt:  RQKSSIIGINCEHSKLASWASLV----------------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKY

Query:  GPHPFDWVSGHRLKGSGKNLWALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQL-RNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTD
        G H   W +   ++ S +  W  +A  F  FS   +  +G+  +  FWED W G++PL + +P+L R + DK ++ ++S+L    P  S +L   R L+D
Subjt:  GPHPFDWVSGHRLKGSGKNLWALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQL-RNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTD

Query:  RETTDLASLLSILSDRTIHLGRRDSRIWTPKPS-----KGFFL--------PLLFPNLMCFSSP------------GNKE--------------------
         E  DL  L+  L D  +     D+R+W    S     K FFL        P  FP+   ++S              +K+                    
Subjt:  RETTDLASLLSILSDRTIHLGRRDSRIWTPKPS-----KGFFL--------PLLFPNLMCFSSP------------GNKE--------------------

Query:  --------ETLNHLLGDCLFASSLWNRFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLEEEIWESIRFN
                E+ +HL   C     LW+R F+   +     R    M+           +G +LWQA   A +  +W ERN  IF       E +W+SI F 
Subjt:  --------ETLNHLLGDCLFASSLWNRFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLEEEIWESIRFN

Query:  TFLWAFVGWPF
          LWAF    F
Subjt:  TFLWAFVGWPF

RVX02255.1 NAD-dependent malic enzyme 62 kDa isoform, mitochondrial [Vitis vinifera]2.0e-5631.52Show/hide
Query:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAISMLKIKRQ
        L NG  +G + A+RGL QGD                L+  A + G+ EGF +G D  +VS LQFA +T+FF      + +NL  +   F  +S LKI  +
Subjt:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAISMLKIKRQ

Query:  KSSIIG--INCEHSKLASWASLV-------------------------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSL
        KS+I G  +      +  W  +V                         G  EG   HLV+WEVVS+P E+GGLG   + LRN  LL KWLWRF  E+  L
Subjt:  KSSIIG--INCEHSKLASWASLV-------------------------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSL

Query:  WHRVIVSKYGPHPFDWVSGHRLKGSGKNLWALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQLRNLSDKMLHSVASILPSFDPSSSLSL
        WH+VIVS YG HP  W +   ++ S +  W  +A  F  FS  ++  +G+  +  FWED W G + LCS F  L  +      +V+++L +  P  + +L
Subjt:  WHRVIVSKYGPHPFDWVSGHRLKGSGKNLWALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQLRNLSDKMLHSVASILPSFDPSSSLSL

Query:  GLSRPLTDRETTDLASLLSILSDRTIHLGRRDSRIWTPKPS-----KGFFL-------PLLF-----------PNLM-----------------------
           R LTD E   L  L+S LS         DSR W+   S     K FFL       P+LF           P+ +                       
Subjt:  GLSRPLTDRETTDLASLLSILSDRTIHLGRRDSRIWTPKPS-----KGFFL-------PLLF-----------PNLM-----------------------

Query:  --------CFSSPGNKEETLNHLLGDCLFASSLWNRFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLEE
                C    GN  E+++HL   C     LWN+ F+  G+     R    M+           +GK LWQ      +W +W ERN  IF     LEE
Subjt:  --------CFSSPGNKEETLNHLLGDCLFASSLWNRFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLEE

Query:  EIWESIRFNTFLWA
         +W+ I F + LWA
Subjt:  EIWESIRFNTFLWA

RVX17938.1 CDP-diacylglycerol--serine O-phosphatidyltransferase 1 [Vitis vinifera]7.7e-5326.75Show/hide
Query:  THLLKLV-RILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKL--YDIRAMKELESFRGH
        T L K++ ++LAG   +V     H T+   V  G+  L  +  A    +     G +  V  + + +  + V     D ++++  + IR  K +   RG 
Subjt:  THLLKLV-RILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKL--YDIRAMKELESFRGH

Query:  RKDVT--ALFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAI
           V+   L NG  +G + A+RGL QGD                ++L A ++ V+EGFK+G +  +VSHLQFA +T+FF S +E     L  +L  F  I
Subjt:  RKDVT--ALFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAI

Query:  SMLKIKRQKSSIIGINCEHSKLASWASLV-----------------------------------------------------------------------
        S LK+   KS+I GIN E + L+  A ++                                                                       
Subjt:  SMLKIKRQKSSIIGINCEHSKLASWASLV-----------------------------------------------------------------------

Query:  ------------------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKYGPHPFDWVSGHRLKGSGKNLW
                          GV EG   HLV W+VV KP   GGLG   + +RN  LL KWLWR+  E  +LWH+VI+S YG H   W   + ++ S +  W
Subjt:  ------------------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKYGPHPFDWVSGHRLKGSGKNLW

Query:  ALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQLRNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTDRETTDLASLLSILSDRTIHLGR
          +A  +  FS   +  +G+  +  FW+D W GE+PL   +P+L  +       ++SIL S  P  S +    R L+D E  DL  L+       I    
Subjt:  ALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQLRNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTDRETTDLASLLSILSDRTIHLGR

Query:  RDSRIWTPKPS-----KGFFL--------PLLFPNLMCFS----------------------------------SPG------NKEETLNHLLGDCLFAS
         D R W+  PS     K FFL        P +FP    ++                                  SP          ET++HL   C    
Subjt:  RDSRIWTPKPS-----KGFFL--------PLLFPNLMCFS----------------------------------SPG------NKEETLNHLLGDCLFAS

Query:  SLWNRFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLEEEIWESIRFNTFLWAF
         LW+R F++  +     R    M+        F  +G +LWQ    A +W +W ERN  IF       E +W+SI F T  WAF
Subjt:  SLWNRFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLEEEIWESIRFNTFLWAF

XP_022151711.1 uncharacterized protein LOC111019624 [Momordica charantia]2.4e-9442.56Show/hide
Query:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAISMLKIKRQ
        L NG+PRGKI A+RGL QGD                L+   V+K  +E F++  +   +SHLQFA NTL FCS     F NLNGLL FFEAIS LKI R 
Subjt:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAISMLKIKRQ

Query:  KSSIIGINCEHSKLASWASLV--------------------------------------------------------GVEEGGGAHLVKWEVVSKPIEVG
        KSS++GINCE  KL+ WA+L                                                         GVEEGGGAHLV W+ VSKP+E G
Subjt:  KSSIIGINCEHSKLASWASLV--------------------------------------------------------GVEEGGGAHLVKWEVVSKPIEVG

Query:  GLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKYGPHPFDWVSGHRLKGSGKNLWALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLF
        GLG+ NLRLRN+  LAKWLWRF  E  +LW ++IVSKY  HP DW+     K S  N W  +AS FP+FS  ++ S+GD    YFWED W+G KPL   F
Subjt:  GLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKYGPHPFDWVSGHRLKGSGKNLWALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLF

Query:  PQLRNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTDRETTDLASLLSILSDRTIHLGRRDSRIWTPKPSKGFFLPLLFPNLMCFSSPGNKEETLNHLLG
        P++  LS+K L SVA +L   +  SS+SLGLSR LTD E+ ++A+LL +L   +   GR D R+W P P  GF     F  L+  SS        +    
Subjt:  PQLRNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTDRETTDLASLLSILSDRTIHLGRRDSRIWTPKPSKGFFLPLLFPNLMCFSSPGNKEETLNHLLG

Query:  DCL--FASSLWN----RFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIE
          +  + +SLW     +  + FG    QG D GS++        FR++G+ LWQACF A+LW IWLERN  +FRG+E
Subjt:  DCL--FASSLWN----RFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIE

TrEMBL top hitse value%identityAlignment
A0A438IZX2 NAD-dependent malic enzyme 62 kDa isoform, mitochondrial9.5e-5731.52Show/hide
Query:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAISMLKIKRQ
        L NG  +G + A+RGL QGD                L+  A + G+ EGF +G D  +VS LQFA +T+FF      + +NL  +   F  +S LKI  +
Subjt:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAISMLKIKRQ

Query:  KSSIIG--INCEHSKLASWASLV-------------------------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSL
        KS+I G  +      +  W  +V                         G  EG   HLV+WEVVS+P E+GGLG   + LRN  LL KWLWRF  E+  L
Subjt:  KSSIIG--INCEHSKLASWASLV-------------------------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSL

Query:  WHRVIVSKYGPHPFDWVSGHRLKGSGKNLWALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQLRNLSDKMLHSVASILPSFDPSSSLSL
        WH+VIVS YG HP  W +   ++ S +  W  +A  F  FS  ++  +G+  +  FWED W G + LCS F  L  +      +V+++L +  P  + +L
Subjt:  WHRVIVSKYGPHPFDWVSGHRLKGSGKNLWALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQLRNLSDKMLHSVASILPSFDPSSSLSL

Query:  GLSRPLTDRETTDLASLLSILSDRTIHLGRRDSRIWTPKPS-----KGFFL-------PLLF-----------PNLM-----------------------
           R LTD E   L  L+S LS         DSR W+   S     K FFL       P+LF           P+ +                       
Subjt:  GLSRPLTDRETTDLASLLSILSDRTIHLGRRDSRIWTPKPS-----KGFFL-------PLLF-----------PNLM-----------------------

Query:  --------CFSSPGNKEETLNHLLGDCLFASSLWNRFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLEE
                C    GN  E+++HL   C     LWN+ F+  G+     R    M+           +GK LWQ      +W +W ERN  IF     LEE
Subjt:  --------CFSSPGNKEETLNHLLGDCLFASSLWNRFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLEE

Query:  EIWESIRFNTFLWA
         +W+ I F + LWA
Subjt:  EIWESIRFNTFLWA

A0A438K9P9 CDP-diacylglycerol--serine O-phosphatidyltransferase 13.7e-5326.75Show/hide
Query:  THLLKLV-RILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKL--YDIRAMKELESFRGH
        T L K++ ++LAG   +V     H T+   V  G+  L  +  A    +     G +  V  + + +  + V     D ++++  + IR  K +   RG 
Subjt:  THLLKLV-RILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKL--YDIRAMKELESFRGH

Query:  RKDVT--ALFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAI
           V+   L NG  +G + A+RGL QGD                ++L A ++ V+EGFK+G +  +VSHLQFA +T+FF S +E     L  +L  F  I
Subjt:  RKDVT--ALFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAI

Query:  SMLKIKRQKSSIIGINCEHSKLASWASLV-----------------------------------------------------------------------
        S LK+   KS+I GIN E + L+  A ++                                                                       
Subjt:  SMLKIKRQKSSIIGINCEHSKLASWASLV-----------------------------------------------------------------------

Query:  ------------------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKYGPHPFDWVSGHRLKGSGKNLW
                          GV EG   HLV W+VV KP   GGLG   + +RN  LL KWLWR+  E  +LWH+VI+S YG H   W   + ++ S +  W
Subjt:  ------------------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKYGPHPFDWVSGHRLKGSGKNLW

Query:  ALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQLRNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTDRETTDLASLLSILSDRTIHLGR
          +A  +  FS   +  +G+  +  FW+D W GE+PL   +P+L  +       ++SIL S  P  S +    R L+D E  DL  L+       I    
Subjt:  ALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQLRNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTDRETTDLASLLSILSDRTIHLGR

Query:  RDSRIWTPKPS-----KGFFL--------PLLFPNLMCFS----------------------------------SPG------NKEETLNHLLGDCLFAS
         D R W+  PS     K FFL        P +FP    ++                                  SP          ET++HL   C    
Subjt:  RDSRIWTPKPS-----KGFFL--------PLLFPNLMCFS----------------------------------SPG------NKEETLNHLLGDCLFAS

Query:  SLWNRFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLEEEIWESIRFNTFLWAF
         LW+R F++  +     R    M+        F  +G +LWQ    A +W +W ERN  IF       E +W+SI F T  WAF
Subjt:  SLWNRFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLEEEIWESIRFNTFLWAF

A0A5H2XQW2 TatD related DNase1.7e-5329.26Show/hide
Query:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAISMLKIKRQ
        + NGKPRGK  A+RGL QGD                ++  A D  +V G   G D ++VSHLQFA +T+FF   KE Y+ NL  +L  F  +S +KI + 
Subjt:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAISMLKIKRQ

Query:  KSSIIGIN--------------CE------------------------------HSKLASW----------ASLV-------------------------
        KS I+GIN              CE                                +L  W           +L+                         
Subjt:  KSSIIGIN--------------CE------------------------------HSKLASW----------ASLV-------------------------

Query:  ----------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKYGPHPFDWVSGHRLKGSGKNLWALVASGFP
                  GVEEG   HLV+W  V+K  E GGLGI +LR RN+ L AKWLWRF  E  SLWHR+I SKYG     W +    K S +N W  ++ G+ 
Subjt:  ----------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKYGPHPFDWVSGHRLKGSGKNLWALVASGFP

Query:  LFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQLRNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTDRETTDLASLLSILSDRTIHLGRRDSRIW--
         F    + S+G+  K  FWEDFW+ E  L  LFP+L +LS +    +A    + +   +      R L++ E  ++  LL IL +  +   R D R W  
Subjt:  LFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQLRNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTDRETTDLASLLSILSDRTIHLGRRDSRIW--

Query:  ------TPKPSKGFFL---PLLFPNL------------------------------------MCFSSP-----GNKEETLNHLLGDCLFASSLWNRFFRT
              + K  + F +    ++FP                                      MC S           E ++HL   C ++  LW R    
Subjt:  ------TPKPSKGFFL---PLLFPNL------------------------------------MCFSSP-----GNKEETLNHLLGDCLFASSLWNRFFRT

Query:  FGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLE-EEIWESIRFNTFLWAFVGWPF
         G      + C  ++   L  S    K  IL      A  WNIW+ERNR IF+G   +  EE+W+ I+F   LWA V   F
Subjt:  FGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLE-EEIWESIRFNTFLWAFVGWPF

A0A6J1DFI2 uncharacterized protein LOC1110196241.1e-9442.56Show/hide
Query:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAISMLKIKRQ
        L NG+PRGKI A+RGL QGD                L+   V+K  +E F++  +   +SHLQFA NTL FCS     F NLNGLL FFEAIS LKI R 
Subjt:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAISMLKIKRQ

Query:  KSSIIGINCEHSKLASWASLV--------------------------------------------------------GVEEGGGAHLVKWEVVSKPIEVG
        KSS++GINCE  KL+ WA+L                                                         GVEEGGGAHLV W+ VSKP+E G
Subjt:  KSSIIGINCEHSKLASWASLV--------------------------------------------------------GVEEGGGAHLVKWEVVSKPIEVG

Query:  GLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKYGPHPFDWVSGHRLKGSGKNLWALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLF
        GLG+ NLRLRN+  LAKWLWRF  E  +LW ++IVSKY  HP DW+     K S  N W  +AS FP+FS  ++ S+GD    YFWED W+G KPL   F
Subjt:  GLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKYGPHPFDWVSGHRLKGSGKNLWALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLF

Query:  PQLRNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTDRETTDLASLLSILSDRTIHLGRRDSRIWTPKPSKGFFLPLLFPNLMCFSSPGNKEETLNHLLG
        P++  LS+K L SVA +L   +  SS+SLGLSR LTD E+ ++A+LL +L   +   GR D R+W P P  GF     F  L+  SS        +    
Subjt:  PQLRNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTDRETTDLASLLSILSDRTIHLGRRDSRIWTPKPSKGFFLPLLFPNLMCFSSPGNKEETLNHLLG

Query:  DCL--FASSLWN----RFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIE
          +  + +SLW     +  + FG    QG D GS++        FR++G+ LWQACF A+LW IWLERN  +FRG+E
Subjt:  DCL--FASSLWN----RFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIE

A5BAF8 Reverse transcriptase domain-containing protein1.0e-5531.12Show/hide
Query:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCS--EKENYFRNLNGLLSFFEAISMLKIK
        L NG  +G + A+RGL QGD                +++ A ++ ++EGF++G +  +VSHLQFA +T+FF +  E+E   + L  LL  F  IS LK+ 
Subjt:  LFNGKPRGKIFATRGLWQGD----------------LVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCS--EKENYFRNLNGLLSFFEAISMLKIK

Query:  RQKSSIIGINCEHSKLASWASLV----------------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKY
          KS I GIN + + L+    ++                GV EG   HLV+W+VV KP  +GGLG  N+  RN  LL KWLWR+  E  +LWH+VI+S Y
Subjt:  RQKSSIIGINCEHSKLASWASLV----------------GVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKY

Query:  GPHPFDWVSGHRLKGSGKNLWALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQL-RNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTD
        G H   W +   ++ S +  W  +A  F  FS   +  +G+  +  FWED W G++PL + +P+L R + DK ++ ++S+L    P  S +L   R L+D
Subjt:  GPHPFDWVSGHRLKGSGKNLWALVASGFPLFSNHIQCSIGDSSKTYFWEDFWVGEKPLCSLFPQL-RNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTD

Query:  RETTDLASLLSILSDRTIHLGRRDSRIWTPKPS-----KGFFL--------PLLFPNLMCFSSP------------GNKE--------------------
         E  DL  L+  L D  +     D+R+W    S     K FFL        P  FP+   ++S              +K+                    
Subjt:  RETTDLASLLSILSDRTIHLGRRDSRIWTPKPS-----KGFFL--------PLLFPNLMCFSSP------------GNKE--------------------

Query:  --------ETLNHLLGDCLFASSLWNRFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLEEEIWESIRFN
                E+ +HL   C     LW+R F+   +     R    M+           +G +LWQA   A +  +W ERN  IF       E +W+SI F 
Subjt:  --------ETLNHLLGDCLFASSLWNRFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLEEEIWESIRFN

Query:  TFLWAFVGWPF
          LWAF    F
Subjt:  TFLWAFVGWPF

SwissProt top hitse value%identityAlignment
P0CS46 Polyadenylation factor subunit 25.2e-2849.09Show/hide
Query:  LPDIWTHL-LKLVRILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRAMKELESF
        L  IW++   K  R L+GHGWDV+ VDWHPTK L+VSG KD LVK WD +TGK+L + H  K+T+   +W+ +G+ V T+ +D +I+L+DIR  +ELE  
Subjt:  LPDIWTHL-LKLVRILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRAMKELESF

Query:  RGHRKDVTAL
        +GH K+V  +
Subjt:  RGHRKDVTAL

P0CS47 Polyadenylation factor subunit 25.2e-2849.09Show/hide
Query:  LPDIWTHL-LKLVRILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRAMKELESF
        L  IW++   K  R L+GHGWDV+ VDWHPTK L+VSG KD LVK WD +TGK+L + H  K+T+   +W+ +G+ V T+ +D +I+L+DIR  +ELE  
Subjt:  LPDIWTHL-LKLVRILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRAMKELESF

Query:  RGHRKDVTAL
        +GH K+V  +
Subjt:  RGHRKDVTAL

Q6NLV4 Flowering time control protein FY1.0e-4470Show/hide
Query:  LLICKCLPDIWTHLLKLVRI-----LAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYD
        L  C C  D    +    +      L GHGWDVKSVDWHPTKSLLVSGGKD LVKLWD ++G+ELCS HGHKN VL VKWNQNGNW+LT+SKDQIIKLYD
Subjt:  LLICKCLPDIWTHLLKLVRI-----LAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYD

Query:  IRAMKELESFRGHRKDVTAL
        IR MKEL+SFRGH KDVT+L
Subjt:  IRAMKELESFRGHRKDVTAL

Q6NLV4 Flowering time control protein FY7.3e-0678.57Show/hide
Query:  ALAWHPFHEEYFVSGSFDGSIFHWLVGY
        +LAWHP HEEYFVSGS DGSI HW+VG+
Subjt:  ALAWHPFHEEYFVSGSFDGSIFHWLVGY

Q8K4P0 pre-mRNA 3' end processing protein WDR331.1e-3050.35Show/hide
Query:  RILAGHGWDVKSVDWHPTKSLLVSGGKDNL--VKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRAMK-ELESFRGHRKDVTAL
        RIL GHG DVK VDWHPTK L+VSG KD+   +K WD KTG+ L + H HKNTV+ VK N NGNW+LT+S+D + KL+DIR +K EL+ FRGH+K+ TA+
Subjt:  RILAGHGWDVKSVDWHPTKSLLVSGGKDNL--VKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRAMK-ELESFRGHRKDVTAL

Query:  FNGKPRGKIFATRGLWQGDLVLSAVD-KGVVEGFKMGSDGL
                +FA+ G   G L+   V  +  V G +M  +G+
Subjt:  FNGKPRGKIFATRGLWQGDLVLSAVD-KGVVEGFKMGSDGL

Q9C0J8 pre-mRNA 3' end processing protein WDR331.1e-3050.35Show/hide
Query:  RILAGHGWDVKSVDWHPTKSLLVSGGKDNL--VKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRAMK-ELESFRGHRKDVTAL
        RIL GHG DVK VDWHPTK L+VSG KD+   +K WD KTG+ L + H HKNTV+ VK N NGNW+LT+S+D + KL+DIR +K EL+ FRGH+K+ TA+
Subjt:  RILAGHGWDVKSVDWHPTKSLLVSGGKDNL--VKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRAMK-ELESFRGHRKDVTAL

Query:  FNGKPRGKIFATRGLWQGDLVLSAVD-KGVVEGFKMGSDGL
                +FA+ G   G L+   V  +  V G +M  +G+
Subjt:  FNGKPRGKIFATRGLWQGDLVLSAVD-KGVVEGFKMGSDGL

Arabidopsis top hitse value%identityAlignment
AT4G15900.1 pleiotropic regulatory locus 12.2e-1328.1Show/hide
Query:  KLVRILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRAMKELESFRGHRKDVTAL
        K++R   GH   V  +  HPT  +L++GG+D++ ++WD +T  ++ +  GH NTV  V        V+T S D  IK +D+R  K + +   H+K V A+
Subjt:  KLVRILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRAMKELESFRGHRKDVTAL

Query:  FNGKPRGKIFATR--------GLWQGDLV--LSAVDKGVVEGFKMGSDGLQVS
            P+   FA+          L +G+    + +  K ++    +  DG+ V+
Subjt:  FNGKPRGKIFATR--------GLWQGDLV--LSAVDKGVVEGFKMGSDGLQVS

AT5G13480.1 Transducin/WD40 repeat-like superfamily protein7.4e-4670Show/hide
Query:  LLICKCLPDIWTHLLKLVRI-----LAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYD
        L  C C  D    +    +      L GHGWDVKSVDWHPTKSLLVSGGKD LVKLWD ++G+ELCS HGHKN VL VKWNQNGNW+LT+SKDQIIKLYD
Subjt:  LLICKCLPDIWTHLLKLVRI-----LAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYD

Query:  IRAMKELESFRGHRKDVTAL
        IR MKEL+SFRGH KDVT+L
Subjt:  IRAMKELESFRGHRKDVTAL

AT5G13480.1 Transducin/WD40 repeat-like superfamily protein5.2e-0778.57Show/hide
Query:  ALAWHPFHEEYFVSGSFDGSIFHWLVGY
        +LAWHP HEEYFVSGS DGSI HW+VG+
Subjt:  ALAWHPFHEEYFVSGSFDGSIFHWLVGY

AT5G13480.2 Transducin/WD40 repeat-like superfamily protein7.4e-4670Show/hide
Query:  LLICKCLPDIWTHLLKLVRI-----LAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYD
        L  C C  D    +    +      L GHGWDVKSVDWHPTKSLLVSGGKD LVKLWD ++G+ELCS HGHKN VL VKWNQNGNW+LT+SKDQIIKLYD
Subjt:  LLICKCLPDIWTHLLKLVRI-----LAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYD

Query:  IRAMKELESFRGHRKDVTAL
        IR MKEL+SFRGH KDVT+L
Subjt:  IRAMKELESFRGHRKDVTAL

AT5G13480.2 Transducin/WD40 repeat-like superfamily protein5.2e-0778.57Show/hide
Query:  ALAWHPFHEEYFVSGSFDGSIFHWLVGY
        +LAWHP HEEYFVSGS DGSI HW+VG+
Subjt:  ALAWHPFHEEYFVSGSFDGSIFHWLVGY

AT5G23430.1 Transducin/WD40 repeat-like superfamily protein7.5e-1433Show/hide
Query:  KLVRILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRAMKELESFRGHRKDVTAL
        K+VR L GH  +  SVD+HP      SG  D  +K+WD +    + ++ GH   V  +++  +G WV++  +D I+K++D+ A K L  F+ H   + +L
Subjt:  KLVRILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRAMKELESFRGHRKDVTAL

AT5G23430.2 Transducin/WD40 repeat-like superfamily protein7.5e-1433Show/hide
Query:  KLVRILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRAMKELESFRGHRKDVTAL
        K+VR L GH  +  SVD+HP      SG  D  +K+WD +    + ++ GH   V  +++  +G WV++  +D I+K++D+ A K L  F+ H   + +L
Subjt:  KLVRILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRAMKELESFRGHRKDVTAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACATGGGAAGGCATCTCTAGGTTTGATAATTTTGTTAATCTGTAAATGTTTGCCTGATATTTGGACACATTTACTAAAATTGGTCAGGATACTGGCAGGCCATGG
GTGGGATGTGAAAAGTGTTGATTGGCACCCCACAAAGTCTCTTCTAGTTTCAGGTGGCAAAGACAACCTCGTAAAGCTTTGGGATGCAAAAACTGGGAAAGAGCTTTGCT
CATTTCATGGTCATAAAAACACAGTACTATGTGTCAAATGGAATCAAAATGGTAATTGGGTGCTGACTTCTTCAAAGGATCAGATCATCAAGCTTTATGACATCAGGGCT
ATGAAAGAACTTGAGTCGTTCCGTGGGCATAGAAAGGACGTGACTGCATTGTTTAATGGCAAACCTAGAGGGAAGATTTTTGCTACTAGAGGCCTCTGGCAAGGAGACCT
AGTGTTGTCAGCGGTGGATAAAGGTGTGGTGGAAGGATTTAAGATGGGCTCGGACGGTTTGCAGGTGTCTCATCTCCAGTTTGCTGGCAATACCCTCTTCTTTTGCTCAG
AGAAGGAAAATTATTTTAGGAACTTAAATGGTTTGTTGTCCTTCTTTGAAGCCATATCCATGCTTAAAATTAAGCGTCAGAAAAGCTCCATCATTGGGATCAATTGTGAG
CACTCTAAGCTTGCTTCCTGGGCCTCGCTTGTGGGGGTGGAGGAAGGAGGGGGAGCCCACCTTGTCAAGTGGGAAGTTGTCTCCAAGCCGATTGAGGTAGGAGGATTGGG
TATTCAGAATCTTAGACTGCGTAATCAGACTCTTCTGGCGAAGTGGTTGTGGCGTTTCTCCTTTGAGCAAGGCTCTTTGTGGCATAGAGTTATTGTGAGCAAGTATGGAC
CGCATCCTTTCGATTGGGTTTCTGGTCATAGGCTTAAGGGTTCTGGCAAAAACCTTTGGGCTCTAGTTGCTTCGGGCTTTCCTTTGTTCTCTAATCATATCCAATGCTCC
ATTGGGGACAGTTCGAAAACTTACTTTTGGGAGGATTTCTGGGTGGGGGAAAAACCTTTGTGTTCCCTTTTTCCTCAACTTCGTAACCTATCTGATAAGATGTTGCACTC
GGTAGCCTCTATTTTGCCTTCCTTTGACCCCTCATCGTCCCTTTCCTTGGGCCTCAGTCGTCCTCTTACCGATCGTGAGACGACCGACCTTGCTAGCCTTCTTTCCATTC
TTTCTGATCGGACCATTCATCTCGGGAGGAGAGACTCGCGTATCTGGACTCCTAAACCCTCTAAAGGATTTTTCTTGCCGCTCTTATTTCCAAACCTTATGTGCTTCTCC
TCCCCAGGGAATAAGGAGGAGACTCTAAACCATTTGCTTGGTGATTGCTTGTTCGCATCTTCTCTTTGGAATCGCTTCTTTCGGACTTTTGGAGTGGCCTCAGCTCAAGG
CAGGGATTGTGGGTCCATGATTGAGGAAGTTCTCCTCAACTCTCATTTTCGTGATAAAGGAAAGATTTTATGGCAGGCGTGCTTTTTTGCTACTTTGTGGAACATTTGGC
TTGAGAGAAATAGGTGCATTTTTAGAGGGATTGAGAGTTTAGAGGAGGAGATTTGGGAGTCAATTAGATTTAACACTTTCTTATGGGCGTTTGTTGGTTGGCCTTTTTAT
ACTAATACTAAGGTGATCGGTTTTGCAGCATTGGCCTGGCACCCTTTCCATGAAGAGTATTTTGTTAGTGGGAGTTTTGATGGCTCTATTTTTCATTGGCTTGTTGGGTA
TGTTTGTGCTTTAGTTTCAATCTCTTGTGTTGTTTATAATCTATCTCTACGGTTGGGAATAGTTTTGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTACATGGGAAGGCATCTCTAGGTTTGATAATTTTGTTAATCTGTAAATGTTTGCCTGATATTTGGACACATTTACTAAAATTGGTCAGGATACTGGCAGGCCATGG
GTGGGATGTGAAAAGTGTTGATTGGCACCCCACAAAGTCTCTTCTAGTTTCAGGTGGCAAAGACAACCTCGTAAAGCTTTGGGATGCAAAAACTGGGAAAGAGCTTTGCT
CATTTCATGGTCATAAAAACACAGTACTATGTGTCAAATGGAATCAAAATGGTAATTGGGTGCTGACTTCTTCAAAGGATCAGATCATCAAGCTTTATGACATCAGGGCT
ATGAAAGAACTTGAGTCGTTCCGTGGGCATAGAAAGGACGTGACTGCATTGTTTAATGGCAAACCTAGAGGGAAGATTTTTGCTACTAGAGGCCTCTGGCAAGGAGACCT
AGTGTTGTCAGCGGTGGATAAAGGTGTGGTGGAAGGATTTAAGATGGGCTCGGACGGTTTGCAGGTGTCTCATCTCCAGTTTGCTGGCAATACCCTCTTCTTTTGCTCAG
AGAAGGAAAATTATTTTAGGAACTTAAATGGTTTGTTGTCCTTCTTTGAAGCCATATCCATGCTTAAAATTAAGCGTCAGAAAAGCTCCATCATTGGGATCAATTGTGAG
CACTCTAAGCTTGCTTCCTGGGCCTCGCTTGTGGGGGTGGAGGAAGGAGGGGGAGCCCACCTTGTCAAGTGGGAAGTTGTCTCCAAGCCGATTGAGGTAGGAGGATTGGG
TATTCAGAATCTTAGACTGCGTAATCAGACTCTTCTGGCGAAGTGGTTGTGGCGTTTCTCCTTTGAGCAAGGCTCTTTGTGGCATAGAGTTATTGTGAGCAAGTATGGAC
CGCATCCTTTCGATTGGGTTTCTGGTCATAGGCTTAAGGGTTCTGGCAAAAACCTTTGGGCTCTAGTTGCTTCGGGCTTTCCTTTGTTCTCTAATCATATCCAATGCTCC
ATTGGGGACAGTTCGAAAACTTACTTTTGGGAGGATTTCTGGGTGGGGGAAAAACCTTTGTGTTCCCTTTTTCCTCAACTTCGTAACCTATCTGATAAGATGTTGCACTC
GGTAGCCTCTATTTTGCCTTCCTTTGACCCCTCATCGTCCCTTTCCTTGGGCCTCAGTCGTCCTCTTACCGATCGTGAGACGACCGACCTTGCTAGCCTTCTTTCCATTC
TTTCTGATCGGACCATTCATCTCGGGAGGAGAGACTCGCGTATCTGGACTCCTAAACCCTCTAAAGGATTTTTCTTGCCGCTCTTATTTCCAAACCTTATGTGCTTCTCC
TCCCCAGGGAATAAGGAGGAGACTCTAAACCATTTGCTTGGTGATTGCTTGTTCGCATCTTCTCTTTGGAATCGCTTCTTTCGGACTTTTGGAGTGGCCTCAGCTCAAGG
CAGGGATTGTGGGTCCATGATTGAGGAAGTTCTCCTCAACTCTCATTTTCGTGATAAAGGAAAGATTTTATGGCAGGCGTGCTTTTTTGCTACTTTGTGGAACATTTGGC
TTGAGAGAAATAGGTGCATTTTTAGAGGGATTGAGAGTTTAGAGGAGGAGATTTGGGAGTCAATTAGATTTAACACTTTCTTATGGGCGTTTGTTGGTTGGCCTTTTTAT
ACTAATACTAAGGTGATCGGTTTTGCAGCATTGGCCTGGCACCCTTTCCATGAAGAGTATTTTGTTAGTGGGAGTTTTGATGGCTCTATTTTTCATTGGCTTGTTGGGTA
TGTTTGTGCTTTAGTTTCAATCTCTTGTGTTGTTTATAATCTATCTCTACGGTTGGGAATAGTTTTGTTTTGA
Protein sequenceShow/hide protein sequence
MLHGKASLGLIILLICKCLPDIWTHLLKLVRILAGHGWDVKSVDWHPTKSLLVSGGKDNLVKLWDAKTGKELCSFHGHKNTVLCVKWNQNGNWVLTSSKDQIIKLYDIRA
MKELESFRGHRKDVTALFNGKPRGKIFATRGLWQGDLVLSAVDKGVVEGFKMGSDGLQVSHLQFAGNTLFFCSEKENYFRNLNGLLSFFEAISMLKIKRQKSSIIGINCE
HSKLASWASLVGVEEGGGAHLVKWEVVSKPIEVGGLGIQNLRLRNQTLLAKWLWRFSFEQGSLWHRVIVSKYGPHPFDWVSGHRLKGSGKNLWALVASGFPLFSNHIQCS
IGDSSKTYFWEDFWVGEKPLCSLFPQLRNLSDKMLHSVASILPSFDPSSSLSLGLSRPLTDRETTDLASLLSILSDRTIHLGRRDSRIWTPKPSKGFFLPLLFPNLMCFS
SPGNKEETLNHLLGDCLFASSLWNRFFRTFGVASAQGRDCGSMIEEVLLNSHFRDKGKILWQACFFATLWNIWLERNRCIFRGIESLEEEIWESIRFNTFLWAFVGWPFY
TNTKVIGFAALAWHPFHEEYFVSGSFDGSIFHWLVGYVCALVSISCVVYNLSLRLGIVLF