; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04130 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04130
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr1:2692989..2699613
RNA-Seq ExpressionMoc01g04130
SyntenyMoc01g04130
Gene Ontology termsNA
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR016197 - Chromo-like domain superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR023780 - Chromo domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ97017.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-9136.03Show/hide
Query:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF
        MT K +E R +  EK++  +K ++    Q     +K +++++  L        R    S L G  TG+  +     V    EGE     +   G D+ +F
Subjt:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF

Query:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------
        KK+EMP+F G + + WI+RAE YF++  L + EK+ +AI+S +G  L W+R+  NR++FR W +LKERL  RF   + G                     
Subjt:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------

Query:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------
          + SA LP+++++VL  AF NGLDPV+R EV A+   GL+    A++ A                                                  
Subjt:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------

Query:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELED-EMQQEDASGEKGNGETQAVQCQLNTIVGIST
                    +++  F++ +++E Q  R++GLC+RCEE ++ GH+CKNRELR+ V+ DD    LED EM      GE     +  V+  LN++VG++ 
Subjt:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELED-EMQQEDASGEKGNGETQAVQCQLNTIVGIST

Query:  PRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGK
        P T K+ G ++  ++V+++DCGATHNFISL L + LKL   +T NYGVIMGSG AV+G G+CKGI + LP ++++EDFLPLELG +D+VLGMQWLQ  G 
Subjt:  PRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGK

Query:  MEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNA----AEKGSVEALEELPPEVEEL
        M  DW ALT+ F  G+ K+ ++GDP+L  +E+S+K L K W+  DQG+LV  RA+       E    +A+EE   E  +L
Subjt:  MEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNA----AEKGSVEALEELPPEVEEL

TYK17386.1 gypsy/ty3 element polyprotein [Cucumis melo var. makuwa]1.6e-9135.86Show/hide
Query:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF
        MT K +E R +  EK++  +K ++    Q     +K +++++  L        R    S L G  TG+  +     V    EGE     +   G D+ +F
Subjt:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF

Query:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------
        KK+EMP+F G + + WI+RAE YF++  L + EK+ +AI+S +G  L W+R+  NR++FR W +LKERL  RF   + G                     
Subjt:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------

Query:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------
          + SA LP+++++VL  AF NGLDPV+R EV A+   GL+    A++ A                                                  
Subjt:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------

Query:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELED-EMQQEDASGEKGNGETQAVQCQLNTIVGIST
                    +++  F++ +++E Q  R++GLC+RCEE ++ GH+CKNRELR+ V+ DD    LED EM      GE     +  V+  LN++VG++ 
Subjt:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELED-EMQQEDASGEKGNGETQAVQCQLNTIVGIST

Query:  PRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGK
        P T K+ G ++  ++V+++DCGATHNFISL L + LKL   +T NYGVIMGSG AV+G G+CKGI + LP ++++EDFLPLELG +D+VLGMQWLQ  G 
Subjt:  PRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGK

Query:  MEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNAAEKGS----VEALEELPPEVEEL
        M  DW ALT+ F  G+ K+ ++GDP+L  +E+S+K L K W+  DQG+LV  RA+   +        +A+EE   E  +L
Subjt:  MEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNAAEKGS----VEALEELPPEVEEL

TYK23724.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.6e-9135.41Show/hide
Query:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF
        MT K +E R +  EK++  +K ++    Q     +K +++++  L        R    S L G  TG+  +     V    EGE     +   G D+ +F
Subjt:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF

Query:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------
        KK+EMP+F G + + WI+RAE YF++  L + EK+ +AI+S +G  L W+R+  NR++FR W +LKERL +RF   + G                     
Subjt:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------

Query:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------
          + SA LP+++++VL  AF NGLDPV+R EV A+   GL+    A++ A                                                  
Subjt:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------

Query:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELEDEMQQEDASGEKGNGETQAVQCQLNTIVGISTP
                    +++  F++ +++E Q  R++GLC+RCEE ++ GH+CKN+ELR+ V+ DD    LED    + A   +    +  V+  LN++VG++ P
Subjt:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELEDEMQQEDASGEKGNGETQAVQCQLNTIVGISTP

Query:  RTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGKM
         T K+ G ++  ++V+++DCGATHNFISL L + LKL   +T NYGVIMGSG AV+G G+CKGI + LP ++++EDFLPLELG +D+VLGMQWLQ  G M
Subjt:  RTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGKM

Query:  EADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNA----AEKGSVEALEELPPEVEEL
          DW ALT+ F  G+ K+ ++GDP+L  +E+S+K L K W+  DQG+LV  RA+       E    +A+EE   E  +L
Subjt:  EADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNA----AEKGSVEALEELPPEVEEL

XP_022154744.1 uncharacterized protein LOC111021922 [Momordica charantia]1.6e-10741.76Show/hide
Query:  KELETRCEVVEKDVADMKIQITSLHQDFAEQKK-----------VLQEVNMTLKAFIAQVIRGTSMSLLMGTGETSVTKGKTVE------GENPEKEDGN
        K+LE RCE  EK+V+ +K  +  +     EQ+K             +E    L  F+ +V +G S         +S  K K  E      GE    +   
Subjt:  KELETRCEVVEKDVADMKIQITSLHQDFAEQKK-----------VLQEVNMTLKAFIAQVIRGTSMSLLMGTGETSVTKGKTVE------GENPEKEDGN

Query:  RGTDKNRFKKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEGK------------
         G ++N+FKKVEMP+F G +  +W+FRAERYFEI  L+++EKM V IISF+G A+ WY ++ NR+ F DWN+LK R+ +RF  +K+              
Subjt:  RGTDKNRFKKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEGK------------

Query:  -----------QSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKG---------------LDQGTGASQAAKKEPPFKKLSEAEFQKGRERGLCFRCE
                    SA+LP ++++V+E        P       A  PK                  +    +Q  +KE   KKL+E E+Q+ +++GLCFR E
Subjt:  -----------QSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKG---------------LDQGTGASQAAKKEPPFKKLSEAEFQKGRERGLCFRCE

Query:  EKYTVGHKCKNRELRVFVLHDDEMFELEDEMQQEDASGEKGNGETQAVQCQLNTIVGISTPRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAK
        EKY++GH+CKN+EL+VFV+HDDE  EL+ E       G +     +  +  LNT+VG STP T+K+ G+I+  +VV+LIDCGATHNFIS  L D   L  
Subjt:  EKYTVGHKCKNRELRVFVLHDDEMFELEDEMQQEDASGEKGNGETQAVQCQLNTIVGISTPRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAK

Query:  TKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGKMEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQ
         +T NYGVIMG+G  V+G G+CKGI+L LPELT+ E+FLPLELG LDVVLGMQWL T G ME DW ALT+ F  G  +I ++GDPTL  +EV++K+LA+ 
Subjt:  TKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGKMEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQ

Query:  WEAQDQGYLVKMRALNAAEKGSVEALEELPP-----EVEELIHATP
        WE  DQG+LV++RA +A  +G  EA+E + P     E+E LI   P
Subjt:  WEAQDQGYLVKMRALNAAEKGSVEALEELPP-----EVEELIHATP

XP_022154744.1 uncharacterized protein LOC111021922 [Momordica charantia]2.7e-0647.54Show/hide
Query:  LTEEWEWMVEPDSVFAYQTNPTTGEVEALVNWKGLPEEEVTWESVRDINQQFPEFHLEDKL
        L +++E + EP+ VFA   N  T E+EALV+WK   E + TWE++     QFP FHLEDK+
Subjt:  LTEEWEWMVEPDSVFAYQTNPTTGEVEALVNWKGLPEEEVTWESVRDINQQFPEFHLEDKL

XP_022154744.1 uncharacterized protein LOC111021922 [Momordica charantia]1.6e-9136.03Show/hide
Query:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF
        MT K +E R +  EK++  +K ++    Q     +K +++++  L        R    S L G  TG+  +     V    EGE     +   G D+ +F
Subjt:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF

Query:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------
        KK+EMP+F G + + WI+RAE YF++  L + EK+ +AI+S +G  L W+R+  NR++FR W +LKERL  RF   + G                     
Subjt:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------

Query:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------
          + SA LP+++++VL  AF NGLDPV+R EV A+   GL+    A++ A                                                  
Subjt:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------

Query:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELED-EMQQEDASGEKGNGETQAVQCQLNTIVGIST
                    +++  F++ +++E Q  R++GLC+RCEE ++ GH+CKNRELR+ V+ DD    LED EM      GE     +  V+  LN++VG++ 
Subjt:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELED-EMQQEDASGEKGNGETQAVQCQLNTIVGIST

Query:  PRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGK
        P T K+ G ++  ++V+++DCGATHNFISL L + LKL   +T NYGVIMGSG AV+G G+CKGI + LP ++++EDFLPLELG +D+VLGMQWLQ  G 
Subjt:  PRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGK

Query:  MEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNA----AEKGSVEALEELPPEVEEL
        M  DW ALT+ F  G+ K+ ++GDP+L  +E+S+K L K W+  DQG+LV  RA+       E    +A+EE   E  +L
Subjt:  MEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNA----AEKGSVEALEELPPEVEEL

TrEMBL top hitse value%identityAlignment
A0A5A7T606 Gypsy/ty3 element polyprotein7.8e-9236.03Show/hide
Query:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF
        MT K +E R +  EK++  +K ++    Q     +K +++++  L        R    S L G  TG+  +     V    EGE     +   G D+ +F
Subjt:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF

Query:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------
        KK+EMP+F G + + WI+RAE YF++  L + EK+ +AI+S +G  L W+R+  NR++FR W +LKERL  RF   + G                     
Subjt:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------

Query:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------
          + SA LP+++++VL  AF NGLDPV+R EV A+   GL+    A++ A                                                  
Subjt:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------

Query:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELED-EMQQEDASGEKGNGETQAVQCQLNTIVGIST
                    +++  F++ +++E Q  R++GLC+RCEE ++ GH+CKNRELR+ V+ DD    LED EM      GE     +  V+  LN++VG++ 
Subjt:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELED-EMQQEDASGEKGNGETQAVQCQLNTIVGIST

Query:  PRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGK
        P T K+ G ++  ++V+++DCGATHNFISL L + LKL   +T NYGVIMGSG AV+G G+CKGI + LP ++++EDFLPLELG +D+VLGMQWLQ  G 
Subjt:  PRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGK

Query:  MEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNA----AEKGSVEALEELPPEVEEL
        M  DW ALT+ F  G+ K+ ++GDP+L  +E+S+K L K W+  DQG+LV  RA+       E    +A+EE   E  +L
Subjt:  MEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNA----AEKGSVEALEELPPEVEEL

A0A5D3D378 Gypsy/ty3 element polyprotein7.8e-9235.86Show/hide
Query:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF
        MT K +E R +  EK++  +K ++    Q     +K +++++  L        R    S L G  TG+  +     V    EGE     +   G D+ +F
Subjt:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF

Query:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------
        KK+EMP+F G + + WI+RAE YF++  L + EK+ +AI+S +G  L W+R+  NR++FR W +LKERL  RF   + G                     
Subjt:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------

Query:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------
          + SA LP+++++VL  AF NGLDPV+R EV A+   GL+    A++ A                                                  
Subjt:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------

Query:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELED-EMQQEDASGEKGNGETQAVQCQLNTIVGIST
                    +++  F++ +++E Q  R++GLC+RCEE ++ GH+CKNRELR+ V+ DD    LED EM      GE     +  V+  LN++VG++ 
Subjt:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELED-EMQQEDASGEKGNGETQAVQCQLNTIVGIST

Query:  PRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGK
        P T K+ G ++  ++V+++DCGATHNFISL L + LKL   +T NYGVIMGSG AV+G G+CKGI + LP ++++EDFLPLELG +D+VLGMQWLQ  G 
Subjt:  PRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGK

Query:  MEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNAAEKGS----VEALEELPPEVEEL
        M  DW ALT+ F  G+ K+ ++GDP+L  +E+S+K L K W+  DQG+LV  RA+   +        +A+EE   E  +L
Subjt:  MEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNAAEKGS----VEALEELPPEVEEL

A0A5D3DJA9 Ty3/gypsy retrotransposon protein1.7e-9135.41Show/hide
Query:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF
        MT K +E R +  EK++  +K ++    Q     +K +++++  L        R    S L G  TG+  +     V    EGE     +   G D+ +F
Subjt:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF

Query:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------
        KK+EMP+F G + + WI+RAE YF++  L + EK+ +AI+S +G  L W+R+  NR++FR W +LKERL +RF   + G                     
Subjt:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------

Query:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------
          + SA LP+++++VL  AF NGLDPV+R EV A+   GL+    A++ A                                                  
Subjt:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------

Query:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELEDEMQQEDASGEKGNGETQAVQCQLNTIVGISTP
                    +++  F++ +++E Q  R++GLC+RCEE ++ GH+CKN+ELR+ V+ DD    LED    + A   +    +  V+  LN++VG++ P
Subjt:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELEDEMQQEDASGEKGNGETQAVQCQLNTIVGISTP

Query:  RTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGKM
         T K+ G ++  ++V+++DCGATHNFISL L + LKL   +T NYGVIMGSG AV+G G+CKGI + LP ++++EDFLPLELG +D+VLGMQWLQ  G M
Subjt:  RTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGKM

Query:  EADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNA----AEKGSVEALEELPPEVEEL
          DW ALT+ F  G+ K+ ++GDP+L  +E+S+K L K W+  DQG+LV  RA+       E    +A+EE   E  +L
Subjt:  EADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNA----AEKGSVEALEELPPEVEEL

A0A6J1DN22 Reverse transcriptase7.8e-10841.76Show/hide
Query:  KELETRCEVVEKDVADMKIQITSLHQDFAEQKK-----------VLQEVNMTLKAFIAQVIRGTSMSLLMGTGETSVTKGKTVE------GENPEKEDGN
        K+LE RCE  EK+V+ +K  +  +     EQ+K             +E    L  F+ +V +G S         +S  K K  E      GE    +   
Subjt:  KELETRCEVVEKDVADMKIQITSLHQDFAEQKK-----------VLQEVNMTLKAFIAQVIRGTSMSLLMGTGETSVTKGKTVE------GENPEKEDGN

Query:  RGTDKNRFKKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEGK------------
         G ++N+FKKVEMP+F G +  +W+FRAERYFEI  L+++EKM V IISF+G A+ WY ++ NR+ F DWN+LK R+ +RF  +K+              
Subjt:  RGTDKNRFKKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEGK------------

Query:  -----------QSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKG---------------LDQGTGASQAAKKEPPFKKLSEAEFQKGRERGLCFRCE
                    SA+LP ++++V+E        P       A  PK                  +    +Q  +KE   KKL+E E+Q+ +++GLCFR E
Subjt:  -----------QSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKG---------------LDQGTGASQAAKKEPPFKKLSEAEFQKGRERGLCFRCE

Query:  EKYTVGHKCKNRELRVFVLHDDEMFELEDEMQQEDASGEKGNGETQAVQCQLNTIVGISTPRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAK
        EKY++GH+CKN+EL+VFV+HDDE  EL+ E       G +     +  +  LNT+VG STP T+K+ G+I+  +VV+LIDCGATHNFIS  L D   L  
Subjt:  EKYTVGHKCKNRELRVFVLHDDEMFELEDEMQQEDASGEKGNGETQAVQCQLNTIVGISTPRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAK

Query:  TKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGKMEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQ
         +T NYGVIMG+G  V+G G+CKGI+L LPELT+ E+FLPLELG LDVVLGMQWL T G ME DW ALT+ F  G  +I ++GDPTL  +EV++K+LA+ 
Subjt:  TKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGKMEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQ

Query:  WEAQDQGYLVKMRALNAAEKGSVEALEELPP-----EVEELIHATP
        WE  DQG+LV++RA +A  +G  EA+E + P     E+E LI   P
Subjt:  WEAQDQGYLVKMRALNAAEKGSVEALEELPP-----EVEELIHATP

A0A6J1DN22 Reverse transcriptase1.3e-0647.54Show/hide
Query:  LTEEWEWMVEPDSVFAYQTNPTTGEVEALVNWKGLPEEEVTWESVRDINQQFPEFHLEDKL
        L +++E + EP+ VFA   N  T E+EALV+WK   E + TWE++     QFP FHLEDK+
Subjt:  LTEEWEWMVEPDSVFAYQTNPTTGEVEALVNWKGLPEEEVTWESVRDINQQFPEFHLEDKL

A0A6J1DN22 Reverse transcriptase7.8e-9236.03Show/hide
Query:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF
        MT K +E R +  EK++  +K ++    Q     +K +++++  L        R    S L G  TG+  +     V    EGE     +   G D+ +F
Subjt:  MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMG--TGETSVTKGKTV----EGENPEKEDGNRGTDKNRF

Query:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------
        KK+EMP+F G + + WI+RAE YF++  L + EK+ +AI+S +G  L W+R+  NR++FR W +LKERL  RF   + G                     
Subjt:  KKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEG---------------------

Query:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------
          + SA LP+++++VL  AF NGLDPV+R EV A+   GL+    A++ A                                                  
Subjt:  --KQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQAA--------------------------------------------------

Query:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELED-EMQQEDASGEKGNGETQAVQCQLNTIVGIST
                    +++  F++ +++E Q  R++GLC+RCEE ++ GH+CKNRELR+ V+ DD    LED EM      GE     +  V+  LN++VG++ 
Subjt:  ------------KKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELED-EMQQEDASGEKGNGETQAVQCQLNTIVGIST

Query:  PRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGK
        P T K+ G ++  ++V+++DCGATHNFISL L + LKL   +T NYGVIMGSG AV+G G+CKGI + LP ++++EDFLPLELG +D+VLGMQWLQ  G 
Subjt:  PRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGK

Query:  MEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNA----AEKGSVEALEELPPEVEEL
        M  DW ALT+ F  G+ K+ ++GDP+L  +E+S+K L K W+  DQG+LV  RA+       E    +A+EE   E  +L
Subjt:  MEADWPALTLRFDRGEKKITIRGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNA----AEKGSVEALEELPPEVEEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G53705.1 aminoacyl-tRNA ligases;nucleotide binding;ATP binding2.7e-0441.51Show/hide
Query:  MGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGK--LDVVLGMQWLQTMGK
        MG G  ++  G C GI L + E  ++ED+L L+L K   DV+LG +WL  +G+
Subjt:  MGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGK--LDVVLGMQWLQTMGK

AT1G67020.1 unknown protein3.0e-1134.74Show/hide
Query:  GENPEKEDGNRGTDKNRFKKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAK
        GE+P   D NR    +  +++EMP+F G  +  W  + ER+F + +  D +K+ +  +S +G+AL W+    +  +FRDWN  ++RL  RF P K
Subjt:  GENPEKEDGNRGTDKNRFKKVEMPIFVGPELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAK

AT3G29750.1 Eukaryotic aspartyl protease family protein4.5e-1534.11Show/hide
Query:  IVGISTPRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGK--LDVVLGM
        ++ ++  + ++  G I  H VV+ ID GAT NFI + LA  LKL  + T    V++G    ++  G C GI L + E+ + E+FL L+L K  +DV+LG 
Subjt:  IVGISTPRTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGK--LDVVLGM

Query:  QWLQTMGKMEADWPALTLRFDRGEKKITI
        +WL  +G+   +W      F   ++ IT+
Subjt:  QWLQTMGKMEADWPALTLRFDRGEKKITI

AT3G30770.1 Eukaryotic aspartyl protease family protein1.1e-1030.25Show/hide
Query:  RTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGK--LDVVLGMQWLQTMG
        + ++  G I  H VV++ID GAT+NFIS  LA  LKL  + T    V++G    ++  G C GI L + E+ + E+FL L+L K  +DV+LG    Q + 
Subjt:  RTIKVNGVIQGHDVVLLIDCGATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGK--LDVVLGMQWLQTMG

Query:  KMEADWPALTLRFDRGEKKITI-RGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNAAEK
        +    W      F   ++ +T+   D  L  +   +K  ++  + +   YL     L    K
Subjt:  KMEADWPALTLRFDRGEKKITI-RGDPTLACIEVSMKRLAKQWEAQDQGYLVKMRALNAAEK

AT3G44713.1 unknown protein1.6e-0434.48Show/hide
Query:  ELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKE-------GKQSAALPQLSDE
        EL +WI   E +F  +  TDDEKM +A    +G A AW+        FR W  L++ L  RF   K+        KQ   L +  DE
Subjt:  ELEAWIFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKE-------GKQSAALPQLSDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGTCAAAGAGCTAGAAACCAGGTGTGAGGTGGTGGAGAAGGATGTTGCAGACATGAAGATCCAGATCACGTCGCTGCACCAAGACTTCGCTGAGCAAAAGAAGGT
GCTCCAGGAGGTCAACATGACGCTGAAAGCATTCATTGCTCAAGTTATTCGGGGTACCTCGATGTCTTTGTTGATGGGGACTGGGGAGACTTCGGTGACAAAAGGCAAAA
CGGTAGAAGGGGAGAATCCGGAGAAAGAAGATGGAAACAGGGGGACGGATAAAAATCGCTTCAAGAAGGTAGAGATGCCTATCTTCGTGGGTCCGGAGTTGGAAGCGTGG
ATATTCAGAGCGGAGAGGTACTTCGAGATACAGAAGCTAACGGACGATGAGAAGATGATGGTTGCTATCATCAGCTTCGACGGCATCGCTCTAGCGTGGTATAGGTACAC
GAATAACCGAGAGAAGTTTCGTGACTGGAATGATCTCAAAGAACGCCTCAGGGACCGGTTCCATCCGGCGAAAGAAGGGAAGCAGTCCGCCGCCCTCCCTCAACTCTCCG
ACGAGGTATTGGAAAGTGCCTTCCTAAACGGACTGGATCCAGTGATGCGAGCTGAAGTTCGGGCCCTAGAGCCCAAAGGCCTGGACCAGGGAACGGGAGCGTCACAAGCA
GCCAAGAAGGAACCACCCTTCAAAAAGCTATCGGAAGCAGAGTTTCAGAAAGGAAGGGAGCGTGGCCTGTGTTTCCGCTGTGAGGAGAAGTATACTGTAGGCCACAAGTG
CAAGAACAGAGAGCTCAGAGTTTTTGTTCTGCACGACGACGAGATGTTCGAGCTGGAGGATGAGATGCAACAAGAGGACGCATCAGGGGAGAAGGGAAATGGTGAAACCC
AAGCCGTCCAATGTCAGCTTAATACCATTGTGGGGATCTCTACCCCTAGAACAATCAAAGTTAATGGGGTGATACAAGGGCATGATGTGGTGTTACTGATAGACTGTGGG
GCAACCCACAACTTCATCTCTCTCACTTTGGCTGATGAACTGAAGTTGGCAAAAACAAAAACACTCAACTATGGAGTAATAATGGGCTCGGGAGCAGCGGTCAAAGGGAA
CGGCGTCTGCAAGGGGATTGTCTTATCATTACCAGAACTGACGGTGATGGAAGACTTCCTTCCTCTAGAATTGGGGAAGTTGGATGTAGTCCTGGGAATGCAGTGGCTCC
AAACGATGGGAAAAATGGAAGCAGACTGGCCAGCTCTTACACTACGTTTCGACAGAGGAGAAAAGAAAATAACTATCAGGGGGGATCCTACGCTGGCATGCATCGAGGTC
TCAATGAAGAGGTTGGCTAAACAATGGGAGGCTCAAGACCAAGGCTATTTGGTAAAGATGAGGGCGCTCAATGCAGCTGAGAAGGGCTCAGTGGAGGCCCTAGAGGAATT
ACCCCCGGAAGTAGAGGAACTCATTCATGCCACCCCTACCCTAGCAGTTCTATCAGCACTCGCAGTATTAGATGTAGAAGTTGTCAAACGAGAAATTGCTGAGGATCCAC
ACTTCTCCAAAGTGTTAGCGGATCTGGATGATGACCCGGAGGCAGCCACGGCAACTCAGTTTATTCCTTCTCAACTGACCGAGGAGTGGGAATGGATGGTGGAACCTGAT
TCAGTGTTTGCTTATCAGACCAATCCCACCACGGGAGAAGTCGAAGCCTTAGTCAATTGGAAAGGTTTACCAGAAGAAGAAGTAACCTGGGAGTCTGTGCGCGACATCAA
CCAACAATTTCCTGAGTTTCACCTTGAGGACAAGCTTTTGTTTGTAGGGAGAGATTACCCACCCTCTCGAATTGGTGGGTCTGTTTTGTGCCTACGGGCTCGTTGTTCTT
GCTTTGTATTGCCTACGGGCTTAATCGAAAGGAGTACATGTCCTAATGTTTCTTTGCCTTCATTTGCGACATCTTGTTGGATTCTTGATTCTGGGGCTTCCACACATATT
TGTTTTTCTCGTGATCTCTTTACAACCCTTCTGCCTATATCTACGTATGTGATATTGCCTGATGGATCTCGTCTTCATGTCCTCTTTATTGGCACGGTGAAGCTTCTTGG
TTCTATTATTCTCAAGAATGTTCTTTTTGTGCCTCAATTTCGCTACAACATGTTGTCTATTAGTGCTCTCACTGCACATGACTCTGTTTTTGTGCAATTCTATATTGATC
ACTGTACTATTCAGGAAAGGTCCACTTTGAAGACGATTGGCAAGGGTAAATTATGTGATGGACTTTATCTGCTCAATCTGTCTGATGTTCTGTCTCCAGTTTTGGTTAAT
GCTATCACAGTCCGAGATCAGACTACTCTGTGGCCTGCCAGGCATGATTCTTCAGATGTTACTTGTTTCGGGGATGTACTTGCTACTAATGTTGCTGTTCATGACTCAAT
TTTGCATACTTCTCTTGATATTATTGCTATGTCCAATGCTGCATCATCTGCTAATCCTCCTAGTCCTATGGATATGACTAGTCCTTCTCCCATGGATAGTCATACAGTTG
ATGTCAATCCCCCAACCGTTGTTGACATTGATATAGTTGTACCTAATGATGTAACCCATCCTATTGAGTTGTCTAATCCCATTGTTATTACCAACCCGAGTTCAATTGTC
TCTACCGACATAGTTGTCCATCTTTCGGCTCGAATTCAGCTGCTTCTAATGACATAG
mRNA sequenceShow/hide mRNA sequence
ATGACGGTCAAAGAGCTAGAAACCAGGTGTGAGGTGGTGGAGAAGGATGTTGCAGACATGAAGATCCAGATCACGTCGCTGCACCAAGACTTCGCTGAGCAAAAGAAGGT
GCTCCAGGAGGTCAACATGACGCTGAAAGCATTCATTGCTCAAGTTATTCGGGGTACCTCGATGTCTTTGTTGATGGGGACTGGGGAGACTTCGGTGACAAAAGGCAAAA
CGGTAGAAGGGGAGAATCCGGAGAAAGAAGATGGAAACAGGGGGACGGATAAAAATCGCTTCAAGAAGGTAGAGATGCCTATCTTCGTGGGTCCGGAGTTGGAAGCGTGG
ATATTCAGAGCGGAGAGGTACTTCGAGATACAGAAGCTAACGGACGATGAGAAGATGATGGTTGCTATCATCAGCTTCGACGGCATCGCTCTAGCGTGGTATAGGTACAC
GAATAACCGAGAGAAGTTTCGTGACTGGAATGATCTCAAAGAACGCCTCAGGGACCGGTTCCATCCGGCGAAAGAAGGGAAGCAGTCCGCCGCCCTCCCTCAACTCTCCG
ACGAGGTATTGGAAAGTGCCTTCCTAAACGGACTGGATCCAGTGATGCGAGCTGAAGTTCGGGCCCTAGAGCCCAAAGGCCTGGACCAGGGAACGGGAGCGTCACAAGCA
GCCAAGAAGGAACCACCCTTCAAAAAGCTATCGGAAGCAGAGTTTCAGAAAGGAAGGGAGCGTGGCCTGTGTTTCCGCTGTGAGGAGAAGTATACTGTAGGCCACAAGTG
CAAGAACAGAGAGCTCAGAGTTTTTGTTCTGCACGACGACGAGATGTTCGAGCTGGAGGATGAGATGCAACAAGAGGACGCATCAGGGGAGAAGGGAAATGGTGAAACCC
AAGCCGTCCAATGTCAGCTTAATACCATTGTGGGGATCTCTACCCCTAGAACAATCAAAGTTAATGGGGTGATACAAGGGCATGATGTGGTGTTACTGATAGACTGTGGG
GCAACCCACAACTTCATCTCTCTCACTTTGGCTGATGAACTGAAGTTGGCAAAAACAAAAACACTCAACTATGGAGTAATAATGGGCTCGGGAGCAGCGGTCAAAGGGAA
CGGCGTCTGCAAGGGGATTGTCTTATCATTACCAGAACTGACGGTGATGGAAGACTTCCTTCCTCTAGAATTGGGGAAGTTGGATGTAGTCCTGGGAATGCAGTGGCTCC
AAACGATGGGAAAAATGGAAGCAGACTGGCCAGCTCTTACACTACGTTTCGACAGAGGAGAAAAGAAAATAACTATCAGGGGGGATCCTACGCTGGCATGCATCGAGGTC
TCAATGAAGAGGTTGGCTAAACAATGGGAGGCTCAAGACCAAGGCTATTTGGTAAAGATGAGGGCGCTCAATGCAGCTGAGAAGGGCTCAGTGGAGGCCCTAGAGGAATT
ACCCCCGGAAGTAGAGGAACTCATTCATGCCACCCCTACCCTAGCAGTTCTATCAGCACTCGCAGTATTAGATGTAGAAGTTGTCAAACGAGAAATTGCTGAGGATCCAC
ACTTCTCCAAAGTGTTAGCGGATCTGGATGATGACCCGGAGGCAGCCACGGCAACTCAGTTTATTCCTTCTCAACTGACCGAGGAGTGGGAATGGATGGTGGAACCTGAT
TCAGTGTTTGCTTATCAGACCAATCCCACCACGGGAGAAGTCGAAGCCTTAGTCAATTGGAAAGGTTTACCAGAAGAAGAAGTAACCTGGGAGTCTGTGCGCGACATCAA
CCAACAATTTCCTGAGTTTCACCTTGAGGACAAGCTTTTGTTTGTAGGGAGAGATTACCCACCCTCTCGAATTGGTGGGTCTGTTTTGTGCCTACGGGCTCGTTGTTCTT
GCTTTGTATTGCCTACGGGCTTAATCGAAAGGAGTACATGTCCTAATGTTTCTTTGCCTTCATTTGCGACATCTTGTTGGATTCTTGATTCTGGGGCTTCCACACATATT
TGTTTTTCTCGTGATCTCTTTACAACCCTTCTGCCTATATCTACGTATGTGATATTGCCTGATGGATCTCGTCTTCATGTCCTCTTTATTGGCACGGTGAAGCTTCTTGG
TTCTATTATTCTCAAGAATGTTCTTTTTGTGCCTCAATTTCGCTACAACATGTTGTCTATTAGTGCTCTCACTGCACATGACTCTGTTTTTGTGCAATTCTATATTGATC
ACTGTACTATTCAGGAAAGGTCCACTTTGAAGACGATTGGCAAGGGTAAATTATGTGATGGACTTTATCTGCTCAATCTGTCTGATGTTCTGTCTCCAGTTTTGGTTAAT
GCTATCACAGTCCGAGATCAGACTACTCTGTGGCCTGCCAGGCATGATTCTTCAGATGTTACTTGTTTCGGGGATGTACTTGCTACTAATGTTGCTGTTCATGACTCAAT
TTTGCATACTTCTCTTGATATTATTGCTATGTCCAATGCTGCATCATCTGCTAATCCTCCTAGTCCTATGGATATGACTAGTCCTTCTCCCATGGATAGTCATACAGTTG
ATGTCAATCCCCCAACCGTTGTTGACATTGATATAGTTGTACCTAATGATGTAACCCATCCTATTGAGTTGTCTAATCCCATTGTTATTACCAACCCGAGTTCAATTGTC
TCTACCGACATAGTTGTCCATCTTTCGGCTCGAATTCAGCTGCTTCTAATGACATAG
Protein sequenceShow/hide protein sequence
MTVKELETRCEVVEKDVADMKIQITSLHQDFAEQKKVLQEVNMTLKAFIAQVIRGTSMSLLMGTGETSVTKGKTVEGENPEKEDGNRGTDKNRFKKVEMPIFVGPELEAW
IFRAERYFEIQKLTDDEKMMVAIISFDGIALAWYRYTNNREKFRDWNDLKERLRDRFHPAKEGKQSAALPQLSDEVLESAFLNGLDPVMRAEVRALEPKGLDQGTGASQA
AKKEPPFKKLSEAEFQKGRERGLCFRCEEKYTVGHKCKNRELRVFVLHDDEMFELEDEMQQEDASGEKGNGETQAVQCQLNTIVGISTPRTIKVNGVIQGHDVVLLIDCG
ATHNFISLTLADELKLAKTKTLNYGVIMGSGAAVKGNGVCKGIVLSLPELTVMEDFLPLELGKLDVVLGMQWLQTMGKMEADWPALTLRFDRGEKKITIRGDPTLACIEV
SMKRLAKQWEAQDQGYLVKMRALNAAEKGSVEALEELPPEVEELIHATPTLAVLSALAVLDVEVVKREIAEDPHFSKVLADLDDDPEAATATQFIPSQLTEEWEWMVEPD
SVFAYQTNPTTGEVEALVNWKGLPEEEVTWESVRDINQQFPEFHLEDKLLFVGRDYPPSRIGGSVLCLRARCSCFVLPTGLIERSTCPNVSLPSFATSCWILDSGASTHI
CFSRDLFTTLLPISTYVILPDGSRLHVLFIGTVKLLGSIILKNVLFVPQFRYNMLSISALTAHDSVFVQFYIDHCTIQERSTLKTIGKGKLCDGLYLLNLSDVLSPVLVN
AITVRDQTTLWPARHDSSDVTCFGDVLATNVAVHDSILHTSLDIIAMSNAASSANPPSPMDMTSPSPMDSHTVDVNPPTVVDIDIVVPNDVTHPIELSNPIVITNPSSIV
STDIVVHLSARIQLLLMT