; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026089 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026089
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr10:28957543..28960979
RNA-Seq ExpressionLag0026089
SyntenyLag0026089
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOX99415.1 Uncharacterized protein TCM_008095 [Theobroma cacao]2.4e-10229.36Show/hide
Query:  NVQQIHGDSLVEDVISQFKD---VSFPESQLETVKQAWEKLTIDRKAKFTSKYGHLARLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYKSL
        + QQ  GD L +D  S   D   +   ++    +   W+K     +A F  KYGH+ARL+ VQV+  +LKA+++ WDP+YRCF F  +DM PTIEEY +L
Subjt:  NVQQIHGDSLVEDVISQFKD---VSFPESQLETVKQAWEKLTIDRKAKFTSKYGHLARLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYKSL

Query:  LHMPARTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYIFTLQQKFANEDKELTLLALCIFNVVLFPKVCGYVEERMVKLFAKIE
        L +      + Y   Q+   +R L+ ++G I ++++++ ++ KG+N C+P  ++ +   K  + ++   ++AL I+ +V+FPKV G++E  ++  F ++ 
Subjt:  LHMPARTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYIFTLQQKFANEDKELTLLALCIFNVVLFPKVCGYVEERMVKLFAKIE

Query:  IGVDPIIPVLAETFQ---------------------------FKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKS------------
           +P   +LA+T +                           F+C   KF   ++    PI EF +S W  +   +  W + F EL S            
Subjt:  IGVDPIIPVLAETFQ---------------------------FKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKS------------

Query:  ----------------------------------------------------------------------------GQFHDDTTEAYKTWHANRAKTVLV
                                                                                    G++ D+ T  Y+ WH  R K V V
Subjt:  ----------------------------------------------------------------------------GQFHDDTTEAYKTWHANRAKTVLV

Query:  SPKMKTKIKLNAKVIPDQQTKQVACEKECDELRKANSSLVQENERLQLEIKQGLLCNAELEKELNRLKGSV--------SKQEQLEKEISALDTEARDLN
         PK   +  L   V P+ +   +  E    +    N+S  Q  E LQ E ++     +E  K++ +++G          +   +L++EI   + +  +L 
Subjt:  SPKMKTKIKLNAKVIPDQQTKQVACEKECDELRKANSSLVQENERLQLEIKQGLLCNAELEKELNRLKGSV--------SKQEQLEKEISALDTEARDLN

Query:  RRMHRLRRDSEVSQVTLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEVEEKNETLCRTIDD------------MQLTLKIR------------
             LRR     QV  +  + Q+L+Q+ E   L  +M   +       Q  T ++E  E+  +   +             ++  K R            
Subjt:  RRMHRLRRDSEVSQVTLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEVEEKNETLCRTIDD------------MQLTLKIR------------

Query:  -------EDQLGELINDN-----------------------KGSKRVRSVTQCTPRTRIMEEQSTE----MEKTKKDIEELREKMDAILVALERGK-IIP
               E QL + ++++                          +RV +  Q   R RIME++  E    ME+ ++++ E   KM  ++++L +GK  I 
Subjt:  -------EDQLGELINDN-----------------------KGSKRVRSVTQCTPRTRIMEEQSTE----MEKTKKDIEELREKMDAILVALERGK-IIP

Query:  DIAQSSSTMNDPPIRQSTEGTTPKYHPLYNIPVEQ-----HPF---PFFKNEQVHVHNQP---GFSLPTKVPPKMTIIVPNLDDPEIRKEL---------
        + AQS    N P      +   P Y P +  P  Q     HP    P + N    + +QP    F   + + P  +I +P+LDDP+ +++L         
Subjt:  DIAQSSSTMNDPPIRQSTEGTTPKYHPLYNIPVEQ-----HPF---PFFKNEQVHVHNQP---GFSLPTKVPPKMTIIVPNLDDPEIRKEL---------

Query:  -------------------------TGDATKLCLAPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDN
                                 T DAT+LCL PDV++P KFKVP+FEKYDG  CP  H+ MYCRKMAA   +DKLLI+ FQDSL G  +RWY+QLD 
Subjt:  -------------------------TGDATKLCLAPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDN

Query:  THICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEY
          I +WK LA +F+ QYKH  ++APDRL LQ MEKK +E+FKEYAQRWRDTAAQVQPPL DKE++ + INTL++PFY+++IG+A  NF+D++   E IE 
Subjt:  THICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEY

Query:  GIKHGKIIDTAGSSTAIKGVRSKKKEGEVQMI
         IK GKI     +S+  KG   KKKEG+VQ +
Subjt:  GIKHGKIIDTAGSSTAIKGVRSKKKEGEVQMI

KAA0033221.1 uncharacterized protein E6C27_scaffold845G00100 [Cucumis melo var. makuwa]1.5e-12535.39Show/hide
Query:  VSFPESQLETVKQAWEKLTIDRKAKFTSKYGHLARLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYKSLLHMPARTEVEAYSYDQELTMKRA
        + F  + LE +K  WE L  +++AKF   YG +  L+Y  +N S L+AL   WDP  +CFTF + D+TPTIEEY++L+ +P     + Y Y ++LT++R+
Subjt:  VSFPESQLETVKQAWEKLTIDRKAKFTSKYGHLARLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYKSLLHMPARTEVEAYSYDQELTMKRA

Query:  LSTLLGKIRTSDIEKQVKIKGENTCLPLDYIFTLQQKFANEDKELTLLALCIFNVVLFPKVCGYVEERMVKLFAKIEIGVDPIIPVLAETFQ--------
        LS  +G I  S+++KQ+K K    C+P++Y+  L +      K L+L+ALCI+  V+FP++ GYVEE +VK+F  IE GV+P+IP++AETF+        
Subjt:  LSTLLGKIRTSDIEKQVKIKGENTCLPLDYIFTLQQKFANEDKELTLLALCIFNVVLFPKVCGYVEERMVKLFAKIEIGVDPIIPVLAETFQ--------

Query:  -----FKC--------------------PEIKFSSSWNKLQNPISEFVQSGWSSSSPERSA------------------------------WE-------
             F C                    P+IKF+S WNK +N I EF  + W    PE  A                              WE       
Subjt:  -----FKC--------------------PEIKFSSSWNKLQNPISEFVQSGWSSSSPERSA------------------------------WE-------

Query:  ------------AFFSELKSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTKQV-ACEKECDELRKANSSLVQENERLQLEIKQGLL
                     F  ++KS +  + TTE Y  W A+R   + ++P       ++  ++P+ + K+    EKE   LR+ N  L  ENE+L+ E+K+ + 
Subjt:  ------------AFFSELKSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTKQV-ACEKECDELRKANSSLVQENERLQLEIKQGLL

Query:  CNAELEKELNRLK--------GSVSKQEQLEKEISALDTEARDLNRRMHRLRRDSEVSQVTLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEV
              ++L+  K        G    Q  +E+  S L  EA   N  + +    SE SQ+ +  +  +V+    + A L E  +E+     +        
Subjt:  CNAELEKELNRLK--------GSVSKQEQLEKEISALDTEARDLNRRMHRLRRDSEVSQVTLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEV

Query:  EEKNETLCRTIDDMQLTLKIREDQLGELIN-DNKGSKRVRSVTQCTPRTRIMEEQST-------EMEKTKKDIEELREKMDAILVALERGKIIPDIAQSS
        +++ ETL R  DD    ++   ++L ++    ++ S + R++ Q    TR  +++ +       +    ++D+E L++++  IL  L  G+    +  SS
Subjt:  EEKNETLCRTIDDMQLTLKIREDQLGELIN-DNKGSKRVRSVTQCTPRTRIMEEQST-------EMEKTKKDIEELREKMDAILVALERGKIIPDIAQSS

Query:  STMNDPPIRQSTE-------GTTPKYHPLYNIPVEQHPFPF-------FKNEQVHVHNQPGFSLPTKVPPKMTIIVPNLDDPEIRKELTGDATKLCLAPD
            D  + Q  E       G TP+      +    +P  F          ++ H +N           P  T+I   ++  +   E  G   +L     
Subjt:  STMNDPPIRQSTE-------GTTPKYHPLYNIPVEQHPFPF-------FKNEQVHVHNQPGFSLPTKVPPKMTIIVPNLDDPEIRKELTGDATKLCLAPD

Query:  VILPPKFKVPE-FEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKK
          L  + +V E FEKY+G SCPK+HL+MYCRKM+AY  +DKLLIHCFQDSL+ P SRWYMQLD + +  WK+LADSFLKQYK+NIDMAPDRLDLQRMEKK
Subjt:  VILPPKFKVPE-FEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKK

Query:  STESFKEYAQRWRDTAAQVQPPLADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAIKGVRSKKKEGEV
        + E+FKEYAQRWR+ AAQVQPPL DKEL+ M INTL++P+YD+M+GSASTNFSD++TI ERIE+G+K+G+I D A  +  +  +  KKKEGEV
Subjt:  STESFKEYAQRWRDTAAQVQPPLADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAIKGVRSKKKEGEV

KAA0036933.1 uncharacterized protein E6C27_scaffold86G00060 [Cucumis melo var. makuwa]1.6e-15137.89Show/hide
Query:  WEKLTIDRKAKFTSKYGHLARLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYKSLLHMPARTEVEAYSYDQELTMKRALSTLLGKIRTSDIE
        WE LT  R+  F+ KYGH+A LMY+ VNY  L+A+I   DPAY CFTFGS D+ PTIEEY+++L MP +     Y ++ + T KR LS  L  +  ++I+
Subjt:  WEKLTIDRKAKFTSKYGHLARLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYKSLLHMPARTEVEAYSYDQELTMKRALSTLLGKIRTSDIE

Query:  KQVKIKGENTCLPLDYIFTLQQKFANEDKELTLLALCIFNVVLFPKVCGYVEERMVKLFAKIEIGVDPIIPVLAETF-----------------------
        K +K KG    +P DY+  + Q + +EDK LTLLALCI+  V+FPK  GYV+ +++KLF ++E GV+PIIP+LAETF                       
Subjt:  KQVKIKGENTCLPLDYIFTLQQKFANEDKELTLLALCIFNVVLFPKVCGYVEERMVKLFAKIEIGVDPIIPVLAETF-----------------------

Query:  ------------------QFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKS-------------------GQFH------------
                          +F+CP + FSS WN ++N ISEF  + W  + P + AW +FF++L S                   G FH            
Subjt:  ------------------QFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKS-------------------GQFH------------

Query:  ------------------------------DDTTEAYKTWHANRAKTVL-VSPKMKTKIKLNAKVIPDQQTKQVACEKECDELRKANSSLVQENERLQLE
                                      +  T  Y+ W ANR K ++ +S ++  + K  +   P+Q  ++        EL + N  L QENE+L+ E
Subjt:  ------------------------------DDTTEAYKTWHANRAKTVL-VSPKMKTKIKLNAKVIPDQQTKQVACEKECDELRKANSSLVQENERLQLE

Query:  IKQGLLCNAELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDSEVSQVTL--------KSRNDQVLKQQSEIASLHELMKELEDCISLRN
          Q +     L+ EL + K  +  Q++LE ++  LD E R +N+    ++ +    Q T+        +S   ++LK  ++   LH  +  L++      
Subjt:  IKQGLLCNAELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDSEVSQVTL--------KSRNDQVLKQQSEIASLHELMKELEDCISLRN

Query:  QTITEVEEKN---------------------ETLCRTIDDMQLTLKIREDQLGELINDNKGSKRVRSVTQCTPRTRIMEEQSTEMEKTKKDIEELREKMD
        Q   E E  N                     E + +TI+ +++  K R +   E          +R       +++IMEE+  +M+K +++I  L E++ 
Subjt:  QTITEVEEKN---------------------ETLCRTIDDMQLTLKIREDQLGELINDNKGSKRVRSVTQCTPRTRIMEEQSTEMEKTKKDIEELREKMD

Query:  AI--LVALERGKIIPDIAQSSSTMNDPPIRQSTEGTTPKYHPLYNIPVEQHPFPFFKNEQVHVHNQPGFSLPTKVPPKMTIIVPNL---DDPEIRKELTG
         I  L+++ +GK   D  QSS+ + D        G TP YH   N+P  +     F              L  +   K+ ++   L   ++ ++   +  
Subjt:  AI--LVALERGKIIPDIAQSSSTMNDPPIRQSTEGTTPKYHPLYNIPVEQHPFPFFKNEQVHVHNQPGFSLPTKVPPKMTIIVPNL---DDPEIRKELTG

Query:  DATKLCLAPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDR
        DAT+LCL P +I+P KFKVPEF KYDG++CP++HLIMYCRKMA ++ NDKLL+HCFQDSL  P SRWY+QLDN HI  WK+LAD+FLKQYK NIDMAPDR
Subjt:  DATKLCLAPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDR

Query:  LDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAI-KGVRSKKKE
        LDLQRMEKKS+ESFKEYAQRWRD AA+VQPPL DKE+++M +NTL++PFY++MIG+ASTNFSDI+ I ERIEYGIKHG++ +       I KG  SKKKE
Subjt:  LDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAI-KGVRSKKKE

Query:  GEVQMIGF
        GEV  IGF
Subjt:  GEVQMIGF

XP_022155098.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia]1.1e-9954.52Show/hide
Query:  EEQSTEMEKTKKDIEELREKMDAILVALERGKIIPDIAQSSSTMNDPPIRQSTEGTTPK---------------------YHPLYNIPVEQHPFPFFKNE
        +++ +E EKT+KDIEELREK+DAIL+ALE+GK    IA++S+ +++PP  Q   G  P                      Y+PLY+IP  Q P P  +  
Subjt:  EEQSTEMEKTKKDIEELREKMDAILVALERGKIIPDIAQSSSTMNDPPIRQSTEGTTPK---------------------YHPLYNIPVEQHPFPFFKNE

Query:  QVHVHNQPG---FSLPTKVPPKMTIIVPNLDDPEIRKE---------------------LTG-------DATKLCLAPDVILPPKFKVPEFEKYDGASCP
               P    F  P +V    T++  NL  P+  K+                     + G       DA++LCL   +++PPKFK+PEFEKY+G+SCP
Subjt:  QVHVHNQPG---FSLPTKVPPKMTIIVPNLDDPEIRKE---------------------LTG-------DATKLCLAPDVILPPKFKVPEFEKYDGASCP

Query:  KNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPP
        KNHLIMYCRKMAAY+QNDKLLIHCFQDSL GP S WYM LD+ H+ SWKNLADSFLKQYKHNIDM  DRLDLQ MEKK+ ESFKEY QRWRDTAAQ QPP
Subjt:  KNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPP

Query:  LADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAIK-GVRSKKKEGEVQMIG
          DKELS+M INTLK PFYD+MIGSAST+FSDI+TI ERIEYG+ HG+I  T G S+  K    SKKKEGEVQM+G
Subjt:  LADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAIK-GVRSKKKEGEVQMIG

XP_022157796.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024415 [Momordica charantia]9.7e-10458.73Show/hide
Query:  MEEQSTEMEKTKKDIEELREKMDAILVALERGKIIPDIAQSSSTMND------------PPIRQSTEGTTPK---YHPLYNIPVEQHPFPFFKNEQ----
        ME+Q  E EKT+KDIEELREK+D I + LE+GK   D A SS+ +++            PP+R   EG  P+   Y+PLY++P+ Q+P  F K  Q    
Subjt:  MEEQSTEMEKTKKDIEELREKMDAILVALERGKIIPDIAQSSSTMND------------PPIRQSTEGTTPK---YHPLYNIPVEQHPFPFFKNEQ----

Query:  --VHVHNQPGFSLPT-----KVPPKMTIIVPNLDDPEIRKELTG--------------DATKLCLAPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKM
          +    +   S PT       P K   +  N    E  + L                DA++LCL   +++PPKFKVPEFEKYDG+SCPKNHLIMYCRKM
Subjt:  --VHVHNQPGFSLPT-----KVPPKMTIIVPNLDDPEIRKELTG--------------DATKLCLAPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKM

Query:  AAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMCI
         AYVQN KLLIHCFQDSL G  SRWYMQLD++H+ SWKNLADSFLKQYKHNIDMAPDRLDLQRMEK STESFKEYAQRWRDTAAQVQPPL DKELS M I
Subjt:  AAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMCI

Query:  NTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAI-KGVRSKKKEGEV
        NTLK PFYD+MIGSASTNFSDIMTI ERIEYG++H +I  TA    A  K   SKKKEGE+
Subjt:  NTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAI-KGVRSKKKEGEV

TrEMBL top hitse value%identityAlignment
A0A061E378 G-patch domain-containing protein1.2e-10229.36Show/hide
Query:  NVQQIHGDSLVEDVISQFKD---VSFPESQLETVKQAWEKLTIDRKAKFTSKYGHLARLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYKSL
        + QQ  GD L +D  S   D   +   ++    +   W+K     +A F  KYGH+ARL+ VQV+  +LKA+++ WDP+YRCF F  +DM PTIEEY +L
Subjt:  NVQQIHGDSLVEDVISQFKD---VSFPESQLETVKQAWEKLTIDRKAKFTSKYGHLARLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYKSL

Query:  LHMPARTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYIFTLQQKFANEDKELTLLALCIFNVVLFPKVCGYVEERMVKLFAKIE
        L +      + Y   Q+   +R L+ ++G I ++++++ ++ KG+N C+P  ++ +   K  + ++   ++AL I+ +V+FPKV G++E  ++  F ++ 
Subjt:  LHMPARTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYIFTLQQKFANEDKELTLLALCIFNVVLFPKVCGYVEERMVKLFAKIE

Query:  IGVDPIIPVLAETFQ---------------------------FKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKS------------
           +P   +LA+T +                           F+C   KF   ++    PI EF +S W  +   +  W + F EL S            
Subjt:  IGVDPIIPVLAETFQ---------------------------FKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKS------------

Query:  ----------------------------------------------------------------------------GQFHDDTTEAYKTWHANRAKTVLV
                                                                                    G++ D+ T  Y+ WH  R K V V
Subjt:  ----------------------------------------------------------------------------GQFHDDTTEAYKTWHANRAKTVLV

Query:  SPKMKTKIKLNAKVIPDQQTKQVACEKECDELRKANSSLVQENERLQLEIKQGLLCNAELEKELNRLKGSV--------SKQEQLEKEISALDTEARDLN
         PK   +  L   V P+ +   +  E    +    N+S  Q  E LQ E ++     +E  K++ +++G          +   +L++EI   + +  +L 
Subjt:  SPKMKTKIKLNAKVIPDQQTKQVACEKECDELRKANSSLVQENERLQLEIKQGLLCNAELEKELNRLKGSV--------SKQEQLEKEISALDTEARDLN

Query:  RRMHRLRRDSEVSQVTLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEVEEKNETLCRTIDD------------MQLTLKIR------------
             LRR     QV  +  + Q+L+Q+ E   L  +M   +       Q  T ++E  E+  +   +             ++  K R            
Subjt:  RRMHRLRRDSEVSQVTLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEVEEKNETLCRTIDD------------MQLTLKIR------------

Query:  -------EDQLGELINDN-----------------------KGSKRVRSVTQCTPRTRIMEEQSTE----MEKTKKDIEELREKMDAILVALERGK-IIP
               E QL + ++++                          +RV +  Q   R RIME++  E    ME+ ++++ E   KM  ++++L +GK  I 
Subjt:  -------EDQLGELINDN-----------------------KGSKRVRSVTQCTPRTRIMEEQSTE----MEKTKKDIEELREKMDAILVALERGK-IIP

Query:  DIAQSSSTMNDPPIRQSTEGTTPKYHPLYNIPVEQ-----HPF---PFFKNEQVHVHNQP---GFSLPTKVPPKMTIIVPNLDDPEIRKEL---------
        + AQS    N P      +   P Y P +  P  Q     HP    P + N    + +QP    F   + + P  +I +P+LDDP+ +++L         
Subjt:  DIAQSSSTMNDPPIRQSTEGTTPKYHPLYNIPVEQ-----HPF---PFFKNEQVHVHNQP---GFSLPTKVPPKMTIIVPNLDDPEIRKEL---------

Query:  -------------------------TGDATKLCLAPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDN
                                 T DAT+LCL PDV++P KFKVP+FEKYDG  CP  H+ MYCRKMAA   +DKLLI+ FQDSL G  +RWY+QLD 
Subjt:  -------------------------TGDATKLCLAPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDN

Query:  THICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEY
          I +WK LA +F+ QYKH  ++APDRL LQ MEKK +E+FKEYAQRWRDTAAQVQPPL DKE++ + INTL++PFY+++IG+A  NF+D++   E IE 
Subjt:  THICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEY

Query:  GIKHGKIIDTAGSSTAIKGVRSKKKEGEVQMI
         IK GKI     +S+  KG   KKKEG+VQ +
Subjt:  GIKHGKIIDTAGSSTAIKGVRSKKKEGEVQMI

A0A5A7SUT0 Reverse transcriptase7.5e-12635.39Show/hide
Query:  VSFPESQLETVKQAWEKLTIDRKAKFTSKYGHLARLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYKSLLHMPARTEVEAYSYDQELTMKRA
        + F  + LE +K  WE L  +++AKF   YG +  L+Y  +N S L+AL   WDP  +CFTF + D+TPTIEEY++L+ +P     + Y Y ++LT++R+
Subjt:  VSFPESQLETVKQAWEKLTIDRKAKFTSKYGHLARLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYKSLLHMPARTEVEAYSYDQELTMKRA

Query:  LSTLLGKIRTSDIEKQVKIKGENTCLPLDYIFTLQQKFANEDKELTLLALCIFNVVLFPKVCGYVEERMVKLFAKIEIGVDPIIPVLAETFQ--------
        LS  +G I  S+++KQ+K K    C+P++Y+  L +      K L+L+ALCI+  V+FP++ GYVEE +VK+F  IE GV+P+IP++AETF+        
Subjt:  LSTLLGKIRTSDIEKQVKIKGENTCLPLDYIFTLQQKFANEDKELTLLALCIFNVVLFPKVCGYVEERMVKLFAKIEIGVDPIIPVLAETFQ--------

Query:  -----FKC--------------------PEIKFSSSWNKLQNPISEFVQSGWSSSSPERSA------------------------------WE-------
             F C                    P+IKF+S WNK +N I EF  + W    PE  A                              WE       
Subjt:  -----FKC--------------------PEIKFSSSWNKLQNPISEFVQSGWSSSSPERSA------------------------------WE-------

Query:  ------------AFFSELKSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTKQV-ACEKECDELRKANSSLVQENERLQLEIKQGLL
                     F  ++KS +  + TTE Y  W A+R   + ++P       ++  ++P+ + K+    EKE   LR+ N  L  ENE+L+ E+K+ + 
Subjt:  ------------AFFSELKSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTKQV-ACEKECDELRKANSSLVQENERLQLEIKQGLL

Query:  CNAELEKELNRLK--------GSVSKQEQLEKEISALDTEARDLNRRMHRLRRDSEVSQVTLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEV
              ++L+  K        G    Q  +E+  S L  EA   N  + +    SE SQ+ +  +  +V+    + A L E  +E+     +        
Subjt:  CNAELEKELNRLK--------GSVSKQEQLEKEISALDTEARDLNRRMHRLRRDSEVSQVTLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEV

Query:  EEKNETLCRTIDDMQLTLKIREDQLGELIN-DNKGSKRVRSVTQCTPRTRIMEEQST-------EMEKTKKDIEELREKMDAILVALERGKIIPDIAQSS
        +++ ETL R  DD    ++   ++L ++    ++ S + R++ Q    TR  +++ +       +    ++D+E L++++  IL  L  G+    +  SS
Subjt:  EEKNETLCRTIDDMQLTLKIREDQLGELIN-DNKGSKRVRSVTQCTPRTRIMEEQST-------EMEKTKKDIEELREKMDAILVALERGKIIPDIAQSS

Query:  STMNDPPIRQSTE-------GTTPKYHPLYNIPVEQHPFPF-------FKNEQVHVHNQPGFSLPTKVPPKMTIIVPNLDDPEIRKELTGDATKLCLAPD
            D  + Q  E       G TP+      +    +P  F          ++ H +N           P  T+I   ++  +   E  G   +L     
Subjt:  STMNDPPIRQSTE-------GTTPKYHPLYNIPVEQHPFPF-------FKNEQVHVHNQPGFSLPTKVPPKMTIIVPNLDDPEIRKELTGDATKLCLAPD

Query:  VILPPKFKVPE-FEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKK
          L  + +V E FEKY+G SCPK+HL+MYCRKM+AY  +DKLLIHCFQDSL+ P SRWYMQLD + +  WK+LADSFLKQYK+NIDMAPDRLDLQRMEKK
Subjt:  VILPPKFKVPE-FEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKK

Query:  STESFKEYAQRWRDTAAQVQPPLADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAIKGVRSKKKEGEV
        + E+FKEYAQRWR+ AAQVQPPL DKEL+ M INTL++P+YD+M+GSASTNFSD++TI ERIE+G+K+G+I D A  +  +  +  KKKEGEV
Subjt:  STESFKEYAQRWRDTAAQVQPPLADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAIKGVRSKKKEGEV

A0A5A7T1W2 Retrotrans_gag domain-containing protein7.9e-15237.89Show/hide
Query:  WEKLTIDRKAKFTSKYGHLARLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYKSLLHMPARTEVEAYSYDQELTMKRALSTLLGKIRTSDIE
        WE LT  R+  F+ KYGH+A LMY+ VNY  L+A+I   DPAY CFTFGS D+ PTIEEY+++L MP +     Y ++ + T KR LS  L  +  ++I+
Subjt:  WEKLTIDRKAKFTSKYGHLARLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYKSLLHMPARTEVEAYSYDQELTMKRALSTLLGKIRTSDIE

Query:  KQVKIKGENTCLPLDYIFTLQQKFANEDKELTLLALCIFNVVLFPKVCGYVEERMVKLFAKIEIGVDPIIPVLAETF-----------------------
        K +K KG    +P DY+  + Q + +EDK LTLLALCI+  V+FPK  GYV+ +++KLF ++E GV+PIIP+LAETF                       
Subjt:  KQVKIKGENTCLPLDYIFTLQQKFANEDKELTLLALCIFNVVLFPKVCGYVEERMVKLFAKIEIGVDPIIPVLAETF-----------------------

Query:  ------------------QFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKS-------------------GQFH------------
                          +F+CP + FSS WN ++N ISEF  + W  + P + AW +FF++L S                   G FH            
Subjt:  ------------------QFKCPEIKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKS-------------------GQFH------------

Query:  ------------------------------DDTTEAYKTWHANRAKTVL-VSPKMKTKIKLNAKVIPDQQTKQVACEKECDELRKANSSLVQENERLQLE
                                      +  T  Y+ W ANR K ++ +S ++  + K  +   P+Q  ++        EL + N  L QENE+L+ E
Subjt:  ------------------------------DDTTEAYKTWHANRAKTVL-VSPKMKTKIKLNAKVIPDQQTKQVACEKECDELRKANSSLVQENERLQLE

Query:  IKQGLLCNAELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDSEVSQVTL--------KSRNDQVLKQQSEIASLHELMKELEDCISLRN
          Q +     L+ EL + K  +  Q++LE ++  LD E R +N+    ++ +    Q T+        +S   ++LK  ++   LH  +  L++      
Subjt:  IKQGLLCNAELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDSEVSQVTL--------KSRNDQVLKQQSEIASLHELMKELEDCISLRN

Query:  QTITEVEEKN---------------------ETLCRTIDDMQLTLKIREDQLGELINDNKGSKRVRSVTQCTPRTRIMEEQSTEMEKTKKDIEELREKMD
        Q   E E  N                     E + +TI+ +++  K R +   E          +R       +++IMEE+  +M+K +++I  L E++ 
Subjt:  QTITEVEEKN---------------------ETLCRTIDDMQLTLKIREDQLGELINDNKGSKRVRSVTQCTPRTRIMEEQSTEMEKTKKDIEELREKMD

Query:  AI--LVALERGKIIPDIAQSSSTMNDPPIRQSTEGTTPKYHPLYNIPVEQHPFPFFKNEQVHVHNQPGFSLPTKVPPKMTIIVPNL---DDPEIRKELTG
         I  L+++ +GK   D  QSS+ + D        G TP YH   N+P  +     F              L  +   K+ ++   L   ++ ++   +  
Subjt:  AI--LVALERGKIIPDIAQSSSTMNDPPIRQSTEGTTPKYHPLYNIPVEQHPFPFFKNEQVHVHNQPGFSLPTKVPPKMTIIVPNL---DDPEIRKELTG

Query:  DATKLCLAPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDR
        DAT+LCL P +I+P KFKVPEF KYDG++CP++HLIMYCRKMA ++ NDKLL+HCFQDSL  P SRWY+QLDN HI  WK+LAD+FLKQYK NIDMAPDR
Subjt:  DATKLCLAPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDR

Query:  LDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAI-KGVRSKKKE
        LDLQRMEKKS+ESFKEYAQRWRD AA+VQPPL DKE+++M +NTL++PFY++MIG+ASTNFSDI+ I ERIEYGIKHG++ +       I KG  SKKKE
Subjt:  LDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAI-KGVRSKKKE

Query:  GEVQMIGF
        GEV  IGF
Subjt:  GEVQMIGF

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222313.2e-10054.79Show/hide
Query:  EEQSTEMEKTKKDIEELREKMDAILVALERGKIIPDIAQSSSTMNDPPIRQSTEGTTPK---------------------YHPLYNIPVEQHPFPFFKNE
        +++ +E EKT+KDIEELREK+DAIL+ALE+GK    IA++S+ +++PP  Q   G  P                      Y+PLY+IP  Q P P  +  
Subjt:  EEQSTEMEKTKKDIEELREKMDAILVALERGKIIPDIAQSSSTMNDPPIRQSTEGTTPK---------------------YHPLYNIPVEQHPFPFFKNE

Query:  QVHVHNQPG---FSLPTKVPPKMTIIVPNLDDPEIRKE---------------------LTG-------DATKLCLAPDVILPPKFKVPEFEKYDGASCP
               P    F  P +V    T++  NL  P+  K+                     + G       DA++LCL   +++PPKFK+PEFEKYDG+SCP
Subjt:  QVHVHNQPG---FSLPTKVPPKMTIIVPNLDDPEIRKE---------------------LTG-------DATKLCLAPDVILPPKFKVPEFEKYDGASCP

Query:  KNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPP
        KNHLIMYCRKMAAY+QNDKLLIHCFQDSL GP S WYM LD+ H+ SWKNLADSFLKQYKHNIDM  DRLDLQ MEKK+ ESFKEY QRWRDTAAQ QPP
Subjt:  KNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPP

Query:  LADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAIK-GVRSKKKEGEVQMIG
          DKELS+M INTLK PFYD+MIGSAST+FSDI+TI ERIEYG+ HG+I  T G S+  K    SKKKEGEVQM+G
Subjt:  LADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAIK-GVRSKKKEGEVQMIG

A0A6J1DZ90 Ribonuclease H4.7e-10458.73Show/hide
Query:  MEEQSTEMEKTKKDIEELREKMDAILVALERGKIIPDIAQSSSTMND------------PPIRQSTEGTTPK---YHPLYNIPVEQHPFPFFKNEQ----
        ME+Q  E EKT+KDIEELREK+D I + LE+GK   D A SS+ +++            PP+R   EG  P+   Y+PLY++P+ Q+P  F K  Q    
Subjt:  MEEQSTEMEKTKKDIEELREKMDAILVALERGKIIPDIAQSSSTMND------------PPIRQSTEGTTPK---YHPLYNIPVEQHPFPFFKNEQ----

Query:  --VHVHNQPGFSLPT-----KVPPKMTIIVPNLDDPEIRKELTG--------------DATKLCLAPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKM
          +    +   S PT       P K   +  N    E  + L                DA++LCL   +++PPKFKVPEFEKYDG+SCPKNHLIMYCRKM
Subjt:  --VHVHNQPGFSLPT-----KVPPKMTIIVPNLDDPEIRKELTG--------------DATKLCLAPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKM

Query:  AAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMCI
         AYVQN KLLIHCFQDSL G  SRWYMQLD++H+ SWKNLADSFLKQYKHNIDMAPDRLDLQRMEK STESFKEYAQRWRDTAAQVQPPL DKELS M I
Subjt:  AAYVQNDKLLIHCFQDSLIGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMCI

Query:  NTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAI-KGVRSKKKEGEV
        NTLK PFYD+MIGSASTNFSDIMTI ERIEYG++H +I  TA    A  K   SKKKEGE+
Subjt:  NTLKSPFYDKMIGSASTNFSDIMTIRERIEYGIKHGKIIDTAGSSTAI-KGVRSKKKEGEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCAAAAATGTACAACAAATCCACGGAGATTCTTTAGTAGAAGATGTTATTTCTCAATTTAAGGATGTCAGTTTTCCAGAAAGTCAATTAGAAACAGTGAAACAGGC
TTGGGAAAAATTAACTATAGATAGAAAGGCTAAATTTACAAGCAAATATGGCCATCTAGCTCGGCTCATGTATGTACAAGTCAATTATTCTGTATTAAAAGCTTTGATTC
GACATTGGGATCCAGCCTACAGATGTTTCACATTTGGCTCAATCGACATGACTCCTACAATAGAGGAATATAAGTCCCTTCTGCATATGCCAGCACGAACAGAGGTTGAA
GCTTATTCTTACGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCGTACAAGCGACATTGAGAAACAAGTAAAGATAAAAGGAGAGAACAC
ATGTCTACCCCTGGACTACATTTTTACTCTTCAACAAAAATTTGCAAATGAAGACAAGGAATTAACTTTACTGGCATTGTGTATCTTTAATGTTGTTTTGTTTCCTAAAG
TATGTGGATATGTTGAGGAACGCATGGTCAAGTTGTTTGCTAAGATAGAAATAGGAGTTGATCCCATTATTCCTGTATTGGCAGAAACTTTTCAATTCAAATGTCCAGAG
ATCAAATTTTCAAGTTCCTGGAATAAGCTGCAAAATCCCATTTCAGAGTTCGTGCAATCAGGTTGGAGTTCATCCTCCCCTGAAAGGAGCGCTTGGGAAGCCTTTTTCTC
CGAACTAAAGAGTGGTCAGTTTCATGATGATACCACAGAGGCGTACAAAACATGGCATGCGAACAGAGCTAAAACCGTGCTTGTGTCACCAAAAATGAAAACCAAGATAA
AACTTAATGCAAAGGTGATACCAGATCAACAGACAAAACAAGTAGCATGCGAAAAGGAATGTGATGAATTGAGAAAAGCGAATTCATCATTGGTTCAAGAAAATGAAAGG
CTGCAATTAGAAATAAAGCAAGGTTTGTTGTGCAATGCTGAACTAGAAAAAGAGTTGAACCGATTAAAGGGAAGTGTCAGCAAACAAGAACAGTTAGAAAAGGAAATTTC
AGCATTAGACACAGAGGCCCGCGACCTAAACAGAAGAATGCATCGATTAAGAAGGGATAGTGAAGTCTCCCAAGTAACTCTCAAGTCAAGGAATGACCAAGTTTTGAAGC
AACAATCTGAGATTGCCTCACTCCATGAGTTGATGAAAGAGCTCGAAGATTGCATTAGTTTGAGGAACCAAACGATTACTGAGGTAGAGGAAAAGAATGAAACGCTATGT
CGAACAATTGACGACATGCAATTAACGCTCAAGATTAGAGAAGATCAACTAGGGGAGCTCATCAACGACAACAAAGGGTCTAAGAGAGTCCGTTCAGTCACTCAATGTAC
GCCTCGGACAAGGATCATGGAAGAGCAAAGTACTGAGATGGAGAAAACAAAGAAAGATATTGAGGAGTTACGAGAAAAAATGGATGCCATTCTTGTCGCCCTAGAAAGAG
GCAAAATAATACCTGATATTGCTCAGTCCAGCAGTACAATGAATGACCCTCCAATCAGGCAATCAACAGAGGGTACTACTCCAAAATATCATCCATTGTACAATATTCCA
GTAGAGCAGCACCCATTTCCATTTTTCAAGAATGAGCAAGTGCATGTACACAATCAACCTGGATTTTCACTACCCACAAAGGTACCTCCCAAGATGACCATTATAGTTCC
CAATTTAGATGATCCTGAAATCAGAAAAGAGCTAACGGGAGATGCGACCAAGTTATGCTTGGCACCAGATGTAATCCTCCCTCCAAAATTCAAGGTGCCCGAGTTTGAAA
AGTATGATGGAGCATCCTGCCCTAAAAACCATCTCATCATGTATTGCAGAAAGATGGCAGCATACGTCCAAAATGACAAACTGTTAATTCACTGCTTCCAGGATAGTCTT
ATTGGTCCAACATCTCGATGGTATATGCAGTTAGACAACACTCATATATGTTCATGGAAGAATCTAGCCGATTCATTTTTAAAGCAATATAAGCACAACATAGATATGGC
TCCTGACCGCCTAGACCTCCAGAGGATGGAAAAGAAGAGCACAGAAAGCTTTAAAGAGTATGCCCAAAGGTGGAGGGATACTGCTGCTCAGGTGCAACCACCTTTAGCAG
ATAAGGAGTTGTCAACCATGTGTATTAATACTCTCAAATCTCCTTTCTATGATAAGATGATTGGGAGTGCCTCTACCAATTTCTCTGACATAATGACAATTAGAGAGAGA
ATCGAGTACGGAATTAAGCATGGAAAGATAATTGATACGGCTGGATCATCGACAGCGATAAAGGGGGTTCGATCGAAGAAAAAAGAGGGAGAAGTTCAGATGATTGGTTT
CAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCAAAAATGTACAACAAATCCACGGAGATTCTTTAGTAGAAGATGTTATTTCTCAATTTAAGGATGTCAGTTTTCCAGAAAGTCAATTAGAAACAGTGAAACAGGC
TTGGGAAAAATTAACTATAGATAGAAAGGCTAAATTTACAAGCAAATATGGCCATCTAGCTCGGCTCATGTATGTACAAGTCAATTATTCTGTATTAAAAGCTTTGATTC
GACATTGGGATCCAGCCTACAGATGTTTCACATTTGGCTCAATCGACATGACTCCTACAATAGAGGAATATAAGTCCCTTCTGCATATGCCAGCACGAACAGAGGTTGAA
GCTTATTCTTACGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCGTACAAGCGACATTGAGAAACAAGTAAAGATAAAAGGAGAGAACAC
ATGTCTACCCCTGGACTACATTTTTACTCTTCAACAAAAATTTGCAAATGAAGACAAGGAATTAACTTTACTGGCATTGTGTATCTTTAATGTTGTTTTGTTTCCTAAAG
TATGTGGATATGTTGAGGAACGCATGGTCAAGTTGTTTGCTAAGATAGAAATAGGAGTTGATCCCATTATTCCTGTATTGGCAGAAACTTTTCAATTCAAATGTCCAGAG
ATCAAATTTTCAAGTTCCTGGAATAAGCTGCAAAATCCCATTTCAGAGTTCGTGCAATCAGGTTGGAGTTCATCCTCCCCTGAAAGGAGCGCTTGGGAAGCCTTTTTCTC
CGAACTAAAGAGTGGTCAGTTTCATGATGATACCACAGAGGCGTACAAAACATGGCATGCGAACAGAGCTAAAACCGTGCTTGTGTCACCAAAAATGAAAACCAAGATAA
AACTTAATGCAAAGGTGATACCAGATCAACAGACAAAACAAGTAGCATGCGAAAAGGAATGTGATGAATTGAGAAAAGCGAATTCATCATTGGTTCAAGAAAATGAAAGG
CTGCAATTAGAAATAAAGCAAGGTTTGTTGTGCAATGCTGAACTAGAAAAAGAGTTGAACCGATTAAAGGGAAGTGTCAGCAAACAAGAACAGTTAGAAAAGGAAATTTC
AGCATTAGACACAGAGGCCCGCGACCTAAACAGAAGAATGCATCGATTAAGAAGGGATAGTGAAGTCTCCCAAGTAACTCTCAAGTCAAGGAATGACCAAGTTTTGAAGC
AACAATCTGAGATTGCCTCACTCCATGAGTTGATGAAAGAGCTCGAAGATTGCATTAGTTTGAGGAACCAAACGATTACTGAGGTAGAGGAAAAGAATGAAACGCTATGT
CGAACAATTGACGACATGCAATTAACGCTCAAGATTAGAGAAGATCAACTAGGGGAGCTCATCAACGACAACAAAGGGTCTAAGAGAGTCCGTTCAGTCACTCAATGTAC
GCCTCGGACAAGGATCATGGAAGAGCAAAGTACTGAGATGGAGAAAACAAAGAAAGATATTGAGGAGTTACGAGAAAAAATGGATGCCATTCTTGTCGCCCTAGAAAGAG
GCAAAATAATACCTGATATTGCTCAGTCCAGCAGTACAATGAATGACCCTCCAATCAGGCAATCAACAGAGGGTACTACTCCAAAATATCATCCATTGTACAATATTCCA
GTAGAGCAGCACCCATTTCCATTTTTCAAGAATGAGCAAGTGCATGTACACAATCAACCTGGATTTTCACTACCCACAAAGGTACCTCCCAAGATGACCATTATAGTTCC
CAATTTAGATGATCCTGAAATCAGAAAAGAGCTAACGGGAGATGCGACCAAGTTATGCTTGGCACCAGATGTAATCCTCCCTCCAAAATTCAAGGTGCCCGAGTTTGAAA
AGTATGATGGAGCATCCTGCCCTAAAAACCATCTCATCATGTATTGCAGAAAGATGGCAGCATACGTCCAAAATGACAAACTGTTAATTCACTGCTTCCAGGATAGTCTT
ATTGGTCCAACATCTCGATGGTATATGCAGTTAGACAACACTCATATATGTTCATGGAAGAATCTAGCCGATTCATTTTTAAAGCAATATAAGCACAACATAGATATGGC
TCCTGACCGCCTAGACCTCCAGAGGATGGAAAAGAAGAGCACAGAAAGCTTTAAAGAGTATGCCCAAAGGTGGAGGGATACTGCTGCTCAGGTGCAACCACCTTTAGCAG
ATAAGGAGTTGTCAACCATGTGTATTAATACTCTCAAATCTCCTTTCTATGATAAGATGATTGGGAGTGCCTCTACCAATTTCTCTGACATAATGACAATTAGAGAGAGA
ATCGAGTACGGAATTAAGCATGGAAAGATAATTGATACGGCTGGATCATCGACAGCGATAAAGGGGGTTCGATCGAAGAAAAAAGAGGGAGAAGTTCAGATGATTGGTTT
CAATTGA
Protein sequenceShow/hide protein sequence
MVKNVQQIHGDSLVEDVISQFKDVSFPESQLETVKQAWEKLTIDRKAKFTSKYGHLARLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYKSLLHMPARTEVE
AYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKGENTCLPLDYIFTLQQKFANEDKELTLLALCIFNVVLFPKVCGYVEERMVKLFAKIEIGVDPIIPVLAETFQFKCPE
IKFSSSWNKLQNPISEFVQSGWSSSSPERSAWEAFFSELKSGQFHDDTTEAYKTWHANRAKTVLVSPKMKTKIKLNAKVIPDQQTKQVACEKECDELRKANSSLVQENER
LQLEIKQGLLCNAELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDSEVSQVTLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEVEEKNETLC
RTIDDMQLTLKIREDQLGELINDNKGSKRVRSVTQCTPRTRIMEEQSTEMEKTKKDIEELREKMDAILVALERGKIIPDIAQSSSTMNDPPIRQSTEGTTPKYHPLYNIP
VEQHPFPFFKNEQVHVHNQPGFSLPTKVPPKMTIIVPNLDDPEIRKELTGDATKLCLAPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSL
IGPTSRWYMQLDNTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMCINTLKSPFYDKMIGSASTNFSDIMTIRER
IEYGIKHGKIIDTAGSSTAIKGVRSKKKEGEVQMIGFN