; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010772 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010772
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr1:6057067..6062080
RNA-Seq ExpressionLag0010772
SyntenyLag0010772
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036933.1 uncharacterized protein E6C27_scaffold86G00060 [Cucumis melo var. makuwa]1.0e-13330.09Show/hide
Query:  WEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIE
        WE LT  R+  F+ KYG++A+LMY+ VNY  L+A+I   DPAY CFTFGS D+ PTIEEYQ++L MP +     Y ++ + T KR LS  L  +  ++I+
Subjt:  WEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIE

Query:  K---------------------------------------------------------------------------------------------------
        K                                                                                                   
Subjt:  K---------------------------------------------------------------------------------------------------

Query:  ----------------------QRSNFQVP----------------------------------------------------------------------
                               R +F  P                                                                      
Subjt:  ----------------------QRSNFQVP----------------------------------------------------------------------

Query:  ----------------------GISCKSHF--------------RVRAIRSTDRTSSTRKECD------------ELRKANSSLVQENERLQLEVKQGLL
                               I  K H+              R   I  +       KE              EL + N  L QENE+L+ E  Q + 
Subjt:  ----------------------GISCKSHF--------------RVRAIRSTDRTSSTRKECD------------ELRKANSSLVQENERLQLEVKQGLL

Query:  RNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATL--------KSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEE
            L+ EL + K  +  Q++LE ++  LD E R +N+    ++ +    QAT+        +S   ++LK  ++   LH  +  L++     ++ IT+E
Subjt:  RNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATL--------KSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEE

Query:  ------QYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVITPTQPNSKNNHKIARSPRIHRTYVTRYRTRIMEEQSTEMEKTR
               Y ++  D+    ++   L  + +  +  +R V++RA+  AE A   S   TP                I   Y TRY+++IMEE+  +M+K R
Subjt:  ------QYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVITPTQPNSKNNHKIARSPRIHRTYVTRYRTRIMEEQSTEMEKTR

Query:  KDIEELREKMDAI--LVALERGKIIPDIAQTNNTMNDPPIRQSTEGTTPKYHPLYNIPVEQHPFPFFKNEQVPVHNQPGFSLPTEKRANGR-RESSSSEK
        ++I  L E++  I  L+++ +GK   D  Q++N + D        G TP YH                            ++P  K  N       S +K
Subjt:  KDIEELREKMDAI--LVALERGKIIPDIAQTNNTMNDPPIRQSTEGTTPKYHPLYNIPVEQHPFPFFKNEQVPVHNQPGFSLPTEKRANGR-RESSSSEK

Query:  LEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWK
        L+VLEERLRA+E TDV+GNIDAT+LCLVP +I+P KFKVPEF KYDG++CP++HLIMYCRKMA ++ NDKLL+HCFQDSLT PASRWY+QLD+ HI  WK
Subjt:  LEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWK

Query:  NLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTI-ERESTGTRANR-
        +LAD+FLKQYK NIDMAPDRLDLQRMEKKS+ESFKEYAQRWRD AA+VQPPL DKE+++MF+NTL++PFY++MIG+ASTNFSDI+ I ER   G +  R 
Subjt:  NLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTI-ERESTGTRANR-

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKE-NGPDVNNNPLPNHQNA--------
                                P +PPYPKWYD NARCDYHAG +GHSTENC ALK  VQ+LI AGWL+FKK     +VN NPLP+ +N         
Subjt:  ------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKE-NGPDVNNNPLPNHQNA--------

Query:  ---------QGILSTNVSFSFEGPGVVRFTFLTVSQKMTQLPQYGEVDIIEECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIPLNIKVQKQYLGT
                 + ++     F     G V   +L  + +     +  + +  E C    ++   +      K    N +       +    + K     L  
Subjt:  ---------QGILSTNVSFSFEGPGVVRFTFLTVSQKMTQLPQYGEVDIIEECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIPLNIKVQKQYLGT

Query:  MNIRSNPCKDFHTGFVTGFRYASSVTFTDDELPPEGTGHTKALHITVKCKNFAVAKVLVDNGSSLNIMPKSTL
         ++  +   +  +G +     ++S+ FTDDE+PPEG GHTKALHI +KCK++ +A+VLVDNGS+LNIMPKSTL
Subjt:  MNIRSNPCKDFHTGFVTGFRYASSVTFTDDELPPEGTGHTKALHITVKCKNFAVAKVLVDNGSSLNIMPKSTL

XP_022143495.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia]5.5e-14345.55Show/hide
Query:  RESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLD
        + + S+EK EVL+ERLRA+EGTDVFGNIDA++LCLV  +++PPKFKVPEFEKYDG+SCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSL+ PASRWYMQLD
Subjt:  RESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLD

Query:  STHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTI-ERES
        S+H+ SWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPL DKELS MFINTLK PFYD+M+GSASTNFSDIM I ER  
Subjt:  STHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTI-ERES

Query:  TGTRANR------------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLPN
         G R  R                              P +PPYP+W D NARCDYH GAIGHS ENCTALK+RVQALIKAGWLNFKKENGP+V+NNPLPN
Subjt:  TGTRANR------------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLPN

Query:  HQNAQ---------------GILSTNVSFSFE---GPGVVRFTFLTVSQK-----------------------------------------MTQLPQYGE
        H N Q                 ++T +   FE   G G V   +L  + K                                         +    Q   
Subjt:  HQNAQ---------------GILSTNVSFSFE---GPGVVRFTFLTVSQK-----------------------------------------MTQLPQYGE

Query:  VDIIE----------ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIP------------------------------------------------
        ++++E          E S  +LKPK LTI Y EKP  PN   +PITI +P P                                                
Subjt:  VDIIE----------ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIP------------------------------------------------

Query:  --------------------------------------------------------------------LNIKVQKQYLGTMNIRSNPCK-----------
                                                                            L +  Q +Y  T  +   P K           
Subjt:  --------------------------------------------------------------------LNIKVQKQYLGTMNIRSNPCK-----------

Query:  ---------------------DFHTGFVTGFRYASSVTFTDDELPPEGTGHTKALHITVKCKNFAVAKVLVDNGSSLNIMPKSTL
                             D  +  V     +SS+TFTD+E+PPEGTGHTKALHI+VKCKNF +AKVLVDNGSSLNIMP+STL
Subjt:  ---------------------DFHTGFVTGFRYASSVTFTDDELPPEGTGHTKALHITVKCKNFAVAKVLVDNGSSLNIMPKSTL

XP_022155098.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia]1.4e-13043.37Show/hide
Query:  EEQSTEMEKTRKDIEELREKMDAILVALERGKIIPDIAQTNNTMNDPPIRQSTEGTTPK---------------------YHPLYNIPVEQHPFPFFKN-
        +++ +E EKTRKDIEELREK+DAIL+ALE+GK    IA+T+N +++PP  Q   G  P                      Y+PLY+IP  Q P P  +  
Subjt:  EEQSTEMEKTRKDIEELREKMDAILVALERGKIIPDIAQTNNTMNDPPIRQSTEGTTPK---------------------YHPLYNIPVEQHPFPFFKN-

Query:  EQVP---VHNQPG--FSLPTEKRANG--------RRESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIM
         Q P   +  +P    + PT     G        R+E+ SSEKLEVLEERLRAVEGTDVFGNIDA++LCL   +++PPKFK+PEFEKY+G+SCPKNHLIM
Subjt:  EQVP---VHNQPG--FSLPTEKRANG--------RRESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIM

Query:  YCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKEL
        YCRKMAAY+QNDKLLIHCFQDSL+GP S WYM LDS H+ SWKNLADSFLKQYKHNIDM  DRLDLQ MEKK+ ESFKEY QRWRDTAAQ QPP  DKEL
Subjt:  YCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKEL

Query:  STMFINTLKSPFYDKMIGSASTNFSDIMTIERE-------------------------------------------------------------------
        S+MFINTLK PFYD+MIGSAST+FSDI+TI                                                                      
Subjt:  STMFINTLKSPFYDKMIGSASTNFSDIMTIERE-------------------------------------------------------------------

Query:  -----------------------------------STGTRANR----------------------------------PSKPPYPKWYDPNARCDYHAGAI
                                           + G + NR                                  P +PPYP WYD N RCDYHAGAI
Subjt:  -----------------------------------STGTRANR----------------------------------PSKPPYPKWYDPNARCDYHAGAI

Query:  GHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLPNHQN-------AQGILSTNVSFSFEGPGVVRFTFLTVSQKM--------------------
        GHSTENCTALK+RVQALIKAG L FKKEN PDV NNPLPNH+N        QGI S +       P    F  L     M                    
Subjt:  GHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLPNHQN-------AQGILSTNVSFSFEGPGVVRFTFLTVSQKM--------------------

Query:  --------------------------------TQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIPLNIKVQK
                                        TQ  Q   +D++E     E S  + KPKPLT+ YREKP  P++  RPITIQ+P P      K
Subjt:  --------------------------------TQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIPLNIKVQK

XP_022157796.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024415 [Momordica charantia]2.8e-14753.25Show/hide
Query:  MEEQSTEMEKTRKDIEELREKMDAILVALERGKIIPDIAQTNNTMND------------PPIRQSTEGTTPK---YHPLYNIPVEQHPFPFFKN-EQVP-
        ME+Q  E EKTRKDIEELREK+D I + LE+GK   D A ++N +++            PP+R   EG  P+   Y+PLY++P+ Q+P  F K  +Q+P 
Subjt:  MEEQSTEMEKTRKDIEELREKMDAILVALERGKIIPDIAQTNNTMND------------PPIRQSTEGTTPK---YHPLYNIPVEQHPFPFFKN-EQVP-

Query:  --VHNQPG--FSLPTEKRANG--------RRESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKM
          +  +P    S PT     G         + + S EK EVLEERLRA+EGTDVFGNIDA++LCLV  +++PPKFKVPEFEKYDG+SCPKNHLIMYCRKM
Subjt:  --VHNQPG--FSLPTEKRANG--------RRESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKM

Query:  AAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFI
         AYVQN KLLIHCFQDSL G ASRWYMQLDS+H+ SWKNLADSFLKQYKHNIDMAPDRLDLQRMEK STESFKEYAQRWRDTAAQVQPPL DKELS MFI
Subjt:  AAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFI

Query:  NTLKSPFYDKMIGSASTNFSDIMTI-ERESTGTRANR------------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTAL
        NTLK PFYD+MIGSASTNFSDIMTI ER   G R  R                              P +P YP+WYD NARCDYHAGAIGHSTENCTAL
Subjt:  NTLKSPFYDKMIGSASTNFSDIMTI-ERESTGTRANR------------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTAL

Query:  KHRVQALIKAGWLNFKKENGPDVNNNPLPNHQN-------AQGI--------LSTNVSFSFE---GPGVVRFTFLTVSQKMTQLP---------------
        K+RVQAL+KAGWLNFKKEN PDV+ NPL NHQN        QGI        + T     FE   G G V   +L  + K  +                 
Subjt:  KHRVQALIKAGWLNFKKENGPDVNNNPLPNHQN-------AQGI--------LSTNVSFSFE---GPGVVRFTFLTVSQKMTQLP---------------

Query:  --------------------------QYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIPLNIKVQK
                                  Q   ++++E     E S  +LKPK LTI Y EKP  P+   +PITI +P P   K  K
Subjt:  --------------------------QYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIPLNIKVQK

XP_022158986.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia]9.7e-13239.82Show/hide
Query:  RESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLD
        + + S+EK EVLEERLRA+EGT VFGNIDA++LCLV  +++PPKFKVPEFEKYDG+SCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSL+GPASRWYMQLD
Subjt:  RESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLD

Query:  STHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTI-ERES
        S+++ SWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPL DKELS MFINTLK PFYD+MIG+ASTNFSDIMTI ER  
Subjt:  STHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTI-ERES

Query:  TGTRANR---------------------------------------------------------------------------------------------
         G R  R                                                                                             
Subjt:  TGTRANR---------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLN
                                                             P +PPYP+WYD NARCDYHAGAIGHSTENCTALK+RVQALIKAGWLN
Subjt:  -----------------------------------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLN

Query:  FKKENGPDVNNNPLPNHQNAQ---------------GILSTNVSFSFE---GPGVVRFTFLTVSQK----------------------------------
        FKKENGPDV+ NPLPNHQN Q                 + T +   FE   G G V   +L  + K                                  
Subjt:  FKKENGPDVNNNPLPNHQNAQ---------------GILSTNVSFSFE---GPGVVRFTFLTVSQK----------------------------------

Query:  -------MTQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIP-------------------------------------
               +    Q   ++I+E     E S  +LKPK LTI Y EKP+ PN   +PITI +P P                                     
Subjt:  -------MTQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIP-------------------------------------

Query:  -------------------------------------------------------------------------------LNIKVQKQYLGTMNIRSNPCK
                                                                                       L +  Q +Y     +   P K
Subjt:  -------------------------------------------------------------------------------LNIKVQKQYLGTMNIRSNPCK

Query:  --------------------------------DFHTGFVTGFRYASSVTFTDDELPPEGTGHTKALHITVKCKNFAVAKVLVDNGSSLNIMPKSTL
                                        D  +  V     ASS+TFTD+E+PPEGTGHTKALHI++KCKNF +AKVLVDNGSSLNIMP+STL
Subjt:  --------------------------------DFHTGFVTGFRYASSVTFTDDELPPEGTGHTKALHITVKCKNFAVAKVLVDNGSSLNIMPKSTL

TrEMBL top hitse value%identityAlignment
A0A5A7T1W2 Retrotrans_gag domain-containing protein5.0e-13430.09Show/hide
Query:  WEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIE
        WE LT  R+  F+ KYG++A+LMY+ VNY  L+A+I   DPAY CFTFGS D+ PTIEEYQ++L MP +     Y ++ + T KR LS  L  +  ++I+
Subjt:  WEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIE

Query:  K---------------------------------------------------------------------------------------------------
        K                                                                                                   
Subjt:  K---------------------------------------------------------------------------------------------------

Query:  ----------------------QRSNFQVP----------------------------------------------------------------------
                               R +F  P                                                                      
Subjt:  ----------------------QRSNFQVP----------------------------------------------------------------------

Query:  ----------------------GISCKSHF--------------RVRAIRSTDRTSSTRKECD------------ELRKANSSLVQENERLQLEVKQGLL
                               I  K H+              R   I  +       KE              EL + N  L QENE+L+ E  Q + 
Subjt:  ----------------------GISCKSHF--------------RVRAIRSTDRTSSTRKECD------------ELRKANSSLVQENERLQLEVKQGLL

Query:  RNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATL--------KSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEE
            L+ EL + K  +  Q++LE ++  LD E R +N+    ++ +    QAT+        +S   ++LK  ++   LH  +  L++     ++ IT+E
Subjt:  RNVELEKELNRLKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATL--------KSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEE

Query:  ------QYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVITPTQPNSKNNHKIARSPRIHRTYVTRYRTRIMEEQSTEMEKTR
               Y ++  D+    ++   L  + +  +  +R V++RA+  AE A   S   TP                I   Y TRY+++IMEE+  +M+K R
Subjt:  ------QYDRLSDDFGFARQNHATLRSKAEHMLTQIRRVTRRADELAEDARTLSKVITPTQPNSKNNHKIARSPRIHRTYVTRYRTRIMEEQSTEMEKTR

Query:  KDIEELREKMDAI--LVALERGKIIPDIAQTNNTMNDPPIRQSTEGTTPKYHPLYNIPVEQHPFPFFKNEQVPVHNQPGFSLPTEKRANGR-RESSSSEK
        ++I  L E++  I  L+++ +GK   D  Q++N + D        G TP YH                            ++P  K  N       S +K
Subjt:  KDIEELREKMDAI--LVALERGKIIPDIAQTNNTMNDPPIRQSTEGTTPKYHPLYNIPVEQHPFPFFKNEQVPVHNQPGFSLPTEKRANGR-RESSSSEK

Query:  LEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWK
        L+VLEERLRA+E TDV+GNIDAT+LCLVP +I+P KFKVPEF KYDG++CP++HLIMYCRKMA ++ NDKLL+HCFQDSLT PASRWY+QLD+ HI  WK
Subjt:  LEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWK

Query:  NLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTI-ERESTGTRANR-
        +LAD+FLKQYK NIDMAPDRLDLQRMEKKS+ESFKEYAQRWRD AA+VQPPL DKE+++MF+NTL++PFY++MIG+ASTNFSDI+ I ER   G +  R 
Subjt:  NLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTI-ERESTGTRANR-

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKE-NGPDVNNNPLPNHQNA--------
                                P +PPYPKWYD NARCDYHAG +GHSTENC ALK  VQ+LI AGWL+FKK     +VN NPLP+ +N         
Subjt:  ------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKE-NGPDVNNNPLPNHQNA--------

Query:  ---------QGILSTNVSFSFEGPGVVRFTFLTVSQKMTQLPQYGEVDIIEECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIPLNIKVQKQYLGT
                 + ++     F     G V   +L  + +     +  + +  E C    ++   +      K    N +       +    + K     L  
Subjt:  ---------QGILSTNVSFSFEGPGVVRFTFLTVSQKMTQLPQYGEVDIIEECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIPLNIKVQKQYLGT

Query:  MNIRSNPCKDFHTGFVTGFRYASSVTFTDDELPPEGTGHTKALHITVKCKNFAVAKVLVDNGSSLNIMPKSTL
         ++  +   +  +G +     ++S+ FTDDE+PPEG GHTKALHI +KCK++ +A+VLVDNGS+LNIMPKSTL
Subjt:  MNIRSNPCKDFHTGFVTGFRYASSVTFTDDELPPEGTGHTKALHITVKCKNFAVAKVLVDNGSSLNIMPKSTL

A0A6J1CNY7 Ribonuclease H1.6e-14345.69Show/hide
Query:  RESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLD
        + + S+EK EVL+ERLRA+EGTDVFGNIDA++LCLV  +++PPKFKVPEFEKYDG+SCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSL+ PASRWYMQLD
Subjt:  RESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLD

Query:  STHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTI-ERES
        S+H+ SWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPL DKELS MFINTLK PFYD+M+GSASTNFSDIM I ER  
Subjt:  STHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTI-ERES

Query:  TGTRANR------------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLPN
         G R  R                              P +PPYP+W D NARCDYH GAIGHS ENCTALK+RVQALIKAGWLNFKKENGPDV+NNPLPN
Subjt:  TGTRANR------------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLPN

Query:  HQNAQ---------------GILSTNVSFSFE---GPGVVRFTFLTVSQK-----------------------------------------MTQLPQYGE
        H N Q                 ++T +   FE   G G V   +L  + K                                         +    Q   
Subjt:  HQNAQ---------------GILSTNVSFSFE---GPGVVRFTFLTVSQK-----------------------------------------MTQLPQYGE

Query:  VDIIE----------ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIP------------------------------------------------
        ++++E          E S  +LKPK LTI Y EKP  PN   +PITI +P P                                                
Subjt:  VDIIE----------ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIP------------------------------------------------

Query:  --------------------------------------------------------------------LNIKVQKQYLGTMNIRSNPCK-----------
                                                                            L +  Q +Y  T  +   P K           
Subjt:  --------------------------------------------------------------------LNIKVQKQYLGTMNIRSNPCK-----------

Query:  ---------------------DFHTGFVTGFRYASSVTFTDDELPPEGTGHTKALHITVKCKNFAVAKVLVDNGSSLNIMPKSTL
                             D  +  V     +SS+TFTD+E+PPEGTGHTKALHI+VKCKNF +AKVLVDNGSSLNIMP+STL
Subjt:  ---------------------DFHTGFVTGFRYASSVTFTDDELPPEGTGHTKALHITVKCKNFAVAKVLVDNGSSLNIMPKSTL

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222314.0e-13143.52Show/hide
Query:  EEQSTEMEKTRKDIEELREKMDAILVALERGKIIPDIAQTNNTMNDPPIRQSTEGTTPK---------------------YHPLYNIPVEQHPFPFFKN-
        +++ +E EKTRKDIEELREK+DAIL+ALE+GK    IA+T+N +++PP  Q   G  P                      Y+PLY+IP  Q P P  +  
Subjt:  EEQSTEMEKTRKDIEELREKMDAILVALERGKIIPDIAQTNNTMNDPPIRQSTEGTTPK---------------------YHPLYNIPVEQHPFPFFKN-

Query:  EQVP---VHNQPG--FSLPTEKRANG--------RRESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIM
         Q P   +  +P    + PT     G        R+E+ SSEKLEVLEERLRAVEGTDVFGNIDA++LCL   +++PPKFK+PEFEKYDG+SCPKNHLIM
Subjt:  EQVP---VHNQPG--FSLPTEKRANG--------RRESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIM

Query:  YCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKEL
        YCRKMAAY+QNDKLLIHCFQDSL+GP S WYM LDS H+ SWKNLADSFLKQYKHNIDM  DRLDLQ MEKK+ ESFKEY QRWRDTAAQ QPP  DKEL
Subjt:  YCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKEL

Query:  STMFINTLKSPFYDKMIGSASTNFSDIMTIERE-------------------------------------------------------------------
        S+MFINTLK PFYD+MIGSAST+FSDI+TI                                                                      
Subjt:  STMFINTLKSPFYDKMIGSASTNFSDIMTIERE-------------------------------------------------------------------

Query:  -----------------------------------STGTRANR----------------------------------PSKPPYPKWYDPNARCDYHAGAI
                                           + G + NR                                  P +PPYP WYD N RCDYHAGAI
Subjt:  -----------------------------------STGTRANR----------------------------------PSKPPYPKWYDPNARCDYHAGAI

Query:  GHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLPNHQN-------AQGILSTNVSFSFEGPGVVRFTFLTVSQKM--------------------
        GHSTENCTALK+RVQALIKAG L FKKEN PDV NNPLPNH+N        QGI S +       P    F  L     M                    
Subjt:  GHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLPNHQN-------AQGILSTNVSFSFEGPGVVRFTFLTVSQKM--------------------

Query:  --------------------------------TQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIPLNIKVQK
                                        TQ  Q   +D++E     E S  + KPKPLT+ YREKP  P++  RPITIQ+P P      K
Subjt:  --------------------------------TQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIPLNIKVQK

A0A6J1DZ90 Ribonuclease H1.4e-14753.25Show/hide
Query:  MEEQSTEMEKTRKDIEELREKMDAILVALERGKIIPDIAQTNNTMND------------PPIRQSTEGTTPK---YHPLYNIPVEQHPFPFFKN-EQVP-
        ME+Q  E EKTRKDIEELREK+D I + LE+GK   D A ++N +++            PP+R   EG  P+   Y+PLY++P+ Q+P  F K  +Q+P 
Subjt:  MEEQSTEMEKTRKDIEELREKMDAILVALERGKIIPDIAQTNNTMND------------PPIRQSTEGTTPK---YHPLYNIPVEQHPFPFFKN-EQVP-

Query:  --VHNQPG--FSLPTEKRANG--------RRESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKM
          +  +P    S PT     G         + + S EK EVLEERLRA+EGTDVFGNIDA++LCLV  +++PPKFKVPEFEKYDG+SCPKNHLIMYCRKM
Subjt:  --VHNQPG--FSLPTEKRANG--------RRESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKM

Query:  AAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFI
         AYVQN KLLIHCFQDSL G ASRWYMQLDS+H+ SWKNLADSFLKQYKHNIDMAPDRLDLQRMEK STESFKEYAQRWRDTAAQVQPPL DKELS MFI
Subjt:  AAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFI

Query:  NTLKSPFYDKMIGSASTNFSDIMTI-ERESTGTRANR------------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTAL
        NTLK PFYD+MIGSASTNFSDIMTI ER   G R  R                              P +P YP+WYD NARCDYHAGAIGHSTENCTAL
Subjt:  NTLKSPFYDKMIGSASTNFSDIMTI-ERESTGTRANR------------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTAL

Query:  KHRVQALIKAGWLNFKKENGPDVNNNPLPNHQN-------AQGI--------LSTNVSFSFE---GPGVVRFTFLTVSQKMTQLP---------------
        K+RVQAL+KAGWLNFKKEN PDV+ NPL NHQN        QGI        + T     FE   G G V   +L  + K  +                 
Subjt:  KHRVQALIKAGWLNFKKENGPDVNNNPLPNHQN-------AQGI--------LSTNVSFSFE---GPGVVRFTFLTVSQKMTQLP---------------

Query:  --------------------------QYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIPLNIKVQK
                                  Q   ++++E     E S  +LKPK LTI Y EKP  P+   +PITI +P P   K  K
Subjt:  --------------------------QYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIPLNIKVQK

A0A6J1E2J7 Ribonuclease H4.7e-13239.82Show/hide
Query:  RESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLD
        + + S+EK EVLEERLRA+EGT VFGNIDA++LCLV  +++PPKFKVPEFEKYDG+SCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSL+GPASRWYMQLD
Subjt:  RESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLD

Query:  STHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTI-ERES
        S+++ SWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPL DKELS MFINTLK PFYD+MIG+ASTNFSDIMTI ER  
Subjt:  STHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTI-ERES

Query:  TGTRANR---------------------------------------------------------------------------------------------
         G R  R                                                                                             
Subjt:  TGTRANR---------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLN
                                                             P +PPYP+WYD NARCDYHAGAIGHSTENCTALK+RVQALIKAGWLN
Subjt:  -----------------------------------------------------PSKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLN

Query:  FKKENGPDVNNNPLPNHQNAQ---------------GILSTNVSFSFE---GPGVVRFTFLTVSQK----------------------------------
        FKKENGPDV+ NPLPNHQN Q                 + T +   FE   G G V   +L  + K                                  
Subjt:  FKKENGPDVNNNPLPNHQNAQ---------------GILSTNVSFSFE---GPGVVRFTFLTVSQK----------------------------------

Query:  -------MTQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIP-------------------------------------
               +    Q   ++I+E     E S  +LKPK LTI Y EKP+ PN   +PITI +P P                                     
Subjt:  -------MTQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIP-------------------------------------

Query:  -------------------------------------------------------------------------------LNIKVQKQYLGTMNIRSNPCK
                                                                                       L +  Q +Y     +   P K
Subjt:  -------------------------------------------------------------------------------LNIKVQKQYLGTMNIRSNPCK

Query:  --------------------------------DFHTGFVTGFRYASSVTFTDDELPPEGTGHTKALHITVKCKNFAVAKVLVDNGSSLNIMPKSTL
                                        D  +  V     ASS+TFTD+E+PPEGTGHTKALHI++KCKNF +AKVLVDNGSSLNIMP+STL
Subjt:  --------------------------------DFHTGFVTGFRYASSVTFTDDELPPEGTGHTKALHITVKCKNFAVAKVLVDNGSSLNIMPKSTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATAGAAGATCAAGCAACAGTACGTCAATGGTCAGAAAATGTACAACAAATCCACGGAGATTCTTTAGTAGAAAATGTTGTTTCTCAATTTAAGGATGTCAGTTT
TCCAGAAAGTCAATTAGAAGCAGTGAAACAGGCTTGGGAAAAATTAACTGTAGATAGGAAGGCTAAATTTACAAGCAAATATGGCTATCTAGCTCAGCTCATGTATGTAC
AAGTTAATTATTCTGTATTAAAAGCTTTGATTCGACATTGGGATCCAGCCTACAGATGTTTCACATTTGGCTCAATTGACATGACTCCTACAATAGAGGAATATCAATCC
CTTCTGCATATGCCAACACGAACAGAGGTTGAAGCTTATTCTTACGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCGTACAAGCGACAT
TGAGAAACAAAGATCAAATTTTCAAGTTCCTGGAATAAGCTGCAAATCCCATTTCAGAGTTCGTGCAATCAGATCAACAGACAGAACAAGCAGCACGCGAAAAGAATGTG
ATGAATTGAGAAAAGCGAATTCATCATTGGTTCAAGAAAATGAAAGGCTGCAATTGGAGGTAAAGCAAGGTTTGTTGCGCAATGTTGAACTAGAAAAAGAGTTGAACCGA
TTAAAGGGCAGTGTCAGCAAACAAGAACAGTTAGAAAAAGAAATTTCAGCATTAGACACAGAGGCCCGCGACCTGAACAGAAGAATGCATCGATTAAGAAGGGATAATGA
AGTCTCCCAAGCAACTCTCAAGTCAAGGAATGACCAAGTTTTGAAGCAACAATCTGAGATTGCCTCACTTCATGAGTTGATGAAAGAGCTCGAAGATTGCATTAGTTTGA
GGAACCAAACGATTACTGAGGAGCAGTACGACAGATTAAGCGATGATTTTGGGTTTGCGAGACAGAACCACGCGACACTACGAAGTAAAGCGGAACATATGCTCACTCAG
ATTAGGAGAGTCACTCGAAGGGCAGATGAACTAGCAGAAGATGCACGTACTCTCTCTAAAGTCATAACACCTACACAGCCGAATAGCAAGAATAATCATAAGATAGCTCG
GTCACCTCGAATCCACCGCACCTACGTCACAAGATACAGGACAAGGATCATGGAAGAGCAAAGTACTGAGATGGAGAAAACAAGGAAAGATATTGAGGAGTTACGAGAAA
AAATGGATGCCATTCTTGTCGCCCTGGAAAGAGGCAAAATAATACCTGATATTGCTCAGACCAACAATACAATGAACGACCCTCCAATCCGGCAATCAACAGAGGGTACT
ACTCCAAAATATCATCCATTGTACAATATTCCAGTAGAGCAGCACCCGTTTCCATTTTTCAAGAATGAGCAAGTGCCTGTACACAATCAACCTGGATTTTCACTACCCAC
AGAGAAAAGAGCTAACGGGAGGAGAGAAAGTTCTTCTAGTGAAAAGCTTGAAGTCCTGGAGGAAAGATTAAGGGCAGTAGAAGGAACAGACGTCTTCGGAAATATAGATG
CGACCAAGCTATGCTTGGTACCAGATGTAATCCTCCCTCCAAAATTCAAGGTGCCCGAGTTTGAAAAGTATGATGGAGCATCCTGTCCTAAGAACCATCTCATCATGTAT
TGTAGGAAGATGGCAGCATACGTCCAAAATGACAAGCTGTTAATTCACTGCTTCCAGGACAGTCTTACTGGTCCAGCATCTCGATGGTATATGCAGTTAGACAGCACTCA
TATATGTTCATGGAAGAATCTAGCCGATTCATTTTTAAAGCAATATAAGCACAACATAGATATGGCTCCTGACCGCCTAGACCTCCAGAGGATGGAAAAGAAGAGCACAG
AAAGCTTTAAAGAGTATGCCCAAAGGTGGAGGGATACTGCTGCTCAGGTGCAACCACCTTTAGCAGATAAGGAGCTGTCAACCATGTTTATTAATACTCTCAAATCTCCT
TTCTATGATAAGATGATTGGGAGTGCCTCTACCAATTTCTCTGACATAATGACAATTGAGAGAGAATCGACTGGCACCCGTGCCAATAGACCCAGTAAACCACCTTACCC
AAAGTGGTATGACCCAAATGCCCGTTGCGACTACCATGCAGGAGCAATTGGACATTCCACTGAAAACTGTACTGCACTCAAGCATAGGGTGCAAGCATTGATCAAGGCAG
GATGGTTGAACTTTAAGAAAGAAAATGGTCCAGATGTCAACAACAATCCTCTGCCAAACCATCAGAATGCACAAGGCATTCTATCGACCAATGTCTCATTTTCGTTTGAA
GGTCCAGGAGTTGTTAGATTCACATTTTTAACAGTTTCTCAAAAGATGACTCAGCTCCCTCAGTATGGGGAAGTTGATATTATAGAAGAATGCTCAAGGTTGTCTCTCAA
GCCAAAACCGTTAACAATTTCTTATCGCGAGAAGCCCAGTACCCCAAATTCCAAGCCAAGACCGATTACCATCCAGATTCCGATCCCTTTGAATATAAAAGTTCAAAAGC
AGTACCTTGGAACTATGAATATAAGGTCGAACCCCTGCAAAGATTTCCATACTGGCTTTGTTACTGGCTTCAGATACGCATCTTCAGTAACCTTCACAGATGATGAGTTA
CCACCAGAAGGCACCGGACACACTAAAGCCTTGCACATTACAGTTAAGTGCAAAAATTTTGCTGTGGCAAAAGTTCTAGTTGATAATGGTTCCTCCTTGAACATAATGCC
TAAATCCACGTTGAGAAATTGCCTGTTGATATGTCTCATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTATAGAAGATCAAGCAACAGTACGTCAATGGTCAGAAAATGTACAACAAATCCACGGAGATTCTTTAGTAGAAAATGTTGTTTCTCAATTTAAGGATGTCAGTTT
TCCAGAAAGTCAATTAGAAGCAGTGAAACAGGCTTGGGAAAAATTAACTGTAGATAGGAAGGCTAAATTTACAAGCAAATATGGCTATCTAGCTCAGCTCATGTATGTAC
AAGTTAATTATTCTGTATTAAAAGCTTTGATTCGACATTGGGATCCAGCCTACAGATGTTTCACATTTGGCTCAATTGACATGACTCCTACAATAGAGGAATATCAATCC
CTTCTGCATATGCCAACACGAACAGAGGTTGAAGCTTATTCTTACGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCGTACAAGCGACAT
TGAGAAACAAAGATCAAATTTTCAAGTTCCTGGAATAAGCTGCAAATCCCATTTCAGAGTTCGTGCAATCAGATCAACAGACAGAACAAGCAGCACGCGAAAAGAATGTG
ATGAATTGAGAAAAGCGAATTCATCATTGGTTCAAGAAAATGAAAGGCTGCAATTGGAGGTAAAGCAAGGTTTGTTGCGCAATGTTGAACTAGAAAAAGAGTTGAACCGA
TTAAAGGGCAGTGTCAGCAAACAAGAACAGTTAGAAAAAGAAATTTCAGCATTAGACACAGAGGCCCGCGACCTGAACAGAAGAATGCATCGATTAAGAAGGGATAATGA
AGTCTCCCAAGCAACTCTCAAGTCAAGGAATGACCAAGTTTTGAAGCAACAATCTGAGATTGCCTCACTTCATGAGTTGATGAAAGAGCTCGAAGATTGCATTAGTTTGA
GGAACCAAACGATTACTGAGGAGCAGTACGACAGATTAAGCGATGATTTTGGGTTTGCGAGACAGAACCACGCGACACTACGAAGTAAAGCGGAACATATGCTCACTCAG
ATTAGGAGAGTCACTCGAAGGGCAGATGAACTAGCAGAAGATGCACGTACTCTCTCTAAAGTCATAACACCTACACAGCCGAATAGCAAGAATAATCATAAGATAGCTCG
GTCACCTCGAATCCACCGCACCTACGTCACAAGATACAGGACAAGGATCATGGAAGAGCAAAGTACTGAGATGGAGAAAACAAGGAAAGATATTGAGGAGTTACGAGAAA
AAATGGATGCCATTCTTGTCGCCCTGGAAAGAGGCAAAATAATACCTGATATTGCTCAGACCAACAATACAATGAACGACCCTCCAATCCGGCAATCAACAGAGGGTACT
ACTCCAAAATATCATCCATTGTACAATATTCCAGTAGAGCAGCACCCGTTTCCATTTTTCAAGAATGAGCAAGTGCCTGTACACAATCAACCTGGATTTTCACTACCCAC
AGAGAAAAGAGCTAACGGGAGGAGAGAAAGTTCTTCTAGTGAAAAGCTTGAAGTCCTGGAGGAAAGATTAAGGGCAGTAGAAGGAACAGACGTCTTCGGAAATATAGATG
CGACCAAGCTATGCTTGGTACCAGATGTAATCCTCCCTCCAAAATTCAAGGTGCCCGAGTTTGAAAAGTATGATGGAGCATCCTGTCCTAAGAACCATCTCATCATGTAT
TGTAGGAAGATGGCAGCATACGTCCAAAATGACAAGCTGTTAATTCACTGCTTCCAGGACAGTCTTACTGGTCCAGCATCTCGATGGTATATGCAGTTAGACAGCACTCA
TATATGTTCATGGAAGAATCTAGCCGATTCATTTTTAAAGCAATATAAGCACAACATAGATATGGCTCCTGACCGCCTAGACCTCCAGAGGATGGAAAAGAAGAGCACAG
AAAGCTTTAAAGAGTATGCCCAAAGGTGGAGGGATACTGCTGCTCAGGTGCAACCACCTTTAGCAGATAAGGAGCTGTCAACCATGTTTATTAATACTCTCAAATCTCCT
TTCTATGATAAGATGATTGGGAGTGCCTCTACCAATTTCTCTGACATAATGACAATTGAGAGAGAATCGACTGGCACCCGTGCCAATAGACCCAGTAAACCACCTTACCC
AAAGTGGTATGACCCAAATGCCCGTTGCGACTACCATGCAGGAGCAATTGGACATTCCACTGAAAACTGTACTGCACTCAAGCATAGGGTGCAAGCATTGATCAAGGCAG
GATGGTTGAACTTTAAGAAAGAAAATGGTCCAGATGTCAACAACAATCCTCTGCCAAACCATCAGAATGCACAAGGCATTCTATCGACCAATGTCTCATTTTCGTTTGAA
GGTCCAGGAGTTGTTAGATTCACATTTTTAACAGTTTCTCAAAAGATGACTCAGCTCCCTCAGTATGGGGAAGTTGATATTATAGAAGAATGCTCAAGGTTGTCTCTCAA
GCCAAAACCGTTAACAATTTCTTATCGCGAGAAGCCCAGTACCCCAAATTCCAAGCCAAGACCGATTACCATCCAGATTCCGATCCCTTTGAATATAAAAGTTCAAAAGC
AGTACCTTGGAACTATGAATATAAGGTCGAACCCCTGCAAAGATTTCCATACTGGCTTTGTTACTGGCTTCAGATACGCATCTTCAGTAACCTTCACAGATGATGAGTTA
CCACCAGAAGGCACCGGACACACTAAAGCCTTGCACATTACAGTTAAGTGCAAAAATTTTGCTGTGGCAAAAGTTCTAGTTGATAATGGTTCCTCCTTGAACATAATGCC
TAAATCCACGTTGAGAAATTGCCTGTTGATATGTCTCATATAA
Protein sequenceShow/hide protein sequence
MGIEDQATVRQWSENVQQIHGDSLVENVVSQFKDVSFPESQLEAVKQAWEKLTVDRKAKFTSKYGYLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQS
LLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQRSNFQVPGISCKSHFRVRAIRSTDRTSSTRKECDELRKANSSLVQENERLQLEVKQGLLRNVELEKELNR
LKGSVSKQEQLEKEISALDTEARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTITEEQYDRLSDDFGFARQNHATLRSKAEHMLTQ
IRRVTRRADELAEDARTLSKVITPTQPNSKNNHKIARSPRIHRTYVTRYRTRIMEEQSTEMEKTRKDIEELREKMDAILVALERGKIIPDIAQTNNTMNDPPIRQSTEGT
TPKYHPLYNIPVEQHPFPFFKNEQVPVHNQPGFSLPTEKRANGRRESSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMY
CRKMAAYVQNDKLLIHCFQDSLTGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDRLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSP
FYDKMIGSASTNFSDIMTIERESTGTRANRPSKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLPNHQNAQGILSTNVSFSFE
GPGVVRFTFLTVSQKMTQLPQYGEVDIIEECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPIPLNIKVQKQYLGTMNIRSNPCKDFHTGFVTGFRYASSVTFTDDEL
PPEGTGHTKALHITVKCKNFAVAKVLVDNGSSLNIMPKSTLRNCLLICLI