; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005885 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005885
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr6:33030090..33035032
RNA-Seq ExpressionLag0005885
SyntenyLag0005885
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033221.1 uncharacterized protein E6C27_scaffold845G00100 [Cucumis melo var. makuwa]1.2e-15030.84Show/hide
Query:  VRQWSENVQQIHGDS---LVEDAMSQFKDVSFPESQLETVKQAWEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI
        V QWSE+ Q   G S    +E  + +   + F  + LE +K  WE L  +++AKF   YG +  L+Y  +N S L+AL   WDP  +CFTF + D+TPTI
Subjt:  VRQWSENVQQIHGDS---LVEDAMSQFKDVSFPESQLETVKQAWEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI

Query:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKERTHAYPWT-------TSLLFNKNLQMKT----------RIN---------
        EEYQ+L+ +P     + Y Y ++LT++R+LS  +G I  S+++KQ+K KE  +  P           LL  K L +            RI          
Subjt:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKERTHAYPWT-------TSLLFNKNLQMKT----------RIN---------

Query:  -FTGIV---------------------TKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISEFVQSGWSSSSPERSAWEAFFSELKV
         F GI                       +G G+F  CAP+L+IW+ SH++ P  F  P+IKF+S WNK RN I EF  + W    PE  A+         
Subjt:  -FTGIV---------------------TKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISEFVQSGWSSSSPERSAWEAFFSELKV

Query:  EDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIAYAPLLVVRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGRR---------
                                       G IAY+PLLV+RQ W +QF P    L+D+EF+Y+     D+I + V+AWK + K++S R          
Subjt:  EDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIAYAPLLVVRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGRR---------

Query:  -------------------------------------------------------KEW-----NLCRTIDD--------------LQLTLKIREDQLGEL
                                                               K+W     N  R +D+              LQ +++  + QL E 
Subjt:  -------------------------------------------------------KEW-----NLCRTIDD--------------LQLTLKIREDQLGEL

Query:  INDNKGLRESVQSLNVRLGKYQDATDRLMKDYTYLKEQYDRLSDDFGFVRQNHATLRK------------------------------------------
         + N  L+++V S   +L   + A + +  DY  L E+Y  +S DF   +  + TLR+                                          
Subjt:  INDNKGLRESVQSLNVRLGKYQDATDRLMKDYTYLKEQYDRLSDDFGFVRQNHATLRK------------------------------------------

Query:  ---------------------------------------------GTTPKYHPLYNIPVEQHPF--PFFKNEQVPVHNQPGFSLPTEVP-PKVTITV---
                                                     GT+ +     N  +E  P   P F  ++         + PT  P P    T    
Subjt:  ---------------------------------------------GTTPKYHPLYNIPVEQHPF--PFFKNEQVPVHNQPGFSLPTEVP-PKVTITV---

Query:  PNLDDPEIRKELTGGEKVS----SSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLI
         + ++P     + GG+K+S    S  +LE LEERLR +E                            +FEKY+G SCPK+HL+MYCRKM+AY  +DKLLI
Subjt:  PNLDDPEIRKELTGGEKVS----SSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLI

Query:  HCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKM
        HCFQDSL+ PASRWYMQLD + +  WK+LADSFLKQYK+NIDMAPD LDLQRMEKK+ E+FKEYAQRWR+ AAQVQPPL DKEL+ MFINTL++P+YD+M
Subjt:  HCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKM

Query:  IGSASTNFSDIMTIGERIEYEIKHGKIID------RLDHRSEKGGSIEE--------------------------KRGRS--------------------
        +GSASTNFSD++TIGERIE+ +K+G+I D      R+    +K G + E                            G+S                    
Subjt:  IGSASTNFSDIMTIGERIEYEIKHGKIID------RLDHRSEKGGSIEE--------------------------KRGRS--------------------

Query:  -----------------------SDDCKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKE-NGPDVNNNPLPNHQNAQVNAID-
                                +  +PPYPKWYDPNA+C+YHAGA+GHSTENC  LK +VQ+L+KAGWL FKK    PDVN NPLPNH+   +NA+D 
Subjt:  -----------------------SDDCKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKE-NGPDVNNNPLPNHQNAQVNAID-

Query:  --------------------------------------------------------------------QVQELLDSHFLTVSQKMVQLPQYGEVDIIEEC
                                                                            +VQ+L+DS  L + Q  +Q     EV++I   
Subjt:  --------------------------------------------------------------------QVQELLDSHFLTVSQKMVQLPQYGEVDIIEEC

Query:  S-------RLSLKPKPLTISYREKPSTPN--SKPRPITIQIPTPFEYKSSKA-VPWNYEYKDIT-----------------VDNLDNIVGNITASSSVTF
        S         +   KPL I Y EKPS  +   KP+ +T++IP     +S +   P N   KD++                  D+L+++    T  +++  
Subjt:  S-------RLSLKPKPLTISYREKPSTPN--SKPRPITIQIPTPFEYKSSKA-VPWNYEYKDIT-----------------VDNLDNIVGNITASSSVTF

Query:  TD------------DELPPEGTGHTKALHITVKCKN
         +            +E+PPEGTGHTKALHI+VKCK+
Subjt:  TD------------DELPPEGTGHTKALHITVKCKN

KAA0036933.1 uncharacterized protein E6C27_scaffold86G00060 [Cucumis melo var. makuwa]5.8e-18534.93Show/hide
Query:  WEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIE
        WE LT  R+  F+ KYGH+A+LMY+ VNY  L+A+I   DPAY CFTFGS D+ PTIEEYQ++L MP +     Y ++ + T KR LS  L  +  ++I+
Subjt:  WEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIE

Query:  KQVKIKERTHAYPWT------------------------TSLLFNKN------------LQMKTRIN------------FTGIVTKG--------TGRFI
        K +K K      P+                          +++F K              QM+  +N                  KG         G+  
Subjt:  KQVKIKERTHAYPWT------------------------TSLLFNKN------------LQMKTRIN------------FTGIVTKG--------TGRFI

Query:  GCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISEFVQSGWSSSSPERSAWEAFFSELKVEDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIA
         C PLLYIW+ SH+K P EF+CP + FSS WN +RN ISEF  + W  + P + AW +FF++L  E+V+W+A WM  + +IY+CG F S+PLLGPWG + 
Subjt:  GCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISEFVQSGWSSSSPERSAWEAFFSELKVEDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIA

Query:  YAPLLVVRQIWVRQFIPATHELKDFEFAYDKGFC-----------------------------------------------------KDRIQKIVKAW-K
        Y PLLV+RQ+W++QFIP TH LK  +  + +G                                                        ++++K    W  
Subjt:  YAPLLVVRQIWVRQFIPATHELKDFEFAYDKGFC-----------------------------------------------------KDRIQKIVKAW-K

Query:  MITKIQSGRRKEWNLCRTIDDLQLTLKIREDQLGELINDNKGLRESVQSLNVRLGKYQDATDR----------------------------------LMK
          T +Q+   K  +  +  D L+  L+  + ++  +   N+ ++    +L   +G +    +R                                  L  
Subjt:  MITKIQSGRRKEWNLCRTIDDLQLTLKIREDQLGELINDNKGLRESVQSLNVRLGKYQDATDR----------------------------------LMK

Query:  DYTYLKEQYDRLSDDFGF----VRQNHATLR----------------KGTTPKYHPLYNIPVEQHPFPFFKNEQVPVHNQPGFSLPTEVPPKVTI-----
        DY  +K  YD  + DF      V Q    LR                   TP  HP YN   +       K++ +    Q   +L  +V   + +     
Subjt:  DYTYLKEQYDRLSDDFGF----VRQNHATLR----------------KGTTPKYHPLYNIPVEQHPFPFFKNEQVPVHNQPGFSLPTEVPPKVTI-----

Query:  ------------TVPNLDD--------------PEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGAS
                     + + DD              P ++   T   ++ S +KL+VLEERLRA+E TDV+GNIDAT+LCLVP +I+P KFKVPEF KYDG++
Subjt:  ------------TVPNLDD--------------PEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGAS

Query:  CPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQ
        CP++HLIMYCRKMA ++ NDKLL+HCFQDSL  PASRWY+QLD+ HI  WK+LAD+FLKQYK NIDMAPD LDLQRMEKKS+ESFKEYAQRWRD AA+VQ
Subjt:  CPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQ

Query:  PPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYEIKHGKIID-RLDHRSEKGGSIEEKR------------------------------
        PPL DKE+++MF+NTL++PFY++MIG+ASTNFSDI+ IGERIEY IKHG++ +   ++   K G+I +K+                              
Subjt:  PPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYEIKHGKIID-RLDHRSEKGGSIEEKR------------------------------

Query:  ---------------------------------------GRSSDD-------------------------------CKPPYPKWYDPNARCDYHAGAIGH
                                                +++ D                                +PPYPKWYD NARCDYHAG +GH
Subjt:  ---------------------------------------GRSSDD-------------------------------CKPPYPKWYDPNARCDYHAGAIGH

Query:  STENCTALKHRVQALIKAGWLNFKKE-NGPDVNNNPLPNHQNAQVNAIDQVQELLDSHFLTVSQKMVQLPQYGEVDIIEECSRLSLKPKPLTISYREKPS
        STENC ALK  VQ+LI AGWL+FKK     +VN NPLP+ +N +VN +D + E   +    +   M  +  +G    + E   +S +     I Y  K  
Subjt:  STENCTALKHRVQALIKAGWLNFKKE-NGPDVNNNPLPNHQNAQVNAIDQVQELLDSHFLTVSQKMVQLPQYGEVDIIEECSRLSLKPKPLTISYREKPS

Query:  TPNSKP--RPITIQIP---TPFEYKSSKAVPWNYEY-----------------------KDITVDNLDNIVGNITASSSVTFTDDELPPEGTGHTKALHI
          N+K   +   ++IP      EYK         E+                        DI+V+    I+GNIT+S+S+ FTDDE+PPEG GHTKALHI
Subjt:  TPNSKP--RPITIQIP---TPFEYKSSKAVPWNYEY-----------------------KDITVDNLDNIVGNITASSSVTFTDDELPPEGTGHTKALHI

Query:  TVKCKNFV
         +KCK++V
Subjt:  TVKCKNFV

XP_022143495.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia]1.0e-16049.33Show/hide
Query:  GEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQL
        G+   S+EK EVL+ERLRA+EGTDVFGNIDA++LCLV  +++PPKFKVPEFEKYDG+SCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSL  PASRWYMQL
Subjt:  GEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQL

Query:  DSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERI
        DS+H+ SWKNLADSFLKQYKHNIDMAPD LDLQRMEKKSTESFKEYAQRWRDTAAQVQPPL DKELS MFINTLK PFYD+M+GSASTNFSDIM IGERI
Subjt:  DSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERI

Query:  EYEIKHGKIIDRLDH--RSEKGGSIEEKRGRSS----DDCKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLP
        EY ++HG+I    D    ++K    ++K G  +    D  +PPYP+W D NARCDYH GAIGHS ENCTALK+RVQALIKAGWLNFKKENGP+V+NNPLP
Subjt:  EYEIKHGKIIDRLDH--RSEKGGSIEEKRGRSS----DDCKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLP

Query:  NHQNAQVNAID---------------------------------------------------------------------QVQELLDSHFLTV--SQKMV
        NH N Q+NAI+                                                                      VQELLDS  LTV  S +  
Subjt:  NHQNAQVNAID---------------------------------------------------------------------QVQELLDSHFLTV--SQKMV

Query:  QLPQYGEVDIIE----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPTPFEYKSSKAVPWNYEYK-------------------------------
         +    +V + E    E S  +LKPK LTI Y EKP  PN   +PITI +P PFEYKSSKAVPW YE K                               
Subjt:  QLPQYGEVDIIE----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPTPFEYKSSKAVPWNYEYK-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------DITVDNLDNIVGNITASSSVTFTDDELPPEGTGHTKALHITVKCKNFVWQK
                          DITVDNL N+VGNITASSS+TFTD+E+PPEGTGHTKALHI+VKCKNF+  K
Subjt:  ------------------DITVDNLDNIVGNITASSSVTFTDDELPPEGTGHTKALHITVKCKNFVWQK

XP_022157796.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024415 [Momordica charantia]2.6e-15357.09Show/hide
Query:  YHPLYNIPVEQHPFPFFKN-EQVPVHNQPGFSLPTEV--PPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPD
        Y+PLY++P+ Q+P  F K  +Q+P +    F  P ++  PP    TV NL     + +    +   S EK EVLEERLRA+EGTDVFGNIDA++LCLV  
Subjt:  YHPLYNIPVEQHPFPFFKN-EQVPVHNQPGFSLPTEV--PPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPD

Query:  VILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKS
        +++PPKFKVPEFEKYDG+SCPKNHLIMYCRKM AYVQN KLLIHCFQDSL G ASRWYMQLDS+H+ SWKNLADSFLKQYKHNIDMAPD LDLQRMEK S
Subjt:  VILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKS

Query:  TESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYEIKHGKIIDRLDH--RSEKGGSIEEKRGRSS----DD
        TESFKEYAQRWRDTAAQVQPPL DKELS MFINTLK PFYD+MIGSASTNFSDIMTIGERIEY ++H +I    D    ++K    ++K G  +    D 
Subjt:  TESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYEIKHGKIIDRLDH--RSEKGGSIEEKRGRSS----DD

Query:  CKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLPNHQNAQVNAID----------------------------
         +P YP+WYD NARCDYHAGAIGHSTENCTALK+RVQAL+KAGWLNFKKEN PDV+ NPL NHQN Q+NAI+                            
Subjt:  CKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLPNHQNAQVNAID----------------------------

Query:  -----------------------------------------QVQELLDSHFLTVSQKMVQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPN
                                                 +VQELLDS  LTV+       Q   ++++E     E S  +LKPK LTI Y EKP  P+
Subjt:  -----------------------------------------QVQELLDSHFLTVSQKMVQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPN

Query:  SKPRPITIQIPTPFEYKSSKAVPWNYEYK
           +PITI +P PFEYKSSKAVPW Y+ K
Subjt:  SKPRPITIQIPTPFEYKSSKAVPWNYEYK

XP_022158986.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia]2.1e-15042.38Show/hide
Query:  TVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHC
        TV NL     + +L  G+   S+EK EVLEERLRA+EGT VFGNIDA++LCLV  +++PPKFKVPEFEKYDG+SCPKNHLIMYCRKMAAYVQNDKLLIHC
Subjt:  TVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHC

Query:  FQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIG
        FQDSL GPASRWYMQLDS+++ SWKNLADSFLKQYKHNIDMAPD LDLQRMEKKSTESFKEYAQRWRDTAAQVQPPL DKELS MFINTLK PFYD+MIG
Subjt:  FQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIG

Query:  SASTNFSDIMTIGERIEYEIKHGKIIDRLD---------HRSEKGGSIE---------------------------------------------------
        +ASTNFSDIMTIGERIEY ++HG+I   +D         H  +K G ++                                                   
Subjt:  SASTNFSDIMTIGERIEYEIKHGKIIDRLD---------HRSEKGGSIE---------------------------------------------------

Query:  -----------------EKRGRSS---------------------------------------------DDCKPPYPKWYDPNARCDYHAGAIGHSTENC
                         + RG+                                               D  +PPYP+WYD NARCDYHAGAIGHSTENC
Subjt:  -----------------EKRGRSS---------------------------------------------DDCKPPYPKWYDPNARCDYHAGAIGHSTENC

Query:  TALKHRVQALIKAGWLNFKKENGPDVNNNPLPNHQNAQVNAID---------------------------------------------------------
        TALK+RVQALIKAGWLNFKKENGPDV+ NPLPNHQN Q+NAI+                                                         
Subjt:  TALKHRVQALIKAGWLNFKKENGPDVNNNPLPNHQNAQVNAID---------------------------------------------------------

Query:  ------------QVQELLDSHFLTVSQKMVQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPTPFEYKSSKAVPWNY---
                    +VQELLDS  LTV+       Q   ++I+E     E S  +LKPK LTI Y EKP+ PN   +PITI +P PFEYKSSKAVPW Y   
Subjt:  ------------QVQELLDSHFLTVSQKMVQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPTPFEYKSSKAVPWNY---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------EYK---------------------------------------DITVDNLDNIVGNITASSSVTFTDDELPPEGTGHTKALHITVKCKNFVWQK
               EYK                                       DITVDNL N+VGNI+ +SS+TFTD+E+PPEGTGHTKALHI++KCKNF+  K
Subjt:  -------EYK---------------------------------------DITVDNLDNIVGNITASSSVTFTDDELPPEGTGHTKALHITVKCKNFVWQK

TrEMBL top hitse value%identityAlignment
A0A5A7SUT0 Reverse transcriptase5.9e-15130.84Show/hide
Query:  VRQWSENVQQIHGDS---LVEDAMSQFKDVSFPESQLETVKQAWEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI
        V QWSE+ Q   G S    +E  + +   + F  + LE +K  WE L  +++AKF   YG +  L+Y  +N S L+AL   WDP  +CFTF + D+TPTI
Subjt:  VRQWSENVQQIHGDS---LVEDAMSQFKDVSFPESQLETVKQAWEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTI

Query:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKERTHAYPWT-------TSLLFNKNLQMKT----------RIN---------
        EEYQ+L+ +P     + Y Y ++LT++R+LS  +G I  S+++KQ+K KE  +  P           LL  K L +            RI          
Subjt:  EEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKERTHAYPWT-------TSLLFNKNLQMKT----------RIN---------

Query:  -FTGIV---------------------TKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISEFVQSGWSSSSPERSAWEAFFSELKV
         F GI                       +G G+F  CAP+L+IW+ SH++ P  F  P+IKF+S WNK RN I EF  + W    PE  A+         
Subjt:  -FTGIV---------------------TKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISEFVQSGWSSSSPERSAWEAFFSELKV

Query:  EDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIAYAPLLVVRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGRR---------
                                       G IAY+PLLV+RQ W +QF P    L+D+EF+Y+     D+I + V+AWK + K++S R          
Subjt:  EDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIAYAPLLVVRQIWVRQFIPATHELKDFEFAYDKGFCKDRIQKIVKAWKMITKIQSGRR---------

Query:  -------------------------------------------------------KEW-----NLCRTIDD--------------LQLTLKIREDQLGEL
                                                               K+W     N  R +D+              LQ +++  + QL E 
Subjt:  -------------------------------------------------------KEW-----NLCRTIDD--------------LQLTLKIREDQLGEL

Query:  INDNKGLRESVQSLNVRLGKYQDATDRLMKDYTYLKEQYDRLSDDFGFVRQNHATLRK------------------------------------------
         + N  L+++V S   +L   + A + +  DY  L E+Y  +S DF   +  + TLR+                                          
Subjt:  INDNKGLRESVQSLNVRLGKYQDATDRLMKDYTYLKEQYDRLSDDFGFVRQNHATLRK------------------------------------------

Query:  ---------------------------------------------GTTPKYHPLYNIPVEQHPF--PFFKNEQVPVHNQPGFSLPTEVP-PKVTITV---
                                                     GT+ +     N  +E  P   P F  ++         + PT  P P    T    
Subjt:  ---------------------------------------------GTTPKYHPLYNIPVEQHPF--PFFKNEQVPVHNQPGFSLPTEVP-PKVTITV---

Query:  PNLDDPEIRKELTGGEKVS----SSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLI
         + ++P     + GG+K+S    S  +LE LEERLR +E                            +FEKY+G SCPK+HL+MYCRKM+AY  +DKLLI
Subjt:  PNLDDPEIRKELTGGEKVS----SSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLI

Query:  HCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKM
        HCFQDSL+ PASRWYMQLD + +  WK+LADSFLKQYK+NIDMAPD LDLQRMEKK+ E+FKEYAQRWR+ AAQVQPPL DKEL+ MFINTL++P+YD+M
Subjt:  HCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKM

Query:  IGSASTNFSDIMTIGERIEYEIKHGKIID------RLDHRSEKGGSIEE--------------------------KRGRS--------------------
        +GSASTNFSD++TIGERIE+ +K+G+I D      R+    +K G + E                            G+S                    
Subjt:  IGSASTNFSDIMTIGERIEYEIKHGKIID------RLDHRSEKGGSIEE--------------------------KRGRS--------------------

Query:  -----------------------SDDCKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKE-NGPDVNNNPLPNHQNAQVNAID-
                                +  +PPYPKWYDPNA+C+YHAGA+GHSTENC  LK +VQ+L+KAGWL FKK    PDVN NPLPNH+   +NA+D 
Subjt:  -----------------------SDDCKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKE-NGPDVNNNPLPNHQNAQVNAID-

Query:  --------------------------------------------------------------------QVQELLDSHFLTVSQKMVQLPQYGEVDIIEEC
                                                                            +VQ+L+DS  L + Q  +Q     EV++I   
Subjt:  --------------------------------------------------------------------QVQELLDSHFLTVSQKMVQLPQYGEVDIIEEC

Query:  S-------RLSLKPKPLTISYREKPSTPN--SKPRPITIQIPTPFEYKSSKA-VPWNYEYKDIT-----------------VDNLDNIVGNITASSSVTF
        S         +   KPL I Y EKPS  +   KP+ +T++IP     +S +   P N   KD++                  D+L+++    T  +++  
Subjt:  S-------RLSLKPKPLTISYREKPSTPN--SKPRPITIQIPTPFEYKSSKA-VPWNYEYKDIT-----------------VDNLDNIVGNITASSSVTF

Query:  TD------------DELPPEGTGHTKALHITVKCKN
         +            +E+PPEGTGHTKALHI+VKCK+
Subjt:  TD------------DELPPEGTGHTKALHITVKCKN

A0A5A7T1W2 Retrotrans_gag domain-containing protein2.8e-18534.93Show/hide
Query:  WEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIE
        WE LT  R+  F+ KYGH+A+LMY+ VNY  L+A+I   DPAY CFTFGS D+ PTIEEYQ++L MP +     Y ++ + T KR LS  L  +  ++I+
Subjt:  WEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQSLLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIE

Query:  KQVKIKERTHAYPWT------------------------TSLLFNKN------------LQMKTRIN------------FTGIVTKG--------TGRFI
        K +K K      P+                          +++F K              QM+  +N                  KG         G+  
Subjt:  KQVKIKERTHAYPWT------------------------TSLLFNKN------------LQMKTRIN------------FTGIVTKG--------TGRFI

Query:  GCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISEFVQSGWSSSSPERSAWEAFFSELKVEDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIA
         C PLLYIW+ SH+K P EF+CP + FSS WN +RN ISEF  + W  + P + AW +FF++L  E+V+W+A WM  + +IY+CG F S+PLLGPWG + 
Subjt:  GCAPLLYIWVLSHVKCPPEFKCPEIKFSSSWNKLRNPISEFVQSGWSSSSPERSAWEAFFSELKVEDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIA

Query:  YAPLLVVRQIWVRQFIPATHELKDFEFAYDKGFC-----------------------------------------------------KDRIQKIVKAW-K
        Y PLLV+RQ+W++QFIP TH LK  +  + +G                                                        ++++K    W  
Subjt:  YAPLLVVRQIWVRQFIPATHELKDFEFAYDKGFC-----------------------------------------------------KDRIQKIVKAW-K

Query:  MITKIQSGRRKEWNLCRTIDDLQLTLKIREDQLGELINDNKGLRESVQSLNVRLGKYQDATDR----------------------------------LMK
          T +Q+   K  +  +  D L+  L+  + ++  +   N+ ++    +L   +G +    +R                                  L  
Subjt:  MITKIQSGRRKEWNLCRTIDDLQLTLKIREDQLGELINDNKGLRESVQSLNVRLGKYQDATDR----------------------------------LMK

Query:  DYTYLKEQYDRLSDDFGF----VRQNHATLR----------------KGTTPKYHPLYNIPVEQHPFPFFKNEQVPVHNQPGFSLPTEVPPKVTI-----
        DY  +K  YD  + DF      V Q    LR                   TP  HP YN   +       K++ +    Q   +L  +V   + +     
Subjt:  DYTYLKEQYDRLSDDFGF----VRQNHATLR----------------KGTTPKYHPLYNIPVEQHPFPFFKNEQVPVHNQPGFSLPTEVPPKVTI-----

Query:  ------------TVPNLDD--------------PEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGAS
                     + + DD              P ++   T   ++ S +KL+VLEERLRA+E TDV+GNIDAT+LCLVP +I+P KFKVPEF KYDG++
Subjt:  ------------TVPNLDD--------------PEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGAS

Query:  CPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQ
        CP++HLIMYCRKMA ++ NDKLL+HCFQDSL  PASRWY+QLD+ HI  WK+LAD+FLKQYK NIDMAPD LDLQRMEKKS+ESFKEYAQRWRD AA+VQ
Subjt:  CPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQ

Query:  PPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYEIKHGKIID-RLDHRSEKGGSIEEKR------------------------------
        PPL DKE+++MF+NTL++PFY++MIG+ASTNFSDI+ IGERIEY IKHG++ +   ++   K G+I +K+                              
Subjt:  PPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYEIKHGKIID-RLDHRSEKGGSIEEKR------------------------------

Query:  ---------------------------------------GRSSDD-------------------------------CKPPYPKWYDPNARCDYHAGAIGH
                                                +++ D                                +PPYPKWYD NARCDYHAG +GH
Subjt:  ---------------------------------------GRSSDD-------------------------------CKPPYPKWYDPNARCDYHAGAIGH

Query:  STENCTALKHRVQALIKAGWLNFKKE-NGPDVNNNPLPNHQNAQVNAIDQVQELLDSHFLTVSQKMVQLPQYGEVDIIEECSRLSLKPKPLTISYREKPS
        STENC ALK  VQ+LI AGWL+FKK     +VN NPLP+ +N +VN +D + E   +    +   M  +  +G    + E   +S +     I Y  K  
Subjt:  STENCTALKHRVQALIKAGWLNFKKE-NGPDVNNNPLPNHQNAQVNAIDQVQELLDSHFLTVSQKMVQLPQYGEVDIIEECSRLSLKPKPLTISYREKPS

Query:  TPNSKP--RPITIQIP---TPFEYKSSKAVPWNYEY-----------------------KDITVDNLDNIVGNITASSSVTFTDDELPPEGTGHTKALHI
          N+K   +   ++IP      EYK         E+                        DI+V+    I+GNIT+S+S+ FTDDE+PPEG GHTKALHI
Subjt:  TPNSKP--RPITIQIP---TPFEYKSSKAVPWNYEY-----------------------KDITVDNLDNIVGNITASSSVTFTDDELPPEGTGHTKALHI

Query:  TVKCKNFV
         +KCK++V
Subjt:  TVKCKNFV

A0A6J1CNY7 Ribonuclease H2.8e-16149.48Show/hide
Query:  GEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQL
        G+   S+EK EVL+ERLRA+EGTDVFGNIDA++LCLV  +++PPKFKVPEFEKYDG+SCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSL  PASRWYMQL
Subjt:  GEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQL

Query:  DSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERI
        DS+H+ SWKNLADSFLKQYKHNIDMAPD LDLQRMEKKSTESFKEYAQRWRDTAAQVQPPL DKELS MFINTLK PFYD+M+GSASTNFSDIM IGERI
Subjt:  DSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERI

Query:  EYEIKHGKIIDRLDH--RSEKGGSIEEKRGRSS----DDCKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLP
        EY ++HG+I    D    ++K    ++K G  +    D  +PPYP+W D NARCDYH GAIGHS ENCTALK+RVQALIKAGWLNFKKENGPDV+NNPLP
Subjt:  EYEIKHGKIIDRLDH--RSEKGGSIEEKRGRSS----DDCKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLP

Query:  NHQNAQVNAID---------------------------------------------------------------------QVQELLDSHFLTV--SQKMV
        NH N Q+NAI+                                                                      VQELLDS  LTV  S +  
Subjt:  NHQNAQVNAID---------------------------------------------------------------------QVQELLDSHFLTV--SQKMV

Query:  QLPQYGEVDIIE----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPTPFEYKSSKAVPWNYEYK-------------------------------
         +    +V + E    E S  +LKPK LTI Y EKP  PN   +PITI +P PFEYKSSKAVPW YE K                               
Subjt:  QLPQYGEVDIIE----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPTPFEYKSSKAVPWNYEYK-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------DITVDNLDNIVGNITASSSVTFTDDELPPEGTGHTKALHITVKCKNFVWQK
                          DITVDNL N+VGNITASSS+TFTD+E+PPEGTGHTKALHI+VKCKNF+  K
Subjt:  ------------------DITVDNLDNIVGNITASSSVTFTDDELPPEGTGHTKALHITVKCKNFVWQK

A0A6J1DZ90 Ribonuclease H1.3e-15357.09Show/hide
Query:  YHPLYNIPVEQHPFPFFKN-EQVPVHNQPGFSLPTEV--PPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPD
        Y+PLY++P+ Q+P  F K  +Q+P +    F  P ++  PP    TV NL     + +    +   S EK EVLEERLRA+EGTDVFGNIDA++LCLV  
Subjt:  YHPLYNIPVEQHPFPFFKN-EQVPVHNQPGFSLPTEV--PPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPD

Query:  VILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKS
        +++PPKFKVPEFEKYDG+SCPKNHLIMYCRKM AYVQN KLLIHCFQDSL G ASRWYMQLDS+H+ SWKNLADSFLKQYKHNIDMAPD LDLQRMEK S
Subjt:  VILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKS

Query:  TESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYEIKHGKIIDRLDH--RSEKGGSIEEKRGRSS----DD
        TESFKEYAQRWRDTAAQVQPPL DKELS MFINTLK PFYD+MIGSASTNFSDIMTIGERIEY ++H +I    D    ++K    ++K G  +    D 
Subjt:  TESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYEIKHGKIIDRLDH--RSEKGGSIEEKRGRSS----DD

Query:  CKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLPNHQNAQVNAID----------------------------
         +P YP+WYD NARCDYHAGAIGHSTENCTALK+RVQAL+KAGWLNFKKEN PDV+ NPL NHQN Q+NAI+                            
Subjt:  CKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKAGWLNFKKENGPDVNNNPLPNHQNAQVNAID----------------------------

Query:  -----------------------------------------QVQELLDSHFLTVSQKMVQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPN
                                                 +VQELLDS  LTV+       Q   ++++E     E S  +LKPK LTI Y EKP  P+
Subjt:  -----------------------------------------QVQELLDSHFLTVSQKMVQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPN

Query:  SKPRPITIQIPTPFEYKSSKAVPWNYEYK
           +PITI +P PFEYKSSKAVPW Y+ K
Subjt:  SKPRPITIQIPTPFEYKSSKAVPWNYEYK

A0A6J1E2J7 Ribonuclease H1.0e-15042.38Show/hide
Query:  TVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHC
        TV NL     + +L  G+   S+EK EVLEERLRA+EGT VFGNIDA++LCLV  +++PPKFKVPEFEKYDG+SCPKNHLIMYCRKMAAYVQNDKLLIHC
Subjt:  TVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHC

Query:  FQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIG
        FQDSL GPASRWYMQLDS+++ SWKNLADSFLKQYKHNIDMAPD LDLQRMEKKSTESFKEYAQRWRDTAAQVQPPL DKELS MFINTLK PFYD+MIG
Subjt:  FQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKELSTMFINTLKSPFYDKMIG

Query:  SASTNFSDIMTIGERIEYEIKHGKIIDRLD---------HRSEKGGSIE---------------------------------------------------
        +ASTNFSDIMTIGERIEY ++HG+I   +D         H  +K G ++                                                   
Subjt:  SASTNFSDIMTIGERIEYEIKHGKIIDRLD---------HRSEKGGSIE---------------------------------------------------

Query:  -----------------EKRGRSS---------------------------------------------DDCKPPYPKWYDPNARCDYHAGAIGHSTENC
                         + RG+                                               D  +PPYP+WYD NARCDYHAGAIGHSTENC
Subjt:  -----------------EKRGRSS---------------------------------------------DDCKPPYPKWYDPNARCDYHAGAIGHSTENC

Query:  TALKHRVQALIKAGWLNFKKENGPDVNNNPLPNHQNAQVNAID---------------------------------------------------------
        TALK+RVQALIKAGWLNFKKENGPDV+ NPLPNHQN Q+NAI+                                                         
Subjt:  TALKHRVQALIKAGWLNFKKENGPDVNNNPLPNHQNAQVNAID---------------------------------------------------------

Query:  ------------QVQELLDSHFLTVSQKMVQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPTPFEYKSSKAVPWNY---
                    +VQELLDS  LTV+       Q   ++I+E     E S  +LKPK LTI Y EKP+ PN   +PITI +P PFEYKSSKAVPW Y   
Subjt:  ------------QVQELLDSHFLTVSQKMVQLPQYGEVDIIE-----ECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPTPFEYKSSKAVPWNY---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------EYK---------------------------------------DITVDNLDNIVGNITASSSVTFTDDELPPEGTGHTKALHITVKCKNFVWQK
               EYK                                       DITVDNL N+VGNI+ +SS+TFTD+E+PPEGTGHTKALHI++KCKNF+  K
Subjt:  -------EYK---------------------------------------DITVDNLDNIVGNITASSSVTFTDDELPPEGTGHTKALHITVKCKNFVWQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATAGAAGATCAAGCAACAGTACGTCAATGGTCAGAAAATGTACAACAAATCCACGGAGATTCTTTAGTAGAAGATGCTATGTCTCAATTTAAGGATGTCAGTTT
TCCAGAAAGTCAATTAGAAACAGTGAAACAGGCTTGGGAAAAATTAACTGTAGATAGAAAGGCTAAATTTACAAGCAAATATGGCCATCTAGCTCAGCTCATGTATGTAC
AAGTTAATTATTCTGTATTAAAAGCTTTGATTCGACATTGGGATCCAGCCTACAGATGTTTCACATTTGGCTCAATTGACATGACTCCTACAATAGAGGAATATCAATCC
CTTCTGCATATGCCAACACGAACAGAGGTTGAAGCTTATTCTTACGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCGTACAAGCGACAT
TGAGAAACAAGTAAAGATAAAGGAGAGAACACATGCCTACCCCTGGACTACATCCTTACTCTTCAACAAAAATTTGCAAATGAAGACAAGGATTAACTTTACTGGCATTG
TAACAAAAGGAACAGGAAGATTTATAGGGTGTGCACCATTATTGTACATTTGGGTCCTTAGTCATGTGAAGTGTCCACCCGAATTCAAATGTCCAGAGATCAAATTTTCA
AGTTCCTGGAATAAGCTGCGAAATCCCATTTCAGAGTTCGTGCAATCAGGTTGGAGTTCATCCTCCCCTGAAAGGAGCGCTTGGGAAGCCTTTTTCTCCGAACTAAAGGT
AGAAGATGTAATGTGGAGAGCCCCTTGGATGTCAACTAGGCCAATGATCTATAAATGTGGTAAATTTCAAAGTTTACCTCTTTTGGGTCCTTGGGGATGTATAGCCTATG
CTCCTTTATTGGTAGTACGCCAAATTTGGGTCCGACAATTCATCCCTGCTACGCATGAGTTGAAGGATTTCGAATTTGCTTATGATAAAGGCTTCTGTAAAGATAGAATT
CAGAAAATTGTGAAAGCATGGAAAATGATCACTAAAATCCAGAGTGGTAGAAGAAAAGAATGGAACCTATGTCGAACAATTGACGACCTACAATTAACGCTCAAGATTAG
AGAAGATCAACTAGGGGAGCTCATCAACGACAACAAGGGTCTAAGAGAGTCCGTTCAGTCACTTAATGTTCGCCTCGGTAAGTATCAGGATGCCACTGATAGATTAATGA
AAGACTATACCTATTTAAAGGAGCAGTACGACAGATTAAGCGATGATTTTGGGTTTGTGAGACAGAACCACGCGACACTACGAAAGGGTACTACTCCAAAATATCATCCA
TTGTACAATATTCCAGTAGAGCAGCACCCATTTCCATTTTTCAAGAATGAGCAAGTGCCTGTACACAATCAACCTGGATTTTCACTACCCACAGAGGTACCTCCCAAGGT
GACCATTACAGTTCCCAATTTAGATGATCCTGAAATCAGAAAAGAGCTAACGGGAGGTGAGAAAGTCTCTTCTAGTGAAAAGCTTGAAGTCCTGGAGGAAAGATTAAGGG
CAGTAGAAGGAACAGACGTCTTCGGAAATATAGATGCGACCAAGCTATGCTTGGTACCAGATGTAATCCTCCCTCCAAAATTCAAGGTGCCCGAGTTTGAAAAGTATGAT
GGAGCATCCTGCCCTAAAAACCATCTCATCATGTATTGCAGGAAGATGGCAGCATACGTCCAAAATGACAAACTGTTAATTCACTGCTTCCAGGACAGTCTTATTGGTCC
AGCATCTCGATGGTATATGCAGTTAGACAGCACTCATATATGTTCATGGAAGAATCTAGCCGATTCATTTTTAAAGCAATATAAGCACAACATAGATATGGCTCCTGACC
TCCTAGACCTCCAGAGGATGGAAAAGAAGAGCACAGAAAGCTTTAAGGAGTATGCCCAAAGGTGGAGGGATACTGCTGCTCAGGTGCAACCACCTTTAGCAGATAAGGAG
CTGTCAACCATGTTTATTAATACTCTCAAATCTCCTTTCTATGATAAGATGATTGGGAGTGCCTCTACCAATTTCTCTGACATAATGACAATTGGAGAGAGAATCGAGTA
CGAAATTAAGCATGGAAAGATAATTGATAGGCTGGATCATCGAAGCGAAAAAGGGGGTTCCATCGAAGAAAAAAGAGGGAGAAGTTCAGATGATTGTAAACCACCTTACC
CAAAGTGGTATGACCCAAATGCCCGTTGCGACTACCATGCAGGAGCAATTGGACATTCCACTGAAAACTGTACTGCACTCAAGCATAGGGTGCAAGCATTGATCAAGGCA
GGATGGTTGAACTTTAAGAAAGAAAATGGTCCAGATGTCAACAACAATCCTCTGCCAAACCATCAGAATGCACAAGTAAATGCGATAGATCAGGTCCAGGAGTTGTTAGA
TTCACATTTTTTAACAGTTTCTCAAAAGATGGTTCAGCTCCCTCAGTATGGGGAAGTTGATATTATAGAAGAATGCTCAAGGTTGTCTCTCAAGCCAAAACCGTTAACAA
TTTCTTATCGCGAGAAGCCCAGTACCCCAAATTCCAAGCCAAGACCGATTACCATCCAGATTCCGACTCCCTTTGAATATAAAAGTTCAAAAGCAGTACCTTGGAACTAT
GAATATAAGGATATTACGGTGGACAACTTGGATAACATTGTTGGAAACATAACTGCATCCAGTTCAGTAACCTTCACAGATGATGAGTTACCACCAGAAGGCACCGGACA
CACTAAAGCCTTGCACATTACAGTTAAGTGCAAAAATTTTGTGTGGCAAAAGTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTATAGAAGATCAAGCAACAGTACGTCAATGGTCAGAAAATGTACAACAAATCCACGGAGATTCTTTAGTAGAAGATGCTATGTCTCAATTTAAGGATGTCAGTTT
TCCAGAAAGTCAATTAGAAACAGTGAAACAGGCTTGGGAAAAATTAACTGTAGATAGAAAGGCTAAATTTACAAGCAAATATGGCCATCTAGCTCAGCTCATGTATGTAC
AAGTTAATTATTCTGTATTAAAAGCTTTGATTCGACATTGGGATCCAGCCTACAGATGTTTCACATTTGGCTCAATTGACATGACTCCTACAATAGAGGAATATCAATCC
CTTCTGCATATGCCAACACGAACAGAGGTTGAAGCTTATTCTTACGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCGTACAAGCGACAT
TGAGAAACAAGTAAAGATAAAGGAGAGAACACATGCCTACCCCTGGACTACATCCTTACTCTTCAACAAAAATTTGCAAATGAAGACAAGGATTAACTTTACTGGCATTG
TAACAAAAGGAACAGGAAGATTTATAGGGTGTGCACCATTATTGTACATTTGGGTCCTTAGTCATGTGAAGTGTCCACCCGAATTCAAATGTCCAGAGATCAAATTTTCA
AGTTCCTGGAATAAGCTGCGAAATCCCATTTCAGAGTTCGTGCAATCAGGTTGGAGTTCATCCTCCCCTGAAAGGAGCGCTTGGGAAGCCTTTTTCTCCGAACTAAAGGT
AGAAGATGTAATGTGGAGAGCCCCTTGGATGTCAACTAGGCCAATGATCTATAAATGTGGTAAATTTCAAAGTTTACCTCTTTTGGGTCCTTGGGGATGTATAGCCTATG
CTCCTTTATTGGTAGTACGCCAAATTTGGGTCCGACAATTCATCCCTGCTACGCATGAGTTGAAGGATTTCGAATTTGCTTATGATAAAGGCTTCTGTAAAGATAGAATT
CAGAAAATTGTGAAAGCATGGAAAATGATCACTAAAATCCAGAGTGGTAGAAGAAAAGAATGGAACCTATGTCGAACAATTGACGACCTACAATTAACGCTCAAGATTAG
AGAAGATCAACTAGGGGAGCTCATCAACGACAACAAGGGTCTAAGAGAGTCCGTTCAGTCACTTAATGTTCGCCTCGGTAAGTATCAGGATGCCACTGATAGATTAATGA
AAGACTATACCTATTTAAAGGAGCAGTACGACAGATTAAGCGATGATTTTGGGTTTGTGAGACAGAACCACGCGACACTACGAAAGGGTACTACTCCAAAATATCATCCA
TTGTACAATATTCCAGTAGAGCAGCACCCATTTCCATTTTTCAAGAATGAGCAAGTGCCTGTACACAATCAACCTGGATTTTCACTACCCACAGAGGTACCTCCCAAGGT
GACCATTACAGTTCCCAATTTAGATGATCCTGAAATCAGAAAAGAGCTAACGGGAGGTGAGAAAGTCTCTTCTAGTGAAAAGCTTGAAGTCCTGGAGGAAAGATTAAGGG
CAGTAGAAGGAACAGACGTCTTCGGAAATATAGATGCGACCAAGCTATGCTTGGTACCAGATGTAATCCTCCCTCCAAAATTCAAGGTGCCCGAGTTTGAAAAGTATGAT
GGAGCATCCTGCCCTAAAAACCATCTCATCATGTATTGCAGGAAGATGGCAGCATACGTCCAAAATGACAAACTGTTAATTCACTGCTTCCAGGACAGTCTTATTGGTCC
AGCATCTCGATGGTATATGCAGTTAGACAGCACTCATATATGTTCATGGAAGAATCTAGCCGATTCATTTTTAAAGCAATATAAGCACAACATAGATATGGCTCCTGACC
TCCTAGACCTCCAGAGGATGGAAAAGAAGAGCACAGAAAGCTTTAAGGAGTATGCCCAAAGGTGGAGGGATACTGCTGCTCAGGTGCAACCACCTTTAGCAGATAAGGAG
CTGTCAACCATGTTTATTAATACTCTCAAATCTCCTTTCTATGATAAGATGATTGGGAGTGCCTCTACCAATTTCTCTGACATAATGACAATTGGAGAGAGAATCGAGTA
CGAAATTAAGCATGGAAAGATAATTGATAGGCTGGATCATCGAAGCGAAAAAGGGGGTTCCATCGAAGAAAAAAGAGGGAGAAGTTCAGATGATTGTAAACCACCTTACC
CAAAGTGGTATGACCCAAATGCCCGTTGCGACTACCATGCAGGAGCAATTGGACATTCCACTGAAAACTGTACTGCACTCAAGCATAGGGTGCAAGCATTGATCAAGGCA
GGATGGTTGAACTTTAAGAAAGAAAATGGTCCAGATGTCAACAACAATCCTCTGCCAAACCATCAGAATGCACAAGTAAATGCGATAGATCAGGTCCAGGAGTTGTTAGA
TTCACATTTTTTAACAGTTTCTCAAAAGATGGTTCAGCTCCCTCAGTATGGGGAAGTTGATATTATAGAAGAATGCTCAAGGTTGTCTCTCAAGCCAAAACCGTTAACAA
TTTCTTATCGCGAGAAGCCCAGTACCCCAAATTCCAAGCCAAGACCGATTACCATCCAGATTCCGACTCCCTTTGAATATAAAAGTTCAAAAGCAGTACCTTGGAACTAT
GAATATAAGGATATTACGGTGGACAACTTGGATAACATTGTTGGAAACATAACTGCATCCAGTTCAGTAACCTTCACAGATGATGAGTTACCACCAGAAGGCACCGGACA
CACTAAAGCCTTGCACATTACAGTTAAGTGCAAAAATTTTGTGTGGCAAAAGTTCTAG
Protein sequenceShow/hide protein sequence
MSIEDQATVRQWSENVQQIHGDSLVEDAMSQFKDVSFPESQLETVKQAWEKLTVDRKAKFTSKYGHLAQLMYVQVNYSVLKALIRHWDPAYRCFTFGSIDMTPTIEEYQS
LLHMPTRTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKIKERTHAYPWTTSLLFNKNLQMKTRINFTGIVTKGTGRFIGCAPLLYIWVLSHVKCPPEFKCPEIKFS
SSWNKLRNPISEFVQSGWSSSSPERSAWEAFFSELKVEDVMWRAPWMSTRPMIYKCGKFQSLPLLGPWGCIAYAPLLVVRQIWVRQFIPATHELKDFEFAYDKGFCKDRI
QKIVKAWKMITKIQSGRRKEWNLCRTIDDLQLTLKIREDQLGELINDNKGLRESVQSLNVRLGKYQDATDRLMKDYTYLKEQYDRLSDDFGFVRQNHATLRKGTTPKYHP
LYNIPVEQHPFPFFKNEQVPVHNQPGFSLPTEVPPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYD
GASCPKNHLIMYCRKMAAYVQNDKLLIHCFQDSLIGPASRWYMQLDSTHICSWKNLADSFLKQYKHNIDMAPDLLDLQRMEKKSTESFKEYAQRWRDTAAQVQPPLADKE
LSTMFINTLKSPFYDKMIGSASTNFSDIMTIGERIEYEIKHGKIIDRLDHRSEKGGSIEEKRGRSSDDCKPPYPKWYDPNARCDYHAGAIGHSTENCTALKHRVQALIKA
GWLNFKKENGPDVNNNPLPNHQNAQVNAIDQVQELLDSHFLTVSQKMVQLPQYGEVDIIEECSRLSLKPKPLTISYREKPSTPNSKPRPITIQIPTPFEYKSSKAVPWNY
EYKDITVDNLDNIVGNITASSSVTFTDDELPPEGTGHTKALHITVKCKNFVWQKF