; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012975 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012975
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold1:12539482..12542426
RNA-Seq ExpressionSpg012975
SyntenySpg012975
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039309.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-3521.28Show/hide
Query:  NPEGHSAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNV------PE---APNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQ
        N +G +AEI ++      + ++VP G  +  W S ++ I     +  +   +      PE   +P       SY  A+ + +    +          S+Q
Subjt:  NPEGHSAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNV------PE---APNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQ

Query:  PALTNP--EFAAIFLSSSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAKIKGWYKVGKYQVRFYPWSAETMVGEPKVPS
         +  +P      + L ++V++ R+ FH +W  I++ L++      + +    +K  +        + L + KGW  VGKY VRF  W+  +      +PS
Subjt:  PALTNP--EFAAIFLSSSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAKIKGWYKVGKYQVRFYPWSAETMVGEPKVPS

Query:  YGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKIP-PS
        YGGW   R  PL  W++ TF+ IG  CGG ++ A +T    +++E  +K++ NY+ F+PA + +     N   +++      ++ +   V +HG     +
Subjt:  YGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKIP-PS

Query:  MANYDE-KDDRDH------PRAAPTRLGSNEGSR--------------LQKPYKWETQGTPIDAKY---NGTHPSKSPTQATLADPI----------QRS
         A++D+   D +          +P  L +  GSR              + KP K+ T  T ++ +    N  H + + ++  +   I          Q+ 
Subjt:  MANYDE-KDDRDH------PRAAPTRLGSNEGSR--------------LQKPYKWETQGTPIDAKY---NGTHPSKSPTQATLADPI----------QRS

Query:  PIPAQSETV-------------TQSKSSTHIKPTHQPDNRTHRKKPIIINNKETFLLTGTMHSTNSELPVSDSEEG----MSSPCFTAMEETPITTRG--
         IP+Q  +              + S  +T   P   P N +  KK  +   +     + T+       P   + +G    ++ P      +   + +G  
Subjt:  PIPAQSETV-------------TQSKSSTHIKPTHQPDNRTHRKKPIIINNKETFLLTGTMHSTNSELPVSDSEEG----MSSPCFTAMEETPITTRG--

Query:  -APQIASPPTI--CKLFED----DQEPLQQIEN--LIP----LRIEEP--------INRCSNQNSIREES--ALIEIDVEDEENDAFPTE----------
            + + P +   K FED    D   +  I N  ++P    L++ +P        +N    ++S R        E   +D  ++AF  +          
Subjt:  -APQIASPPTI--CKLFED----DQEPLQQIEN--LIP----LRIEEP--------INRCSNQNSIREES--ALIEIDVEDEENDAFPTE----------

Query:  ----------ATSTDLAVYLP-------ILFPWLTEHGMCIMHMPGRQKTS------------------TATKKKVKWVKELQNLH--------------
                  AT++  A++         IL  W  +H   +    G+   S                     ++++   ++L NLH              
Subjt:  ----------ATSTDLAVYLP-------ILFPWLTEHGMCIMHMPGRQKTS------------------TATKKKVKWVKELQNLH--------------

Query:  --------TSVNYNKSSTGYIS--TGSFLITDSSITK-----------------------------FSNATARRLDKITSDHFPISL--TLGKEKWGPAP
                T+V ++  S+  ++    + L+ D  +T                              F+    R L + TSDHFP+    +    +WGPAP
Subjt:  --------TSVNYNKSSTGYIS--TGSFLITDSSITK-----------------------------FSNATARRLDKITSDHFPISL--TLGKEKWGPAP

Query:  FRLNNVWLNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISAN
        FRLN++ LN   F   ++ WW+ +   G PG  FIQ+LK L   IK W +  F      K  + +E+ S+D  E    LS  ++ RR+ +KAEL  +S  
Subjt:  FRLNNVWLNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISAN

Query:  EEILWRQ
        E   W Q
Subjt:  EEILWRQ

KAA0056838.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.2e-3923.33Show/hide
Query:  SAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNVP-------------EAPNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQP
        +AEI ++ + G    ++VP G +  GW+  +  I   ++   + +                 + +D +R  SY   L  + ED +  K+ +A    S+  
Subjt:  SAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNVP-------------EAPNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQP

Query:  ALTNPEFAAIFLS-----SSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAK--IKGWYKVGKYQVRFYPWSAETMVGEP
          ++  F    LS      +VII R+ FH +W  IM +L++      S  P Q DKA L    +      +     GW  VG YQV+F  W +       
Subjt:  ALTNPEFAAIFLS-----SSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAK--IKGWYKVGKYQVRFYPWSAETMVGEP

Query:  KVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKI
         +PSYGGW++ R  PL  W+  TF+HIG  CGG+L+ A +T+    +++  IKV+ NYT F+PA I +         +         + +   V +HG  
Subjt:  KVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKI

Query:  PPSMANYDEKDDRDHPRAAPTRLGSNEGSRLQKPYKWETQGTPIDAKYNGTHPSKSPTQA--TLADPIQRSPIPAQSETVTQSKSSTHIKPTHQPDNRTH
            A  DE D  +H     T      G +   P    T G       +  H     TQA    +   +  P   Q     + K    +    Q      
Subjt:  PPSMANYDEKDDRDHPRAAPTRLGSNEGSRLQKPYKWETQGTPIDAKYNGTHPSKSPTQA--TLADPIQRSPIPAQSETVTQSKSSTHIKPTHQPDNRTH

Query:  RKKPIIINNKETFLLTGTM--HSTNSEL----------PVSDSEEGMSSPCFTAMEETPITTRGAPQIASPPTICKLFED-----------DQEPLQQIE
        ++   I N K +FL  G +  +S+N+E+           ++D  E   SP      +     +  PQ ++      L E            D  P+  +E
Subjt:  RKKPIIINNKETFLLTGTM--HSTNSEL----------PVSDSEEGMSSPCFTAMEETPITTRGAPQIASPPTICKLFED-----------DQEPLQQIE

Query:  NLIPLRIEEPINRCSNQ----NSIREESA-----------------------------------LIEIDVEDEE-----------------------NDA
        ++I       ++  +NQ    NS   +SA                                    +EID   +E                       + +
Subjt:  NLIPLRIEEPINRCSNQ----NSIREESA-----------------------------------LIEIDVEDEE-----------------------NDA

Query:  FPTEAT--STDLAVYLP------ILFPW----------------------LTEHGMCIMHMPGRQKTSTATK----------------------KKVKWV
        FP   +  + D+A + P      IL  W                       T     +  + G  K +  TK                        V+W 
Subjt:  FPTEAT--STDLAVYLP------ILFPW----------------------LTEHGMCIMHMPGRQKTSTATK----------------------KKVKWV

Query:  KE----------LQNLHTSVNYN-----------------KSSTGYISTGSFLITDSSITKFSNATARRLDKITSDHFPISLTLGKEKWGPAPFRLNNVW
        +E          + N +  ++ N                 + +  Y     FL++      F   T+R L++  SDHFPI L   + KWGP PFRLNN  
Subjt:  KE----------LQNLHTSVNYN-----------------KSSTGYISTGSFLITDSSITKFSNATARRLDKITSDHFPISLTLGKEKWGPAPFRLNNVW

Query:  LNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISANEEILWRQ
        L  K F     +WW N+   G+PG+ FIQ L  L K IK+W  +    Y   K  L KE+  +D  E  G++S     +RI +K++L+SI  N+  +W Q
Subjt:  LNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISANEEILWRQ

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.4e-3922.78Show/hide
Query:  SAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNVP-------------EAPNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQP
        +AEI ++ + G    ++VP G +  GW+S +  I   ++   + +                 + +D +R  SY   L  + ED +  K+ +A    S+  
Subjt:  SAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNVP-------------EAPNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQP

Query:  ALTNPEFAAIFLS-----SSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAK--IKGWYKVGKYQVRFYPWSAETMVGEP
          ++  F    LS      +VII R+ FH +W  IM +L++      S  P Q DKA L    +      +     GW  VG YQV+F  W +       
Subjt:  ALTNPEFAAIFLS-----SSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAK--IKGWYKVGKYQVRFYPWSAETMVGEP

Query:  KVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKI
         +PSYGGW++ R  PL  W+  TF+HIG  CGG+L+ A +T+    +++  IKV+ NY  F+PA I +         +         + +   V +HG  
Subjt:  KVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKI

Query:  PPSMANYDEKDDRDHPRAAPTRLGSNEGSRLQKPYKWETQGTPIDAKYNGTHPSKSPTQA--TLADPIQRSPIPAQSETVTQSKSSTHIKPTHQPDNRTH
            A  DE D  +H     T      G +   P    T G       +  H     TQA    +   +  P   Q     + K    +    Q      
Subjt:  PPSMANYDEKDDRDHPRAAPTRLGSNEGSRLQKPYKWETQGTPIDAKYNGTHPSKSPTQA--TLADPIQRSPIPAQSETVTQSKSSTHIKPTHQPDNRTH

Query:  RKKPIIINNKETFLLTGTM--HSTNSEL----------PVSDSEEGMSSPCFTAMEETPITTRGAPQIASPPTICKLFED-----------DQEPLQQIE
        ++   I N K +FL  G +  +S+N+E+           ++D  E   SP      +     +  PQ ++      L E            D  P+  +E
Subjt:  RKKPIIINNKETFLLTGTM--HSTNSEL----------PVSDSEEGMSSPCFTAMEETPITTRGAPQIASPPTICKLFED-----------DQEPLQQIE

Query:  NLIPLRIEEPINRCSNQ----NSIREESA-----------------------------------LIEID----------VEDEENDAFPTEATSTDLAVY
        ++I       ++  +NQ    NS   +SA                                    +EID          +++ E    P        + Y
Subjt:  NLIPLRIEEPINRCSNQ----NSIREESA-----------------------------------LIEID----------VEDEENDAFPTEATSTDLAVY

Query:  LPILF-----------PWLTEHGMCIM--------------------------------HMPGRQKTSTATK----------------------KKVKWV
         P++            P   + G+ ++                                 + G  K +  TK                        V+W 
Subjt:  LPILF-----------PWLTEHGMCIM--------------------------------HMPGRQKTSTATK----------------------KKVKWV

Query:  KE----------LQNLHTSVNYN-----------------KSSTGYISTGSFLITDSSITKFSNATARRLDKITSDHFPISLTLGKEKWGPAPFRLNNVW
        +E          + N +  ++ N                 + +  Y     FL++      F   T+R L++  SDHFPI L   + KWGP PFRLNN  
Subjt:  KE----------LQNLHTSVNYN-----------------KSSTGYISTGSFLITDSSITKFSNATARRLDKITSDHFPISLTLGKEKWGPAPFRLNNVW

Query:  LNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISANEEILWRQ
        L  K F     +WW ++   G+PG+ FIQ L  L K IK+W  +    Y   K  L KE+  +D  E  G++S     +RI +K++L+SI  N+  +W Q
Subjt:  LNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISANEEILWRQ

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-3521.28Show/hide
Query:  NPEGHSAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNV------PE---APNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQ
        N +G +AEI ++      + ++VP G  +  W S ++ I     +  +   +      PE   +P       SY  A+ + +    +          S+Q
Subjt:  NPEGHSAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNV------PE---APNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQ

Query:  PALTNP--EFAAIFLSSSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAKIKGWYKVGKYQVRFYPWSAETMVGEPKVPS
         +  +P      + L ++V++ R+ FH +W  I++ L++      + +    +K  +        + L + KGW  VGKY VRF  W+  +      +PS
Subjt:  PALTNP--EFAAIFLSSSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAKIKGWYKVGKYQVRFYPWSAETMVGEPKVPS

Query:  YGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKIP-PS
        YGGW   R  PL  W++ TF+ IG  CGG ++ A +T    +++E  +K++ NY+ F+PA + +     N   +++      ++ +   V +HG     +
Subjt:  YGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKIP-PS

Query:  MANYDE-KDDRDH------PRAAPTRLGSNEGSR--------------LQKPYKWETQGTPIDAKY---NGTHPSKSPTQATLADPI----------QRS
         A++D+   D +          +P  L +  GSR              + KP K+ T  T ++ +    N  H + + ++  +   I          Q+ 
Subjt:  MANYDE-KDDRDH------PRAAPTRLGSNEGSR--------------LQKPYKWETQGTPIDAKY---NGTHPSKSPTQATLADPI----------QRS

Query:  PIPAQSETV-------------TQSKSSTHIKPTHQPDNRTHRKKPIIINNKETFLLTGTMHSTNSELPVSDSEEG----MSSPCFTAMEETPITTRG--
         IP+Q  +              + S  +T   P   P N +  KK  +   +     + T+       P   + +G    ++ P      +   + +G  
Subjt:  PIPAQSETV-------------TQSKSSTHIKPTHQPDNRTHRKKPIIINNKETFLLTGTMHSTNSELPVSDSEEG----MSSPCFTAMEETPITTRG--

Query:  -APQIASPPTI--CKLFED----DQEPLQQIEN--LIP----LRIEEP--------INRCSNQNSIREES--ALIEIDVEDEENDAFPTE----------
            + + P +   K FED    D   +  I N  ++P    L++ +P        +N    ++S R        E   +D  ++AF  +          
Subjt:  -APQIASPPTI--CKLFED----DQEPLQQIEN--LIP----LRIEEP--------INRCSNQNSIREES--ALIEIDVEDEENDAFPTE----------

Query:  ----------ATSTDLAVYLP-------ILFPWLTEHGMCIMHMPGRQKTS------------------TATKKKVKWVKELQNLH--------------
                  AT++  A++         IL  W  +H   +    G+   S                     ++++   ++L NLH              
Subjt:  ----------ATSTDLAVYLP-------ILFPWLTEHGMCIMHMPGRQKTS------------------TATKKKVKWVKELQNLH--------------

Query:  --------TSVNYNKSSTGYIS--TGSFLITDSSITK-----------------------------FSNATARRLDKITSDHFPISL--TLGKEKWGPAP
                T+V ++  S+  ++    + L+ D  +T                              F+    R L + TSDHFP+    +    +WGPAP
Subjt:  --------TSVNYNKSSTGYIS--TGSFLITDSSITK-----------------------------FSNATARRLDKITSDHFPISL--TLGKEKWGPAP

Query:  FRLNNVWLNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISAN
        FRLN++ LN   F   ++ WW+ +   G PG  FIQ+LK L   IK W +  F      K  + +E+ S+D  E    LS  ++ RR+ +KAEL  +S  
Subjt:  FRLNNVWLNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISAN

Query:  EEILWRQ
        E   W Q
Subjt:  EEILWRQ

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.4e-3922.78Show/hide
Query:  SAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNVP-------------EAPNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQP
        +AEI ++ + G    ++VP G +  GW+S +  I   ++   + +                 + +D +R  SY   L  + ED +  K+ +A    S+  
Subjt:  SAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNVP-------------EAPNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQP

Query:  ALTNPEFAAIFLS-----SSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAK--IKGWYKVGKYQVRFYPWSAETMVGEP
          ++  F    LS      +VII R+ FH +W  IM +L++      S  P Q DKA L    +      +     GW  VG YQV+F  W +       
Subjt:  ALTNPEFAAIFLS-----SSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAK--IKGWYKVGKYQVRFYPWSAETMVGEP

Query:  KVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKI
         +PSYGGW++ R  PL  W+  TF+HIG  CGG+L+ A +T+    +++  IKV+ NY  F+PA I +         +         + +   V +HG  
Subjt:  KVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKI

Query:  PPSMANYDEKDDRDHPRAAPTRLGSNEGSRLQKPYKWETQGTPIDAKYNGTHPSKSPTQA--TLADPIQRSPIPAQSETVTQSKSSTHIKPTHQPDNRTH
            A  DE D  +H     T      G +   P    T G       +  H     TQA    +   +  P   Q     + K    +    Q      
Subjt:  PPSMANYDEKDDRDHPRAAPTRLGSNEGSRLQKPYKWETQGTPIDAKYNGTHPSKSPTQA--TLADPIQRSPIPAQSETVTQSKSSTHIKPTHQPDNRTH

Query:  RKKPIIINNKETFLLTGTM--HSTNSEL----------PVSDSEEGMSSPCFTAMEETPITTRGAPQIASPPTICKLFED-----------DQEPLQQIE
        ++   I N K +FL  G +  +S+N+E+           ++D  E   SP      +     +  PQ ++      L E            D  P+  +E
Subjt:  RKKPIIINNKETFLLTGTM--HSTNSEL----------PVSDSEEGMSSPCFTAMEETPITTRGAPQIASPPTICKLFED-----------DQEPLQQIE

Query:  NLIPLRIEEPINRCSNQ----NSIREESA-----------------------------------LIEID----------VEDEENDAFPTEATSTDLAVY
        ++I       ++  +NQ    NS   +SA                                    +EID          +++ E    P        + Y
Subjt:  NLIPLRIEEPINRCSNQ----NSIREESA-----------------------------------LIEID----------VEDEENDAFPTEATSTDLAVY

Query:  LPILF-----------PWLTEHGMCIM--------------------------------HMPGRQKTSTATK----------------------KKVKWV
         P++            P   + G+ ++                                 + G  K +  TK                        V+W 
Subjt:  LPILF-----------PWLTEHGMCIM--------------------------------HMPGRQKTSTATK----------------------KKVKWV

Query:  KE----------LQNLHTSVNYN-----------------KSSTGYISTGSFLITDSSITKFSNATARRLDKITSDHFPISLTLGKEKWGPAPFRLNNVW
        +E          + N +  ++ N                 + +  Y     FL++      F   T+R L++  SDHFPI L   + KWGP PFRLNN  
Subjt:  KE----------LQNLHTSVNYN-----------------KSSTGYISTGSFLITDSSITKFSNATARRLDKITSDHFPISLTLGKEKWGPAPFRLNNVW

Query:  LNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISANEEILWRQ
        L  K F     +WW ++   G+PG+ FIQ L  L K IK+W  +    Y   K  L KE+  +D  E  G++S     +RI +K++L+SI  N+  +W Q
Subjt:  LNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISANEEILWRQ

TrEMBL top hitse value%identityAlignment
A0A5A7U495 DUF4283 domain-containing protein5.5e-3528.53Show/hide
Query:  LQHPLNPEGHSAEIAKLGSNGGINKLIVPVGENRKGWQSLI------------TSIQSLTNLSQQAVNVPEAPNDGNRNVSYKDALKKNQEDHHTHKQKQ
        +Q   N +G+ AEI ++   G    ++VP G  + GW   +            T+ ++L N   +         D  +  SY +A+ K      + ++  
Subjt:  LQHPLNPEGHSAEIAKLGSNGGINKLIVPVGENRKGWQSLI------------TSIQSLTNLSQQAVNVPEAPNDGNRNVSYKDALKKNQEDHHTHKQKQ

Query:  AYVVVSAQPALTNPEFAAIFLSSSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAKIKGWYKVGKYQVRFYPWSAETMVG
        ++     +   +N  F+  +   +V++ R+ FH +W  I+  L + L       P   DKA +  ++EEQ + + K KGW  VG++ V+F  W+ +    
Subjt:  AYVVVSAQPALTNPEFAAIFLSSSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAKIKGWYKVGKYQVRFYPWSAETMVG

Query:  EPKVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHG
           +PSYGGWIK+R  PL  W++E+F  IGD CGG++E A +T    D++E SI++K+NY+ FIPA I L     +   I++      ++++     IHG
Subjt:  EPKVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHG

Query:  KIPPSMA-NYDE
              A  +DE
Subjt:  KIPPSMA-NYDE

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein1.7e-3922.78Show/hide
Query:  SAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNVP-------------EAPNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQP
        +AEI ++ + G    ++VP G +  GW+S +  I   ++   + +                 + +D +R  SY   L  + ED +  K+ +A    S+  
Subjt:  SAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNVP-------------EAPNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQP

Query:  ALTNPEFAAIFLS-----SSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAK--IKGWYKVGKYQVRFYPWSAETMVGEP
          ++  F    LS      +VII R+ FH +W  IM +L++      S  P Q DKA L    +      +     GW  VG YQV+F  W +       
Subjt:  ALTNPEFAAIFLS-----SSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAK--IKGWYKVGKYQVRFYPWSAETMVGEP

Query:  KVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKI
         +PSYGGW++ R  PL  W+  TF+HIG  CGG+L+ A +T+    +++  IKV+ NY  F+PA I +         +         + +   V +HG  
Subjt:  KVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKI

Query:  PPSMANYDEKDDRDHPRAAPTRLGSNEGSRLQKPYKWETQGTPIDAKYNGTHPSKSPTQA--TLADPIQRSPIPAQSETVTQSKSSTHIKPTHQPDNRTH
            A  DE D  +H     T      G +   P    T G       +  H     TQA    +   +  P   Q     + K    +    Q      
Subjt:  PPSMANYDEKDDRDHPRAAPTRLGSNEGSRLQKPYKWETQGTPIDAKYNGTHPSKSPTQA--TLADPIQRSPIPAQSETVTQSKSSTHIKPTHQPDNRTH

Query:  RKKPIIINNKETFLLTGTM--HSTNSEL----------PVSDSEEGMSSPCFTAMEETPITTRGAPQIASPPTICKLFED-----------DQEPLQQIE
        ++   I N K +FL  G +  +S+N+E+           ++D  E   SP      +     +  PQ ++      L E            D  P+  +E
Subjt:  RKKPIIINNKETFLLTGTM--HSTNSEL----------PVSDSEEGMSSPCFTAMEETPITTRGAPQIASPPTICKLFED-----------DQEPLQQIE

Query:  NLIPLRIEEPINRCSNQ----NSIREESA-----------------------------------LIEID----------VEDEENDAFPTEATSTDLAVY
        ++I       ++  +NQ    NS   +SA                                    +EID          +++ E    P        + Y
Subjt:  NLIPLRIEEPINRCSNQ----NSIREESA-----------------------------------LIEID----------VEDEENDAFPTEATSTDLAVY

Query:  LPILF-----------PWLTEHGMCIM--------------------------------HMPGRQKTSTATK----------------------KKVKWV
         P++            P   + G+ ++                                 + G  K +  TK                        V+W 
Subjt:  LPILF-----------PWLTEHGMCIM--------------------------------HMPGRQKTSTATK----------------------KKVKWV

Query:  KE----------LQNLHTSVNYN-----------------KSSTGYISTGSFLITDSSITKFSNATARRLDKITSDHFPISLTLGKEKWGPAPFRLNNVW
        +E          + N +  ++ N                 + +  Y     FL++      F   T+R L++  SDHFPI L   + KWGP PFRLNN  
Subjt:  KE----------LQNLHTSVNYN-----------------KSSTGYISTGSFLITDSSITKFSNATARRLDKITSDHFPISLTLGKEKWGPAPFRLNNVW

Query:  LNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISANEEILWRQ
        L  K F     +WW ++   G+PG+ FIQ L  L K IK+W  +    Y   K  L KE+  +D  E  G++S     +RI +K++L+SI  N+  +W Q
Subjt:  LNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISANEEILWRQ

A0A5D3BKT8 LINE-1 retrotransposable element ORF2 protein5.7e-4023.33Show/hide
Query:  SAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNVP-------------EAPNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQP
        +AEI ++ + G    ++VP G +  GW+  +  I   ++   + +                 + +D +R  SY   L  + ED +  K+ +A    S+  
Subjt:  SAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNVP-------------EAPNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQP

Query:  ALTNPEFAAIFLS-----SSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAK--IKGWYKVGKYQVRFYPWSAETMVGEP
          ++  F    LS      +VII R+ FH +W  IM +L++      S  P Q DKA L    +      +     GW  VG YQV+F  W +       
Subjt:  ALTNPEFAAIFLS-----SSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAK--IKGWYKVGKYQVRFYPWSAETMVGEP

Query:  KVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKI
         +PSYGGW++ R  PL  W+  TF+HIG  CGG+L+ A +T+    +++  IKV+ NYT F+PA I +         +         + +   V +HG  
Subjt:  KVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKI

Query:  PPSMANYDEKDDRDHPRAAPTRLGSNEGSRLQKPYKWETQGTPIDAKYNGTHPSKSPTQA--TLADPIQRSPIPAQSETVTQSKSSTHIKPTHQPDNRTH
            A  DE D  +H     T      G +   P    T G       +  H     TQA    +   +  P   Q     + K    +    Q      
Subjt:  PPSMANYDEKDDRDHPRAAPTRLGSNEGSRLQKPYKWETQGTPIDAKYNGTHPSKSPTQA--TLADPIQRSPIPAQSETVTQSKSSTHIKPTHQPDNRTH

Query:  RKKPIIINNKETFLLTGTM--HSTNSEL----------PVSDSEEGMSSPCFTAMEETPITTRGAPQIASPPTICKLFED-----------DQEPLQQIE
        ++   I N K +FL  G +  +S+N+E+           ++D  E   SP      +     +  PQ ++      L E            D  P+  +E
Subjt:  RKKPIIINNKETFLLTGTM--HSTNSEL----------PVSDSEEGMSSPCFTAMEETPITTRGAPQIASPPTICKLFED-----------DQEPLQQIE

Query:  NLIPLRIEEPINRCSNQ----NSIREESA-----------------------------------LIEIDVEDEE-----------------------NDA
        ++I       ++  +NQ    NS   +SA                                    +EID   +E                       + +
Subjt:  NLIPLRIEEPINRCSNQ----NSIREESA-----------------------------------LIEIDVEDEE-----------------------NDA

Query:  FPTEAT--STDLAVYLP------ILFPW----------------------LTEHGMCIMHMPGRQKTSTATK----------------------KKVKWV
        FP   +  + D+A + P      IL  W                       T     +  + G  K +  TK                        V+W 
Subjt:  FPTEAT--STDLAVYLP------ILFPW----------------------LTEHGMCIMHMPGRQKTSTATK----------------------KKVKWV

Query:  KE----------LQNLHTSVNYN-----------------KSSTGYISTGSFLITDSSITKFSNATARRLDKITSDHFPISLTLGKEKWGPAPFRLNNVW
        +E          + N +  ++ N                 + +  Y     FL++      F   T+R L++  SDHFPI L   + KWGP PFRLNN  
Subjt:  KE----------LQNLHTSVNYN-----------------KSSTGYISTGSFLITDSSITKFSNATARRLDKITSDHFPISLTLGKEKWGPAPFRLNNVW

Query:  LNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISANEEILWRQ
        L  K F     +WW N+   G+PG+ FIQ L  L K IK+W  +    Y   K  L KE+  +D  E  G++S     +RI +K++L+SI  N+  +W Q
Subjt:  LNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISANEEILWRQ

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein1.7e-3922.78Show/hide
Query:  SAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNVP-------------EAPNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQP
        +AEI ++ + G    ++VP G +  GW+S +  I   ++   + +                 + +D +R  SY   L  + ED +  K+ +A    S+  
Subjt:  SAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNVP-------------EAPNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQP

Query:  ALTNPEFAAIFLS-----SSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAK--IKGWYKVGKYQVRFYPWSAETMVGEP
          ++  F    LS      +VII R+ FH +W  IM +L++      S  P Q DKA L    +      +     GW  VG YQV+F  W +       
Subjt:  ALTNPEFAAIFLS-----SSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAK--IKGWYKVGKYQVRFYPWSAETMVGEP

Query:  KVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKI
         +PSYGGW++ R  PL  W+  TF+HIG  CGG+L+ A +T+    +++  IKV+ NY  F+PA I +         +         + +   V +HG  
Subjt:  KVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKI

Query:  PPSMANYDEKDDRDHPRAAPTRLGSNEGSRLQKPYKWETQGTPIDAKYNGTHPSKSPTQA--TLADPIQRSPIPAQSETVTQSKSSTHIKPTHQPDNRTH
            A  DE D  +H     T      G +   P    T G       +  H     TQA    +   +  P   Q     + K    +    Q      
Subjt:  PPSMANYDEKDDRDHPRAAPTRLGSNEGSRLQKPYKWETQGTPIDAKYNGTHPSKSPTQA--TLADPIQRSPIPAQSETVTQSKSSTHIKPTHQPDNRTH

Query:  RKKPIIINNKETFLLTGTM--HSTNSEL----------PVSDSEEGMSSPCFTAMEETPITTRGAPQIASPPTICKLFED-----------DQEPLQQIE
        ++   I N K +FL  G +  +S+N+E+           ++D  E   SP      +     +  PQ ++      L E            D  P+  +E
Subjt:  RKKPIIINNKETFLLTGTM--HSTNSEL----------PVSDSEEGMSSPCFTAMEETPITTRGAPQIASPPTICKLFED-----------DQEPLQQIE

Query:  NLIPLRIEEPINRCSNQ----NSIREESA-----------------------------------LIEID----------VEDEENDAFPTEATSTDLAVY
        ++I       ++  +NQ    NS   +SA                                    +EID          +++ E    P        + Y
Subjt:  NLIPLRIEEPINRCSNQ----NSIREESA-----------------------------------LIEID----------VEDEENDAFPTEATSTDLAVY

Query:  LPILF-----------PWLTEHGMCIM--------------------------------HMPGRQKTSTATK----------------------KKVKWV
         P++            P   + G+ ++                                 + G  K +  TK                        V+W 
Subjt:  LPILF-----------PWLTEHGMCIM--------------------------------HMPGRQKTSTATK----------------------KKVKWV

Query:  KE----------LQNLHTSVNYN-----------------KSSTGYISTGSFLITDSSITKFSNATARRLDKITSDHFPISLTLGKEKWGPAPFRLNNVW
        +E          + N +  ++ N                 + +  Y     FL++      F   T+R L++  SDHFPI L   + KWGP PFRLNN  
Subjt:  KE----------LQNLHTSVNYN-----------------KSSTGYISTGSFLITDSSITKFSNATARRLDKITSDHFPISLTLGKEKWGPAPFRLNNVW

Query:  LNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISANEEILWRQ
        L  K F     +WW ++   G+PG+ FIQ L  L K IK+W  +    Y   K  L KE+  +D  E  G++S     +RI +K++L+SI  N+  +W Q
Subjt:  LNHKSFLITVDSWWKNTPSRGWPGHGFIQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISANEEILWRQ

A0A5D3CFS8 DUF4283 domain-containing protein5.5e-3528.53Show/hide
Query:  LQHPLNPEGHSAEIAKLGSNGGINKLIVPVGENRKGWQSLI------------TSIQSLTNLSQQAVNVPEAPNDGNRNVSYKDALKKNQEDHHTHKQKQ
        +Q   N +G+ AEI ++   G    ++VP G  + GW   +            T+ ++L N   +         D  +  SY +A+ K      + ++  
Subjt:  LQHPLNPEGHSAEIAKLGSNGGINKLIVPVGENRKGWQSLI------------TSIQSLTNLSQQAVNVPEAPNDGNRNVSYKDALKKNQEDHHTHKQKQ

Query:  AYVVVSAQPALTNPEFAAIFLSSSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAKIKGWYKVGKYQVRFYPWSAETMVG
        ++     +   +N  F+  +   +V++ R+ FH +W  I+  L + L       P   DKA +  ++EEQ + + K KGW  VG++ V+F  W+ +    
Subjt:  AYVVVSAQPALTNPEFAAIFLSSSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAKIKGWYKVGKYQVRFYPWSAETMVG

Query:  EPKVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHG
           +PSYGGWIK+R  PL  W++E+F  IGD CGG++E A +T    D++E SI++K+NY+ FIPA I L     +   I++      ++++     IHG
Subjt:  EPKVPSYGGWIKIRNFPLDKWSIETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHG

Query:  KIPPSMA-NYDE
              A  +DE
Subjt:  KIPPSMA-NYDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCAGACTTGTTTCGATCGGCTCTGCAACATCCCCTTAACCCAGAAGGACACTCGGCCGAAATAGCTAAGCTTGGATCAAATGGGGGCATTAATAAATTGATTGT
ACCTGTGGGCGAGAATAGAAAAGGATGGCAGAGCCTCATAACCAGCATTCAATCCCTCACCAACCTCAGCCAACAAGCAGTTAATGTGCCTGAAGCCCCAAATGATGGCA
ACCGAAATGTCTCATATAAAGATGCATTAAAGAAGAACCAAGAGGACCACCACACCCACAAACAAAAGCAAGCCTACGTAGTTGTTTCAGCCCAACCAGCCTTGACCAAT
CCCGAGTTCGCTGCCATCTTCTTGTCATCCTCAGTTATAATCCAAAGAAAGCACTTCCACGGTAACTGGTATGATATCATGAGAGCTCTCCAACAACATTTATCTGCCTT
TGCATCAGTCAGCCCTCTCCAACCCGATAAAGCCCGGCTAGCTTGCGAGGATGAAGAACAAACCCATACCCTTGCCAAAATAAAAGGATGGTATAAGGTGGGTAAATACC
AGGTCCGTTTTTACCCTTGGAGCGCCGAAACCATGGTTGGTGAACCTAAGGTTCCCTCGTATGGAGGATGGATAAAGATCAGGAACTTTCCTCTTGATAAATGGTCTATA
GAAACGTTTAAGCACATTGGTGATGAATGTGGAGGGTACTTGGAAACCGCTAATAAAACCCTAGCACGTATGGACATGATGGAAGTCAGTATTAAAGTCAAGGAGAACTA
CACAAGCTTCATCCCAGCTGAAATTCACCTCCCATCAGCATCATCAAACCCTACGACCATCAAAATCGACCCCTTCTTCATGGAAGAATATAACATTGGTTACATTGTCG
GAATACATGGCAAAATCCCTCCCAGCATGGCGAATTATGATGAGAAAGATGATCGCGACCACCCACGCGCCGCCCCAACGCGATTAGGATCCAATGAAGGTAGTAGGCTG
CAGAAACCGTACAAGTGGGAAACCCAAGGCACCCCCATTGATGCAAAATATAATGGGACCCACCCATCAAAATCCCCCACTCAAGCTACGCTAGCCGACCCAATCCAAAG
AAGCCCAATACCAGCCCAATCTGAGACAGTCACCCAATCAAAGAGCTCCACCCACATTAAGCCAACCCACCAGCCAGACAACCGCACGCACAGGAAAAAGCCCATCATCA
TTAATAACAAGGAAACCTTCCTCCTCACGGGTACAATGCACTCCACAAACTCTGAGTTACCCGTCTCTGATTCCGAAGAAGGGATGTCTTCACCTTGCTTCACAGCCATG
GAAGAAACACCAATAACCACTCGAGGGGCCCCCCAAATCGCATCTCCTCCGACCATATGCAAGCTTTTCGAAGATGATCAGGAGCCGCTACAACAGATAGAAAATCTTAT
CCCCCTGAGAATTGAAGAACCTATTAACCGATGCTCCAACCAAAACTCTATCAGAGAAGAATCTGCTCTTATAGAGATTGATGTGGAAGATGAGGAAAACGATGCATTCC
CAACAGAAGCAACAAGCACAGACCTAGCGGTTTATCTCCCTATCTTATTCCCTTGGCTAACCGAGCACGGTATGTGCATCATGCATATGCCCGGCAGACAAAAGACCTCC
ACTGCCACCAAGAAGAAGGTCAAATGGGTTAAGGAATTACAAAATCTCCACACATCTGTGAACTACAATAAGTCCTCTACAGGTTACATATCAACGGGGTCGTTTCTTAT
AACCGATAGCAGTATTACCAAATTCTCTAACGCCACAGCCCGTCGATTGGACAAAATCACCTCTGATCATTTCCCTATCAGCCTCACATTGGGGAAAGAAAAATGGGGAC
CAGCCCCTTTTAGGCTCAACAACGTTTGGCTTAACCACAAATCCTTCCTCATTACCGTGGATTCATGGTGGAAAAACACTCCATCTAGGGGTTGGCCCGGTCATGGATTC
ATCCAGAAATTAAAGGATCTCAAAAAGGAGATTAAACAGTGGAATCAATCGGTTTTCGGTCAGTATAGAGAGAAAAAATCTTGCCTGAATAAAGAGCTATCTTCCTTAGA
CCATAGGGAGGAACACGGTCAGCTATCTGCCCACGATGCTATCAGAAGAATAGAGATCAAGGCTGAACTCATCTCTATATCGGCCAATGAAGAGATCCTGTGGAGGCAAA
ATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCAGACTTGTTTCGATCGGCTCTGCAACATCCCCTTAACCCAGAAGGACACTCGGCCGAAATAGCTAAGCTTGGATCAAATGGGGGCATTAATAAATTGATTGT
ACCTGTGGGCGAGAATAGAAAAGGATGGCAGAGCCTCATAACCAGCATTCAATCCCTCACCAACCTCAGCCAACAAGCAGTTAATGTGCCTGAAGCCCCAAATGATGGCA
ACCGAAATGTCTCATATAAAGATGCATTAAAGAAGAACCAAGAGGACCACCACACCCACAAACAAAAGCAAGCCTACGTAGTTGTTTCAGCCCAACCAGCCTTGACCAAT
CCCGAGTTCGCTGCCATCTTCTTGTCATCCTCAGTTATAATCCAAAGAAAGCACTTCCACGGTAACTGGTATGATATCATGAGAGCTCTCCAACAACATTTATCTGCCTT
TGCATCAGTCAGCCCTCTCCAACCCGATAAAGCCCGGCTAGCTTGCGAGGATGAAGAACAAACCCATACCCTTGCCAAAATAAAAGGATGGTATAAGGTGGGTAAATACC
AGGTCCGTTTTTACCCTTGGAGCGCCGAAACCATGGTTGGTGAACCTAAGGTTCCCTCGTATGGAGGATGGATAAAGATCAGGAACTTTCCTCTTGATAAATGGTCTATA
GAAACGTTTAAGCACATTGGTGATGAATGTGGAGGGTACTTGGAAACCGCTAATAAAACCCTAGCACGTATGGACATGATGGAAGTCAGTATTAAAGTCAAGGAGAACTA
CACAAGCTTCATCCCAGCTGAAATTCACCTCCCATCAGCATCATCAAACCCTACGACCATCAAAATCGACCCCTTCTTCATGGAAGAATATAACATTGGTTACATTGTCG
GAATACATGGCAAAATCCCTCCCAGCATGGCGAATTATGATGAGAAAGATGATCGCGACCACCCACGCGCCGCCCCAACGCGATTAGGATCCAATGAAGGTAGTAGGCTG
CAGAAACCGTACAAGTGGGAAACCCAAGGCACCCCCATTGATGCAAAATATAATGGGACCCACCCATCAAAATCCCCCACTCAAGCTACGCTAGCCGACCCAATCCAAAG
AAGCCCAATACCAGCCCAATCTGAGACAGTCACCCAATCAAAGAGCTCCACCCACATTAAGCCAACCCACCAGCCAGACAACCGCACGCACAGGAAAAAGCCCATCATCA
TTAATAACAAGGAAACCTTCCTCCTCACGGGTACAATGCACTCCACAAACTCTGAGTTACCCGTCTCTGATTCCGAAGAAGGGATGTCTTCACCTTGCTTCACAGCCATG
GAAGAAACACCAATAACCACTCGAGGGGCCCCCCAAATCGCATCTCCTCCGACCATATGCAAGCTTTTCGAAGATGATCAGGAGCCGCTACAACAGATAGAAAATCTTAT
CCCCCTGAGAATTGAAGAACCTATTAACCGATGCTCCAACCAAAACTCTATCAGAGAAGAATCTGCTCTTATAGAGATTGATGTGGAAGATGAGGAAAACGATGCATTCC
CAACAGAAGCAACAAGCACAGACCTAGCGGTTTATCTCCCTATCTTATTCCCTTGGCTAACCGAGCACGGTATGTGCATCATGCATATGCCCGGCAGACAAAAGACCTCC
ACTGCCACCAAGAAGAAGGTCAAATGGGTTAAGGAATTACAAAATCTCCACACATCTGTGAACTACAATAAGTCCTCTACAGGTTACATATCAACGGGGTCGTTTCTTAT
AACCGATAGCAGTATTACCAAATTCTCTAACGCCACAGCCCGTCGATTGGACAAAATCACCTCTGATCATTTCCCTATCAGCCTCACATTGGGGAAAGAAAAATGGGGAC
CAGCCCCTTTTAGGCTCAACAACGTTTGGCTTAACCACAAATCCTTCCTCATTACCGTGGATTCATGGTGGAAAAACACTCCATCTAGGGGTTGGCCCGGTCATGGATTC
ATCCAGAAATTAAAGGATCTCAAAAAGGAGATTAAACAGTGGAATCAATCGGTTTTCGGTCAGTATAGAGAGAAAAAATCTTGCCTGAATAAAGAGCTATCTTCCTTAGA
CCATAGGGAGGAACACGGTCAGCTATCTGCCCACGATGCTATCAGAAGAATAGAGATCAAGGCTGAACTCATCTCTATATCGGCCAATGAAGAGATCCTGTGGAGGCAAA
ATTGA
Protein sequenceShow/hide protein sequence
MAPDLFRSALQHPLNPEGHSAEIAKLGSNGGINKLIVPVGENRKGWQSLITSIQSLTNLSQQAVNVPEAPNDGNRNVSYKDALKKNQEDHHTHKQKQAYVVVSAQPALTN
PEFAAIFLSSSVIIQRKHFHGNWYDIMRALQQHLSAFASVSPLQPDKARLACEDEEQTHTLAKIKGWYKVGKYQVRFYPWSAETMVGEPKVPSYGGWIKIRNFPLDKWSI
ETFKHIGDECGGYLETANKTLARMDMMEVSIKVKENYTSFIPAEIHLPSASSNPTTIKIDPFFMEEYNIGYIVGIHGKIPPSMANYDEKDDRDHPRAAPTRLGSNEGSRL
QKPYKWETQGTPIDAKYNGTHPSKSPTQATLADPIQRSPIPAQSETVTQSKSSTHIKPTHQPDNRTHRKKPIIINNKETFLLTGTMHSTNSELPVSDSEEGMSSPCFTAM
EETPITTRGAPQIASPPTICKLFEDDQEPLQQIENLIPLRIEEPINRCSNQNSIREESALIEIDVEDEENDAFPTEATSTDLAVYLPILFPWLTEHGMCIMHMPGRQKTS
TATKKKVKWVKELQNLHTSVNYNKSSTGYISTGSFLITDSSITKFSNATARRLDKITSDHFPISLTLGKEKWGPAPFRLNNVWLNHKSFLITVDSWWKNTPSRGWPGHGF
IQKLKDLKKEIKQWNQSVFGQYREKKSCLNKELSSLDHREEHGQLSAHDAIRRIEIKAELISISANEEILWRQN