; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041157 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041157
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSWIM-type domain-containing protein
Genome locationchr13:12981228..12985154
RNA-Seq ExpressionLag0041157
SyntenyLag0041157
Gene Ontology termsGO:0006313 - transposition, DNA-mediated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004803 - transposase activity (molecular function)
InterPro domainsIPR001207 - Transposase, mutator type
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KYP42750.1 Sporozoite surface protein 2 [Cajanus cajan]5.8e-6334.79Show/hide
Query:  SNPGIVAQMNLNIRHEQEEIKGAPIERMRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWA
        SNPG   ++ ++   + E     P    R++ TFQR+YIC  GCK+ FLK CRPIIGLD C LKG YGGQ++AA GRD N+Q   + F+VVE ETKESW 
Subjt:  SNPGIVAQMNLNIRHEQEEIKGAPIERMRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWA

Query:  WFLQLLLGAI-EYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIK
        WFL+LL+  +  +     YTFISDQQKGL+P++N+++PGV QRFCVRH+Y+N  KKFPGK++K++MW AA ATY   WEREM  +KK+D  A++ L  I 
Subjt:  WFLQLLLGAI-EYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIK

Query:  PSLWSKHAFTTNNICG--------------------------------------------------------RMLQKEA---------------------
        P  WSK  F  N  C                                                         + ++KE+                     
Subjt:  PSLWSKHAFTTNNICG--------------------------------------------------------RMLQKEA---------------------

Query:  ------------------------------------------------------YGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRK
                                                              Y   Y+ +IYP NG  LW  T    + PP  ++ PGRP K R    
Subjt:  ------------------------------------------------------YGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRK

Query:  MNQRK-GSSIGKRNVSIKCSRCKKFGHNKRSCKE
           +K  + + +R    KCSRCK FGHNK +CKE
Subjt:  MNQRK-GSSIGKRNVSIKCSRCKKFGHNKRSCKE

PNX98087.1 hypothetical protein L195_g021327 [Trifolium pratense]1.9e-6134.67Show/hide
Query:  FQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWAWFLQLLLGAIEYGPN--RKYTFISDQQKGLVPS
        FQR+YIC+ GCKQ FLK CR IIGLD C LKG YGGQ++AA GRD NDQ   +AF+VVE ET++SW WFLQLL+  +  G N    YTFISDQQKGL+P+
Subjt:  FQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWAWFLQLLLGAIEYGPN--RKYTFISDQQKGLVPS

Query:  LNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKPSLWSKHAFTTNNICGRML-------------
        +++++PGV QRFCVRH+Y+N  K+FPGK +K +MW AA ATY + W REM+ +KKI   AY++L  I P  WSK  F     C  +L             
Subjt:  LNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKPSLWSKHAFTTNNICGRML-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------QKEAYGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRKMNQ--RKGSSIGKRNVSIKCSRCKKFGHNKRS
                          +KE Y + Y P+IYP NG ++W  T Y+ +QPP IRR PGRP K +RNR+  +  R+   + K+ +   C RC   GHNK +
Subjt:  ------------------QKEAYGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRKMNQ--RKGSSIGKRNVSIKCSRCKKFGHNKRS

Query:  CKEPIEQGSESQMMGEDVACKTENLHTSVVLAAPTPM-LKSENRSTRARE
        C+ P    + +          T N  +S    APT     S  +S R +E
Subjt:  CKEPIEQGSESQMMGEDVACKTENLHTSVVLAAPTPM-LKSENRSTRARE

XP_023877950.1 uncharacterized protein LOC111990411 [Quercus suber]2.4e-6135.56Show/hide
Query:  SNPGIVAQMNLNIRHEQEEIKGAPIERMRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWA
        +NPG  A + +N  H +            +   F+R Y+C+D CK+GFL  CRP IG+D CHLKG + GQL+ A G+D N   + +AF+VVEAETKESW 
Subjt:  SNPGIVAQMNLNIRHEQEEIKGAPIERMRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWA

Query:  WFLQLLLGAIEYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKP
        WFL  LL  I     R++TF+SDQQKGLVP+ ++++PGV  RFCVRH+Y+N  K+  GK +K+ MW AA AT    +  EME +KKI+  A+ W  A  P
Subjt:  WFLQLLLGAIEYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKP

Query:  SLWSKHAF--------TTNN---------------------------------------------ICGRMLQK---------------------------
         LW++ AF         TNN                                             IC R+  K                           
Subjt:  SLWSKHAF--------TTNN---------------------------------------------ICGRMLQK---------------------------

Query:  ------------------------EAYGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRKMNQRKGSSIGKRNVSIKCSRCKKFGHNK
                                E+  + +EP +YPT+GSNLWP +N   I PP  RR PGRP KLR+      R    + + N  I+CS+C   GHNK
Subjt:  ------------------------EAYGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRKMNQRKGSSIGKRNVSIKCSRCKKFGHNK

Query:  RSCKE
        R CK+
Subjt:  RSCKE

XP_028791136.1 uncharacterized protein LOC114747018 [Prosopis alba]1.7e-5943.21Show/hide
Query:  FQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWAWFLQLLLGAIEYGPNRKYTFISDQQKGLVPSLN
        F+RLY C+   K+GF+ GCRPIIGLD C LKGPYGG L+ A GRDANDQ+F LA++VVE+++++SW WFL+ L   I     R++ FISDQQKGLVP+  
Subjt:  FQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWAWFLQLLLGAIEYGPNRKYTFISDQQKGLVPSLN

Query:  DVIPGVVQRFCVRHIYSNMGKKF-PGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKPSLWSKHAFT--------TNNICGRMLQKEAYG
        +++ G   RFCVRH+YSN+  K+  G  I++++  AA AT +  W   M  +K I  GA+  L AI PS WS+HA           ++   + L  +++ 
Subjt:  DVIPGVVQRFCVRHIYSNMGKKF-PGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKPSLWSKHAFT--------TNNICGRMLQKEAYG

Query:  STYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRKMNQRKGSSIGKRNV--SIKCSRCKKFGHNKRSCKEP
        + Y P+I P NG  LW   +   I+PP   + PGRP K R+     Q+   +  ++    +  CS C K GHN+R+CK+P
Subjt:  STYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRKMNQRKGSSIGKRNV--SIKCSRCKKFGHNKRSCKEP

XP_029124609.1 uncharacterized protein LOC114914968 [Cajanus cajan]5.8e-6334.79Show/hide
Query:  SNPGIVAQMNLNIRHEQEEIKGAPIERMRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWA
        SNPG   ++ ++   + E     P    R++ TFQR+YIC  GCK+ FLK CRPIIGLD C LKG YGGQ++AA GRD N+Q   + F+VVE ETKESW 
Subjt:  SNPGIVAQMNLNIRHEQEEIKGAPIERMRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWA

Query:  WFLQLLLGAI-EYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIK
        WFL+LL+  +  +     YTFISDQQKGL+P++N+++PGV QRFCVRH+Y+N  KKFPGK++K++MW AA ATY   WEREM  +KK+D  A++ L  I 
Subjt:  WFLQLLLGAI-EYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIK

Query:  PSLWSKHAFTTNNICG--------------------------------------------------------RMLQKEA---------------------
        P  WSK  F  N  C                                                         + ++KE+                     
Subjt:  PSLWSKHAFTTNNICG--------------------------------------------------------RMLQKEA---------------------

Query:  ------------------------------------------------------YGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRK
                                                              Y   Y+ +IYP NG  LW  T    + PP  ++ PGRP K R    
Subjt:  ------------------------------------------------------YGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRK

Query:  MNQRK-GSSIGKRNVSIKCSRCKKFGHNKRSCKE
           +K  + + +R    KCSRCK FGHNK +CKE
Subjt:  MNQRK-GSSIGKRNVSIKCSRCKKFGHNKRSCKE

TrEMBL top hitse value%identityAlignment
A0A151RJK5 Sporozoite surface protein 22.8e-6334.79Show/hide
Query:  SNPGIVAQMNLNIRHEQEEIKGAPIERMRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWA
        SNPG   ++ ++   + E     P    R++ TFQR+YIC  GCK+ FLK CRPIIGLD C LKG YGGQ++AA GRD N+Q   + F+VVE ETKESW 
Subjt:  SNPGIVAQMNLNIRHEQEEIKGAPIERMRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWA

Query:  WFLQLLLGAI-EYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIK
        WFL+LL+  +  +     YTFISDQQKGL+P++N+++PGV QRFCVRH+Y+N  KKFPGK++K++MW AA ATY   WEREM  +KK+D  A++ L  I 
Subjt:  WFLQLLLGAI-EYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIK

Query:  PSLWSKHAFTTNNICG--------------------------------------------------------RMLQKEA---------------------
        P  WSK  F  N  C                                                         + ++KE+                     
Subjt:  PSLWSKHAFTTNNICG--------------------------------------------------------RMLQKEA---------------------

Query:  ------------------------------------------------------YGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRK
                                                              Y   Y+ +IYP NG  LW  T    + PP  ++ PGRP K R    
Subjt:  ------------------------------------------------------YGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRK

Query:  MNQRK-GSSIGKRNVSIKCSRCKKFGHNKRSCKE
           +K  + + +R    KCSRCK FGHNK +CKE
Subjt:  MNQRK-GSSIGKRNVSIKCSRCKKFGHNKRSCKE

A0A2K3N4X6 SWIM-type domain-containing protein9.0e-6234.67Show/hide
Query:  FQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWAWFLQLLLGAIEYGPN--RKYTFISDQQKGLVPS
        FQR+YIC+ GCKQ FLK CR IIGLD C LKG YGGQ++AA GRD NDQ   +AF+VVE ET++SW WFLQLL+  +  G N    YTFISDQQKGL+P+
Subjt:  FQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWAWFLQLLLGAIEYGPN--RKYTFISDQQKGLVPS

Query:  LNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKPSLWSKHAFTTNNICGRML-------------
        +++++PGV QRFCVRH+Y+N  K+FPGK +K +MW AA ATY + W REM+ +KKI   AY++L  I P  WSK  F     C  +L             
Subjt:  LNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKPSLWSKHAFTTNNICGRML-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------QKEAYGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRKMNQ--RKGSSIGKRNVSIKCSRCKKFGHNKRS
                          +KE Y + Y P+IYP NG ++W  T Y+ +QPP IRR PGRP K +RNR+  +  R+   + K+ +   C RC   GHNK +
Subjt:  ------------------QKEAYGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRKMNQ--RKGSSIGKRNVSIKCSRCKKFGHNKRS

Query:  CKEPIEQGSESQMMGEDVACKTENLHTSVVLAAPTPM-LKSENRSTRARE
        C+ P    + +          T N  +S    APT     S  +S R +E
Subjt:  CKEPIEQGSESQMMGEDVACKTENLHTSVVLAAPTPM-LKSENRSTRARE

A0A2N9FLE9 Uncharacterized protein3.6e-6332.34Show/hide
Query:  IRHEQEEIKGAPIER--MRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWAWFLQLLLGAI
        ++ + E +    IER    +   F+RLY+C+D CK+GF+  CRP IG+DACHLKGPYGGQL+AA  RD N+QFF LAF+VVEAETK+SW WFL  L+  +
Subjt:  IRHEQEEIKGAPIER--MRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWAWFLQLLLGAI

Query:  EYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKPSLWSKHAFTT
             R  TFISD+QKGLVP+  +V  G+  R CVRH+Y+N  KKFPG  +K++ W  A+ATY++QWER M+ +K++D  A+ W+ +     W KH F  
Subjt:  EYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKPSLWSKHAFTT

Query:  NNICGRML--------------------------------------------------------------------------------------------
        ++ C  ++                                                                                            
Subjt:  NNICGRML--------------------------------------------------------------------------------------------

Query:  -----------------------QKEA-------------YGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRKMNQR-KGSSIGKRN
                               Q+EA             Y + Y+P++ P NG ++W +T  + ++PP IRR PGRP KLRR      R + + + KR+
Subjt:  -----------------------QKEA-------------YGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRKMNQR-KGSSIGKRN

Query:  VSIKCSRCKKFGHNKRSCKEPI--EQGSESQMMGED
        + +KC +C + GHN+R+CK  +   QG ++   G +
Subjt:  VSIKCSRCKKFGHNKRSCKEPI--EQGSESQMMGED

A0A2N9HNE0 CCHC-type domain-containing protein1.2e-6137.67Show/hide
Query:  SNPGIVAQMNLNIRHEQEEIKGAPIERMRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWA
        SN G    M +   +E +E++   + ++R    FQRLY+C + CK  F   CRP IGLDACHLKGPYGGQL+AA GRD N+++F LAF+VVEAET +SW 
Subjt:  SNPGIVAQMNLNIRHEQEEIKGAPIERMRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWA

Query:  WFLQLLLGAIEYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKP
        WFL+LL    + G N+  T++S+QQKGLV    D  P    R C RHIY+N+ ++ PG  IKE+ W AA ATY +++E+ M  +K++D GA++WL  +  
Subjt:  WFLQLLLGAIEYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKP

Query:  SLWSKHAFT--------TNNICGRMLQK----------------------------------------------EAYGSTYEPLIYPTNGSNLWPSTNYS
          W++  FT         NN+C     K                                                Y + Y+ +I P NGS +W  T   
Subjt:  SLWSKHAFT--------TNNICGRMLQK----------------------------------------------EAYGSTYEPLIYPTNGSNLWPSTNYS

Query:  IIQPPTIRRAPGRPHKLRRNRKMNQRKGSSIGKRNVSIKCSRCKKFGHNKRSCKEPIEQGSE-SQMMGE
         ++PP +RR PGRP K         + G+ +G+   + KC +C K GHNKRSCK  +   S+  Q  GE
Subjt:  IIQPPTIRRAPGRPHKLRRNRKMNQRKGSSIGKRNVSIKCSRCKKFGHNKRSCKEPIEQGSE-SQMMGE

A0A2N9HUS0 Uncharacterized protein3.6e-6332.34Show/hide
Query:  IRHEQEEIKGAPIER--MRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWAWFLQLLLGAI
        ++ + E +    IER    +   F+RLY+C+D CK+GF+  CRP IG+DACHLKGPYGGQL+AA  RD N+QFF LAF+VVEAETK+SW WFL  L+  +
Subjt:  IRHEQEEIKGAPIER--MRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWAWFLQLLLGAI

Query:  EYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKPSLWSKHAFTT
             R  TFISD+QKGLVP+  +V  G+  R CVRH+Y+N  KKFPG  +K++ W  A+ATY++QWER M+ +K++D  A+ W+ +     W KH F  
Subjt:  EYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKPSLWSKHAFTT

Query:  NNICGRML--------------------------------------------------------------------------------------------
        ++ C  ++                                                                                            
Subjt:  NNICGRML--------------------------------------------------------------------------------------------

Query:  -----------------------QKEA-------------YGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRKMNQR-KGSSIGKRN
                               Q+EA             Y + Y+P++ P NG ++W +T  + ++PP IRR PGRP KLRR      R + + + KR+
Subjt:  -----------------------QKEA-------------YGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRKMNQR-KGSSIGKRN

Query:  VSIKCSRCKKFGHNKRSCKEPI--EQGSESQMMGED
        + +KC +C + GHN+R+CK  +   QG ++   G +
Subjt:  VSIKCSRCKKFGHNKRSCKEPI--EQGSESQMMGED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase1.1e-1427.37Show/hide
Query:  TFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWAWFLQLLLGAIEYGPNRKYTFISDQQKGLVPSL
        +F+ L+       QGF + CRP+I +D  +L G Y  +LM A   DA +Q+F LAF+V +  + +SW WFL  +   +     +    IS     ++  +
Subjt:  TFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWAWFLQLLLGAIEYGPNRKYTFISDQQKGLVPSL

Query:  ND-----VIPGVVQRFCVRHIYSNMGKKFPG--KQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKPSLWS
        N+       P    RFC+ H+ S +    PG    +  ++  A +++ K +++  M+ +K+ +  A++WL    P  W+
Subjt:  ND-----VIPGVVQRFCVRHIYSNMGKKFPG--KQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKPSLWS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAATCCTGGGATCGTAGCGCAAATGAATCTGAATATAAGACATGAGCAAGAGGAAATTAAAGGTGCACCAATTGAGAGAATGAGAATAATCCAAACTTTTCAACG
TCTTTATATTTGCATGGATGGATGCAAGCAAGGATTCTTGAAAGGTTGTAGACCAATTATAGGGTTGGATGCTTGTCACCTAAAGGGACCTTATGGAGGACAACTTATGG
CTGCATTTGGAAGAGATGCAAATGATCAATTCTTTTCGCTTGCTTTCTCAGTGGTTGAGGCTGAAACCAAAGAATCTTGGGCATGGTTCCTACAACTATTGTTAGGTGCT
ATAGAGTATGGGCCAAACCGAAAATATACCTTCATCTCTGACCAACAAAAGGGATTAGTGCCTAGCTTAAACGATGTCATTCCAGGTGTGGTTCAACGATTTTGTGTCAG
ACATATATACAGTAATATGGGAAAGAAGTTTCCAGGAAAGCAAATAAAGGAAATAATGTGGTGGGCTGCGAATGCCACATATAAACGACAATGGGAGAGGGAAATGGAAG
CAATGAAAAAGATAGATGATGGAGCTTATAGATGGTTATCTGCAATTAAGCCAAGTCTCTGGAGCAAGCATGCCTTCACTACAAACAATATATGTGGCCGAATGTTACAA
AAAGAGGCGTATGGATCTACTTATGAACCTCTCATATATCCAACCAATGGTTCTAACCTATGGCCATCCACCAATTATTCAATAATTCAACCACCCACAATTAGAAGAGC
TCCTGGTAGACCACATAAACTGAGAAGAAATAGGAAAATGAACCAAAGAAAGGGGTCTTCCATTGGCAAGAGAAACGTCTCAATCAAATGCAGTAGATGTAAGAAATTTG
GTCATAACAAGAGAAGCTGTAAAGAGCCCATTGAACAAGGAAGTGAATCTCAGATGATGGGGGAAGACGTGGCCTGCAAGACGGAAAACCTGCACACTAGTGTGGTGCTA
GCCGCACCGACTCCGATGCTTAAGTCAGAAAACAGAAGTACGAGGGCTAGAGAGAATTCGGAGGCGTTTTGGGACGAACTAGGCGAAACCGGGGCGGCCAAAGGCGGTAT
GGACCGAACAGAGTCGGTGGGACTCGGCTCGCGCAAGCAGGCCGGGCAGAGGCCGAGCATGGGGTCGGGCCTTGGTCTTGGCCTGATCCACTGGCCCGTTTCCCCGCCCG
AGTCCATCTTCCAGTCCGATTTCTGCCCGGTTGTCCTCATCAGCTCCATGTGCATCGGGGTGGTCCAAAATTACCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAATCCTGGGATCGTAGCGCAAATGAATCTGAATATAAGACATGAGCAAGAGGAAATTAAAGGTGCACCAATTGAGAGAATGAGAATAATCCAAACTTTTCAACG
TCTTTATATTTGCATGGATGGATGCAAGCAAGGATTCTTGAAAGGTTGTAGACCAATTATAGGGTTGGATGCTTGTCACCTAAAGGGACCTTATGGAGGACAACTTATGG
CTGCATTTGGAAGAGATGCAAATGATCAATTCTTTTCGCTTGCTTTCTCAGTGGTTGAGGCTGAAACCAAAGAATCTTGGGCATGGTTCCTACAACTATTGTTAGGTGCT
ATAGAGTATGGGCCAAACCGAAAATATACCTTCATCTCTGACCAACAAAAGGGATTAGTGCCTAGCTTAAACGATGTCATTCCAGGTGTGGTTCAACGATTTTGTGTCAG
ACATATATACAGTAATATGGGAAAGAAGTTTCCAGGAAAGCAAATAAAGGAAATAATGTGGTGGGCTGCGAATGCCACATATAAACGACAATGGGAGAGGGAAATGGAAG
CAATGAAAAAGATAGATGATGGAGCTTATAGATGGTTATCTGCAATTAAGCCAAGTCTCTGGAGCAAGCATGCCTTCACTACAAACAATATATGTGGCCGAATGTTACAA
AAAGAGGCGTATGGATCTACTTATGAACCTCTCATATATCCAACCAATGGTTCTAACCTATGGCCATCCACCAATTATTCAATAATTCAACCACCCACAATTAGAAGAGC
TCCTGGTAGACCACATAAACTGAGAAGAAATAGGAAAATGAACCAAAGAAAGGGGTCTTCCATTGGCAAGAGAAACGTCTCAATCAAATGCAGTAGATGTAAGAAATTTG
GTCATAACAAGAGAAGCTGTAAAGAGCCCATTGAACAAGGAAGTGAATCTCAGATGATGGGGGAAGACGTGGCCTGCAAGACGGAAAACCTGCACACTAGTGTGGTGCTA
GCCGCACCGACTCCGATGCTTAAGTCAGAAAACAGAAGTACGAGGGCTAGAGAGAATTCGGAGGCGTTTTGGGACGAACTAGGCGAAACCGGGGCGGCCAAAGGCGGTAT
GGACCGAACAGAGTCGGTGGGACTCGGCTCGCGCAAGCAGGCCGGGCAGAGGCCGAGCATGGGGTCGGGCCTTGGTCTTGGCCTGATCCACTGGCCCGTTTCCCCGCCCG
AGTCCATCTTCCAGTCCGATTTCTGCCCGGTTGTCCTCATCAGCTCCATGTGCATCGGGGTGGTCCAAAATTACCTATAA
Protein sequenceShow/hide protein sequence
MSNPGIVAQMNLNIRHEQEEIKGAPIERMRIIQTFQRLYICMDGCKQGFLKGCRPIIGLDACHLKGPYGGQLMAAFGRDANDQFFSLAFSVVEAETKESWAWFLQLLLGA
IEYGPNRKYTFISDQQKGLVPSLNDVIPGVVQRFCVRHIYSNMGKKFPGKQIKEIMWWAANATYKRQWEREMEAMKKIDDGAYRWLSAIKPSLWSKHAFTTNNICGRMLQ
KEAYGSTYEPLIYPTNGSNLWPSTNYSIIQPPTIRRAPGRPHKLRRNRKMNQRKGSSIGKRNVSIKCSRCKKFGHNKRSCKEPIEQGSESQMMGEDVACKTENLHTSVVL
AAPTPMLKSENRSTRARENSEAFWDELGETGAAKGGMDRTESVGLGSRKQAGQRPSMGSGLGLGLIHWPVSPPESIFQSDFCPVVLISSMCIGVVQNYL