; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031777 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031777
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationchr11:14339463..14345842
RNA-Seq ExpressionLag0031777
SyntenyLag0031777
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB2595522.1 hypothetical protein D8674_030972 [Pyrus ussuriensis x Pyrus communis]1.7e-6454.39Show/hide
Query:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLA
        MHRI LEEGS  S E QRR NP M EVVKKEVIK LD G+IYPI+D+ WVSPVQCVPKK G TVV N +NEL+PTR  TGWRVC+DYR+LN  T +    
Subjt:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLA

Query:  FAMIQQHF---------------------------RDCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEE
           I Q                             ++C  +F  LK +L +API+  P+WSLPFE+MCDAS   +GA+LGQ++ K  H IYY S+ LN+ 
Subjt:  FAMIQQHF---------------------------RDCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEE

Query:  QINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDHAAI
        Q+NY+TTEKELLAIVFA +KF  YL+G++V +FTDHAA+
Subjt:  QINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDHAAI

XP_019236126.1 PREDICTED: uncharacterized protein LOC109216433 [Nicotiana attenuata]5.6e-6862.91Show/hide
Query:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLA
        MH+I LE+G   S+EQQRR NP MKEVVKKEVIK LD GII+PI+D+NWVSPVQCVPKKGG TVV N+ NELIPTRTVTGWRVC+DYRRL +        
Subjt:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLA

Query:  FAMIQQHFRDC-RKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLV
           +  +F D    AFE LK  L++API+ AP+WSLPFE+MCDAS   +GA+LGQ++ K  + IYY SK L++ Q NYTTTEKELLA+V+AFEKF  YL 
Subjt:  FAMIQQHFRDC-RKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLV

Query:  GSEVTIFTDHAAI
        G++V + TDHA I
Subjt:  GSEVTIFTDHAAI

XP_019258694.1 PREDICTED: uncharacterized protein LOC109236911 [Nicotiana attenuata]3.2e-7163.11Show/hide
Query:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLA
        MH+I LE+G   S+EQQRR NP  KEVVKKEVIK LD GII+PI+D+NWVSPVQCVPKKGG  VV N+ NELIPTRTVTGWRVC+DYRRLNKAT +   A
Subjt:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLA

Query:  FAMIQQ------------HFRD-CRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAI
           I Q            +F D C  AFE LK  L++A I+ AP+WSLPFE+MCDAS   +GA+LGQ++ K  H IYY SK L++ Q+NYTTTEKELLA+
Subjt:  FAMIQQ------------HFRD-CRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAI

Query:  VFAFEKFCPYLVGSEVTIFTDHAAI
        V+AFEKF  YLVG++V + TDHAAI
Subjt:  VFAFEKFCPYLVGSEVTIFTDHAAI

XP_020250927.1 uncharacterized protein LOC109828314 [Asparagus officinalis]6.2e-6762.26Show/hide
Query:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLA
        MHRI LE+    SIE QRR NP MKEVVKKEV+K LD G+IYPI+D+ WVSPV  VPKKGG TVV N +NELIPTRTVTGWR+C+DYR+LNKAT +    
Subjt:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLA

Query:  FAMIQQHFRDCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVG
           I Q         E LK  LI+API+  P+W+LPFEVM DAS   VGA+LGQ++ K +H IYY S+ L+E Q+NY TTEKELLA+VFAF+KF  YLVG
Subjt:  FAMIQQHFRDCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVG

Query:  SEVTIFTDHAAI
        S+V ++TDH+AI
Subjt:  SEVTIFTDHAAI

XP_023753304.1 uncharacterized protein LOC111901651 [Lactuca sativa]8.1e-6750.19Show/hide
Query:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKA-------
        MH+I LEEG+  S+E QRR NP MK+VVKK++IKWLDVGIIYPIADN W+SPV CVPKKGGATVV N+  E+I  RTVTGWR+C+DYR+LN A       
Subjt:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKA-------

Query:  ---------------------------TCECLLAFAMIQQH-----------------FRDCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVG
                                    C  ++   ++  H                    C++AFE LK+ L SAPI+ APNW  PFE+MCDAS   +G
Subjt:  ---------------------------TCECLLAFAMIQQH-----------------FRDCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVG

Query:  AMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDHAAI
        A+LGQ++GKF H IYY S  LN  Q+NYTTTEKE LA+VF+ EKF  YL+G++V ++T HAAI
Subjt:  AMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDHAAI

TrEMBL top hitse value%identityAlignment
A0A1S3ZVZ8 uncharacterized protein LOC1077910891.4e-6152.4Show/hide
Query:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLA
        MH+I LE+G   S+EQQRR NP MKEVVKKEVIK LD GII+PI+D+NWVSPVQCVPKKGG TV++NK N+LIPTRTV GWRVC+DYRRLNKATC+    
Subjt:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLA

Query:  FAMIQQ-------------------------------------------------------HFRD-CRKAFETLKVVLISAPILCAPNWSLPFEVMCDAS
           I Q                                                       +F D C KAFE LK  L++API+ AP+WSLPF++MCDAS
Subjt:  FAMIQQ-------------------------------------------------------HFRD-CRKAFETLKVVLISAPILCAPNWSLPFEVMCDAS

Query:  GAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKF
           +GA+LGQ++ +  + IYY SK L++ Q+NYTTTEKELLAIV+AFEKF
Subjt:  GAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKF

A0A1U7XLI6 uncharacterized protein LOC1042389354.5e-6361.61Show/hide
Query:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLA
        MH+I LE+G   S+EQQRR N  MKEVVKKEVIK LD GII+PI+D+NWVS V+CVPKKGG TVV N+ NELIPTRTVTGWRVC+DYRRLNKAT +    
Subjt:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLA

Query:  FAMIQQ------------HFRD-CRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAI
           I Q            +F D C KAFE LK  L++A I+ AP+WSLPFE+MCDAS   +GA+LGQ++ K  + IYY SK L++ Q+NYTTTE ELLA+
Subjt:  FAMIQQ------------HFRD-CRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAI

Query:  VFAFEKFCPYL
        V+AFEKF  YL
Subjt:  VFAFEKFCPYL

A0A2Z6MUU6 Reverse transcriptase5.0e-6247.24Show/hide
Query:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKAT------
        MH+I +E+     ++ QRR NP MKEVVK EV+K L+ G+IYPI+D+ WVSPV  VPKKGG TV+ N  NELIPTRTVTGWR+C+DYRRLNKAT      
Subjt:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKAT------

Query:  ---------------------------------------------------CECLLAFAMIQQH--------------------FRD-CRKAFETLKVVL
                                                           C  ++   ++  H                    F D C  AFE LK  L
Subjt:  ---------------------------------------------------CECLLAFAMIQQH--------------------FRD-CRKAFETLKVVL

Query:  ISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDHAAI
        +S P++ AP W LPFE+MCDAS   VGA+LGQ Q KF H IYY SKVLNE QINYTTTEKELLAIVFA EKF  YL+GS+V +FTDHAA+
Subjt:  ISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDHAAI

A0A5N5F2P3 Uncharacterized protein8.2e-6554.39Show/hide
Query:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLA
        MHRI LEEGS  S E QRR NP M EVVKKEVIK LD G+IYPI+D+ WVSPVQCVPKK G TVV N +NEL+PTR  TGWRVC+DYR+LN  T +    
Subjt:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLA

Query:  FAMIQQHF---------------------------RDCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEE
           I Q                             ++C  +F  LK +L +API+  P+WSLPFE+MCDAS   +GA+LGQ++ K  H IYY S+ LN+ 
Subjt:  FAMIQQHF---------------------------RDCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEE

Query:  QINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDHAAI
        Q+NY+TTEKELLAIVFA +KF  YL+G++V +FTDHAA+
Subjt:  QINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDHAAI

A0A5N5FT77 S ribonuclease5.9e-6353.94Show/hide
Query:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKAT------
        MHRI LEE +  S E QRR NP M EVVKKEVIK LD G+IYPI+D+ WVSPVQ VPKK G TVV N++ EL+PTR VTGWRVC+DYR+LN  T      
Subjt:  MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKAT------

Query:  -----------------------CECLLAFAMIQQHFRDCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLN
                               C  LL   ++ +    C  AF+ LK  L SAPI+  P+WSLPFE+MCDAS   +GA+LGQ++ K  H IYY S+ LN
Subjt:  -----------------------CECLLAFAMIQQHFRDCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLN

Query:  EEQINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDHAAI
        + Q+NY+TTEKELLA+VFA +KF  YL+G++V IFTDHAA+
Subjt:  EEQINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDHAAI

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.7e-1534.44Show/hide
Query:  VSNKDNELIPTRTVTGW--RVCMDYRRLNKATCECLLAFAMIQQHFRDCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHP
        +  K  E+     +TG+  +   ++  + K   +CL     I     +   AF+ LK ++   PIL  P+++  F +  DAS   +GA+L Q      HP
Subjt:  VSNKDNELIPTRTVTGW--RVCMDYRRLNKATCECLLAFAMIQQHFRDCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHP

Query:  IYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDHAAIS
        + Y S+ LNE +INY+T EKELLAIV+A + F  YL+G    I +DH  +S
Subjt:  IYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDHAAIS

P10394 Retrovirus-related Pol polyprotein from transposon 4123.8e-1137.5Show/hide
Query:  DCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDH
        +C+KAF  LK  LI+  +L  P++S  F +  DAS    GA+L Q       P+ Y S+   + + N +TTE+EL AI +A   F PY+ G   T+ TDH
Subjt:  DCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDH

Query:  AAIS
          ++
Subjt:  AAIS

P10401 Retrovirus-related Pol polyprotein from transposon gypsy3.8e-1141.35Show/hide
Query:  RKAFETLKVVLISAP-ILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGS-EVTIFTDH
        R AF+ L+ +L S   IL  P++  PF++  DAS + +GA+L Q+      PI   S+ L + + NY T E+ELLAIV+A  K   +L GS E+ IFTDH
Subjt:  RKAFETLKVVLISAP-ILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGS-EVTIFTDH

Query:  AAIS
          ++
Subjt:  AAIS

P20825 Retrovirus-related Pol polyprotein from transposon 2978.8e-1635.37Show/hide
Query:  VSNKDNELIPTRTVTGW--RVCMDYRRLNKATCECLLAFAMIQQHFRDCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHP
        +  KD E+     +TG+  +   +Y  + K    CL     I     +  +AFE LK ++I  PIL  P++   F +  DAS   +GA+L Q      HP
Subjt:  VSNKDNELIPTRTVTGW--RVCMDYRRLNKATCECLLAFAMIQQHFRDCRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHP

Query:  IYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDH
        I + S+ LN+ ++NY+  EKELLAIV+A + F  YL+G +  I +DH
Subjt:  IYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDH

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.5e-1237.25Show/hide
Query:  KAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGS-EVTIFTDHAA
        ++F  LK +L S+ IL  P ++ PF +  DAS   +GA+L Q       PI Y S+ LN+ + NY T EKE+LAI+++ +    YL G+  + ++TDH  
Subjt:  KAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGS-EVTIFTDHAA

Query:  IS
        ++
Subjt:  IS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAGAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGGAGGTTTAACCCTGCAATGAAAGAGGTTGTTAAGAAGGAGGTGATTAAATGGTTGGA
TGTTGGGATTATTTATCCAATTGCAGATAACAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGCCACAGTAGTGTCTAATAAGGACAATGAATTGATCC
CAACCAGGACAGTGACTGGCTGGAGGGTTTGCATGGATTACAGGAGGCTTAACAAAGCCACCTGTGAATGTCTTTTGGCCTTTGCAATGATCCAGCAACATTTCAGAGAT
TGTAGAAAGGCTTTTGAAACTTTAAAGGTTGTTTTAATCTCAGCACCCATTCTTTGTGCACCTAATTGGAGTTTACCATTTGAGGTAATGTGTGATGCGAGTGGTGCTGT
AGTAGGTGCCATGCTAGGACAAAAGCAGGGCAAATTTATCCATCCTATTTACTATACAAGCAAGGTTTTAAATGAGGAACAAATAAACTATACAACTACTGAGAAGGAGT
TGTTAGCTATTGTGTTTGCTTTTGAGAAATTCTGTCCATATTTGGTTGGATCCGAAGTCACAATCTTCACGGATCATGCAGCAATAAGTCGTCGACCAGCTTGTCTCCTC
CACCTCCGACGACCGGTGATCTTCCACTCTTGCGAGCAGTTCCGGCAACGACATAGGCAGAGCAGTGGTGGCGTCGCGCAAAAATTCCAGTGCAGCGGCAGCGTCGGGCA
GAAAATTCTGCATCTTCTTCGACGAGCGACGACGTGGACAACAGTGGTTCAAGGTCGTTTTCAGCATTTCCAACACGTTTTAAGCAGTGGGTCTTCAAGTTCTTTCATTT
CCCGGCAATTTCGGCTTTGGGTGTCTACAATTTTTGGGATAGTTGTAGTTACGTCGTTATGCTACCAAATTTTTGATAGCGTTGCAGAAGTTAACGGGGCTAAATATAAG
ATGCAGAATTCCAATAAGGGTTCACTACCTTTCAGACATGGAATTCTTTTGTCTAAGAAACTGTTTCCTAAGACACCTCAAGAGGTTGAGGACGTGAGACGAATGCCTTA
TGCAATAAGGATTGTTAGATACACTGACACGGATTTTAAGACTGATAAGGATTTGAGGAAATCTACATCAGTATCCATGCATGGTCAATCTTCACGAAGGATCTATAGTA
TGACATAG
mRNA sequenceShow/hide mRNA sequence
ATGCATAGAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGGAGGTTTAACCCTGCAATGAAAGAGGTTGTTAAGAAGGAGGTGATTAAATGGTTGGA
TGTTGGGATTATTTATCCAATTGCAGATAACAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGCCACAGTAGTGTCTAATAAGGACAATGAATTGATCC
CAACCAGGACAGTGACTGGCTGGAGGGTTTGCATGGATTACAGGAGGCTTAACAAAGCCACCTGTGAATGTCTTTTGGCCTTTGCAATGATCCAGCAACATTTCAGAGAT
TGTAGAAAGGCTTTTGAAACTTTAAAGGTTGTTTTAATCTCAGCACCCATTCTTTGTGCACCTAATTGGAGTTTACCATTTGAGGTAATGTGTGATGCGAGTGGTGCTGT
AGTAGGTGCCATGCTAGGACAAAAGCAGGGCAAATTTATCCATCCTATTTACTATACAAGCAAGGTTTTAAATGAGGAACAAATAAACTATACAACTACTGAGAAGGAGT
TGTTAGCTATTGTGTTTGCTTTTGAGAAATTCTGTCCATATTTGGTTGGATCCGAAGTCACAATCTTCACGGATCATGCAGCAATAAGTCGTCGACCAGCTTGTCTCCTC
CACCTCCGACGACCGGTGATCTTCCACTCTTGCGAGCAGTTCCGGCAACGACATAGGCAGAGCAGTGGTGGCGTCGCGCAAAAATTCCAGTGCAGCGGCAGCGTCGGGCA
GAAAATTCTGCATCTTCTTCGACGAGCGACGACGTGGACAACAGTGGTTCAAGGTCGTTTTCAGCATTTCCAACACGTTTTAAGCAGTGGGTCTTCAAGTTCTTTCATTT
CCCGGCAATTTCGGCTTTGGGTGTCTACAATTTTTGGGATAGTTGTAGTTACGTCGTTATGCTACCAAATTTTTGATAGCGTTGCAGAAGTTAACGGGGCTAAATATAAG
ATGCAGAATTCCAATAAGGGTTCACTACCTTTCAGACATGGAATTCTTTTGTCTAAGAAACTGTTTCCTAAGACACCTCAAGAGGTTGAGGACGTGAGACGAATGCCTTA
TGCAATAAGGATTGTTAGATACACTGACACGGATTTTAAGACTGATAAGGATTTGAGGAAATCTACATCAGTATCCATGCATGGTCAATCTTCACGAAGGATCTATAGTA
TGACATAG
Protein sequenceShow/hide protein sequence
MHRITLEEGSFRSIEQQRRFNPAMKEVVKKEVIKWLDVGIIYPIADNNWVSPVQCVPKKGGATVVSNKDNELIPTRTVTGWRVCMDYRRLNKATCECLLAFAMIQQHFRD
CRKAFETLKVVLISAPILCAPNWSLPFEVMCDASGAVVGAMLGQKQGKFIHPIYYTSKVLNEEQINYTTTEKELLAIVFAFEKFCPYLVGSEVTIFTDHAAISRRPACLL
HLRRPVIFHSCEQFRQRHRQSSGGVAQKFQCSGSVGQKILHLLRRATTWTTVVQGRFQHFQHVLSSGSSSSFISRQFRLWVSTIFGIVVVTSLCYQIFDSVAEVNGAKYK
MQNSNKGSLPFRHGILLSKKLFPKTPQEVEDVRRMPYAIRIVRYTDTDFKTDKDLRKSTSVSMHGQSSRRIYSMT